StrucTexT: Structured Text Understanding with Multi-Modal Transformers Y Li, Y Qian, Y Yu, X Qin, C Zhang, Y Liu, K Yao, J Han, J Liu, E Ding arXiv preprint arXiv:2108.02923, 2021 | 96 | 2021 |
Robust match fusion using optimization X Qin, J Shen, X Mao, X Li, Y Jia IEEE transactions on cybernetics 45 (8), 1549-1560, 2014 | 47 | 2014 |
EATEN: Entity-aware Attention for Single Shot Visual Text Extraction ED He guo, Xiameng Qin, Jiaming Liu, Junyu Han, Jingtuo Liu ICDAR 2019, 2019 | 45* | 2019 |
Structextv2: Masked visual-textual prediction for document image pre-training Y Yu, Y Li, C Zhang, X Zhang, Z Guo, X Qin, K Yao, J Han, E Ding, J Wang arXiv preprint arXiv:2303.00289, 2023 | 26 | 2023 |
Bilateral cross-modality graph matching attention for feature fusion in visual question answering J Cao, X Qin, S Zhao, J Shen IEEE Transactions on Neural Networks and Learning Systems, 2022 | 19 | 2022 |
Vehicle color recognition based on license plate color Y Dong, M Pei, X Qin 2014 tenth international conference on computational intelligence and …, 2014 | 14 | 2014 |
A real-time system for 3D recovery of dynamic scene with multiple RGBD imagers D Yong, C Lei, W Yucheng, Y Min, Q Xiameng, H Shaoyang, J Yunde CVPR 2011 WORKSHOPS, 1-8, 2011 | 14 | 2011 |
Structured-patch optimization for dense correspondence X Qin, J Shen, X Mao, X Li, Y Jia IEEE Transactions on Multimedia 17 (3), 295-306, 2015 | 10 | 2015 |
An embedded calibration stereovision system J Yunde, Q Xiameng 2012 IEEE Intelligent Vehicles Symposium, 1072-1077, 2012 | 6 | 2012 |
Fast-structext: An efficient hourglass transformer with modality-guided dynamic token merge for document understanding M Zhai, Y Li, X Qin, C Yi, Q Xie, C Zhang, K Yao, Y Wu, Y Jia arXiv preprint arXiv:2305.11392, 2023 | 5 | 2023 |
Stereo camera calibration with an embedded calibration device and scene features X Qin, J Yang, W Liang, M Pei, Y Jia 2012 IEEE International Conference on Robotics and Biomimetics (ROBIO), 2306 …, 2012 | 3 | 2012 |
TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision Y Zhai, X Zhang, X Qin, S Zhao, X Dong, J Shen Machine Intelligence Research, 1-14, 2024 | 2 | 2024 |
MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary B Dai, Q Xie, Y Li, X Qin, C Zhang, K Yao, J Han arXiv preprint arXiv:2307.12571, 2023 | 2 | 2023 |
A Unified Probabilistic Framework for Real-Time Depth Map Fusion. Y Duan, M Pei, Y Wang, M Yang, I Qin, Y Jia J. Inf. Sci. Eng. 31 (4), 1309-1327, 2015 | 2 | 2015 |
Collaborative Position Reasoning Network for Referring Image Segmentation J Cao, B Dai, Y Li, X Qin, J Wang arXiv preprint arXiv:2401.11775, 2024 | | 2024 |
Image-based information extraction model, method, and apparatus, device, and storage medium QIN Xiameng, Y Li, X Zhang, J Huang, XIE Qunyi, K Yao US Patent App. 18/113,178, 2024 | | 2024 |
Character detection method and apparatus, model training method and apparatus, device and storage medium J Huang, X Zhang, QIN Xiameng, C Zhang, K Yao US Patent App. 18/168,089, 2023 | | 2023 |
Text extraction method, text extraction model training method, electronic device and storage medium QIN Xiameng, X Zhang, J Huang, Y Li, XIE Qunyi, K Yao, J Han US Patent App. 18/059,362, 2023 | | 2023 |
Method and platform of generating document, electronic device and storage medium XIE Qunyi, QIN Xiameng, M En, D Zhang, J HUANG, Y Xu, Y Chen, K Yao US Patent App. 17/974,183, 2023 | | 2023 |
Method for training model, device, and storage medium Y Xu, XIE Qunyi, Y Chen, QIN Xiameng, C Zhang, K Yao US Patent App. 17/972,253, 2023 | | 2023 |