Multi-task collaborative network for joint referring expression comprehension and segmentation G Luo, Y Zhou, X Sun, L Cao, C Wu, C Deng, R Ji Proceedings of the IEEE/CVF Conference on computer vision and pattern ¡K, 2020 | 76 | 2020 |
Improving image captioning by leveraging intra-and inter-layer global representation in transformer network J Ji, Y Luo, X Sun, F Chen, G Luo, Y Wu, Y Gao, R Ji Proceedings of the AAAI conference on artificial intelligence 35 (2), 1655-1663, 2021 | 49 | 2021 |
Cascade grouped attention network for referring expression segmentation G Luo, Y Zhou, R Ji, X Sun, J Su, CW Lin, Q Tian Proceedings of the 28th ACM International Conference on Multimedia, 1274-1282, 2020 | 18 | 2020 |
A real-time global inference network for one-stage referring expression comprehension Y Zhou, R Ji, G Luo, X Sun, J Su, X Ding, CW Lin, Q Tian IEEE Transactions on Neural Networks and Learning Systems, 2021 | 15 | 2021 |
K-armed bandit based multi-modal network architecture search for visual question answering Y Zhou, R Ji, X Sun, G Luo, X Hong, J Su, X Ding, L Shao Proceedings of the 28th ACM International Conference on Multimedia, 1245-1254, 2020 | 7 | 2020 |
Towards Lightweight Transformer Via Group-Wise Transformation for Vision-and-Language Tasks G Luo, Y Zhou, X Sun, Y Wang, L Cao, Y Wu, F Huang, R Ji IEEE Transactions on Image Processing 31, 3386-3398, 2022 | 1 | 2022 |
SeqTR: A Simple yet Universal Network for Visual Grounding C Zhu, Y Zhou, Y Shen, G Luo, X Pan, M Lin, C Chen, L Cao, X Sun, R Ji arXiv preprint arXiv:2203.16265, 2022 | 1 | 2022 |
Towards Language-guided Visual Recognition via Dynamic Convolutions G Luo, Y Zhou, X Sun, X Ding, Y Wu, F Huang, Y Gao, R Ji arXiv preprint arXiv:2110.08797, 2021 | 1 | 2021 |
No-reference image sharpness Algorithm based on gradient shape J Ni, G Luo, T Yu, NC Li 2016 9th International Congress on Image and Signal Processing, BioMedical ¡K, 2016 | 1 | 2016 |
Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning J Ji, X Huang, X Sun, Y Zhou, G Luo, L Cao, J Liu, L Shao, R Ji IEEE Transactions on Multimedia, 2022 | | 2022 |
What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study G Luo, Y Zhou, J Sun, S Huang, X Sun, Q Ye, Y Wu, R Ji arXiv preprint arXiv:2204.07913, 2022 | | 2022 |
Active Teacher for Semi-Supervised Object Detection P Mi, J Lin, Y Zhou, Y Shen, G Luo, X Sun, L Cao, R Fu, Q Xu, R Ji Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern ¡K, 2022 | | 2022 |