Grounding dino: Marrying dino with grounded pre-training for open-set object detection S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, Q Jiang, C Li, J Yang, H Su, ... European Conference on Computer Vision, 38-55, 2025 | 1400 | 2025 |
Pixel-bert: Aligning image pixels with text by deep multi-modal transformers Z Huang, Z Zeng, B Liu, D Fu, J Fu arXiv preprint arXiv:2004.00849, 2020 | 459 | 2020 |
Seeing out of the box: End-to-end pre-training for vision-language representation learning Z Huang, Z Zeng, Y Huang, B Liu, D Fu, J Fu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 294 | 2021 |
Grounded sam: Assembling open-world models for diverse visual tasks T Ren, S Liu, A Zeng, J Lin, K Li, H Cao, J Chen, X Huang, Y Chen, F Yan, ... arXiv preprint arXiv:2401.14159, 2024 | 191 | 2024 |
Wsod2: Learning bottom-up and top-down objectness distillation for weakly-supervised object detection Z Zeng, B Liu, J Fu, H Chao, L Zhang Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 179 | 2019 |
Active contrastive learning of audio-visual video representations S Ma, Z Zeng, D McDuff, Y Song arXiv preprint arXiv:2009.09805, 2020 | 116 | 2020 |
Mind the discriminability: Asymmetric adversarial domain adaptation J Yang, H Zou, Y Zhou, Z Zeng, L Xie Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 57 | 2020 |
Contrastive learning of global and local video representations Z Zeng, D McDuff, Y Song Advances in Neural Information Processing Systems 34, 7025-7040, 2021 | 56 | 2021 |
GarbageNet: a unified learning framework for robust garbage classification J Yang, Z Zeng, K Wang, H Zou, L Xie IEEE Transactions on Artificial Intelligence 2 (4), 372-380, 2021 | 54 | 2021 |
Smp challenge: An overview of social media prediction challenge 2019 B Wu, WH Cheng, P Liu, B Liu, Z Zeng, J Luo Proceedings of the 27th ACM International Conference on Multimedia, 2667-2671, 2019 | 42 | 2019 |
Suppressing mislabeled data via grouping and self-attention X Peng, K Wang, Z Zeng, Q Li, J Yang, Y Qiao Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 40 | 2020 |
Reference-based defect detection network Z Zeng, B Liu, J Fu, H Chao IEEE Transactions on Image Processing 30, 6637-6647, 2021 | 36 | 2021 |
Detection transformer with stable matching S Liu, T Ren, J Chen, Z Zeng, H Zhang, F Li, H Li, J Huang, H Su, J Zhu, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 32 | 2023 |
T-rex2: Towards generic object detection via text-visual prompt synergy Q Jiang, F Li, Z Zeng, T Ren, S Liu, L Zhang European Conference on Computer Vision, 38-57, 2025 | 18 | 2025 |
Dfa3d: 3d deformable attention for 2d-to-3d feature lifting H Li, H Zhang, Z Zeng, S Liu, F Li, T Ren, L Zhang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 16 | 2023 |
detrex: Benchmarking detection transformers T Ren, S Liu, F Li, H Zhang, A Zeng, J Yang, X Liao, D Jia, H Li, H Cao, ... arXiv preprint arXiv:2306.07265, 2023 | 13 | 2023 |
Activitynet 2019 task 3: Exploring contexts for dense captioning events in videos S Chen, Y Song, Y Zhao, Q Jin, Z Zeng, B Liu, J Fu, A Hauptmann arXiv preprint arXiv:1907.05092, 2019 | 13 | 2019 |
Pixel-bert: Aligning image pixels with text by deep multi-modal transformers. arXiv 2020 Z Huang, Z Zeng, B Liu, D Fu, J Fu arXiv preprint arXiv:2004.00849, 2020 | 12 | 2020 |
Taptr: Tracking any point with transformers as detection H Li, H Zhang, S Liu, Z Zeng, T Ren, F Li, L Zhang European Conference on Computer Vision, 57-75, 2025 | 11 | 2025 |
Grounding DINO 1.5: Advance the" Edge" of Open-Set Object Detection T Ren, Q Jiang, S Liu, Z Zeng, W Liu, H Gao, H Huang, Z Ma, X Jiang, ... arXiv preprint arXiv:2405.10300, 2024 | 11 | 2024 |