K-net: Towards unified image segmentation W Zhang, J Pang, K Chen, CC Loy Advances in Neural Information Processing Systems 34, 10326-10338, 2021 | 274 | 2021 |
MMDetection3D: OpenMMLab Next-Generation Platform for General 3D Object Detection M Contributors | 245 | 2020 |
Seesaw loss for long-tailed instance segmentation J Wang, W Zhang, Y Zang, Y Cao, J Pang, T Gong, K Chen, Z Liu, CC Loy, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 241 | 2021 |
Robust multi-modality multi-object tracking W Zhang, H Zhou, S Sun, Z Wang, J Shi, CC Loy Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 223 | 2019 |
Mmrotate: A rotated object detection benchmark using pytorch Y Zhou, X Yang, G Zhang, J Wang, Y Liu, L Hou, X Jiang, X Liu, J Yan, ... Proceedings of the 30th ACM International Conference on Multimedia, 7331-7334, 2022 | 171 | 2022 |
Rtmdet: An empirical study of designing real-time object detectors C Lyu, W Zhang, H Huang, Y Zhou, Y Wang, Y Liu, S Zhang, K Chen arXiv preprint arXiv:2212.07784, 2022 | 161 | 2022 |
Side-aware boundary localization for more precise object detection J Wang, W Zhang, Y Cao, K Chen, J Pang, T Gong, J Shi, CC Loy, D Lin Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 153 | 2020 |
Multimodal-gpt: A vision and language model for dialogue with humans T Gong, C Lyu, S Zhang, Y Wang, M Zheng, Q Zhao, K Liu, W Zhang, ... arXiv preprint arXiv:2305.04790, 2023 | 144 | 2023 |
Econas: Finding proxies for economical neural architecture search D Zhou, X Zhou, W Zhang, CC Loy, S Yi, X Zhang, W Ouyang Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2020 | 131 | 2020 |
Internlm: A multilingual language model with progressively enhanced capabilities ILM Team 2023-01-06)[2023-09-27]. https://github. com/InternLM/InternLM, 2023 | 120 | 2023 |
Gpt4roi: Instruction tuning large language model on region-of-interest S Zhang, P Sun, S Chen, M Xiao, W Shao, W Zhang, K Chen, P Luo arXiv preprint arXiv:2307.03601, 2023 | 83 | 2023 |
Video k-net: A simple, strong, and unified baseline for video segmentation X Li, W Zhang, J Pang, K Chen, G Cheng, Y Tong, CC Loy Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 71 | 2022 |
Exploring data augmentation for multi-modality 3d object detection W Zhang, Z Wang, CC Loy arXiv preprint arXiv:2012.12741, 2020 | 67 | 2020 |
MMOCR: a comprehensive toolbox for text detection, recognition and understanding Z Kuang, H Sun, Z Li, X Yue, TH Lin, J Chen, H Wei, Y Zhu, T Gao, ... Proceedings of the 29th ACM International Conference on Multimedia, 3791-3794, 2021 | 63 | 2021 |
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition P Zhang, XDB Wang, Y Cao, C Xu, L Ouyang, Z Zhao, S Ding, S Zhang, ... arXiv preprint arXiv:2309.15112, 2023 | 62 | 2023 |
Opencompass: A universal evaluation platform for foundation models OC Contributors GitHub repository, 2023 | 61 | 2023 |
Transformer-based visual segmentation: A survey X Li, H Ding, H Yuan, W Zhang, J Pang, G Cheng, K Chen, Z Liu, CC Loy arXiv preprint arXiv:2304.09854, 2023 | 47 | 2023 |
Dense distinct query for end-to-end object detection S Zhang, X Wang, J Wang, J Pang, C Lyu, W Zhang, P Luo, K Chen Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 47 | 2023 |
Aligning bag of regions for open-vocabulary object detection S Wu, W Zhang, S Jin, W Liu, CC Loy Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 44 | 2023 |
Robo3d: Towards robust and reliable 3d perception against corruptions L Kong, Y Liu, X Li, R Chen, W Zhang, J Ren, L Pan, K Chen, Z Liu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 40 | 2023 |