Follow
Jiaqi Wang
Jiaqi Wang
Shanghai AI Laboratory
Verified email at pjlab.org.cn - Homepage
Title
Cited by
Cited by
Year
MMDetection: Open mmlab detection toolbox and benchmark
K Chen, J Wang, J Pang, Y Cao, Y Xiong, X Li, S Sun, W Feng, Z Liu, J Xu, ...
arXiv preprint arXiv:1906.07155, 2019
33622019
Hybrid task cascade for instance segmentation
K Chen, J Pang, J Wang, Y Xiong, X Li, S Sun, W Feng, Z Liu, J Shi, ...
Proceedings of the IEEE Conference on Computer Vision and Pattern ¡K, 2019
15902019
Region proposal by guided anchoring
J Wang, K Chen, S Yang, C Change Loy, D Lin
Proceedings of the IEEE Conference on Computer Vision and Pattern ¡K, 2019
7622019
CARAFE: Content-Aware ReAssembly of FEatures
J Wang, K Chen, R Xu, Z Liu, CC Loy, D Lin
Proceedings of the IEEE International Conference on Computer Vision, 2019
7032019
Mmbench: Is your multi-modal model an all-around player?
Y Liu, H Duan, Y Zhang, B Li, S Zhang, W Zhao, Y Yuan, J Wang, C He, ...
European Conference on Computer Vision, 216-233, 2025
6012025
Sharegpt4v: Improving large multi-modal models with better captions
L Chen, J Li, X Dong, P Zhang, C He, J Wang, F Zhao, D Lin
European Conference on Computer Vision, 370-387, 2025
3622025
Lavt: Language-aware vision transformer for referring image segmentation
Z Yang, J Wang, Y Tang, K Chen, H Zhao, PHS Torr
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern ¡K, 2022
3092022
Seesaw Loss for Long-Tailed Instance Segmentation
J Wang, W Zhang, Y Zang, Y Cao, J Pang, T Gong, K Chen, Z Liu, CC Loy, ...
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2902021
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites
Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ...
arXiv preprint arXiv:2404.16821, 2024
2382024
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ...
arXiv preprint arXiv:2401.16420, 2024
1732024
Side-aware boundary localization for more precise object detection
J Wang, W Zhang, Y Cao, K Chen, J Pang, T Gong, J Shi, CC Loy, D Lin
Proceedings of the European Conference on Computer Vision (ECCV), 2020
1712020
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition
P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, H Duan, ...
arXiv preprint arXiv:2309.15112, 2023
1632023
Internlm2 technical report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
1592024
Omniobject3d: Large-vocabulary 3d object dataset for realistic perception, reconstruction and generation
T Wu, J Zhang, X Fu, Y Wang, J Ren, L Pan, W Wu, L Yang, J Wang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern ¡K, 2023
1572023
Pyskl: Towards good practices for skeleton action recognition
H Duan, J Wang, K Chen, D Lin
Proceedings of the 30th ACM International Conference on Multimedia, 7351-7354, 2022
1392022
Optimizing video object detection via a scale-time lattice
K Chen, J Wang, S Yang, X Zhang, Y Xiong, CC Loy, D Lin
Proceedings of the IEEE conference on computer vision and pattern ¡K, 2018
1382018
Dense distinct query for end-to-end object detection
S Zhang, X Wang, J Wang, J Pang, C Lyu, W Zhang, P Luo, K Chen
Proceedings of the IEEE/CVF conference on computer vision and pattern ¡K, 2023
1282023
Are We on the Right Way for Evaluating Large Vision-Language Models?
L Chen, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, J Wang, Y Qiao, ...
arXiv preprint arXiv:2403.20330, 2024
1052024
Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation
Q Huang, X Dong, P Zhang, B Wang, C He, J Wang, D Lin, W Zhang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern ¡K, 2024
1012024
Few-shot object detection via association and discrimination
Y Cao, J Wang, Y Jin, T Wu, K Chen, Z Liu, D Lin
Advances in neural information processing systems 34, 16570-16581, 2021
1012021
The system can't perform the operation now. Try again later.
Articles 1–20