Follow
Jianjian Sun
Jianjian Sun
Researcher of Megvii Technology
Verified email at megvii.com
Title
Cited by
Cited by
Year
Bevdepth: Acquisition of reliable depth for multi-view 3d object detection
Y Li, Z Ge, G Yu, J Yang, Z Wang, Y Shi, J Sun, Z Li
Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1477-1485, 2023
3202023
Bevstereo: Enhancing depth estimation in multi-view 3d object detection with temporal stereo
Y Li, H Bao, Z Ge, J Yang, J Sun, Z Li
Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1486-1494, 2023
1142023
Autoencoders as cross-modal teachers: Can pretrained 2d image transformers help 3d representation learning?
R Dong, Z Qi, L Zhang, J Zhang, J Sun, Z Ge, L Yi, K Ma
arXiv preprint arXiv:2212.08320, 2022
512022
Dreamllm: Synergistic multimodal comprehension and creation
R Dong, C Han, Y Peng, Z Qi, Z Ge, J Yang, L Zhao, J Sun, H Zhou, H Wei, ...
arXiv preprint arXiv:2309.11499, 2023
432023
Exploring recurrent long-term temporal fusion for multi-view 3d perception
C Han, J Sun, Z Ge, J Yang, R Dong, H Zhou, W Mao, Y Peng, X Zhang
arXiv preprint arXiv:2303.05970, 2023
312023
Reversible column networks
Y Cai, Y Zhou, Q Han, J Sun, X Kong, J Li, X Zhang
arXiv preprint arXiv:2212.11696, 2022
312022
Cross modal transformer via coordinates encoding for 3d object dectection
J Yan, Y Liu, J Sun, F Jia, S Li, T Wang, X Zhang
arXiv preprint arXiv:2301.01283 2 (3), 4, 2023
272023
Chatspot: Bootstrapping multimodal llms via precise referring instruction tuning
L Zhao, E Yu, Z Ge, J Yang, H Wei, H Zhou, J Sun, Y Peng, R Dong, ...
arXiv preprint arXiv:2307.09474, 2023
222023
Cross modal transformer: Towards fast and robust 3d object detection
J Yan, Y Liu, J Sun, F Jia, S Li, T Wang, X Zhang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
212023
Vary: Scaling up the vision vocabulary for large vision-language models
H Wei, L Kong, J Chen, L Zhao, Z Ge, J Yang, J Sun, C Han, X Zhang
arXiv preprint arXiv:2312.06109, 2023
132023
Small Language Model Meets with Reinforced Vision Vocabulary
H Wei, L Kong, J Chen, L Zhao, Z Ge, E Yu, J Sun, C Han, X Zhang
arXiv preprint arXiv:2401.12503, 2024
62024
The 1st-place solution for cvpr 2023 openlane topology in autonomous driving challenge
D Wu, F Jia, J Chang, Z Li, J Sun, C Han, S Li, Y Liu, Z Ge, T Wang
arXiv preprint arXiv:2306.09590, 2023
52023
Bevstereo++: Accurate depth estimation in multi-view 3d object detection via dynamic temporal stereo
Y Li, J Yang, J Sun, H Bao, Z Ge, L Xiao
arXiv preprint arXiv:2304.04185, 2023
22023
OneChart: Purify the Chart Structural Extraction via One Auxiliary Token
J Chen, L Kong, H Wei, C Liu, Z Ge, L Zhao, J Sun, C Han, X Zhang
arXiv preprint arXiv:2404.09987, 2024
2024
Bevstereo: Enhancing depth estimation in multi-view 3d object detection with temporal stereo
Y Li, H Bao, Z Ge, J Yang, J Sun, Z Li
Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1486-1494, 2023
2023
First Place Solution to the 3D Object Detection of the SSLAD2022 Challenge
T Huang, Z Yao, L Liu, B Wang, T Jiang, J Sun, X Wang, Z Li, H Yao
The system can't perform the operation now. Try again later.
Articles 1–16