Bevdepth: Acquisition of reliable depth for multi-view 3d object detection Y Li, Z Ge, G Yu, J Yang, Z Wang, Y Shi, J Sun, Z Li Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1477-1485, 2023 | 320 | 2023 |
Bevstereo: Enhancing depth estimation in multi-view 3d object detection with temporal stereo Y Li, H Bao, Z Ge, J Yang, J Sun, Z Li Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1486-1494, 2023 | 114 | 2023 |
Autoencoders as cross-modal teachers: Can pretrained 2d image transformers help 3d representation learning? R Dong, Z Qi, L Zhang, J Zhang, J Sun, Z Ge, L Yi, K Ma arXiv preprint arXiv:2212.08320, 2022 | 51 | 2022 |
Dreamllm: Synergistic multimodal comprehension and creation R Dong, C Han, Y Peng, Z Qi, Z Ge, J Yang, L Zhao, J Sun, H Zhou, H Wei, ... arXiv preprint arXiv:2309.11499, 2023 | 43 | 2023 |
Exploring recurrent long-term temporal fusion for multi-view 3d perception C Han, J Sun, Z Ge, J Yang, R Dong, H Zhou, W Mao, Y Peng, X Zhang arXiv preprint arXiv:2303.05970, 2023 | 31 | 2023 |
Reversible column networks Y Cai, Y Zhou, Q Han, J Sun, X Kong, J Li, X Zhang arXiv preprint arXiv:2212.11696, 2022 | 31 | 2022 |
Cross modal transformer via coordinates encoding for 3d object dectection J Yan, Y Liu, J Sun, F Jia, S Li, T Wang, X Zhang arXiv preprint arXiv:2301.01283 2 (3), 4, 2023 | 27 | 2023 |
Chatspot: Bootstrapping multimodal llms via precise referring instruction tuning L Zhao, E Yu, Z Ge, J Yang, H Wei, H Zhou, J Sun, Y Peng, R Dong, ... arXiv preprint arXiv:2307.09474, 2023 | 22 | 2023 |
Cross modal transformer: Towards fast and robust 3d object detection J Yan, Y Liu, J Sun, F Jia, S Li, T Wang, X Zhang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 21 | 2023 |
Vary: Scaling up the vision vocabulary for large vision-language models H Wei, L Kong, J Chen, L Zhao, Z Ge, J Yang, J Sun, C Han, X Zhang arXiv preprint arXiv:2312.06109, 2023 | 13 | 2023 |
Small Language Model Meets with Reinforced Vision Vocabulary H Wei, L Kong, J Chen, L Zhao, Z Ge, E Yu, J Sun, C Han, X Zhang arXiv preprint arXiv:2401.12503, 2024 | 6 | 2024 |
The 1st-place solution for cvpr 2023 openlane topology in autonomous driving challenge D Wu, F Jia, J Chang, Z Li, J Sun, C Han, S Li, Y Liu, Z Ge, T Wang arXiv preprint arXiv:2306.09590, 2023 | 5 | 2023 |
Bevstereo++: Accurate depth estimation in multi-view 3d object detection via dynamic temporal stereo Y Li, J Yang, J Sun, H Bao, Z Ge, L Xiao arXiv preprint arXiv:2304.04185, 2023 | 2 | 2023 |
OneChart: Purify the Chart Structural Extraction via One Auxiliary Token J Chen, L Kong, H Wei, C Liu, Z Ge, L Zhao, J Sun, C Han, X Zhang arXiv preprint arXiv:2404.09987, 2024 | | 2024 |
Bevstereo: Enhancing depth estimation in multi-view 3d object detection with temporal stereo Y Li, H Bao, Z Ge, J Yang, J Sun, Z Li Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1486-1494, 2023 | | 2023 |
First Place Solution to the 3D Object Detection of the SSLAD2022 Challenge T Huang, Z Yao, L Liu, B Wang, T Jiang, J Sun, X Wang, Z Li, H Yao | | |