Safety-enhanced autonomous driving using interpretable sensor fusion transformer H Shao, L Wang, R Chen, H Li, Y Liu 6th Annual Conference on Robot Learning, 2022 | 218 | 2022 |
Lmdrive: Closed-loop end-to-end driving with large language models H Shao, Y Hu, L Wang, G Song, SL Waslander, Y Liu, H Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 101 | 2024 |
Temporal interlacing network H Shao, S Qian, Y Liu Proceedings of the AAAI Conference on Artificial Intelligence 34 (07), 11966 …, 2020 | 98 | 2020 |
Sphinx-x: Scaling data and parameters for a family of multi-modal large language models D Liu, R Zhang, L Qiu, S Huang, W Lin, S Zhao, S Geng, Z Lin, P Jin, ... arXiv preprint arXiv:2402.05935, 2024 | 88* | 2024 |
Reasonnet: End-to-end driving with temporal and global reasoning H Shao, L Wang, R Chen, SL Waslander, H Li, Y Liu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 79 | 2023 |
Visual cot: Advancing multi-modal language models with a comprehensive dataset and benchmark for chain-of-thought reasoning H Shao, S Qian, H Xiao, G Song, Z Zong, L Wang, Y Liu, H Li The Thirty-eight Conference on Neural Information Processing Systems …, 2024 | 40* | 2024 |
Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors L Wang, J Liu, H Shao, W Wang, R Chen, Y Liu, SL Waslander Robotics: Science and Systems (RSS 2023), 2023 | 31 | 2023 |
Mova: Adapting mixture of vision experts to multimodal context Z Zong, B Ma, D Shen, G Song, H Shao, D Jiang, H Li, Y Liu Advances in Neural Information Processing Systems 37 (NeurIPS 2024), 2024 | 24 | 2024 |
Blending anti-aliasing into vision transformer S Qian, H Shao, Y Zhu, M Li, J Jia Advances in Neural Information Processing Systems 34, 5416-5429, 2021 | 20 | 2021 |
DI-drive: OpenDILab decision intelligence platform for autonomous driving simulation DI Contributors | 10 | 2021 |
SmartRefine: An Scenario-Adaptive Refinement Framework for Efficient Motion Prediction Y Zhou*, H Shao*, L Wang, SL Waslander, H Li, Y Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 9 | 2024 |
1st place solution for ava-kinetics crossover in acitivitynet challenge 2020 S Chen, J Pan, G Song, M Zhang, H Shao, Z Lin, J Shao, H Li, Y Liu arXiv preprint arXiv:2006.09116, 2020 | 6 | 2020 |
Top-1 solution of multi-moments in time challenge 2019 M Zhang, H Shao, G Song, Y Liu, J Yan arXiv preprint arXiv:2003.05837, 2020 | 2 | 2020 |
Self-supervised temporal learning H Shao, Y Liu, H Li | 1 | 2021 |
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping H Shao, S Wang, Y Zhou, G Song, D He, S Qin, Z Zong, B Ma, Y Liu, H Li arXiv preprint arXiv:2412.11279, 2024 | | 2024 |
Easyref: Omni-generalized group image reference for diffusion models via multimodal llm Z Zong, D Jiang, B Ma, G Song, H Shao, D Shen, Y Liu, H Li arXiv preprint arXiv:2412.09618, 2024 | | 2024 |
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction Y Zhou*, H Shao*, L Wang, SL Waslander, H Li, Y Liu arXiv preprint arXiv:2410.08669, 2024 | | 2024 |
Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020 H Su, J Feng, H Shao, Z Jiang, M Zhang, W Wu, Y Liu, H Li, J Yan arXiv preprint arXiv:2007.09883, 2020 | | 2020 |
Team Efficient Multi-Moments in Time Challenge 2019 Technical Report M Zhang, H Shao, G Song, Y Liu, J Yan | | |