关注
Lewei Lu
Lewei Lu
Research Director (We're Hiring, luotto@sensetime.com) @ SenseTime Research
在 sensetime.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Deformable DETR: Deformable Transformers for End-to-End Object Detection
X Zhu, W Su, L Lu, B Li, X Wang, J Dai
The International Conference on Learning Representations (ICLR), 2021
46312021
VL-BERT: Pre-Training of Generic Visual-Linguistic Representations
W Su, X Zhu, Y Cao, B Li, L Lu, F Wei, J Dai
The International Conference on Learning Representations (ICLR), 2020
17512020
Internimage: Exploring large-scale vision foundation models with deformable convolutions
W Wang, J Dai, Z Chen, Z Huang, Z Li, X Zhu, X Hu, T Lu, L Lu, H Li, ...
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
4912023
Planning-oriented autonomous driving
Y Hu, J Yang, L Chen, K Li, C Sima, X Zhu, S Chai, S Du, T Lin, W Wang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
3262023
Bevformer v2: Adapting modern image backbones to bird's-eye-view recognition via perspective supervision
C Yang, Y Chen, H Tian, C Tao, X Zhu, Z Zhang, G Huang, H Li, Y Qiao, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1642023
FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
R Liu, H Deng, Y Huang, X Shi, L Lu, W Sun, X Wang, J Dai, H Li
International Conference on Computer Vision (ICCV), 2021
1302021
Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe
H Li, C Sima, J Dai, W Wang, L Lu, H Wang, J Zeng, Z Li, J Yang, H Deng, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
872023
Scene as occupancy
W Tong, C Sima, T Wang, L Chen, S Wu, H Deng, Y Gu, L Lu, P Luo, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
602023
Decoupled spatial-temporal transformer for video inpainting
R Liu, H Deng, Y Huang, X Shi, L Lu, W Sun, X Wang, J Dai, H Li
arXiv preprint arXiv:2104.06637, 2021
542021
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites
Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ...
arXiv preprint arXiv:2404.16821, 2024
512024
Ghost in the minecraft: Generally capable agents for open-world environments via large language models with text-based knowledge and memory
X Zhu, Y Chen, H Tian, C Tao, W Su, C Yang, G Huang, B Li, L Lu, ...
arXiv preprint arXiv:2305.17144, 2023
442023
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks
Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, M Zhong, Q Zhang, X Zhu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
402024
Towards all-in-one pre-training via maximizing multi-modal mutual information
W Su, X Zhu, C Tao, L Lu, B Li, G Huang, Y Qiao, X Wang, J Zhou, J Dai
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
352023
Drivemlm: Aligning multi-modal large language models with behavioral planning states for autonomous driving
W Wang, J Xie, CY Hu, H Zou, J Fan, W Tong, Y Wen, S Wu, H Deng, Z Li, ...
arXiv preprint arXiv:2312.09245, 2023
322023
Mm-interleaved: Interleaved image-text generative modeling via multi-modal feature synchronizer
C Tian, X Zhu, Y Xiong, W Wang, Z Chen, W Wang, Y Chen, L Lu, T Lu, ...
arXiv preprint arXiv:2401.10208, 2024
202024
Controlllm: Augment language models with tools by searching on graphs
Z Liu, Z Lai, Z Gao, E Cui, Z Li, X Zhu, L Lu, Q Chen, Y Qiao, J Dai, ...
arXiv preprint arXiv:2310.17796, 2023
202023
1st Place Solution of LVIS Challenge 2020: A Good Box is not a Guarantee of a Good Mask
J Tan, G Zhang, H Deng, C Wang, L Lu, Q Li, J Dai
European Conference on Computer Vision Workshops (ECCVW), 2020
182020
Scene as occupancy
C Sima, W Tong, T Wang, L Chen, S Wu, H Deng, Y Gu, L Lu, P Luo, ...
arXiv preprint arXiv:2306.02851, 2023
152023
Efficient deformable convnets: Rethinking dynamic and sparse operator for vision applications
Y Xiong, Z Li, Y Chen, F Wang, X Zhu, J Luo, W Wang, T Lu, H Li, Y Qiao, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
142024
Vision-rwkv: Efficient and scalable visual perception with rwkv-like architectures
Y Duan, W Wang, Z Chen, X Zhu, L Lu, T Lu, Y Qiao, H Li, J Dai, W Wang
arXiv preprint arXiv:2403.02308, 2024
132024
系统目前无法执行此操作,请稍后再试。
文章 1–20