Follow
Weijie Su
Title
Cited by
Cited by
Year
Deformable DETR: Deformable Transformers for End-to-End Object Detection
X Zhu, W Su, L Lu, B Li, X Wang, J Dai
International Conference on Learning Representations (ICLR), 2021
40112021
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
W Su, X Zhu, Y Cao, B Li, L Lu, F Wei, J Dai
International Conference on Learning Representations (ICLR), 2020
16522020
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
X Zhu, Y Chen, H Tian, C Tao, W Su, C Yang, G Huang, B Li, L Lu, ...
arXiv preprint arXiv:2305.17144, 2023
105*2023
Siamese image modeling for self-supervised vision representation learning
C Tao, X Zhu, W Su, G Huang, B Li, J Zhou, Y Qiao, X Wang, J Dai
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
522023
Towards all-in-one pre-training via maximizing multi-modal mutual information
W Su, X Zhu, C Tao, L Lu, B Li, G Huang, Y Qiao, X Wang, J Zhou, J Dai
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
262023
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks
Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ...
arXiv preprint arXiv:2312.14238, 2023
92023
The system can't perform the operation now. Try again later.
Articles 1–6