Follow
Chunyuan Li
Chunyuan Li
Microsoft Research, Redmond
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Visual instruction tuning
H Liu*, C Li*, Q Wu, YJ Lee
NeurIPS, 2023
43132023
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
X Li, X Yin, C Li, X Hu, P Zhang, L Zhang, L Wang, H Hu, L Dong, F Wei, ...
European Conference on Computer Vision (ECCV), 2020
2320*2020
Improved baselines with visual instruction tuning
H Liu, C Li, Y Li, YJ Lee
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
14392024
Grounding dino: Marrying dino with grounded pre-training for open-set object detection
S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, Q Jiang, C Li, J Yang, H Su, ...
European Conference on Computer Vision, 38-55, 2025
13932025
Grounded Language-Image Pre-training
LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ...
CVPR, 2022
10752022
Variational Autoencoder for Deep Learning of Images, Labels and Captions
Y Pu, Z Gan, R Henao, X Yuan, C Li, A Stevens, L Carin
Neural Information Processing Systems (NIPS), 2016
10142016
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
9032021
Instruction tuning with gpt-4
B Peng, C Li, P He, M Galley, J Gao
arXiv preprint arXiv:2304.03277, 2023
8102023
Focal self-attention for local-global interactions in vision transformers
J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao
arXiv preprint arXiv:2107.00641, 2021
632*2021
Mimic-it: Multi-modal in-context instruction tuning
B Li, Y Zhang, L Chen, J Wang, F Pu, J Yang, C Li, Z Liu
arXiv preprint arXiv:2306.05425, 2023
5922023
Gligen: Open-set grounded text-to-image generation
Y Li, H Liu, Q Wu, F Mu, J Yang, J Gao, C Li, YJ Lee
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
5712023
RegionCLIP: Region-based Language-Image Pretraining
Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ...
CVPR, 2022
5442022
Joint Embedding of Words and Labels for Text Classification
G Wang, C Li, W Wang, Y Zhang, D Shen, X Zhang, R Henao, L Carin
Annual Meeting of the Association for Computational Linguistics (ACL), 2018
5362018
Llava-med: Training a large language-and-vision assistant for biomedicine in one day
C Li*, C Wong*, S Zhang*, N Usuyama, H Liu, J Yang, T Naumann, ...
NeurIPS, 2023
4852023
Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms
D Shen, G Wang, W Wang, MR Min, Q Su, Y Zhang, C Li, R Henao, ...
Annual Meeting of the Association for Computational Linguistics (ACL), 2018
4582018
Cyclical Annealing Schedule: A Simple Approach to Mitigating KL Vanishing
H Fu*, C Li*, X Liu, J Gao, A Celikyilmaz, L Carin
NAACL, 2019
4462019
Llava-next: Improved reasoning, ocr, and world knowledge
H Liu, C Li, Y Li, B Li, Y Zhang, S Shen, YJ Lee
https://llava-vl.github.io/blog/2024-01-30-llava-next, 2024
440*2024
Measuring the intrinsic dimension of objective landscapes
C Li, H Farkhoor, R Liu, J Yosinski
arXiv preprint arXiv:1804.08838, 2018
4122018
Preconditioned Stochastic Gradient Langevin Dynamics for Deep Neural Networks
C Li, C Chen, D Carlson, L Carin
AAAI Conference on Artificial Intelligence, 2016
3932016
Soloist: Few-shot task-oriented dialog with a single pretrained auto-regressive model
B Peng, C Li, J Li, S Shayandeh, L Liden, J Gao
arXiv preprint arXiv:2005.05298 3, 2020
332*2020
The system can't perform the operation now. Try again later.
Articles 1–20