追蹤
Chunyuan Li
Chunyuan Li
Microsoft Research, Redmond
在 microsoft.com 的電子郵件地址已通過驗證 - 首頁
標題
引用次數
引用次數
年份
Visual instruction tuning
H Liu*, C Li*, Q Wu, YJ Lee
NeurIPS, 2023
41862023
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
X Li, X Yin, C Li, X Hu, P Zhang, L Zhang, L Wang, H Hu, L Dong, F Wei, ...
European Conference on Computer Vision (ECCV), 2020
2301*2020
Improved baselines with visual instruction tuning
H Liu, C Li, Y Li, YJ Lee
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
13832024
Grounding dino: Marrying dino with grounded pre-training for open-set object detection
S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, Q Jiang, C Li, J Yang, H Su, ...
European Conference on Computer Vision, 38-55, 2025
13622025
Grounded Language-Image Pre-training
LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ...
CVPR, 2022
10602022
Variational Autoencoder for Deep Learning of Images, Labels and Captions
Y Pu, Z Gan, R Henao, X Yuan, C Li, A Stevens, L Carin
Neural Information Processing Systems (NIPS), 2016
10102016
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
8962021
Instruction tuning with gpt-4
B Peng, C Li, P He, M Galley, J Gao
arXiv preprint arXiv:2304.03277, 2023
8012023
Focal self-attention for local-global interactions in vision transformers
J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao
arXiv preprint arXiv:2107.00641, 2021
631*2021
Mimic-it: Multi-modal in-context instruction tuning
B Li, Y Zhang, L Chen, J Wang, F Pu, J Yang, C Li, Z Liu
arXiv preprint arXiv:2306.05425, 2023
5862023
Gligen: Open-set grounded text-to-image generation
Y Li, H Liu, Q Wu, F Mu, J Yang, J Gao, C Li, YJ Lee
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
5552023
RegionCLIP: Region-based Language-Image Pretraining
Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ...
CVPR, 2022
5372022
Joint Embedding of Words and Labels for Text Classification
G Wang, C Li, W Wang, Y Zhang, D Shen, X Zhang, R Henao, L Carin
Annual Meeting of the Association for Computational Linguistics (ACL), 2018
5342018
Llava-med: Training a large language-and-vision assistant for biomedicine in one day
C Li*, C Wong*, S Zhang*, N Usuyama, H Liu, J Yang, T Naumann, ...
NeurIPS, 2023
4692023
Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms
D Shen, G Wang, W Wang, MR Min, Q Su, Y Zhang, C Li, R Henao, ...
Annual Meeting of the Association for Computational Linguistics (ACL), 2018
4572018
Cyclical Annealing Schedule: A Simple Approach to Mitigating KL Vanishing
H Fu*, C Li*, X Liu, J Gao, A Celikyilmaz, L Carin
NAACL, 2019
4402019
Llava-next: Improved reasoning, ocr, and world knowledge
H Liu, C Li, Y Li, B Li, Y Zhang, S Shen, YJ Lee
https://llava-vl.github.io/blog/2024-01-30-llava-next, 2024
430*2024
Measuring the intrinsic dimension of objective landscapes
C Li, H Farkhoor, R Liu, J Yosinski
arXiv preprint arXiv:1804.08838, 2018
4122018
Preconditioned Stochastic Gradient Langevin Dynamics for Deep Neural Networks
C Li, C Chen, D Carlson, L Carin
AAAI Conference on Artificial Intelligence, 2016
3932016
Soloist: Few-shot task-oriented dialog with a single pretrained auto-regressive model
B Peng, C Li, J Li, S Shayandeh, L Liden, J Gao
arXiv preprint arXiv:2005.05298 3, 2020
330*2020
系統目前無法執行作業,請稍後再試。
文章 1–20