Seeing is not always believing: Benchmarking human and model perception of ai-generated images Z Lu, D Huang, L Bai, J Qu, C Wu, X Liu, W Ouyang Advances in Neural Information Processing Systems 36, 25435-25447, 2023 | 56 | 2023 |
Llama pro: Progressive llama with block expansion C Wu, Y Gan, Y Ge, Z Lu, J Wang, Y Feng, P Luo, Y Shan ACL 2024, main conference, 2024 | 51 | 2024 |
-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation C Wu, T Wang, Y Ge, Z Lu, R Zhou, Y Shan, P Luo International Conference on Machine Learning, 37713-37727, 2023 | 33 | 2023 |
Fit: Flexible vision transformer for diffusion model Z Lu, Z Wang, D Huang, C Wu, X Liu, W Ouyang, L Bai ICML 2024, 2024 | 32 | 2024 |
NTIRE 2022 image inpainting challenge: Report A Romero, A Castillo, J Abril-Nova, R Timofte, R Das, S Hira, Z Pan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 28 | 2022 |
Janus: Decoupling visual encoding for unified multimodal understanding and generation C Wu, X Chen, Z Wu, Y Ma, X Liu, Z Pan, W Liu, Z Xie, X Yu, C Ruan, ... arXiv preprint arXiv:2410.13848, 2024 | 25 | 2024 |
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots C Wu, Y Ge, Q Guo, J Wang, Z Liang, Z Lu, Y Shan, P Luo arXiv preprint arXiv:2405.07990, 2024 | 14 | 2024 |
Hierarchical diffusion autoencoders and disentangled image manipulation Z Lu, C Wu, X Chen, Y Wang, L Bai, Y Qiao, X Liu Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 9 | 2024 |
Generative data augmentation for non-iid problem in decentralized clinical machine learning Z Wang, S Duan, C Wu, W Lin, X Zha, P Han, C Liu 2022 4th International Conference on Data Intelligence and Security (ICDIS …, 2022 | 9 | 2022 |
Deepseek-vl2: Mixture-of-experts vision-language models for advanced multimodal understanding Z Wu, X Chen, Z Pan, X Liu, W Liu, D Dai, H Gao, Y Ma, C Wu, B Wang, ... arXiv preprint arXiv:2412.10302, 2024 | 8 | 2024 |
Automatic time series forecasting model design based on pruning C Wang, X Chen, C Wu, H Wang Applied Soft Computing, 111804, 2024 | 8* | 2024 |
Janusflow: Harmonizing autoregression and rectified flow for unified multimodal understanding and generation Y Ma, X Liu, X Chen, W Liu, C Wu, Z Wu, Z Pan, Z Xie, H Zhang, L Zhao, ... arXiv preprint arXiv:2411.07975, 2024 | 3 | 2024 |
Adapting llama decoder to vision transformer J Wang, W Shao, M Chen, C Wu, Y Liu, T Wu, K Zhang, S Zhang, K Chen, ... arXiv preprint arXiv:2404.06773, 2024 | 3 | 2024 |
Autoregressive Models in Vision: A Survey J Xiong, G Liu, L Huang, C Wu, T Wu, Y Mu, Y Yao, H Shen, Z Wan, ... arXiv preprint arXiv:2411.05902, 2024 | 1 | 2024 |
LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation J Wang, N Kang, L Yao, M Chen, C Wu, S Zhang, S Xue, Y Liu, T Wu, ... arXiv preprint arXiv:2501.12976, 2025 | | 2025 |