Composer: Creative and controllable image synthesis with composable conditions L Huang, D Chen, Y Liu, Y Shen, D Zhao, J Zhou arXiv preprint arXiv:2302.09778, 2023 | 133 | 2023 |
Anydoor: Zero-shot object-level image customization X Chen, L Huang, Y Liu, Y Shen, D Zhao, H Zhao arXiv preprint arXiv:2307.09481, 2023 | 66 | 2023 |
Self-supervised video representation learning by context and motion decoupling L Huang, Y Liu, B Wang, P Pan, Y Xu, R Jin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 56 | 2021 |
Cones: Concept neurons in diffusion models for customized generation Z Liu, R Feng, K Zhu, Y Zhang, K Zheng, Y Liu, D Zhao, J Zhou, Y Cao arXiv preprint arXiv:2303.05125, 2023 | 45 | 2023 |
Cones 2: Customizable image synthesis with multiple subjects Z Liu, Y Zhang, Y Shen, K Zheng, K Zhu, R Feng, Y Liu, D Zhao, J Zhou, ... arXiv preprint arXiv:2305.19327, 2023 | 34 | 2023 |
Geoaug: Data augmentation for few-shot nerf with geometry constraints D Chen, Y Liu, L Huang, B Wang, P Pan European Conference on Computer Vision, 322-337, 2022 | 26 | 2022 |
Animating images to transfer clip for video-text retrieval Y Liu, H Chen, L Huang, D Chen, B Wang, P Pan, L Wang Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022 | 12 | 2022 |
Dreamvideo: Composing your dream videos with customized subject and motion Y Wei, S Zhang, Z Qing, H Yuan, Z Liu, Y Liu, Y Zhang, J Zhou, H Shan arXiv preprint arXiv:2312.04433, 2023 | 11 | 2023 |
LivePhoto: Real Image Animation with Text-guided Motion Control X Chen, Z Liu, M Chen, Y Feng, Y Liu, Y Shen, H Zhao arXiv preprint arXiv:2312.02928, 2023 | 9 | 2023 |
Videolcm: Video latent consistency model X Wang, S Zhang, H Zhang, Y Liu, Y Zhang, C Gao, N Sang arXiv preprint arXiv:2312.09109, 2023 | 8 | 2023 |
Communication efficient SGD via gradient sampling with Bayes prior L Song, K Zhao, P Pan, Y Liu, Y Zhang, Y Xu, R Jin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 7 | 2021 |
Dimensionality-varying diffusion process H Zhang, R Feng, Z Yang, L Huang, Y Liu, Y Zhang, Y Shen, D Zhao, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 6 | 2023 |
Diffgar: Model-agnostic restoration from generative artifacts using image-to-image diffusion models Y Yin, L Huang, Y Liu, K Huang Proceedings of the 2022 6th International Conference on Computer Science and …, 2022 | 6 | 2022 |
Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning Y Liu, L Huang, P Pan, B Wang, Y Xu, R Jin In Proceedings of the AAAI Conference on Artificial Intelligence, 2021 | 5 | 2021 |
Eliminating lipschitz singularities in diffusion models Z Yang, R Feng, H Zhang, Y Shen, K Zhu, L Huang, Y Zhang, Y Liu, ... arXiv preprint arXiv:2306.11251, 2023 | 4 | 2023 |
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following Y Feng, B Gong, D Chen, Y Shen, Y Liu, J Zhou arXiv preprint arXiv:2311.17002, 2023 | 3 | 2023 |
Once and for all: Self-supervised multi-modal co-training on one-billion videos at alibaba L Huang, Y Liu, X Zhou, A You, M Li, B Wang, Y Zhang, P Pan, X Yinghui Proceedings of the 29th ACM international conference on multimedia, 1148-1156, 2021 | 3 | 2021 |
Enhancing textual cues in multi-modal transformers for VQA Y Liu, L Huang, L Song, B Wang, Y Zhang, P Pan VizWiz Challenge 2021, 2021 | 3 | 2021 |
CCM: Adding Conditional Controls to Text-to-Image Consistency Models J Xiao, K Zhu, H Zhang, Z Liu, Y Shen, Y Liu, X Fu, ZJ Zha arXiv preprint arXiv:2312.06971, 2023 | 2 | 2023 |
Efficient-vqgan: Towards high-resolution image generation with efficient vision transformers S Cao, Y Yin, L Huang, Y Liu, X Zhao, D Zhao, K Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 2 | 2023 |