Semantic-aware implicit neural audio-driven video portrait generation X Liu, Y Xu, Q Wu, H Zhou, W Wu, B Zhou European Conference on Computer Vision, 2022, 2022 | 134 | 2022 |
Learning hierarchical cross-modal association for co-speech gesture generation X Liu, Q Wu, H Zhou, Y Xu, R Qian, X Lin, X Zhou, W Wu, B Dai, B Zhou Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 114 | 2022 |
Taming diffusion models for audio-driven co-speech gesture generation X Liu, L Zhu, X Liu, R Qian, Z Liu, L Yu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 97 | 2023 |
Object-Compositional Neural Implicit Surfaces Q Wu, X Liu, Y Chen, K Li, C Zheng, J Cai, J Zheng European Conference on Computer Vision, 2022, 2022 | 76 | 2022 |
Monohuman: Animatable human neural field from monocular video Z Yu, W Cheng, X Liu, W Wu, KY Lin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 70 | 2023 |
Humangaussian: Text-driven 3d human generation with gaussian splatting X Liu, X Zhan, J Tang, Y Shan, G Zeng, D Lin, X Liu, Z Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 66 | 2024 |
Motion capture from internet videos J Dong, Q Shuai, Y Zhang, X Liu, X Zhou, H Bao Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 51 | 2020 |
Audio-Driven Co-Speech Gesture Video Generation X Liu, Q Wu, H Zhou, Y Du, W Wu, D Lin, Z Liu Advances in Neural Information Processing Systems, 21386--21399, 2022 | 39 | 2022 |
Hyperhuman: Hyper-realistic human generation with latent structural diffusion X Liu, J Ren, A Siarohin, I Skorokhodov, Y Li, D Lin, X Liu, Z Liu, ... The Twelfth International Conference on Learning Representations (ICLR), 2024, 2024 | 35 | 2024 |
Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis L Zhu, Z Xue, Z Jin, X Liu, J He, Z Liu, L Yu International Conference on Medical Image Computing and Computer-Assisted …, 2023 | 31 | 2023 |
Enhancing self-supervised video representation learning via multi-level feature optimization R Qian, Y Li, H Liu, J See, S Ding, X Liu, D Li, W Lin Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 30 | 2021 |
Visual sound localization in the wild by cross-modal interference erasing X Liu, R Qian, H Zhou, D Hu, W Lin, Z Liu, B Zhou, X Zhou Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI 2022), 2022 | 28 | 2022 |
Static and Dynamic Concepts for Self-supervised Video Representation Learning R Qian, S Ding, X Liu, D Lin European Conference on Computer Vision, 2022, 2022 | 25 | 2022 |
Tc4d: Trajectory-conditioned text-to-4d generation S Bahmani, X Liu, W Yifan, I Skorokhodov, V Rong, Z Liu, X Liu, JJ Park, ... European Conference on Computer Vision, 53-72, 2025 | 18 | 2025 |
Brushnet: A plug-and-play image inpainting model with decomposed dual-branch diffusion X Ju, X Liu, X Wang, Y Bian, Y Shan, Q Xu arXiv preprint arXiv:2403.06976, 2024 | 18 | 2024 |
ChemSpacE: Toward Steerable and Interpretable Chemical Space Exploration Y Du, X Liu, S Liu, J Zhang, B Zhou Transactions on Machine Learning Research (TMLR), 2023, 2022 | 16* | 2022 |
Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos R Qian, S Ding, X Liu, D Lin Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 14 | 2023 |
Textcraftor: Your text encoder can be image quality controller Y Li, X Liu, A Kag, J Hu, Y Idelbayev, D Sagar, Y Wang, S Tulyakov, J Ren Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 11 | 2024 |
T2v-compbench: A comprehensive benchmark for compositional text-to-video generation K Sun, K Huang, X Liu, Y Wu, Z Xu, Z Li, X Liu arXiv preprint arXiv:2407.14505, 2024 | 10 | 2024 |
Accelerating auto-regressive text-to-image generation with training-free speculative jacobi decoding Y Teng, H Shi, X Liu, X Ning, G Dai, Y Wang, Z Li, X Liu arXiv preprint arXiv:2410.01699, 2024 | 5 | 2024 |