Self-play fine-tuning converts weak language models to strong language models Z Chen, Y Deng, H Yuan, K Ji, Q Gu arXiv preprint arXiv:2401.01335, 2024 | 42 | 2024 |
SIPID: A deep learning framework for sinogram interpolation and image denoising in low-dose CT reconstruction H Yuan, J Jia, Z Zhu 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018 …, 2018 | 42 | 2018 |
A general framework for sample-efficient function approximation in reinforcement learning Z Chen, CJ Li, A Yuan, Q Gu, MI Jordan arXiv preprint arXiv:2209.15634, 2022 | 30 | 2022 |
Stochastic recursive momentum for policy gradient methods H Yuan, X Lian, J Liu, Y Zhou arXiv preprint arXiv:2003.04302, 2020 | 28 | 2020 |
Efficient smooth non-convex stochastic compositional optimization via stochastic recursive gradient descent W Hu, CJ Li, X Lian, J Liu, H Yuan Advances in Neural Information Processing Systems 32, 2019 | 25 | 2019 |
Stochastic recursive momentum method for non-convex compositional optimization H Yuan, W Hu arXiv preprint arXiv:2006.01688, 2020 | 14 | 2020 |
Stochastic recursive variance reduction for efficient smooth non-convex compositional optimization H Yuan, X Lian, J Liu arXiv preprint arXiv:1912.13515, 2019 | 11 | 2019 |
Nesterov meets optimism: rate-optimal separable minimax optimization CJ Li, H Yuan, G Gidel, Q Gu, M Jordan International Conference on Machine Learning, 20351-20383, 2023 | 8 | 2023 |
Stochastic modified equations for continuous limit of stochastic ADMM X Zhou, H Yuan, CJ Li, Q Sun arXiv preprint arXiv:2003.03532, 2020 | 8 | 2020 |
Differential inclusions for modeling nonsmooth ADMM variants: A continuous limit theory H Yuan, Y Zhou, CJ Li, Q Sun International Conference on Machine Learning, 7232-7241, 2019 | 8 | 2019 |
Object-oriented state abstraction in reinforcement learning for video games Y Chen, H Yuan, Y Li 2019 IEEE Conference on Games (CoG), 1-4, 2019 | 6 | 2019 |
Policy optimization via stochastic recursive gradient algorithm H Yuan, CJ Li, Y Tang, Y Zhou | 3 | 2018 |
Fast Sampling via De-randomization for Discrete Diffusion Models Z Chen, H Yuan, Y Li, Y Kou, J Zhang, Q Gu arXiv preprint arXiv:2312.09193, 2023 | 2 | 2023 |
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation H Yuan, Z Chen, K Ji, Q Gu arXiv preprint arXiv:2402.10210, 2024 | 1 | 2024 |
Self-Play Preference Optimization for Language Model Alignment Y Wu, Z Sun, H Yuan, K Ji, Y Yang, Q Gu arXiv preprint arXiv:2405.00675, 2024 | | 2024 |
Protein Conformation Generation via Force-Guided SE (3) Diffusion Models Y Wang, L Wang, Y Shen, Y Wang, H Yuan, Y Wu, Q Gu arXiv preprint arXiv:2403.14088, 2024 | | 2024 |
Optimal Extragradient-Based Algorithms for Stochastic Variational Inequalities with Separable Structure A Yuan, CJ Li, G Gidel, M Jordan, Q Gu, SS Du Advances in Neural Information Processing Systems 36, 2024 | | 2024 |
A Continuous Limit Theory for Nonsmooth ADMM Variants H Yuan, Y Zhou, CJ Li, Q Sun | | 2019 |