A general sample complexity analysis of vanilla policy gradient R Yuan, RM Gower, A Lazaric International Conference on Artificial Intelligence and Statistics (AISTATS …, 2022 | 43 | 2022 |
Sketched Newton-Raphson R Yuan, A Lazaric, RM Gower SIAM Journal on Optimization 32 (3), 1555-1583, 2022 | 26* | 2022 |
Linear convergence of natural policy gradient methods with log-linear policies R Yuan, SS Du, RM Gower, A Lazaric, L Xiao The Eleventh International Conference on Learning Representations, 2022 | 25 | 2022 |
A Novel Framework for Policy Mirror Descent with General Parameterization and Linear Convergence C Alfano, R Yuan, P Rebeschini Advances in Neural Information Processing Systems 36, 2024 | 9 | 2024 |
SAN: Stochastic Average Newton Algorithm for Minimizing Finite Sums J Chen, R Yuan, G Garrigos, RM Gower International Conference on Artificial Intelligence and Statistics (AISTATS …, 2022 | 5 | 2022 |
Enhancing Policy Gradient with the Polyak Step-Size Adaption Y Li, R Yuan, C Fan, M Schmidt, S Horváth, RM Gower, M Takáč arXiv preprint arXiv:2404.07525, 2024 | | 2024 |
Understanding in-context learning in transformers S Rossi, R Yuan, T Hannagan The Third Blogpost Track at ICLR 2024, 2024 | | 2024 |
Méthodes du second d'ordre stochastiques et analysis de temps fini des méthodes de policie gradient R Yuan | | 2023 |
Stochastic Second Order Methods and Finite Time Analysis of Policy Gradient Methods R Yuan Institut polytechnique de Paris, 2023 | | 2023 |