Yu Bai

引用次数

	总计	2019 年至今
引用	2311	2249
h 指数	23	23
i10 指数	37	37

740

370

185

555

2017201820192020202120222023202417 45 86 166 358 491 739 406

开放获取的出版物数量

查看全部

13 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Song MeiAssistant Professor at UC Berkeley在 berkeley.edu 的电子邮件经过验证
Huan WangSalesforce Research在 yale.edu 的电子邮件经过验证
Caiming XiongSalesforce Research在 salesforce.com 的电子邮件经过验证
Chi JinAssistant Professor, Princeton University在 princeton.edu 的电子邮件经过验证
Yu-Xiang WangAssociate Professor of Computer Science, UC Santa Barbara在 cs.ucsb.edu 的电子邮件经过验证
Tiancheng YuTwo Sigma在 mit.edu 的电子邮件经过验证
Nan JiangAssistant Professor of Computer Science, UIUC在 illinois.edu 的电子邮件经过验证
Jason D. LeeAssociate Professor of Electrical Engineering and Computer Science, Princeton University在 princeton.edu 的电子邮件经过验证
Tengyang XieUniversity of Wisconsin-Madison, Microsoft Research在 cs.wisc.edu 的电子邮件经过验证
Andrea MontanariProfessor of Statistics and Mathematics, Stanford University在 stanford.edu 的电子邮件经过验证
Minshuo ChenPrinceton University在 princeton.edu 的电子邮件经过验证
Qinghua LiuPrinceton University在 princeton.edu 的电子邮件经过验证
Fan ChenMassachusetts Institute of Technology在 mit.edu 的电子邮件经过验证
Ming YinPrinceton University在 princeton.edu 的电子邮件经过验证
Ziang SongStanford University在 stanford.edu 的电子邮件经过验证
Tuo ZhaoAssistant Professor, Georgia Tech在 gatech.edu 的电子邮件经过验证
Sham M KakadeHarvard University在 seas.harvard.edu 的电子邮件经过验证
Edo Libertypinecone.io在 edoliberty.com 的电子邮件经过验证
Andrej RisteskiCarnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Tengyu MAStanford University在 stanford.edu 的电子邮件经过验证

关注

Yu Bai

Research Scientist, Salesforce Research

在 salesforce.com 的电子邮件经过验证 - 首页

Machine Learning Statistics


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
The landscape of empirical risk for nonconvex losses S Mei, Y Bai, A Montanari The Annals of Statistics 46 (6A), 2747-2774, 2018	347	2018
Provable self-play algorithms for competitive reinforcement learning Y Bai, C Jin International conference on machine learning, 551-560, 2020	164	2020
Policy finetuning: Bridging sample-efficient offline and online reinforcement learning T Xie, N Jiang, H Wang, C Xiong, Y Bai Advances in neural information processing systems 34, 27395-27407, 2021	143	2021
A sharp analysis of model-based reinforcement learning with self-play Q Liu, T Yu, Y Bai, C Jin International Conference on Machine Learning, 7001-7010, 2021	138	2021
Near-Optimal Reinforcement Learning with Self-Play Y Bai, C Jin, T Yu Advances in Neural Information Processing Systems, 2020, 2020	131	2020
Proxquant: Quantized neural networks via proximal operators Y Bai, YX Wang, E Liberty International Conference on Learning Representations (ICLR) 2019, 2018	120	2018
Beyond linearization: On quadratic and higher-order approximation of wide neural networks Y Bai, JD Lee International Conference on Learning Representations (ICLR) 2020, 2019	116	2019
Provably Efficient Q-Learning with Low Switching Cost Y Bai, T Xie, N Jiang, YX Wang Advances in Neural Information Processing Systems, 2019, 2019	98	2019
When can we learn general-sum Markov games with a large number of players sample-efficiently? Z Song, S Mei, Y Bai International Conference on Learning Representations (ICLR) 2022, 2021	86	2021
Near-optimal provable uniform convergence in offline policy evaluation for reinforcement learning M Yin, Y Bai, YX Wang International Conference on Artificial Intelligence and Statistics, 1567-1575, 2021	86*	2021
Approximability of discriminators implies diversity in GANs Y Bai, T Ma, A Risteski International Conference on Learning Representations (ICLR) 2019, 2018	85	2018
How important is the train-validation split in meta-learning? Y Bai, M Chen, P Zhou, T Zhao, J Lee, S Kakade, H Wang, C Xiong International Conference on Machine Learning, 543-553, 2021	70	2021
Near-optimal offline reinforcement learning via double variance reduction M Yin, Y Bai, YX Wang Advances in neural information processing systems 34, 7677-7688, 2021	69	2021
Sample-efficient learning of Stackelberg equilibria in general-sum games Y Bai, C Jin, H Wang, C Xiong Advances in Neural Information Processing Systems 34, 25799-25811, 2021	65	2021
Transformers as statisticians: Provable in-context learning with in-context algorithm selection Y Bai, F Chen, H Wang, C Xiong, S Mei Advances in neural information processing systems 36, 2024	63	2024
Subgradient descent learns orthogonal dictionaries Y Bai, Q Jiang, J Sun International Conference on Learning Representations (ICLR) 2019, 2018	58	2018
Towards understanding hierarchical learning: Benefits of neural representations M Chen, Y Bai, JD Lee, T Zhao, H Wang, C Xiong, R Socher Advances in Neural Information Processing Systems, 2020, 2020	50	2020
The role of coverage in online reinforcement learning T Xie, DJ Foster, Y Bai, N Jiang, SM Kakade arXiv preprint arXiv:2210.04157, 2022	44	2022
Don't Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification Y Bai, S Mei, H Wang, C Xiong International Conference on Machine Learning, 566-576, 2021	42	2021
Unified algorithms for rl with decision-estimation coefficients: No-regret, pac, and reward-free learning F Chen, S Mei, Y Bai arXiv preprint arXiv:2209.11745, 2022	30	2022

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者