Yaqi Duan

Cited by

	All	Since 2019
Citations	535	533
h-index	10	10
i10-index	11	11

180

135

2019202020212022202320247 15 78 141 180 109

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Yaqi Duan

Department of Technology, Operations and Statistics at NYU Stern

Verified email at stern.nyu.edu - Homepage


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Minimax-optimal off-policy evaluation with linear function approximation Y Duan, M Wang International Conference on Machine Learning, 2701-2709, 2020	161	2020
Near-optimal offline reinforcement learning with linear representation: Leveraging variance information with pessimism M Yin, Y Duan, M Wang, YX Wang International Conference on Learning Representations, 2022	70	2022
State aggregation learning from Markov transition data Y Duan, T Ke, M Wang Advances in Neural Information Processing Systems, 4486-4495, 2019	63	2019
Risk bounds and Rademacher complexity in batch reinforcement learning Y Duan, C Jin, Z Li International Conference on Machine Learning, 2892-2902, 2021	52	2021
Bootstrapping fitted Q-evaluation for off-policy inference B Hao, X Ji, Y Duan, H Lu, C Szepesvari, M Wang International Conference on Machine Learning, 4074-4084, 2021	38	2021
Optimal policy evaluation using kernel-based temporal difference methods Y Duan, M Wang, MJ Wainwright Annals of Statistics, 2024	37	2024
Sparse feature selection makes batch reinforcement learning more sample efficient B Hao, Y Duan, T Lattimore, C Szepesvári, M Wang International Conference on Machine Learning, 4063-4073, 2021	36	2021
Adaptive and robust multi-task learning Y Duan, K Wang Annals of Statistics 51 (5), 2015-2039, 2023	24	2023
Bootstrapping statistical inference for off-policy evaluation B Hao, X Ji, Y Duan, H Lu, C Szepesvári, M Wang arXiv preprint, arXiv:2102.03607, 2021	16	2021
Learning low-dimensional state embeddings and metastable clusters from time series data Y Sun, Y Duan, H Gong, M Wang Advances in Neural Information Processing Systems, 4561-4570, 2019	14	2019
Learning good state and action representations for Markov decision process via tensor decomposition C Ni, Y Duan, M Dahleh, M Wang, AR Zhang Journal of Machine Learning Research 24 (115), 1-53, 2023	10*	2023
Adaptive low-nonnegative-rank approximation for state aggregation of Markov chains Y Duan, M Wang, Z Wen, Y Yuan SIAM Journal on Matrix Analysis and Applications 41 (1), 244-278, 2020	9	2020
A finite-sample analysis of multi-step temporal difference estimates Y Duan, MJ Wainwright Learning for Dynamics and Control Conference, 612-624, 2023	4	2023
Policy evaluation from a single path: Multi-step methods, mixing and mis-specification Y Duan, MJ Wainwright arXiv preprint, arXiv:2211.03899, 2022	1	2022
Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces Y Duan, MJ Wainwright arXiv preprint, arXiv:2401.05233, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–15

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by