Chenlu Ye

2023202412 81

Tong ZhangUIUCVerified email at tongzhang-ml.org
Wei XiongComputer Science, University of Illinois Urbana-ChampaignVerified email at illinois.edu
Quanquan GuAssociate Professor of Computer Science, UCLAVerified email at cs.ucla.edu
Hanze DongSalesforce ResearchVerified email at salesforce.com
Han ZhongPeking UniversityVerified email at stu.pku.edu.cn
Nan JiangAssistant Professor of Computer Science, UIUCVerified email at illinois.edu
Heng JiProfessor of Computer Science, University of Illinois Urbana-Champaign, Amazon ScholarVerified email at illinois.edu
Yuheng ZhangUIUCVerified email at illinois.edu
Rui YangHong Kong University of Science and TechnologyVerified email at connect.ust.hk
Yong Linthe Hong Kong University of Science and TechnologyVerified email at connect.ust.hk
Chen LiuHong Kong University of Science and TechnologyVerified email at connect.ust.hk
Jiafan HePhD student, Department of Computer Science, UCLAVerified email at ucla.edu
Qing LianHKUSTVerified email at connect.ust.hk
Yuan YaoInstitute of Physics, Chinese Academy of ScienceVerified email at iphy.ac.cn
Jianqing FanProfessor of Statistics, Professor of Finance, Princeton UniversityVerified email at princeton.edu
Yuan YAOHong Kong University of Science and TechnologyVerified email at ust.hk
Zhuoran YangYale UniversityVerified email at yale.edu
Zhaoran WangAssistant Professor at Northwestern UniversityVerified email at northwestern.edu

Chenlu Ye

Verified email at connect.ust.hk - Homepage


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Iterative preference learning from human feedback: Bridging theory and practice for rlhf under kl-constraint W Xiong, H Dong, C Ye, Z Wang, H Zhong, H Ji, N Jiang, T Zhang Forty-first International Conference on Machine Learning, 2024	46*	2024
Corruption-robust algorithms with uncertainty weighting for nonlinear contextual bandits and markov decision processes C Ye, W Xiong, Q Gu, T Zhang International Conference on Machine Learning, 39834-39863, 2023	19	2023
A theoretical analysis of nash learning from human feedback under general kl-regularized preference C Ye, W Xiong, Y Zhang, N Jiang, T Zhang arXiv preprint arXiv:2402.07314, 2024	15	2024
Corruption-Robust Offline Reinforcement Learning with General Function Approximation C Ye, R Yang, Q Gu, T Zhang Neural Information Processing Systems, 2023	9	2023
Towards robust model-based reinforcement learning against adversarial corruption C Ye, J He, Q Gu, T Zhang arXiv preprint arXiv:2402.08991, 2024	2	2024
Optimal sample selection through uncertainty estimation and its application in deep learning Y Lin, C Liu, C Ye, Q Lian, Y Yao, T Zhang arXiv preprint arXiv:2309.02476, 2023	2	2023
Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks J Fan, Z Wang, Z Yang, C Ye arXiv preprint arXiv:2311.13180, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–7

Citations per year