Follow
Yangchen Pan
Title
Cited by
Cited by
Year
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Q Lan, Y Pan, A Fyshe, M White
International Conference on Learning Representations (ICLR), 2020., 2020
952020
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online
Y Pan, K Banman, W Martha
International Conference on Learning Representations (ICLR), 2021., 2019
75*2019
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains
Y Pan, M Zaheer, A White, A Patterson, M White
International Joint Conference on Artificial Intelligence (IJCAI), 2019., 2019
492019
Accelerated gradient temporal difference learning
Y Pan, A White, M White
Thirty-First AAAI Conference on Artificial Intelligence, 2017
262017
Incremental truncated LSTD
C Gehring, Y Pan, M White
Proceedings of the Twenty-Fifth International Joint Conference on Artificial …, 2015
152015
Hill climbing on value estimates for search-control in dyna
Y Pan, H Yao, A Farahmand, M White
International Joint Conference on Artificial Intelligence (IJCAI), 2019., 2019
142019
Frequency-based Search-control in Dyna
Y Pan, J Mei, A Farahmand
International Conference on Learning Representations (ICLR), 2020., 2020
102020
Reinforcement learning with function-valued action spaces for partial differential equation control
Y Pan, A Farahmand, M White, S Nabi, P Grover, D Nikovski
International Conference on Machine Learning, 3986-3995, 2018
92018
Effective sketching methods for value function approximation
Y Pan, ES Azer, M White
Uncertainty in Artificial Intelligence (UAI), 2017., 2017
92017
Actor-expert: A framework for using action-value methods in continuous action spaces
S Lim, A Joseph, L Le, Y Pan, M White
arXiv preprint arXiv:1810.09103 22, 2018
82018
Actor-expert: A framework for using q-learning in continuous action spaces
S Lim, A Joseph, L Le, Y Pan, M White
arXiv preprint arXiv:1810.09103 9, 2018
72018
Adapting kernel representations online using submodular maximization
M Schlegel, Y Pan, J Chen, M White
International Conference on Machine Learning, 3037-3046, 2017
72017
An implicit function learning approach for parametric modal regression
Y Pan, E Imani, A Farahmand, M White
Advances in Neural Information Processing Systems 33, 2020
62020
Memory-efficient Reinforcement Learning with Knowledge Consolidation
Q Lan, Y Pan, J Luo, AR Mahmood
arXiv preprint arXiv:2205.10868, 2022
32022
Understanding and Mitigating the Limitations of Prioritized Experience Replay
Y Pan, J Mei, A Farahmand, M White, H Yao, M Rohani, J Luo
The 38th Conference on Uncertainty in Artificial Intelligence, 2022
22022
An Alternate Policy Gradient Estimator for Softmax Policies
S Garg, S Tosatto, Y Pan, M White, AR Mahmood
arXiv preprint arXiv:2112.11622, 2021
12021
Beyond prioritized replay: Sampling states in model-based RL via simulated priorities
J Mei, Y Pan, M White, A Farahmand, H Yao
12020
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
X Zhao, Y Pan, C Xiao, S Chandar, J Rajendran
arXiv preprint arXiv:2303.09032, 2023
2023
The In-Sample Softmax for Offline Reinforcement Learning
C Xiao, H Wang, Y Pan, A White, M White
arXiv preprint arXiv:2302.14372, 2023
2023
Label Alignment Regularization for Distribution Shift
E Imani, G Zhang, J Luo, P Poupart, Y Pan
arXiv preprint arXiv:2211.14960, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–20