Maxmin Q-learning: Controlling the Estimation Bias of Q-learning Q Lan, Y Pan, A Fyshe, M White International Conference on Learning Representations (ICLR), 2020., 2020 | 95 | 2020 |
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online Y Pan, K Banman, W Martha International Conference on Learning Representations (ICLR), 2021., 2019 | 75* | 2019 |
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains Y Pan, M Zaheer, A White, A Patterson, M White International Joint Conference on Artificial Intelligence (IJCAI), 2019., 2019 | 49 | 2019 |
Accelerated gradient temporal difference learning Y Pan, A White, M White Thirty-First AAAI Conference on Artificial Intelligence, 2017 | 26 | 2017 |
Incremental truncated LSTD C Gehring, Y Pan, M White Proceedings of the Twenty-Fifth International Joint Conference on Artificial …, 2015 | 15 | 2015 |
Hill climbing on value estimates for search-control in dyna Y Pan, H Yao, A Farahmand, M White International Joint Conference on Artificial Intelligence (IJCAI), 2019., 2019 | 14 | 2019 |
Frequency-based Search-control in Dyna Y Pan, J Mei, A Farahmand International Conference on Learning Representations (ICLR), 2020., 2020 | 10 | 2020 |
Reinforcement learning with function-valued action spaces for partial differential equation control Y Pan, A Farahmand, M White, S Nabi, P Grover, D Nikovski International Conference on Machine Learning, 3986-3995, 2018 | 9 | 2018 |
Effective sketching methods for value function approximation Y Pan, ES Azer, M White Uncertainty in Artificial Intelligence (UAI), 2017., 2017 | 9 | 2017 |
Actor-expert: A framework for using action-value methods in continuous action spaces S Lim, A Joseph, L Le, Y Pan, M White arXiv preprint arXiv:1810.09103 22, 2018 | 8 | 2018 |
Actor-expert: A framework for using q-learning in continuous action spaces S Lim, A Joseph, L Le, Y Pan, M White arXiv preprint arXiv:1810.09103 9, 2018 | 7 | 2018 |
Adapting kernel representations online using submodular maximization M Schlegel, Y Pan, J Chen, M White International Conference on Machine Learning, 3037-3046, 2017 | 7 | 2017 |
An implicit function learning approach for parametric modal regression Y Pan, E Imani, A Farahmand, M White Advances in Neural Information Processing Systems 33, 2020 | 6 | 2020 |
Memory-efficient Reinforcement Learning with Knowledge Consolidation Q Lan, Y Pan, J Luo, AR Mahmood arXiv preprint arXiv:2205.10868, 2022 | 3 | 2022 |
Understanding and Mitigating the Limitations of Prioritized Experience Replay Y Pan, J Mei, A Farahmand, M White, H Yao, M Rohani, J Luo The 38th Conference on Uncertainty in Artificial Intelligence, 2022 | 2 | 2022 |
An Alternate Policy Gradient Estimator for Softmax Policies S Garg, S Tosatto, Y Pan, M White, AR Mahmood arXiv preprint arXiv:2112.11622, 2021 | 1 | 2021 |
Beyond prioritized replay: Sampling states in model-based RL via simulated priorities J Mei, Y Pan, M White, A Farahmand, H Yao | 1 | 2020 |
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning X Zhao, Y Pan, C Xiao, S Chandar, J Rajendran arXiv preprint arXiv:2303.09032, 2023 | | 2023 |
The In-Sample Softmax for Offline Reinforcement Learning C Xiao, H Wang, Y Pan, A White, M White arXiv preprint arXiv:2302.14372, 2023 | | 2023 |
Label Alignment Regularization for Distribution Shift E Imani, G Zhang, J Luo, P Poupart, Y Pan arXiv preprint arXiv:2211.14960, 2022 | | 2022 |