Follow
Zhizhou Ren
Title
Cited by
Cited by
Year
QPLEX: Duplex Dueling Multi-Agent Q-Learning
J Wang, Z Ren, T Liu, Y Yu, C Zhang
Ninth International Conference on Learning Representations (ICLR 2021), 2021
4012021
Exploration via Hindsight Goal Generation
Z Ren, K Dong, Y Zhou, Q Liu, J Peng
Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019), 2019
812019
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization
J Wang, Z Ren, B Han, J Ye, C Zhang
Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), 2021
44*2021
Generalizable Episodic Memory for Deep Reinforcement Learning
H Hu, J Ye, G Zhu, Z Ren, C Zhang
Thirty-eighth International Conference on Machine Learning (ICML 2021), 2021
392021
Proximal Exploration for Model-guided Protein Sequence Design
Z Ren, J Li, F Ding, Y Zhou, J Ma, J Peng
Thirty-ninth International Conference on Machine Learning (ICML 2022), 2022
292022
Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Z Ren, R Guo, Y Zhou, J Peng
Tenth International Conference on Learning Representations (ICLR 2022 Spotlight), 2022
252022
Off-Policy Reinforcement Learning with Delayed Rewards
B Han, Z Ren, Z Wu, Y Zhou, J Peng
Thirty-ninth International Conference on Machine Learning (ICML 2022), 2022
212022
On the Estimation Bias in Double Q-Learning
Z Ren, G Zhu, H Hu, B Han, J Chen, C Zhang
Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), 2021
122021
Self-Organized Polynomial-Time Coordination Graphs
Q Yang, W Dong, Z Ren, J Wang, T Wang, C Zhang
Thirty-ninth International Conference on Machine Learning (ICML 2022), 2022
102022
Object-Oriented Dynamics Learning through Multi-Level Abstraction
G Zhu, J Wang, Z Ren, Z Lin, C Zhang
Thirty-fourth AAAI Conference on Artificial Intelligence (AAAI 2020), 2020
102020
Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
Z Ren, A Liu, Y Liang, J Peng, J Ma
Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022), 2022
72022
The system can't perform the operation now. Try again later.
Articles 1–11