Cityflow: A multi-agent reinforcement learning environment for large scale city traffic scenario H Zhang, S Feng, C Liu, Y Ding, Y Zhu, Z Zhou, W Zhang, Y Yu, H Jin, Z Li The world wide web conference, 3620-3624, 2019 | 249 | 2019 |
Evolutionary population curriculum for scaling multi-agent reinforcement learning Q Long, Z Zhou, A Gupta, F Fang, Y Wu, X Wang arXiv preprint arXiv:2003.10423, 2020 | 104 | 2020 |
Continuously discovering novel strategies via reward-switching policy optimization Z Zhou, W Fu, B Zhang, Y Wu arXiv preprint arXiv:2204.02246, 2022 | 27 | 2022 |
Replan: Robotic replanning with perception and language models M Skreta, Z Zhou, JL Yuan, K Darvish, A Aspuru-Guzik, A Garg arXiv preprint arXiv:2401.04157, 2024 | 2 | 2024 |
Learning achievement structure for structured exploration in domains with sparse reward Z Zhou, A Garg arXiv preprint arXiv:2305.00508, 2023 | 2 | 2023 |
Temporal Induced Self-Play for Stochastic Bayesian Games W Chen, Z Zhou, Y Wu, F Fang arXiv preprint arXiv:2108.09444, 2021 | 2 | 2021 |
Image based review text generation with emotional guidance X Sun, Z Zhou, Y Fan arXiv preprint arXiv:1901.04140, 2019 | 1 | 2019 |
Approximated Temporal-Induced Neural Self-Play for Finitely Repeated Bayesian Games Z Zhou, ZR Shi, Y Wu, F Fang | | 2020 |