Zheng Tian
Cited by
Cited by
Thinking fast and slow with deep learning and tree search
T Anthony, Z Tian, D Barber
Advances in Neural Information Processing Systems, 5360-5370, 2017
Smarts: Scalable multi-agent reinforcement learning training school for autonomous driving
M Zhou, J Luo, J Villella, Y Yang, D Rusu, J Miao, W Zhang, M Alban, ...
arXiv preprint arXiv:2010.09776, 2020
A regularized opponent model with maximum entropy objective
Z Tian, Y Wen, Z Gong, F Punakkath, S Zou, J Wang
arXiv preprint arXiv:1905.08087, 2019
Learning to Communicate Implicitly by Actions.
Z Tian, S Zou, I Davies, T Warr, L Wu, H Bou-Ammar, J Wang
AAAI, 7261-7268, 2020
Online double oracle
YY Le Cong Dinh, Z Tian, NP Nieves, O Slumbers, DH Mguni, HB Ammar, ...
Multi-Agent Constrained Policy Optimisation
S Gu, JG Kuba, M Wen, R Chen, Z Wang, Z Tian, J Wang, A Knoll, Y Yang
arXiv preprint arXiv:2110.02793, 2021
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
Y Wen, H Chen, Y Yang, Z Tian, M Li, X Chen, J Wang
arXiv preprint arXiv:2106.06828, 2021
SMARTS: An Open-Source Scalable Multi-Agent RL Training School for Autonomous Driving
M Zhou, J Luo, J Villella, Y Yang, D Rusu, J Miao, W Zhang, M Alban, ...
Conference on Robot Learning, 264-285, 2021
Opponent Modelling in Multi-Agent Systems
Z Tian
UCL (University College London), 2021
Learning to Safely Exploit a Non-Stationary Opponent
Z Tian, H Ren, Y Yang, Y Sun, Z Han, I Davies, J Wang
Online Double Oracle
C Le Dinh, Y Yang, Z Tian, N Perez Nieves, O Slumbers, DH Mguni, ...
arXiv e-prints, arXiv: 2103.07780, 2021
Learning to Model Opponent Learning (Student Abstract)
I Davies, Z Tian, J Wang
Proceedings of the AAAI Conference on Artificial Intelligence 34 (10), 13771 …, 2020
Joint Perception and Control as Inference with an Object-based Implementation
M Li, Z Tian, P Nashikkar, I Davies, Y Wen, J Wang
arXiv preprint arXiv:1903.01385, 2019
The system can't perform the operation now. Try again later.
Articles 1–13