Exponentially weighted imitation learning for batched historical data Q Wang, J Xiong, L Han, H Liu, T Zhang Advances in Neural Information Processing Systems 31, 2018 | 111 | 2018 |
Tstarbots: Defeating the cheating level builtin ai in starcraft ii in the full game P Sun, X Sun, L Han, J Xiong, Q Wang, B Li, Y Zheng, J Liu, Y Liu, H Liu, ... arXiv preprint arXiv:1809.07193, 2018 | 78 | 2018 |
Discerning tactical patterns for professional soccer teams: an enhanced topic model with applications Q Wang, H Zhu, W Hu, Z Shen, Y Yao Proceedings of the 21th ACM SIGKDD International Conference on Knowledge …, 2015 | 73 | 2015 |
Divergence-augmented policy optimization Q Wang, Y Li, J Xiong, T Zhang Advances in Neural Information Processing Systems 32, 2019 | 12 | 2019 |
Arena: a toolkit for multi-agent reinforcement learning Q Wang, J Xiong, L Han, M Fang, X Sun, Z Zheng, P Sun, Z Zhang arXiv preprint arXiv:1907.09467, 2019 | 5 | 2019 |