Parametrized deep q-networks learning: Reinforcement learning with discrete-continuous hybrid action space J Xiong, Q Wang, Z Yang, P Sun, L Han, Y Zheng, H Fu, T Zhang, J Liu, ... arXiv preprint arXiv:1810.06394, 2018 | 225 | 2018 |
Exponentially weighted imitation learning for batched historical data Q Wang, J Xiong, L Han, H Liu, T Zhang Advances in Neural Information Processing Systems 31, 2018 | 122 | 2018 |
Tstarbots: Defeating the cheating level builtin ai in starcraft ii in the full game P Sun, X Sun, L Han, J Xiong, Q Wang, B Li, Y Zheng, J Liu, Y Liu, H Liu, ... arXiv preprint arXiv:1809.07193, 2018 | 84 | 2018 |
Sparse recovery via differential inclusions S Osher, F Ruan, J Xiong, Y Yao, W Yin Applied and Computational Harmonic Analysis 41 (2), 436-469, 2016 | 77 | 2016 |
Grid-wise control for multi-agent reinforcement learning in video game ai L Han, P Sun, Y Du, J Xiong, Q Wang, X Sun, H Liu, T Zhang International Conference on Machine Learning, 2576-2585, 2019 | 64 | 2019 |
Robust subjective visual property prediction from crowdsourced pairwise labels Y Fu, TM Hospedales, T Xiang, J Xiong, S Gong, Y Wang, Y Yao IEEE transactions on pattern analysis and machine intelligence 38 (3), 563-577, 2015 | 63 | 2015 |
Robust evaluation for quality of experience in crowdsourcing Q Xu, J Xiong, Q Huang, Y Yao Proceedings of the 21st ACM international conference on Multimedia, 43-52, 2013 | 37 | 2013 |
Tstarbot-x: An open-sourced and comprehensive study for efficient league training in starcraft ii full game L Han, J Xiong, P Sun, X Sun, M Fang, Q Guo, Q Chen, T Shi, H Yu, X Wu, ... arXiv preprint arXiv:2011.13729, 2020 | 33 | 2020 |
Stochastic non-convex ordinal embedding with stabilized barzilai-borwein step size K Ma, J Zeng, J Xiong, Q Xu, X Cao, W Liu, Y Yao Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 25 | 2018 |
Split LBI: An iterative regularization path with structural sparsity C Huang, X Sun, J Xiong, Y Yao Advances In Neural Information Processing Systems 29, 2016 | 23 | 2016 |
Tleague: A framework for competitive self-play based distributed multi-agent reinforcement learning P Sun, J Xiong, L Han, X Sun, S Li, J Xu, M Fang, Z Zhang arXiv preprint arXiv:2011.12895, 2020 | 20 | 2020 |
Online HodgeRank on random graphs for crowdsourceable QoE evaluation Q Xu, J Xiong, Q Huang, Y Yao IEEE Transactions on Multimedia 16 (2), 373-386, 2013 | 20 | 2013 |
Exploring outliers in crowdsourced ranking for qoe Q Xu, M Yan, C Huang, J Xiong, Q Huang, Y Yao Proceedings of the 25th ACM international conference on Multimedia, 1540-1548, 2017 | 19 | 2017 |
Boosting with structural sparsity: A differential inclusion approach C Huang, X Sun, J Xiong, Y Yao Applied and Computational Harmonic Analysis 48 (1), 1-45, 2020 | 14 | 2020 |
Hodgerank with information maximization for crowdsourced pairwise ranking aggregation Q Xu, J Xiong, X Chen, Q Huang, Y Yao Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 14 | 2018 |
Zeroth-order supervised policy improvement H Sun, Z Xu, Y Song, M Fang, J Xiong, B Dai, B Zhou arXiv preprint arXiv:2006.06600, 2020 | 13 | 2020 |
From social to individuals: A parsimonious path of multi-level models for crowdsourced preference aggregation Q Xu, J Xiong, X Cao, Q Huang, Y Yao IEEE transactions on pattern analysis and machine intelligence 41 (4), 844-856, 2018 | 13 | 2018 |
Greedy when sure and conservative when uncertain about the opponents H Fu, Y Tian, H Yu, W Liu, S Wu, J Xiong, Y Wen, K Li, J Xing, Q Fu, ... International Conference on Machine Learning, 6829-6848, 2022 | 12 | 2022 |
Divergence-augmented policy optimization Q Wang, Y Li, J Xiong, T Zhang Advances in Neural Information Processing Systems 32, 2019 | 12 | 2019 |
Analysis of crowdsourced sampling strategies for hodgerank with sparse random graphs B Osting, J Xiong, Q Xu, Y Yao Applied and Computational Harmonic Analysis 41 (2), 540-560, 2016 | 12 | 2016 |