关注
Jiechao Xiong
Jiechao Xiong
Tencent AI Lab
在 tencent.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Parametrized deep q-networks learning: Reinforcement learning with discrete-continuous hybrid action space
J Xiong, Q Wang, Z Yang, P Sun, L Han, Y Zheng, H Fu, T Zhang, J Liu, ...
arXiv preprint arXiv:1810.06394, 2018
2252018
Exponentially weighted imitation learning for batched historical data
Q Wang, J Xiong, L Han, H Liu, T Zhang
Advances in Neural Information Processing Systems 31, 2018
1222018
Tstarbots: Defeating the cheating level builtin ai in starcraft ii in the full game
P Sun, X Sun, L Han, J Xiong, Q Wang, B Li, Y Zheng, J Liu, Y Liu, H Liu, ...
arXiv preprint arXiv:1809.07193, 2018
842018
Sparse recovery via differential inclusions
S Osher, F Ruan, J Xiong, Y Yao, W Yin
Applied and Computational Harmonic Analysis 41 (2), 436-469, 2016
772016
Grid-wise control for multi-agent reinforcement learning in video game ai
L Han, P Sun, Y Du, J Xiong, Q Wang, X Sun, H Liu, T Zhang
International Conference on Machine Learning, 2576-2585, 2019
642019
Robust subjective visual property prediction from crowdsourced pairwise labels
Y Fu, TM Hospedales, T Xiang, J Xiong, S Gong, Y Wang, Y Yao
IEEE transactions on pattern analysis and machine intelligence 38 (3), 563-577, 2015
632015
Robust evaluation for quality of experience in crowdsourcing
Q Xu, J Xiong, Q Huang, Y Yao
Proceedings of the 21st ACM international conference on Multimedia, 43-52, 2013
372013
Tstarbot-x: An open-sourced and comprehensive study for efficient league training in starcraft ii full game
L Han, J Xiong, P Sun, X Sun, M Fang, Q Guo, Q Chen, T Shi, H Yu, X Wu, ...
arXiv preprint arXiv:2011.13729, 2020
332020
Stochastic non-convex ordinal embedding with stabilized barzilai-borwein step size
K Ma, J Zeng, J Xiong, Q Xu, X Cao, W Liu, Y Yao
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
252018
Split LBI: An iterative regularization path with structural sparsity
C Huang, X Sun, J Xiong, Y Yao
Advances In Neural Information Processing Systems 29, 2016
232016
Tleague: A framework for competitive self-play based distributed multi-agent reinforcement learning
P Sun, J Xiong, L Han, X Sun, S Li, J Xu, M Fang, Z Zhang
arXiv preprint arXiv:2011.12895, 2020
202020
Online HodgeRank on random graphs for crowdsourceable QoE evaluation
Q Xu, J Xiong, Q Huang, Y Yao
IEEE Transactions on Multimedia 16 (2), 373-386, 2013
202013
Exploring outliers in crowdsourced ranking for qoe
Q Xu, M Yan, C Huang, J Xiong, Q Huang, Y Yao
Proceedings of the 25th ACM international conference on Multimedia, 1540-1548, 2017
192017
Boosting with structural sparsity: A differential inclusion approach
C Huang, X Sun, J Xiong, Y Yao
Applied and Computational Harmonic Analysis 48 (1), 1-45, 2020
142020
Hodgerank with information maximization for crowdsourced pairwise ranking aggregation
Q Xu, J Xiong, X Chen, Q Huang, Y Yao
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
142018
Zeroth-order supervised policy improvement
H Sun, Z Xu, Y Song, M Fang, J Xiong, B Dai, B Zhou
arXiv preprint arXiv:2006.06600, 2020
132020
From social to individuals: A parsimonious path of multi-level models for crowdsourced preference aggregation
Q Xu, J Xiong, X Cao, Q Huang, Y Yao
IEEE transactions on pattern analysis and machine intelligence 41 (4), 844-856, 2018
132018
Greedy when sure and conservative when uncertain about the opponents
H Fu, Y Tian, H Yu, W Liu, S Wu, J Xiong, Y Wen, K Li, J Xing, Q Fu, ...
International Conference on Machine Learning, 6829-6848, 2022
122022
Divergence-augmented policy optimization
Q Wang, Y Li, J Xiong, T Zhang
Advances in Neural Information Processing Systems 32, 2019
122019
Analysis of crowdsourced sampling strategies for hodgerank with sparse random graphs
B Osting, J Xiong, Q Xu, Y Yao
Applied and Computational Harmonic Analysis 41 (2), 540-560, 2016
122016
系统目前无法执行此操作,请稍后再试。
文章 1–20