关注
Tianpei Yang
标题
引用次数
引用次数
年份
Exploration in deep reinforcement learning: From single-agent to multiagent domain
J Hao, T Yang, H Tang, C Bai, J Liu, Z Meng, P Liu, Z Wang
IEEE Transactions on Neural Networks and Learning Systems, 2023
225*2023
From few to more: Large-scale dynamic multiagent curriculum learning
W Wang *, T Yang*, Y Liu*, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao
Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7293-7300, 2020
1212020
A survey on interpretable reinforcement learning
C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao, W Liu
Machine Learning, 1-44, 2024
952024
A deep bayesian policy reuse approach against non-stationary agents
Y Zheng, Z Meng, J Hao, Z Zhang, T Yang, C Fan
Advances in neural information processing systems 31, 2018
922018
Towards efficient detection and optimal response against sophisticated opponents
T Yang, Z Meng, J Hao, C Zhang, Y Zheng, Z Zheng
Proceedings of the 28th International Joint Conference on Artificial …, 2018
482018
An efficient transfer learning framework for multiagent reinforcement learning
T Yang, W Wang, H Tang, J Hao, Z Meng, H Mao, D Li, W Liu, Y Chen, ...
Advances in neural information processing systems 34, 17037-17048, 2021
38*2021
Efficient deep reinforcement learning via adaptive policy transfer
T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Cheng, C Fan, W Wang, W Liu, ...
Proceedings of the Twenty-Ninth International Joint Conference on Artificial …, 2020
382020
Action semantics network: Considering the effects of actions in multiagent systems
W Wang*, T Yang*, Y Liu, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao
Proceedings of the 8th International Conference on Learning Representations, 2019
382019
Human-in-the-loop reinforcement learning: A survey and position on requirements, challenges, and opportunities
CO Retzlaff, S Das, C Wayllace, P Mousavi, M Afshari, T Yang, A Saranti, ...
Journal of Artificial Intelligence Research 79, 359-415, 2024
362024
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
P Li, H Tang, T Yang, X Hao, T Sang, Y Zheng, J Hao, ME Taylor, Z Wang
International Conference on Machine Learning 162, 12979-12997, 2022
322022
Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
C Zhang, Y Tian, Z Zhang, W Xue, X Xie, T Yang, X Ge, R Chen
IEEE Transactions on Intelligent Transportation Systems 23 (12), 25157-25168, 2022
252022
Learning action-transferable policy with action embedding
Y Chen, Y Chen, Z Hu, T Yang, C Fan, Y Yu, J Hao
arXiv preprint arXiv:1909.02291, 2019
192019
Efficient policy detecting and reusing for non-stationarity in markov games
Y Zheng, J Hao, Z Zhang, Z Meng, T Yang, Y Li, C Fan
Autonomous Agents and Multi-Agent Systems 35, 1-29, 2021
182021
Accelerating Norm Emergence Through Hierarchical Heuristic Learning.
T Yang, Z Meng, J Hao, S Sen, C Yu
Proceedings of 22nd European Conference on Artificial Intelligence (ECAI …, 2016
172016
GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis
Y Cao, Z Li, T Yang, H Zhang, Y Zheng, Y Li, J Hao, Y Liu
Advances in Neural Information Processing Systems 35, 19930-19943, 2022
162022
Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents.
T Yang, J Hao, Z Meng, Y Zheng, C Zhang, Z Zheng
AAMAS, 2282-2284, 2019
152019
Cross-domain Adaptive Transfer Reinforcement Learning Based on State-Action Correspondence
H You, T Yang, Y Zheng, J Hao, ME Taylor
The 38th Conference on Uncertainty in Artificial Intelligence, 2022
132022
Efficient Deep Reinforcement Learning through Policy Transfer.
T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Chen, C Fan, W Wang, Z Wang, ...
AAMAS, 2053-2055, 2020
132020
Advertising impression resource allocation strategy with multi-level budget constraint dqn in real-time bidding
C Zhang, K Zheng, Y Tian, W Xue, T Yang, D An, Y Pi, R Chen
Neurocomputing 488, 647-656, 2022
102022
Learning to shape rewards using a game of two partners
D Mguni, T Jafferjee, J Wang, N Perez-Nieves, W Song, F Tong, M Taylor, ...
Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11604 …, 2023
62023
系统目前无法执行此操作,请稍后再试。
文章 1–20