关注
Haifeng Zhang
标题
引用次数
引用次数
年份
Learn to navigate: cooperative path planning for unmanned surface vehicles using deep reinforcement learning
X Zhou, P Wu, H Zhang, W Guo, Y Liu
Ieee Access 7, 165262-165278, 2019
1402019
Improving knowledge tracing via pre-training question embeddings
Y Liu, Y Yang, X Chen, J Shen, H Zhang, Y Yu
arXiv preprint arXiv:2012.05031, 2020
1342020
Bi-level actor-critic for multi-agent coordination
H Zhang, W Chen, Z Huang, M Li, Y Yang, W Zhang, J Wang
Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7325-7332, 2020
982020
Offline pre-trained multi-agent decision transformer
L Meng, M Wen, C Le, X Li, D Xing, W Zhang, Y Wen, H Zhang, J Wang, ...
Machine Intelligence Research 20 (2), 233-248, 2023
732023
Learning correlated communication topology in multi-agent reinforcement learning
Y Du, B Liu, V Moens, Z Liu, Z Ren, J Wang, X Chen, H Zhang
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
692021
Settling the variance of multi-agent policy gradients
JG Kuba, M Wen, L Meng, H Zhang, D Mguni, J Wang, Y Yang
Advances in Neural Information Processing Systems 34, 13458-13470, 2021
622021
User response learning for directly optimizing campaign performance in display advertising
K Ren, W Zhang, Y Rong, H Zhang, Y Yu, J Wang
Proceedings of the 25th acm international on conference on information and …, 2016
502016
GCS: Graph-based coordination strategy for multi-agent reinforcement learning
J Ruan, Y Du, X Xiong, D Xing, X Li, L Meng, H Zhang, J Wang, B Xu
arXiv preprint arXiv:2201.06257, 2022
402022
Token-level Direct Preference Optimization
Y Zeng, G Liu, W Ma, N Yang, H Zhang, J Wang
arXiv preprint arXiv:2404.11999, 2024
312024
Large language models play starcraft ii: Benchmarks and a chain of summarization approach
W Ma, Q Mi, Y Zeng, X Yan, Y Wu, R Lin, H Zhang, J Wang
arXiv preprint arXiv:2312.11865, 2023
302023
Large sequence models for sequential decision-making: a survey
M Wen, R Lin, H Wang, Y Yang, Y Wen, L Mai, J Wang, H Zhang, ...
Frontiers of Computer Science 17 (6), 176349, 2023
302023
Offline pre-trained multi-agent decision transformer: One big sequence model tackles all smac tasks
L Meng, M Wen, Y Yang, C Le, X Li, W Zhang, Y Wen, H Zhang, J Wang, ...
arXiv preprint arXiv:2112.02845, 2021
292021
Botzone: an online multi-agent competitive platform for ai education
H Zhou, H Zhang, Y Zhou, X Wang, W Li
Proceedings of the 23rd Annual ACM Conference on Innovation and Technology …, 2018
282018
A review: machine learning for combinatorial optimization problems in energy areas
X Yang, Z Wang, H Zhang, N Ma, N Yang, H Liu, H Zhang, L Yang
Algorithms 15 (6), 205, 2022
272022
Layout design for intelligent warehouse by evolution with fitness approximation
H Zhang, Z Guo, W Zhang, H Cai, C Wang, Y Yu, W Li, J Wang
IEEE Access 7, 166310-166317, 2019
222019
Learning to design games: Strategic environments in reinforcement learning
H Zhang, J Wang, Z Zhou, W Zhang, Y Wen, Y Yu, W Li
Proceedings of the 27th international joint conference on Artificial …, 2017
202017
A game-theoretic approach for improving generalization ability of TSP solvers
C Wang, Y Yang, O Slumbers, C Han, T Guo, H Zhang, J Wang
arXiv preprint arXiv:2110.15105, 2021
152021
Managing risk of bidding in display advertising
H Zhang, W Zhang, Y Rong, K Ren, W Li, J Wang
Proceedings of the Tenth ACM International Conference on Web Search and Data …, 2017
152017
Estimating -Rank from A Few Entries with Low Rank Matrix Completion
Y Du, X Yan, X Chen, J Wang, H Zhang
International Conference on Machine Learning, 2870-2879, 2021
122021
Contextual transformer for offline meta reinforcement learning
R Lin, Y Li, X Feng, Z Zhang, XHW Fung, H Zhang, J Wang, Y Du, Y Yang
arXiv preprint arXiv:2211.08016, 2022
102022
系统目前无法执行此操作,请稍后再试。
文章 1–20