关注
Xidong Feng
标题
引用次数
引用次数
年份
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Y Chen, Y Yang, T Wu, S Wang, X Feng, J Jiang, SM McAleer, H Dong, ...
NeurIPS 2022, 2022
662022
Vehicle trajectory prediction using intention-based conditional variational autoencoder
X Feng, Z Cen, J Hu, Y Zhang
2019 IEEE Intelligent Transportation Systems Conference (ITSC), 3514-3519, 2019
492019
Neural Auto-Curricula
X Feng*, O Slumbers*, Y Yang, Z Wan, B Liu, S McAleer, Y Wen, J Wang
NeurIPS 2021, 2021
46*2021
Towards effective context for meta-reinforcement learning: an approach based on contrastive learning
H Fu, H Tang, J Hao, C Chen, X Feng, D Li, W Liu
Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 7457-7465, 2021
462021
Mri reconstruction with interpretable pixel-wise operations using reinforcement learning
W Li*, X Feng*, H An, XY Ng, YJ Zhang
Proceedings of the AAAI conference on artificial intelligence 34 (01), 792-799, 2020
282020
Heterogeneous-agent mirror learning: A continuum of solutions to cooperative marl
JG Kuba, X Feng, S Ding, H Dong, J Wang, Y Yang
JMLR, 2022
25*2022
Cmml: Contextual modulation meta learning for cold-start recommendation
X Feng, C Chen, D Li, M Zhao, J Hao, J Wang
Proceedings of the 30th ACM International Conference on Information …, 2021
222021
Alphazero-like tree-search can guide large language model decoding and training
X Feng*, Z Wan*, M Wen, Y Wen, W Zhang, J Wang
ICML 2024, 2023
202023
ChessGPT: Bridging Policy Learning and Language Modeling
X Feng, Y Luo, Z Wang, H Tang, M Yang, K Shao, D Mguni, Y Du, J Wang
Advances in Neural Information Processing Systems 36, 2024
152024
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
X Feng*, B Liu*, J Ren, L Mai, R Zhu, J Wang, Y Yang
NeurIPS 2022, 2021
12*2021
Autonomous lane change decision making using different deep reinforcement learning methods
X Feng, J Hu, Y Huo, Y Zhang
CICTP 2019, 5563-5575, 2019
92019
Pangu-agent: A fine-tunable generalist agent with structured reasoning
F Christianos, G Papoudakis, M Zimmer, T Coste, Z Wu, J Chen, ...
arXiv preprint arXiv:2312.14878, 2023
82023
Contextual Transformer for Offline Meta Reinforcement Learning
R Lin, Y Li, X Feng, Z Zhang, XHW Fung, H Zhang, J Wang, Y Du, Y Yang
NeurIPS2022 FMDM workshop, 2022
7*2022
Torchopt: An efficient library for differentiable optimization
J Ren*, X Feng*, B Liu*, X Pan*, Y Fu, L Mai, Y Yang
JMLR Open Source Software, 2022
72022
MANSA: learning fast and slow in multi-agent systems
DH Mguni, H Chen, T Jafferjee, J Wang, L Yue, X Feng, SM Mcaleer, ...
International Conference on Machine Learning, 24631-24658, 2023
32023
Natural Language Reinforcement Learning
X Feng, Z Wan, M Yang, Z Wang, GA Koushiks, Y Du, Y Wen, J Wang
arXiv preprint arXiv:2402.07157, 2024
12024
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
Z Hu, C Liu, X Feng, Y Zhao, SK Ng, AT Luu, J He, PW Koh, B Hooi
arXiv preprint arXiv:2402.03271, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–17