关注
Mihai Anca
Mihai Anca
在 bristol.ac.uk 的电子邮件经过验证
标题
引用次数
引用次数
年份
Twin delayed hierarchical actor-critic
M Anca, M Studley
2021 7th International Conference on Automation, Robotics and Applications …, 2021
62021
Multi-lingual agents through multi-headed neural networks
JD Thomas, R Santos-Rodriguez, R Piechocki, M Anca
arXiv preprint arXiv:2111.11129, 2021
32021
Achieving Goals using Reward Shaping and Curriculum Learning
M Anca, JD Thomas, D Pedamonti, M Hansen, M Studley
Proceedings of the Future Technologies Conference, 316-331, 2023
12023
Effects of reward shaping on curriculum learning in goal conditioned tasks
M Anca, M Studley, M Hansen, JD Thomas, D Pedamonti
arXiv preprint arXiv:2206.02462, 2022
12022
Learning Long Chain of Actions through Hierarchical Reinforcement Learning
M Anca, M Anca
distances 1, 1, 2024
2024
Modular Hierarchical Reinforcement Learning for Robotics: Improving Scalability and Generalizability
M Anca, MF Hansen, M Studley
ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–6