关注
Xiao Hu
Xiao Hu
在 mails.tsinghua.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Fault diagnosis using novel AdaBoost based discriminant locality preserving projection with resamples
YL He, Y Zhao, X Hu, XN Yan, QX Zhu, Y Xu
Engineering Applications of Artificial Intelligence 91, 103631, 2020
562020
Mind the gap: Offline policy optimization for imperfect rewards
J Li*, X Hu*, H Xu, J Liu, X Zhan, QS Jia, YQ Zhang
International Conference on Learning Representations (ICLR), 2023, 2023
152023
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
J Li, X Hu, H Xu, J Liu, X Zhan, YQ Zhang
arXiv preprint arXiv:2305.15669, 2023
82023
Query-Policy Misalignment in Preference-Based Reinforcement Learning
X Hu, J Li, X Zhan, QS Jia, YQ Zhang
International Conference on Learning Representations (ICLR), 2024, 2023
32023
Novel L2-Discriminant Locality Preserving Projection Integrated with Adaboost and Its Application to Fault Diagnosis
X Hu, Y Zhao, Y Xu, YL He, QX Zhu
2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS …, 2020
22020
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
S Huang, Q Gallouédec, F Felten, A Raffin, RFJ Dossa, Y Zhao, ...
arXiv preprint arXiv:2402.03046, 2024
12024
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
J Li, J Zheng, Y Zheng, L Mao, X Hu, S Cheng, H Niu, J Liu, Y Liu, J Liu, ...
arXiv preprint arXiv:2402.18137, 2024
2024
Vehicle Extreme Control based on Offline Reinforcement Leaning
S Zhao, J Li, X Hu, J Zhang, C He
2022 China Automation Congress (CAC), 4539-4543, 2022
2022
面向数据中心绿色可靠运行的强化学习方法
贾庆山, 唐静娴, 吴俊杰, 胡潇, 林依挺, 夏恒
智能科学与技术学报 2 (4), 341-347, 0
系统目前无法执行此操作,请稍后再试。
文章 1–9