关注
Hongyi Guo
Hongyi Guo
其他姓名郭洪一
在 u.northwestern.edu 的电子邮件经过验证
标题
引用次数
引用次数
年份
Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates
Y Liu, H Guo
International Conference on Machine Learning, 2020
2142020
Provably efficient offline reinforcement learning for partially observable markov decision processes
H Guo, Q Cai, Y Zhang, Z Yang, Z Wang
International Conference on Machine Learning (Spotlight), 8016-8038, 2022
152022
Policy learning using weak supervision
J Wang*, H Guo*, Z Zhu*, Y Liu
Advances in Neural Information Processing Systems 34, 19960-19973, 2021
102021
Reason for future, act for now: A principled framework for autonomous llm agents with provable sample efficiency
Z Liu, H Hu, S Zhang, H Guo, S Ke, B Liu, Z Wang
arXiv preprint arXiv:2309.17382, 2023
82023
Decentralized single-timescale actor-critic on zero-sum two-player stochastic games
H Guo, Z Fu, Z Yang, Z Wang
International Conference on Machine Learning (Spotlight), 3899-3909, 2021
82021
Signal instructed coordination in cooperative multi-agent reinforcement learning
L Chen, H Guo, Y Du, F Fang, H Zhang, W Zhang, Y Yu
Distributed Artificial Intelligence: Third International Conference, DAI …, 2022
52022
Human-instruction-free llm self-alignment with limited samples
H Guo, Y Yao, W Shen, J Wei, X Zhang, Z Wang, Y Liu
arXiv preprint arXiv:2401.06785, 2024
42024
Behavior Contrastive Learning for Unsupervised Skill Discovery
R Yang, C Bai, H Guo, S Li, B Zhao, Z Wang, P Liu, X Li
International Conference on Machine Learning, 2023
22023
Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
W Shen, X Zhang, Y Yao, R Zheng, H Guo, Y Liu
arXiv preprint arXiv:2403.07708, 2024
12024
Can Large Language Models Play Games? A Case Study of A Self-Play Approach
H Guo, Z Liu, Y Zhang, Z Wang
arXiv preprint arXiv:2403.05632, 2024
12024
Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting
J Wei, Y Yao, JF Ton, H Guo, A Estornell, Y Liu
arXiv preprint arXiv:2402.10412, 2024
12024
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning
X Yu, C Bai, H Guo, C Wang, Z Wang
arXiv preprint arXiv:2404.06188, 2024
2024
Lightweight Uncertainty for Offline Reinforcement Learning via Bayesian Posterior
X Yu, C Bai, H Guo, L Wang, C Wang, Z Wang, Z Wang
2022
系统目前无法执行此操作,请稍后再试。
文章 1–13