Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Y Zhai, H Bai, Z Lin, J Pan, S Tong, Y Zhou, A Suhr, S Xie, Y LeCun, Y Ma, ... Preprint, 2024 | 5 | 2024 |
CharmBana: Progressive Responses with Real-Time Internet Search for Knowledge-Powered Conversations RG Reddy, S Suresh, H Bai, W Yao, MS Sidhu, K Aggarwal, P Sonawane, ... WSDM'24, 2024 | 3* | 2024 |
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is? Y Yu, S Buchanan, D Pai, T Chu, Z Wu, S Tong, H Bai, Y Zhai, ... JMLR (Treatise), 2023 | 2 | 2023 |
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning H Bai, Y Zhou, M Cemri, J Pan, A Suhr, S Levine, A Kumar Preprint, 2024 | 1 | 2024 |
Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations RG Reddy, H Bai, W Yao, SCE Suresh, H Ji, CX Zhai EMNLP'23, 2023 | | 2023 |