Gaze target estimation inspired by interactive attention Z Hu, K Zhao, B Zhou, H Guo, S Wu, Y Yang, J Liu IEEE Transactions on Circuits and Systems for Video Technology 32 (12), 8524 …, 2022 | 13 | 2022 |
Towards general computer control: A multimodal agent for red dead redemption ii as a case study W Tan, Z Ding, W Zhang, B Li, B Zhou, J Yue, H Xia, J Jiang, L Zheng, ... arXiv preprint arXiv:2403.03186, 2024 | 5 | 2024 |
UniCode: Learning a Unified Codebook for Multimodal Large Language Models S Zheng, B Zhou, Y Feng, Y Wang, Z Lu arXiv preprint arXiv:2403.09072, 2024 | 1 | 2024 |
Learning from Visual Observation via Offline Pretrained State-to-Go Transformer B Zhou, K Li, J Jiang, Z Lu Advances in Neural Information Processing Systems 36, 2024 | 1 | 2024 |
Gfie: A dataset and baseline for gaze-following from 2d to 3d in indoor environments Z Hu, Y Yang, X Zhai, D Yang, B Zhou, J Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 1 | 2023 |