Video as conditional graph hierarchy for multi-granular question answering J Xiao, A Yao, Z Liu, Y Li, W Ji, TS Chua Proceedings of the AAAI Conference on Artificial Intelligence 36 (3), 2804-2812, 2022 | 56 | 2022 |
Invariant grounding for video question answering Y Li, X Wang, J Xiao, W Ji, TS Chua Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 52 | 2022 |
Interventional video relation detection Y Li, X Yang, X Shang, TS Chua Proceedings of the 29th ACM International Conference on Multimedia, 4091-4099, 2021 | 47 | 2021 |
Video question answering: Datasets, algorithms and challenges Y Zhong, J Xiao, W Ji, Y Li, W Deng, TS Chua arXiv preprint arXiv:2203.01225, 2022 | 32 | 2022 |
Video visual relation detection via iterative inference X Shang, Y Li, J Xiao, W Ji, TS Chua Proceedings of the 29th ACM international conference on Multimedia, 3654-3663, 2021 | 24 | 2021 |
Equivariant and invariant grounding for video question answering Y Li, X Wang, J Xiao, TS Chua Proceedings of the 30th ACM International Conference on Multimedia, 4714-4722, 2022 | 13 | 2022 |
Vidvrd 2021: The third grand challenge on video relation detection W Ji, Y Li, M Wei, X Shang, J Xiao, T Ren, TS Chua Proceedings of the 29th ACM International Conference on Multimedia, 4779-4783, 2021 | 8 | 2021 |
Transformer-empowered invariant grounding for video question answering Y Li, X Wang, J Xiao, W Ji, TS Chua IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 2 | 2023 |
Redundancy-aware Transformer for Video Question Answering Y Li, X Yang, A Zhang, C Feng, X Wang, TS Chua arXiv preprint arXiv:2308.03267, 2023 | 1 | 2023 |
Discovering Spatio-Temporal Rationales for Video Question Answering Y Li, J Xiao, C Feng, X Wang, TS Chua Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 1 | 2023 |