Tree-structured policy based progressive reinforcement learning for temporally language grounding in video J Wu, G Li, S Liu, L Lin Proceedings of the AAAI Conference on Artificial Intelligence 34 (07), 12386 …, 2020 | 89 | 2020 |
Fine-grained image captioning with global-local discriminative objective J Wu, T Chen, H Wu, Z Yang, G Luo, L Lin IEEE Transactions on Multimedia 23, 2413-2427, 2020 | 45 | 2020 |
Activation modulation and recalibration scheme for weakly supervised semantic segmentation J Qin, J Wu, X Xiao, L Li, X Wang Proceedings of the AAAI Conference on Artificial Intelligence 36 (2), 2117-2125, 2022 | 42 | 2022 |
Weakly-supervised spatio-temporal anomaly detection in surveillance video J Wu, W Zhang, G Li, W Wu, X Tan, Y Li, E Ding, L Lin arXiv preprint arXiv:2108.03825, 2021 | 36 | 2021 |
Multi-granularity tracking with modularlized components for unsupervised vehicles anomaly detection Y Li, J Wu, X Bai, X Yang, X Tan, G Li, S Wen, H Zhang, E Ding Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 36 | 2020 |
Reinforcement learning for weakly supervised temporal grounding of natural language in untrimmed videos J Wu, G Li, X Han, L Lin Proceedings of the 28th ACM International Conference on Multimedia, 1283-1291, 2020 | 33 | 2020 |
Sepvit: Separable vision transformer W Li, X Wang, X Xia, J Wu, X Xiao, M Zheng, S Wen arXiv preprint arXiv:2203.15380, 2022 | 32 | 2022 |
Online multi-granularity distillation for gan compression Y Ren, J Wu, X Xiao, J Yang Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 28 | 2021 |
Cascade recurrent neural network for image caption generation J Wu, H Hu Electronics Letters 53 (25), 1642-1643, 2017 | 28 | 2017 |
Scalablevit: Rethinking the context-oriented generalization of vision transformer R Yang, H Ma, J Wu, Y Tang, X Xiao, M Zheng, X Li European Conference on Computer Vision, 480-496, 2022 | 26 | 2022 |
TRT-ViT: TensorRT-oriented vision transformer X Xia, J Li, J Wu, X Wang, X Xiao, M Zheng, R Wang arXiv preprint arXiv:2205.09579, 2022 | 18 | 2022 |
Revisiting discriminator in GAN compression: A generator-discriminator cooperative compression scheme S Li, J Wu, X Xiao, F Chao, X Mao, R Ji Advances in Neural Information Processing Systems 34, 28560-28572, 2021 | 17 | 2021 |
Image captioning via semantic guidance attention and consensus selection strategy J Wu, H Hu, Y Wu ACM Transactions on Multimedia Computing, Communications, and Applications …, 2018 | 13 | 2018 |
Box-level tube tracking and refinement for vehicles anomaly detection J Wu, X Wang, X Xiao, Y Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 10 | 2021 |
Pseudo-3D attention transfer network with content-aware strategy for image captioning J Wu, H Hu, L Yang ACM Transactions on Multimedia Computing, Communications, and Applications …, 2019 | 9 | 2019 |
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation J Qin, J Wu, P Yan, M Li, R Yuxi, X Xiao, Y Wang, R Wang, S Wen, X Pan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 6 | 2023 |
Global-local feature attention network with reranking strategy for image caption generation J Wu, S Xie, X Shi, Y Chen Computer Vision: Second CCF Chinese Conference, CCCV 2017, Tianjin, China …, 2017 | 6 | 2017 |
Concrete image captioning by integrating content sensitive and global discriminative objective J Wu, T Chen, H Wu, Z Yang, Q Wang, L Lin 2019 IEEE International Conference on Multimedia and Expo (ICME), 1306-1311, 2019 | 5 | 2019 |
Multi-granularity distillation scheme towards lightweight semi-supervised semantic segmentation J Qin, J Wu, M Li, X Xiao, M Zheng, X Wang European Conference on Computer Vision, 481-498, 2022 | 4 | 2022 |
Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models W Chen, J Wu, P Xie, H Wu, J Li, X Xia, X Xiao, L Lin arXiv preprint arXiv:2305.13840, 2023 | 2 | 2023 |