Query-adaptive video summarization via quality-aware relevance estimation AB Vasudevan, M Gygli, A Volokitin, L Van Gool Proceedings of the 25th ACM international conference on Multimedia, 582-590, 2017 | 107 | 2017 |
Object referring in videos with language and human gaze AB Vasudevan, D Dai, L Van Gool Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 78 | 2018 |
Talk2nav: Long-range vision-and-language navigation with dual attention and spatial memory AB Vasudevan, D Dai, L Van Gool International Journal of Computer Vision 129, 246-266, 2021 | 61 | 2021 |
Semantic object prediction and spatial sound super-resolution with binaural sounds AB Vasudevan, D Dai, L Van Gool European conference on computer vision, 638-655, 2020 | 50 | 2020 |
Object referring in visual scene with spoken language AB Vasudevan, D Dai, L Van Gool 2018 IEEE winter conference on applications of computer vision (WACV), 1861-1870, 2018 | 22 | 2018 |
Dynamic scene classification using spatial and temporal cues A Vasudevan, S Muralidharan, S Chintapalli, S Raman Proceedings of the IEEE International Conference on Computer Vision …, 2013 | 22 | 2013 |
Binaural soundnet: predicting semantics, depth and motion with binaural sounds D Dai, AB Vasudevan, J Matas, L Van Gool IEEE transactions on pattern analysis and machine intelligence 45 (1), 123-136, 2022 | 11 | 2022 |
Sound and visual representation learning with multiple pretraining tasks AB Vasudevan, D Dai, L Van Gool Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 8 | 2022 |
ETH-CVL@ MediaEval 2016: Textual-Visual Embeddings and Video2GIF for Video Interestingness. AB Vasudevan, M Gygli, A Volokitin, L Van Gool MediaEval, 2016 | 5 | 2016 |
The un-kidnappable robot: Acoustic localization of sneaking people M Yang, P Grady, S Brahmbhatt, AB Vasudevan, CC Kemp, J Hays 2024 IEEE International Conference on Robotics and Automation (ICRA), 985-992, 2024 | 1 | 2024 |
Deep Visual Semantic Embedding for Video Thumbnail Selection AB Vasudevan Master’s thesis, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, 2016 | 1 | 2016 |
A novel approach to the extraction of multiple salient objects in an image S Muralidharan, AB Vasudevan, CS Pratheek, S Raman 2015 IEEE International Conference on Signal Processing, Informatics …, 2015 | 1 | 2015 |
Motion characterization of a dynamic scene AB Vasudevan, S Muralidharan, SP Chintapalli, S Raman 2014 International Conference on Computer Vision Theory and Applications …, 2014 | 1 | 2014 |
LCA-on-the-Line: Benchmarking Out-of-Distribution Generalization with Class Taxonomies J Shi, G Gare, J Tian, S Chai, Z Lin, A Vasudevan, D Feng, F Ferroni, ... arXiv preprint arXiv:2407.16067, 2024 | | 2024 |
Planning with Adaptive World Models for Autonomous Driving AB Vasudevan, N Peri, J Schneider, D Ramanan arXiv preprint arXiv:2406.10714, 2024 | | 2024 |
A method for training a neural network to describe an environment on the basis of an audio signal, and the corresponding neural network W Abbeloos, AB VASUDEVAN, DAI Dengxin, L Van Gool US Patent App. 17/792,073, 2023 | | 2023 |
Sound and Visual Representation Learning with Multiple Pretraining Tasks A Balajee Vasudevan, D Dai, L Van Gool arXiv e-prints, arXiv: 2201.01046, 2022 | | 2022 |
Multimodal Semantic Understanding and Navigation in Outdoor Scenes AB Vasudevan ETH Zurich, 2021 | | 2021 |
Planning with an Ensemble of World Models AB Vasudevan, N Peri, D Ramanan | | |
Supplementary Material: Sound and Visual Representation Learning with Multiple Pretraining Tasks AB Vasudevan, D Dai, L Van Gool | | |