Data Augmentation Using GANs for Speech Emotion Recognition. A Chatziagapi, G Paraskevopoulos, D Sgouropoulos, G Pantazopoulos, ... Interspeech, 171-175, 2019 | 149 | 2019 |
Combine to describe: Evaluating compositional generalization in image captioning G Pantazopoulos, A Suglia, A Eshghi Proceedings of the 60th Annual Meeting of the Association for Computational ¡K, 2022 | 12 | 2022 |
Multitask Multimodal Prompted Training for Interactive Embodied Task Completion G Pantazopoulos, M Nikandrou, A Parekh, B Hemanthage, A Eshghi, ... arXiv preprint arXiv:2311.04067, 2023 | 5 | 2023 |
ViCA: Combining visual, social, and task-oriented conversational AI in a healthcare setting G Pantazopoulos, J Bruyere, M Nikandrou, T Boissier, S Hemanthage, ... Proceedings of the 2021 International Conference on Multimodal Interaction ¡K, 2021 | 5 | 2021 |
Using Oliver API for emotion-aware movie content characterization T Giannakopoulos, S Dimopoulos, G Pantazopoulos, A Chatziagapi, ... 2019 International Conference on Content-Based Multimedia Indexing (CBMI), 1-4, 2019 | 3 | 2019 |
Lost in Space: Probing Fine-grained Spatial Understanding in Vision and Language Resamplers G Pantazopoulos, A Suglia, O Lemon, A Eshghi arXiv preprint arXiv:2404.13594, 2024 | 2 | 2024 |
Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks G Pantazopoulos, A Parekh, M Nikandrou, A Suglia arXiv preprint arXiv:2405.04403, 2024 | 1 | 2024 |
EMMA: A Foundation Model for Embodied, Interactive, Multimodal Task Completion in 3D Environments A Parekh, M Nikandrou, G Pantazopoulos, B Hemanthage, A Eshghi, ... | 1 | 2023 |
Demonstrating EMMA: Embodied MultiModal Agent for Language-guided Action Execution in 3D Simulated Environments A Suglia, B Hemanthage, M Nikandrou, G Pantazopoulos, A Parekh, ... Proceedings of the 23rd Annual Meeting of the Special Interest Group on ¡K, 2022 | 1 | 2022 |
Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling G Pantazopoulos, M Nikandrou, A Suglia, O Lemon, A Eshghi arXiv preprint arXiv:2409.05395, 2024 | | 2024 |
Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation M Nikandrou, G Pantazopoulos, I Konstas, A Suglia arXiv preprint arXiv:2406.19297, 2024 | | 2024 |