Retrieval-augmented transformer for image captioning S Sarto, M Cornia, L Baraldi, R Cucchiara International Conference on Content-based Multimedia Indexing, 1-7, 2022 | 33 | 2022 |
Positive-Augmented Constrastive Learning for Image and Video Captioning Evaluation S Sarto, M Barraco, M Cornia, L Baraldi, R Cucchiara IEEE/CVF Conference on Computer Vision and Pattern Recognition (Highlight Paper), 2023 | 23 | 2023 |
The (R) Evolution of Multimodal Large Language Models: A Survey D Caffagni, F Cocchi, L Barsellotti, N Moratelli, S Sarto, L Baraldi, ... arXiv preprint arXiv:2402.12451, 2024 | 4 | 2024 |
Multi-class explainable unlearning for image classification via weight filtering S Poppi, S Sarto, M Cornia, L Baraldi, R Cucchiara arXiv preprint arXiv:2304.02049, 2023 | 3 | 2023 |
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning M Barraco, S Sarto, M Cornia, L Baraldi, R Cucchiara Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 3 | 2023 |
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs D Caffagni, F Cocchi, N Moratelli, S Sarto, M Cornia, L Baraldi, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024 | 1 | 2024 |
Video Surveillance and Privacy: A Solvable Paradox? R Cucchiara, L Baraldi, M Cornia, S Sarto Computer 57 (3), 91-100, 2024 | | 2024 |
Towards Retrieval-Augmented Architectures for Image Captioning S Sarto, M Cornia, L Baraldi, A Nicolosi, R Cucchiara ACM Transactions on Multimedia Computing, Communications and Applications, 2024 | | 2024 |
Transformer combinato con tecniche di retrieval per generazione di didascalie di immagini S SARTO | | 2022 |