Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models
Amir Mohammad Karimi Mamaghan , Samuele Papa, Karl Henrik Johansson, Stefan Bauer, Andrea Dittadi
arXiv preprint arXiv:2407.15589, 2024