ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.15589
  4. Cited By
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models
v1v2v3v4v5 (latest)

Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models

22 July 2024
Amir Mohammad Karimi Mamaghan
Samuele Papa
Karl Henrik Johansson
Stefan Bauer
Andrea Dittadi
    OCL
ArXiv (abs)PDFHTML

Papers citing "Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models"

50 / 87 papers shown
Title
Are We Done with Object-Centric Learning?
Are We Done with Object-Centric Learning?
Alexander Rubinstein
Ameya Prabhu
Matthias Bethge
Seong Joon Oh
OCL
984
2
0
09 Apr 2025
CTRL-O: Language-Controllable Object-Centric Visual Representation Learning
CTRL-O: Language-Controllable Object-Centric Visual Representation Learning
Aniket Didolkar
Andrii Zadaianchuk
Rabiul Awal
Maximilian Seitzer
E. Gavves
Aishwarya Agrawal
OCLVLM
170
3
0
27 Mar 2025
PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning
PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning
Angel Villar-Corrales
Sven Behnke
200
4
0
11 Feb 2025
Zero-Shot Object-Centric Representation Learning
Zero-Shot Object-Centric Representation Learning
Aniket Didolkar
Andrii Zadaianchuk
Anirudh Goyal
Mike Mozer
Yoshua Bengio
Georg Martius
Maximilian Seitzer
VLMOCL
72
7
0
17 Aug 2024
Learning to Compose: Improving Object Centric Learning by Injecting
  Compositionality
Learning to Compose: Improving Object Centric Learning by Injecting Compositionality
Whie Jung
Jaehoon Yoo
Sungjin Ahn
Seunghoon Hong
OCLCoGe
64
5
0
01 May 2024
Explicitly Disentangled Representations in Object-Centric Learning
Explicitly Disentangled Representations in Object-Centric Learning
Riccardo Majellaro
Jonathan Collu
Aske Plaat
Thomas M. Moerland
CoGeOODOCL
141
1
0
18 Jan 2024
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Bowen Wen
Wei Yang
Jan Kautz
Stanley T. Birchfield
71
209
0
13 Dec 2023
Benchmarking and Analysis of Unsupervised Object Segmentation from
  Real-world Single Images
Benchmarking and Analysis of Unsupervised Object Segmentation from Real-world Single Images
Yafei Yang
Bo Yang
OCL
59
4
0
08 Dec 2023
Imagine the Unseen World: A Benchmark for Systematic Generalization in
  Visual World Models
Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models
Yeongbin Kim
Gautam Singh
Junyeong Park
Çağlar Gülçehre
Sungjin Ahn
OCLVLM
76
4
0
15 Nov 2023
Object-centric architectures enable efficient causal representation
  learning
Object-centric architectures enable efficient causal representation learning
Amin Mansouri
Jason S. Hartford
Yan Zhang
Yoshua Bengio
CMLOCLOOD
77
18
0
29 Oct 2023
The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
Zhengyuan Yang
Linjie Li
Kevin Qinghong Lin
Jianfeng Wang
Chung-Ching Lin
Nasim Shakouri Mahmoudabadi
Lijuan Wang
LM&MA
65
643
0
29 Sep 2023
Vision Transformers Need Registers
Vision Transformers Need Registers
Zilong Chen
Maxime Oquab
Julien Mairal
Huaping Liu
ViT
164
350
0
28 Sep 2023
Grounded Object Centric Learning
Grounded Object Centric Learning
Avinash Kori
Francesco Locatello
Fabio De Sousa Ribeiro
Francesca Toni
Ben Glocker
OCL
68
11
0
18 Jul 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MHALM
335
12,044
0
18 Jul 2023
DORSal: Diffusion for Object-centric Representations of Scenes et al
DORSal: Diffusion for Object-centric Representations of Scenes et al
Allan Jabri
Sjoerd van Steenkiste
Emiel Hoogeboom
Mehdi S. M. Sajjadi
Thomas Kipf
60
16
0
13 Jun 2023
Systematic Visual Reasoning through Object-Centric Relational
  Abstraction
Systematic Visual Reasoning through Object-Centric Relational Abstraction
Taylor Webb
S. S. Mondal
Jonathan D. Cohen
OCL
81
26
0
04 Jun 2023
Rotating Features for Object Discovery
Rotating Features for Object Discovery
Sindy Löwe
Phillip Lippe
Francesco Locatello
Max Welling
OCL
93
27
0
01 Jun 2023
Provably Learning Object-Centric Representations
Provably Learning Object-Centric Representations
Jack Brady
Roland S. Zimmermann
Yash Sharma
Bernhard Schölkopf
Julius von Kügelgen
Wieland Brendel
OCL
64
36
0
23 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
Ziyi Wu
Jingyu Hu
Wuyue Lu
Igor Gilitschenski
Animesh Garg
DiffMOCL
89
47
0
18 May 2023
DINOv2: Learning Robust Visual Features without Supervision
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLMCLIPSSL
362
3,479
0
14 Apr 2023
STU-Net: Scalable and Transferable Medical Image Segmentation Models
  Empowered by Large-Scale Supervised Pre-training
STU-Net: Scalable and Transferable Medical Image Segmentation Models Empowered by Large-Scale Supervised Pre-training
Ziyan Huang
Hao Wang
Zhongying Deng
Jin Ye
Yanzhou Su
...
Junjun He
Yun Gu
Lixu Gu
Shaoting Zhang
Yu Qiao
62
81
0
13 Apr 2023
Segment Anything
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLMVLM
348
7,365
0
05 Apr 2023
Shepherding Slots to Objects: Towards Stable and Robust Object-Centric
  Learning
Shepherding Slots to Objects: Towards Stable and Robust Object-Centric Learning
Jinwoo Kim
Janghyuk Choi
Ho-Jin Choi
Seon Joo Kim
OCLVLM
76
14
0
31 Mar 2023
Object-Centric Slot Diffusion
Object-Centric Slot Diffusion
Jindong Jiang
Fei Deng
Gautam Singh
S. Ahn
DiffMBDLOCL
108
61
0
20 Mar 2023
PaLM-E: An Embodied Multimodal Language Model
PaLM-E: An Embodied Multimodal Language Model
Danny Driess
F. Xia
Mehdi S. M. Sajjadi
Corey Lynch
Aakanksha Chowdhery
...
Marc Toussaint
Klaus Greff
Andy Zeng
Igor Mordatch
Peter R. Florence
LM&Ro
114
1,663
0
06 Mar 2023
Learning to reason over visual objects
Learning to reason over visual objects
S. S. Mondal
Taylor Webb
Jonathan D. Cohen
OCL
96
29
0
03 Mar 2023
Invariant Slot Attention: Object Discovery with Slot-Centric Reference
  Frames
Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Ondrej Biza
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gamaleldin F. Elsayed
Aravindh Mahendran
Thomas Kipf
OCL
90
37
0
09 Feb 2023
An Investigation into Pre-Training Object-Centric Representations for
  Reinforcement Learning
An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning
Jaesik Yoon
Yi-Fu Wu
Heechul Bae
Sungjin Ahn
OCL
72
43
0
09 Feb 2023
Causal Triplet: An Open Challenge for Intervention-centric Causal
  Representation Learning
Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning
Yuejiang Liu
Alexandre Alahi
Chris Russell
Max Horn
Dominik Zietlow
Bernhard Schölkopf
Francesco Locatello
CML
103
23
0
12 Jan 2023
Improving Object-centric Learning with Query Optimization
Improving Object-centric Learning with Query Optimization
Baoxiong Jia
Yu Liu
Siyuan Huang
OCL
86
52
0
17 Oct 2022
SlotFormer: Unsupervised Visual Dynamics Simulation with Object-Centric
  Models
SlotFormer: Unsupervised Visual Dynamics Simulation with Object-Centric Models
Ziyi Wu
Nikita Dvornik
Klaus Greff
Thomas Kipf
Animesh Garg
OCLBDL
127
95
0
12 Oct 2022
Bridging the Gap to Real-World Object-Centric Learning
Bridging the Gap to Real-World Object-Centric Learning
Maximilian Seitzer
Max Horn
Andrii Zadaianchuk
Dominik Zietlow
Tianjun Xiao
...
Tong He
Zheng Zhang
Bernhard Schölkopf
Thomas Brox
Francesco Locatello
OCL
110
152
0
29 Sep 2022
AudioLM: a Language Modeling Approach to Audio Generation
AudioLM: a Language Modeling Approach to Audio Generation
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
157
615
0
07 Sep 2022
SAVi++: Towards End-to-End Object-Centric Learning from Real-World
  Videos
SAVi++: Towards End-to-End Object-Centric Learning from Real-World Videos
Gamaleldin F. Elsayed
Aravindh Mahendran
Sjoerd van Steenkiste
Klaus Greff
Michael C. Mozer
Thomas Kipf
VOSOCL
132
142
0
15 Jun 2022
Object Scene Representation Transformer
Object Scene Representation Transformer
Mehdi S. M. Sajjadi
Daniel Duckworth
Aravindh Mahendran
Sjoerd van Steenkiste
Filip Pavetić
Mario Luvcić
Leonidas Guibas
Klaus Greff
Thomas Kipf
ViTOCL
79
94
0
14 Jun 2022
Unsupervised Image Representation Learning with Deep Latent Particles
Unsupervised Image Representation Learning with Deep Latent Particles
Tal Daniel
Aviv Tamar
OCLSSL
53
12
0
31 May 2022
Simple Unsupervised Object-Centric Learning for Complex and Naturalistic
  Videos
Simple Unsupervised Object-Centric Learning for Complex and Naturalistic Videos
Gautam Singh
Yi-Fu Wu
Sungjin Ahn
OCL
112
120
0
27 May 2022
Inductive Biases for Object-Centric Representations in the Presence of
  Complex Textures
Inductive Biases for Object-Centric Representations in the Presence of Complex Textures
Samuele Papa
Ole Winther
Andrea Dittadi
OCL
105
14
0
18 Apr 2022
PaLM: Scaling Language Modeling with Pathways
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILMLRM
515
6,279
0
05 Apr 2022
Delving Deeper into Cross-lingual Visual Question Answering
Delving Deeper into Cross-lingual Visual Question Answering
Chen Cecilia Liu
Jonas Pfeiffer
Anna Korhonen
Ivan Vulić
Iryna Gurevych
80
8
0
15 Feb 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
466
15,734
0
20 Dec 2021
Conditional Object-Centric Learning from Video
Conditional Object-Centric Learning from Video
Thomas Kipf
Gamaleldin F. Elsayed
Aravindh Mahendran
Austin Stone
S. Sabour
G. Heigold
Rico Jonschkowski
Alexey Dosovitskiy
Klaus Greff
OCL
105
218
0
24 Nov 2021
ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object
  Segmentation
ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object Segmentation
Laurynas Karazija
Iro Laina
Christian Rupprecht
3DVVOS
127
90
0
19 Nov 2021
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViTTPM
467
7,814
0
11 Nov 2021
Dynamic Visual Reasoning by Learning Differentiable Physics Models from
  Video and Language
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
Mingyu Ding
Zhenfang Chen
Tao Du
Ping Luo
J. Tenenbaum
Chuang Gan
VGenPINNOCL
90
75
0
28 Oct 2021
Illiterate DALL-E Learns to Compose
Illiterate DALL-E Learns to Compose
Gautam Singh
Fei Deng
Sungjin Ahn
CoGeOCL
114
139
0
17 Oct 2021
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
Zhengyuan Yang
Zhe Gan
Jianfeng Wang
Xiaowei Hu
Yumao Lu
Zicheng Liu
Lijuan Wang
251
421
0
10 Sep 2021
Generative Video Transformer: Can Objects be the Words?
Generative Video Transformer: Can Objects be the Words?
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
ViT
87
34
0
20 Jul 2021
Structured World Belief for Reinforcement Learning in POMDP
Structured World Belief for Reinforcement Learning in POMDP
Gautam Singh
Skand Peri
Junghyun Kim
Hyunseok Kim
Sungjin Ahn
OCL
59
28
0
19 Jul 2021
Generalization and Robustness Implications in Object-Centric Learning
Generalization and Robustness Implications in Object-Centric Learning
Andrea Dittadi
Samuele Papa
Michele De Vita
Bernhard Schölkopf
Ole Winther
Francesco Locatello
OCLOOD
75
76
0
01 Jul 2021
12
Next