Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.05166
Cited By
v1
v2
v3 (latest)
Object-Centric Representation Learning for Video Question Answering
12 April 2021
Long Hoang Dang
T. Le
Vuong Le
T. Tran
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Object-Centric Representation Learning for Video Question Answering"
23 / 23 papers shown
Title
On the Binding Problem in Artificial Neural Networks
Klaus Greff
Sjoerd van Steenkiste
Jürgen Schmidhuber
OCL
306
266
0
09 Dec 2020
Dynamic Language Binding in Relational Visual Reasoning
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
NAI
51
19
0
30 Apr 2020
Hierarchical Conditional Relation Networks for Video Question Answering
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
83
260
0
25 Feb 2020
CLEVRER: CoLlision Events for Video REpresentation and Reasoning
Kexin Yi
Yuta Saito
Yunzhu Li
Pushmeet Kohli
Jiajun Wu
Antonio Torralba
J. Tenenbaum
NAI
136
475
0
03 Oct 2019
Neural Reasoning, Fast and Slow, for Video Question Answering
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
36
14
0
10 Jul 2019
Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering
Chenyou Fan
Xiaofan Zhang
Shu Zhang
Wensheng Wang
Chi Zhang
Heng-Chiao Huang
60
279
0
08 Apr 2019
Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks
Peng Wang
Qi Wu
Jiewei Cao
Chunhua Shen
Lianli Gao
Anton Van Den Hengel
ObjD
93
255
0
12 Dec 2018
Motion-Appearance Co-Memory Networks for Video Question Answering
J. Gao
Runzhou Ge
Kan Chen
Ram Nevatia
126
241
0
29 Mar 2018
Compositional Attention Networks for Machine Reasoning
Drew A. Hudson
Christopher D. Manning
BDL
OOD
LRM
196
577
0
08 Mar 2018
Object-based reasoning in VQA
Mikyas T. Desta
Larry Chen
Tomasz Kornuta
65
33
0
29 Jan 2018
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks
Zhaofan Qiu
Ting Yao
Tao Mei
102
1,663
0
28 Nov 2017
Video Question Answering via Attribute-Augmented Attention Network Learning
Yunan Ye
Zhou Zhao
Yimeng Li
Long Chen
Jun Xiao
Yueting Zhuang
56
109
0
20 Jul 2017
DeepStory: Video Story QA by Deep Embedded Memory Networks
Kyung-Min Kim
Min-Oh Heo
Seongho Choi
Byoung-Tak Zhang
88
175
0
04 Jul 2017
Action Tubelet Detector for Spatio-Temporal Action Localization
Vicky Kalogeiton
Philippe Weinzaepfel
V. Ferrari
Cordelia Schmid
79
325
0
04 May 2017
Learning to Reason: End-to-End Module Networks for Visual Question Answering
Ronghang Hu
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Kate Saenko
KELM
GNN
ReLM
LRM
131
579
0
18 Apr 2017
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
Y. Jang
Yale Song
Youngjae Yu
Youngjin Kim
Gunhee Kim
87
561
0
14 Apr 2017
Simple Online and Realtime Tracking with a Deep Association Metric
N. Wojke
Alex Bewley
Dietrich Paulus
VOT
400
3,546
0
21 Mar 2017
Towards Context-aware Interaction Recognition
Bohan Zhuang
Lingqiao Liu
Chunhua Shen
Ian Reid
HAI
60
143
0
18 Mar 2017
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
319
2,391
0
20 Dec 2016
Semi-Supervised Classification with Graph Convolutional Networks
Thomas Kipf
Max Welling
GNN
SSL
679
29,183
0
09 Sep 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.3K
194,510
0
10 Dec 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
533
62,409
0
04 Jun 2015
From Machine Learning to Machine Reasoning
Léon Bottou
LRM
ReLM
NAI
157
285
0
09 Feb 2011
1