Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.00105
Cited By
Visual Question Reasoning on General Dependency Tree
31 March 2018
Qingxing Cao
Xiaodan Liang
Bailin Li
Guanbin Li
Liang Lin
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Visual Question Reasoning on General Dependency Tree"
17 / 17 papers shown
Title
A Concept-Centric Approach to Multi-Modality Learning
Yuchong Geng
Ao Tang
162
0
0
18 Dec 2024
Align and Aggregate: Compositional Reasoning with Video Alignment and Answer Aggregation for Video Question-Answering
Zhaohe Liao
Jiangtong Li
Li Niu
Liqing Zhang
CoGe
84
6
0
03 Jul 2024
Measuring Compositional Consistency for Video Question Answering
Mona Gandhi
Mustafa Omer Gul
Eva Prakash
Madeleine Grunde-McLaughlin
Ranjay Krishna
Maneesh Agrawala
CoGe
87
16
0
14 Apr 2022
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Xiangru Lin
Guanbin Li
Yizhou Yu
LM&Ro
80
53
0
24 Mar 2021
Linguistically Driven Graph Capsule Network for Visual Question Reasoning
Qingxing Cao
Xiaodan Liang
Keze Wang
Liang Lin
GNN
47
3
0
23 Mar 2020
Multilayer Dense Connections for Hierarchical Concept Classification
T. Parag
Hongcheng Wang
31
1
0
19 Mar 2020
Multi-modal Deep Analysis for Multimedia
Wenwu Zhu
Xin Eric Wang
Hongzhi Li
74
43
0
11 Oct 2019
CLEVRER: CoLlision Events for Video REpresentation and Reasoning
Kexin Yi
Yuta Saito
Yunzhu Li
Pushmeet Kohli
Jiajun Wu
Antonio Torralba
J. Tenenbaum
NAI
182
475
0
03 Oct 2019
Explainable High-order Visual Question Reasoning: A New Benchmark and Knowledge-routed Network
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
57
13
0
23 Sep 2019
Dynamic Graph Attention for Referring Expression Comprehension
Sibei Yang
Guanbin Li
Yizhou Yu
OCL
86
222
0
18 Sep 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
141
136
0
22 Jul 2019
Relationship-Embedded Representation Learning for Grounding Referring Expressions
Sibei Yang
Guanbin Li
Yizhou Yu
ObjD
95
55
0
11 Jun 2019
Deep Tree Learning for Zero-shot Face Anti-Spoofing
Yaojie Liu
J. Stehouwer
Amin Jourabloo
Xiaoming Liu
CVBM
93
240
0
05 Apr 2019
RAVEN: A Dataset for Relational and Analogical Visual rEasoNing
Chi Zhang
Feng Gao
Baoxiong Jia
Yixin Zhu
Song-Chun Zhu
AIMat
82
312
0
07 Mar 2019
Learning to Assemble Neural Module Tree Networks for Visual Grounding
Daqing Liu
Hanwang Zhang
Feng Wu
Zhengjun Zha
100
274
0
08 Dec 2018
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding
Kexin Yi
Jiajun Wu
Chuang Gan
Antonio Torralba
Pushmeet Kohli
J. Tenenbaum
NAI
121
614
0
04 Oct 2018
Adaptive Temporal Encoding Network for Video Instance-level Human Parsing
Qixian Zhou
Xiaodan Liang
Ke Gong
Liang Lin
VOS
3DH
91
56
0
02 Aug 2018
1