ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.00105
  4. Cited By
Visual Question Reasoning on General Dependency Tree

Visual Question Reasoning on General Dependency Tree

31 March 2018
Qingxing Cao
Xiaodan Liang
Bailin Li
Guanbin Li
Liang Lin
    CoGe
ArXiv (abs)PDFHTML

Papers citing "Visual Question Reasoning on General Dependency Tree"

17 / 17 papers shown
Title
A Concept-Centric Approach to Multi-Modality Learning
A Concept-Centric Approach to Multi-Modality Learning
Yuchong Geng
Ao Tang
162
0
0
18 Dec 2024
Align and Aggregate: Compositional Reasoning with Video Alignment and
  Answer Aggregation for Video Question-Answering
Align and Aggregate: Compositional Reasoning with Video Alignment and Answer Aggregation for Video Question-Answering
Zhaohe Liao
Jiangtong Li
Li Niu
Liqing Zhang
CoGe
84
6
0
03 Jul 2024
Measuring Compositional Consistency for Video Question Answering
Measuring Compositional Consistency for Video Question Answering
Mona Gandhi
Mustafa Omer Gul
Eva Prakash
Madeleine Grunde-McLaughlin
Ranjay Krishna
Maneesh Agrawala
CoGe
87
16
0
14 Apr 2022
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Xiangru Lin
Guanbin Li
Yizhou Yu
LM&Ro
80
53
0
24 Mar 2021
Linguistically Driven Graph Capsule Network for Visual Question
  Reasoning
Linguistically Driven Graph Capsule Network for Visual Question Reasoning
Qingxing Cao
Xiaodan Liang
Keze Wang
Liang Lin
GNN
47
3
0
23 Mar 2020
Multilayer Dense Connections for Hierarchical Concept Classification
Multilayer Dense Connections for Hierarchical Concept Classification
T. Parag
Hongcheng Wang
31
1
0
19 Mar 2020
Multi-modal Deep Analysis for Multimedia
Multi-modal Deep Analysis for Multimedia
Wenwu Zhu
Xin Eric Wang
Hongzhi Li
74
43
0
11 Oct 2019
CLEVRER: CoLlision Events for Video REpresentation and Reasoning
CLEVRER: CoLlision Events for Video REpresentation and Reasoning
Kexin Yi
Yuta Saito
Yunzhu Li
Pushmeet Kohli
Jiajun Wu
Antonio Torralba
J. Tenenbaum
NAI
182
475
0
03 Oct 2019
Explainable High-order Visual Question Reasoning: A New Benchmark and
  Knowledge-routed Network
Explainable High-order Visual Question Reasoning: A New Benchmark and Knowledge-routed Network
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
57
13
0
23 Sep 2019
Dynamic Graph Attention for Referring Expression Comprehension
Dynamic Graph Attention for Referring Expression Comprehension
Sibei Yang
Guanbin Li
Yizhou Yu
OCL
86
222
0
18 Sep 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
141
136
0
22 Jul 2019
Relationship-Embedded Representation Learning for Grounding Referring
  Expressions
Relationship-Embedded Representation Learning for Grounding Referring Expressions
Sibei Yang
Guanbin Li
Yizhou Yu
ObjD
95
55
0
11 Jun 2019
Deep Tree Learning for Zero-shot Face Anti-Spoofing
Deep Tree Learning for Zero-shot Face Anti-Spoofing
Yaojie Liu
J. Stehouwer
Amin Jourabloo
Xiaoming Liu
CVBM
93
240
0
05 Apr 2019
RAVEN: A Dataset for Relational and Analogical Visual rEasoNing
RAVEN: A Dataset for Relational and Analogical Visual rEasoNing
Chi Zhang
Feng Gao
Baoxiong Jia
Yixin Zhu
Song-Chun Zhu
AIMat
82
312
0
07 Mar 2019
Learning to Assemble Neural Module Tree Networks for Visual Grounding
Learning to Assemble Neural Module Tree Networks for Visual Grounding
Daqing Liu
Hanwang Zhang
Feng Wu
Zhengjun Zha
100
274
0
08 Dec 2018
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language
  Understanding
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding
Kexin Yi
Jiajun Wu
Chuang Gan
Antonio Torralba
Pushmeet Kohli
J. Tenenbaum
NAI
121
614
0
04 Oct 2018
Adaptive Temporal Encoding Network for Video Instance-level Human
  Parsing
Adaptive Temporal Encoding Network for Video Instance-level Human Parsing
Qixian Zhou
Xiaodan Liang
Ke Gong
Liang Lin
VOS3DH
91
56
0
02 Aug 2018
1