ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1505.00468
  4. Cited By
VQA: Visual Question Answering
v1v2v3v4v5v6v7 (latest)

VQA: Visual Question Answering

3 May 2015
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
    CoGe
ArXiv (abs)PDFHTML

Papers citing "VQA: Visual Question Answering"

50 / 2,957 papers shown
Title
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
Alex Schwing
Tamir Hazan
87
71
0
11 Apr 2019
Reasoning Visual Dialogs with Structural and Partial Observations
Reasoning Visual Dialogs with Structural and Partial Observations
Zilong Zheng
Wenguan Wang
Siyuan Qi
Song-Chun Zhu
128
117
0
11 Apr 2019
UniVSE: Robust Visual Semantic Embeddings via Structured Semantic
  Representations
UniVSE: Robust Visual Semantic Embeddings via Structured Semantic Representations
Hao Wu
Jiayuan Mao
Yufeng Zhang
Yuning Jiang
Lei Li
Weiwei Sun
Wei-Ying Ma
33
8
0
11 Apr 2019
Detecting Cybersecurity Events from Noisy Short Text
Detecting Cybersecurity Events from Noisy Short Text
Semih Yagcioglu
M. S. Seyfioglu
Begum Citamak
Batuhan Bardak
Seren Guldamlasioglu
Azmi Yuksel
E. I. Tatli
36
20
0
10 Apr 2019
Heterogeneous Memory Enhanced Multimodal Attention Model for Video
  Question Answering
Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering
Chenyou Fan
Xiaofan Zhang
Shu Zhang
Wensheng Wang
Fangqiu Yi
Heng-Chiao Huang
75
279
0
08 Apr 2019
Revisiting EmbodiedQA: A Simple Baseline and Beyond
Revisiting EmbodiedQA: A Simple Baseline and Beyond
Yuehua Wu
Lu Jiang
Yi Yang
LM&Ro
86
30
0
08 Apr 2019
Modularized Textual Grounding for Counterfactual Resilience
Modularized Textual Grounding for Counterfactual Resilience
Zhiyuan Fang
Shu Kong
Charless C. Fowlkes
Yezhou Yang
79
32
0
07 Apr 2019
VATEX: A Large-Scale, High-Quality Multilingual Dataset for
  Video-and-Language Research
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
125
558
0
06 Apr 2019
Can You Explain That? Lucid Explanations Help Human-AI Collaborative
  Image Retrieval
Can You Explain That? Lucid Explanations Help Human-AI Collaborative Image Retrieval
Arijit Ray
Yi Yao
Rakesh Kumar
Ajay Divakaran
Giedrius Burachas
40
5
0
05 Apr 2019
What Object Should I Use? - Task Driven Object Detection
What Object Should I Use? - Task Driven Object Detection
Johann Sawatzky
Yaser Souri
C. Grund
Juergen Gall
ObjD
77
27
0
05 Apr 2019
Actively Seeking and Learning from Live Data
Actively Seeking and Learning from Live Data
Damien Teney
Anton Van Den Hengel
OOD
77
21
0
05 Apr 2019
VQD: Visual Query Detection in Natural Scenes
VQD: Visual Query Detection in Natural Scenes
Manoj Acharya
Karan Jariwala
Christopher Kanan
ObjD
60
18
0
04 Apr 2019
MMED: A Multi-domain and Multi-modality Event Dataset
MMED: A Multi-domain and Multi-modality Event Dataset
Zhenguo Yang
Zehang Lin
Min Cheng
Qing Li
Wenyin Liu
121
9
0
04 Apr 2019
Multi-Modal Generative Adversarial Network for Short Product Title
  Generation in Mobile E-Commerce
Multi-Modal Generative Adversarial Network for Short Product Title Generation in Mobile E-Commerce
Jianguo Zhang
Pengcheng Zou
Zhao Li
Yao Wan
Xiuming Pan
Yu Gong
Philip S. Yu
82
28
0
03 Apr 2019
Habitat: A Platform for Embodied AI Research
Habitat: A Platform for Embodied AI Research
Manolis Savva
Abhishek Kadian
Oleksandr Maksymets
Yili Zhao
Erik Wijmans
...
Jia-Wei Liu
V. Koltun
Jitendra Malik
Devi Parikh
Dhruv Batra
LM&Ro
145
1,424
0
02 Apr 2019
Recent Advances in Natural Language Inference: A Survey of Benchmarks,
  Resources, and Approaches
Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches
Shane Storks
Qiaozi Gao
J. Chai
100
132
0
02 Apr 2019
Constructing Hierarchical Q&A Datasets for Video Story Understanding
Constructing Hierarchical Q&A Datasets for Video Story Understanding
Y. Heo
Kyoung-Woon On
Seong-Ho Choi
Jaeseo Lim
Jinah Kim
Jeh-Kwang Ryu
Byung-Chull Bae
Byoung-Tak Zhang
53
5
0
01 Apr 2019
Relation-Aware Graph Attention Network for Visual Question Answering
Relation-Aware Graph Attention Network for Visual Question Answering
Linjie Li
Zhe Gan
Yu Cheng
Jingjing Liu
GNN
196
347
0
29 Mar 2019
Information Maximizing Visual Question Generation
Information Maximizing Visual Question Generation
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
131
95
0
27 Mar 2019
Combination of Multiple Global Descriptors for Image Retrieval
Combination of Multiple Global Descriptors for Image Retrieval
HeeJae Jun
ByungSoo Ko
Youngjoon Kim
Insik Kim
Jongtack Kim
108
61
0
26 Mar 2019
A Survey of Code-switched Speech and Language Processing
A Survey of Code-switched Speech and Language Processing
Sunayana Sitaram
Khyathi Chandu
Sai Krishna Rallabandi
A. Black
77
135
0
25 Mar 2019
The Probabilistic Object Detection Challenge
The Probabilistic Object Detection Challenge
John Skinner
David Hall
Haoyang Zhang
Feras Dayoub
Niko Sünderhauf
AAML
36
9
0
19 Mar 2019
Neural Sequential Phrase Grounding (SeqGROUND)
Neural Sequential Phrase Grounding (SeqGROUND)
Pelin Dogan
Leonid Sigal
Markus Gross
ObjD
81
52
0
18 Mar 2019
Visual Query Answering by Entity-Attribute Graph Matching and Reasoning
Visual Query Answering by Entity-Attribute Graph Matching and Reasoning
Peixi Xiong
Huayi Zhan
Xin Eric Wang
Baivab Sinha
Ying Nian Wu
41
16
0
16 Mar 2019
Visual Semantic Information Pursuit: A Survey
Visual Semantic Information Pursuit: A Survey
Daqi Liu
M. Bober
J. Kittler
75
32
0
13 Mar 2019
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual
  Dialog
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog
Satwik Kottur
José M. F. Moura
Devi Parikh
Dhruv Batra
Marcus Rohrbach
100
87
0
07 Mar 2019
Learning to Speak and Act in a Fantasy Text Adventure Game
Learning to Speak and Act in a Fantasy Text Adventure Game
Jack Urbanek
Angela Fan
Siddharth Karamcheti
Saachi Jain
Samuel Humeau
Emily Dinan
Tim Rocktaschel
Douwe Kiela
Arthur Szlam
Jason Weston
LLMAG
93
207
0
07 Mar 2019
RAVEN: A Dataset for Relational and Analogical Visual rEasoNing
RAVEN: A Dataset for Relational and Analogical Visual rEasoNing
Chi Zhang
Feng Gao
Baoxiong Jia
Yixin Zhu
Song-Chun Zhu
AIMat
74
312
0
07 Mar 2019
Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language
  Navigation
Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation
Liyiming Ke
Xiujun Li
Yonatan Bisk
Ari Holtzman
Zhe Gan
Jingjing Liu
Jianfeng Gao
Yejin Choi
S. Srinivasa
100
169
0
06 Mar 2019
Answer Them All! Toward Universal Visual Question Answering Models
Answer Them All! Toward Universal Visual Question Answering Models
Robik Shrestha
Kushal Kafle
Christopher Kanan
88
83
0
01 Mar 2019
From Visual to Acoustic Question Answering
From Visual to Acoustic Question Answering
Jerome Abdelnour
G. Salvi
Jean Rouat
70
3
0
28 Feb 2019
A Framework for Decoding Event-Related Potentials from Text
A Framework for Decoding Event-Related Potentials from Text
Shaorong Yan
A. White
26
0
0
27 Feb 2019
Differentiable Scene Graphs
Differentiable Scene Graphs
Moshiko Raboh
Roei Herzig
Gal Chechik
Jonathan Berant
Amir Globerson
OCL
99
34
0
26 Feb 2019
Generative Visual Dialogue System via Adaptive Reasoning and Weighted
  Likelihood Estimation
Generative Visual Dialogue System via Adaptive Reasoning and Weighted Likelihood Estimation
Heming Zhang
Shalini Ghosh
Larry Heck
Stephen Walsh
Junting Zhang
Jie Zhang
C.-C. Jay Kuo
133
7
0
26 Feb 2019
Image-Question-Answer Synergistic Network for Visual Dialog
Image-Question-Answer Synergistic Network for Visual Dialog
Dalu Guo
Chang Xu
Dacheng Tao
63
74
0
26 Feb 2019
GQA: A New Dataset for Real-World Visual Reasoning and Compositional
  Question Answering
GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering
Drew A. Hudson
Christopher D. Manning
CoGeNAI
87
138
0
25 Feb 2019
MUREL: Multimodal Relational Reasoning for Visual Question Answering
MUREL: Multimodal Relational Reasoning for Visual Question Answering
Rémi Cadène
H. Ben-younes
Matthieu Cord
Nicolas Thome
LRM
88
277
0
25 Feb 2019
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Gi-Cheon Kang
Jaeseo Lim
Byoung-Tak Zhang
56
73
0
25 Feb 2019
Making History Matter: History-Advantage Sequence Training for Visual
  Dialog
Making History Matter: History-Advantage Sequence Training for Visual Dialog
Tianhao Yang
Zhengjun Zha
Hanwang Zhang
OffRL
79
8
0
25 Feb 2019
Probabilistic Neural-symbolic Models for Interpretable Visual Question
  Answering
Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering
Ramakrishna Vedantam
Karan Desai
Stefan Lee
Marcus Rohrbach
Dhruv Batra
Devi Parikh
NAIBDL
97
87
0
21 Feb 2019
Learning to Generalize from Sparse and Underspecified Rewards
Learning to Generalize from Sparse and Underspecified Rewards
Rishabh Agarwal
Chen Liang
Dale Schuurmans
Mohammad Norouzi
OffRL
134
96
0
19 Feb 2019
Generating Natural Language Explanations for Visual Question Answering
  using Scene Graphs and Visual Attention
Generating Natural Language Explanations for Visual Question Answering using Scene Graphs and Visual Attention
Shalini Ghosh
Giedrius Burachas
Arijit Ray
Avi Ziskind
66
65
0
15 Feb 2019
Can We Automate Diagrammatic Reasoning?
Can We Automate Diagrammatic Reasoning?
Sk. Arif Ahmed
D. P. Dogra
S. Kar
P. Roy
D. Prasad
25
4
0
13 Feb 2019
Taking a HINT: Leveraging Explanations to Make Vision and Language
  Models More Grounded
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
Ramprasaath R. Selvaraju
Stefan Lee
Yilin Shen
Hongxia Jin
Shalini Ghosh
Larry Heck
Dhruv Batra
Devi Parikh
FAttVLM
76
255
0
11 Feb 2019
EvalAI: Towards Better Evaluation Systems for AI Agents
EvalAI: Towards Better Evaluation Systems for AI Agents
Deshraj Yadav
Rishabh Jain
Harsh Agrawal
Prithvijit Chattopadhyay
Taranjeet Singh
Akash Jain
Shivkaran Singh
Stefan Lee
Dhruv Batra
ELM
70
57
0
10 Feb 2019
Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog
Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog
Zhe Gan
Yu Cheng
Ahmed El Kholy
Linjie Li
Jingjing Liu
Jianfeng Gao
111
105
0
01 Feb 2019
BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and
  Visual Relationship Detection
BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection
H. Ben-younes
Rémi Cadène
Nicolas Thome
Matthieu Cord
64
218
0
31 Jan 2019
Effect of Various Regularizers on Model Complexities of Neural Networks
  in Presence of Input Noise
Effect of Various Regularizers on Model Complexities of Neural Networks in Presence of Input Noise
Mayank Sharma
Aayush Yadav
Sumit Soman
Jayadeva Jayadeva
25
1
0
31 Jan 2019
Higher-order Count Sketch: Dimensionality Reduction That Retains
  Efficient Tensor Operations
Higher-order Count Sketch: Dimensionality Reduction That Retains Efficient Tensor Operations
Yang Shi
Anima Anandkumar
110
13
0
31 Jan 2019
Audio-Visual Scene-Aware Dialog
Audio-Visual Scene-Aware Dialog
Huda AlAmri
Vincent Cartillier
Abhishek Das
Jue Wang
A. Cherian
...
Tim K. Marks
Chiori Hori
Peter Anderson
Stefan Lee
Devi Parikh
VGen
61
195
0
25 Jan 2019
Previous
123...484950...585960
Next