ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1505.00468
  4. Cited By
VQA: Visual Question Answering
v1v2v3v4v5v6v7 (latest)

VQA: Visual Question Answering

3 May 2015
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
    CoGe
ArXiv (abs)PDFHTML

Papers citing "VQA: Visual Question Answering"

50 / 2,957 papers shown
Title
Reciprocal Attention Fusion for Visual Question Answering
Reciprocal Attention Fusion for Visual Question Answering
M. Farazi
Salman H Khan
72
14
0
11 May 2018
LearningWord Embeddings for Low-resource Languages by PU Learning
LearningWord Embeddings for Low-resource Languages by PU Learning
Chao Jiang
Hsiang-Fu Yu
Cho-Jui Hsieh
Kai-Wei Chang
46
21
0
09 May 2018
Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented
  Visual Dialog
Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog
Jiaping Zhang
Tiancheng Zhao
Zhou Yu
57
40
0
08 May 2018
Hypothesis Only Baselines in Natural Language Inference
Hypothesis Only Baselines in Natural Language Inference
Adam Poliak
Jason Naradowsky
Aparajita Haldar
Rachel Rudinger
Benjamin Van Durme
254
580
0
02 May 2018
Dialog-based Interactive Image Retrieval
Dialog-based Interactive Image Retrieval
Xiaoxiao Guo
Hui Wu
Yu Cheng
Steven J. Rennie
Gerald Tesauro
Rogerio Feris
144
209
0
01 May 2018
Learning Cross-Modal Deep Embeddings for Multi-Object Image Retrieval
  using Text and Sketch
Learning Cross-Modal Deep Embeddings for Multi-Object Image Retrieval using Text and Sketch
S. Dey
Anjan Dutta
Suman K. Ghosh
Ernest Valveny
Josep Lladós
Umapada Pal
116
24
0
28 Apr 2018
Reward Learning from Narrated Demonstrations
Reward Learning from Narrated Demonstrations
H. Tung
Adam W. Harley
Liang-Kang Huang
Katerina Fragkiadaki
LM&RoSSL
88
29
0
27 Apr 2018
Customized Image Narrative Generation via Interactive Visual Question
  Generation and Answering
Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
69
7
0
27 Apr 2018
Interactive Language Acquisition with One-shot Visual Concept Learning
  through a Conversational Game
Interactive Language Acquisition with One-shot Visual Concept Learning through a Conversational Game
Haichao Zhang
Haonan Yu
Wenyuan Xu
LLMAG
79
8
0
26 Apr 2018
Large Scale Scene Text Verification with Guided Attention
Large Scale Scene Text Verification with Guided Attention
Dafang He
Yeqing Li
Alexander N. Gorban
Derrall Heath
Julian Ibarz
Qian Yu
Daniel Kifer
C. Lee Giles
3DV
36
0
0
23 Apr 2018
Multi-task Learning for Universal Sentence Embeddings: A Thorough
  Evaluation using Transfer and Auxiliary Tasks
Multi-task Learning for Universal Sentence Embeddings: A Thorough Evaluation using Transfer and Auxiliary Tasks
Wasi Uddin Ahmad
Xueying Bai
Zhechao Huang
Chao Jiang
Nanyun Peng
Kai-Wei Chang
SSL
65
6
0
21 Apr 2018
Pathologies of Neural Models Make Interpretations Difficult
Pathologies of Neural Models Make Interpretations Difficult
Shi Feng
Eric Wallace
Alvin Grissom II
Mohit Iyyer
Pedro Rodriguez
Jordan L. Boyd-Graber
AAMLFAtt
106
322
0
20 Apr 2018
Object Ordering with Bidirectional Matchings for Visual Reasoning
Object Ordering with Bidirectional Matchings for Visual Reasoning
Hao Tan
Joey Tianyi Zhou
BDLCoGe
54
16
0
18 Apr 2018
NTUA-SLP at SemEval-2018 Task 1: Predicting Affective Content in Tweets
  with Deep Attentive RNNs and Transfer Learning
NTUA-SLP at SemEval-2018 Task 1: Predicting Affective Content in Tweets with Deep Attentive RNNs and Transfer Learning
Christos Baziotis
Athanasiou Nikolaos
Alexandra Chronopoulou
Athanasia Kolovou
Georgios Paraskevopoulos
Nikolaos Ellinas
Shrikanth Narayanan
Alexandros Potamianos
85
127
0
18 Apr 2018
Deep Multimodal Subspace Clustering Networks
Deep Multimodal Subspace Clustering Networks
Mahdi Abavisani
Vishal M. Patel
84
166
0
17 Apr 2018
Comparatives, Quantifiers, Proportions: A Multi-Task Model for the
  Learning of Quantities from Vision
Comparatives, Quantifiers, Proportions: A Multi-Task Model for the Learning of Quantities from Vision
Sandro Pezzelle
Ionut-Teodor Sorodoc
Raffaella Bernardi
40
9
0
13 Apr 2018
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of
  Untranscribed Speech
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech
David Harwath
Galen Chuang
James R. Glass
87
58
0
09 Apr 2018
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
Dima Damen
Hazel Doughty
G. Farinella
Sanja Fidler
Antonino Furnari
...
Davide Moltisanti
Jonathan Munro
Toby Perrett
Will Price
Michael Wray
EgoV
145
1,040
0
08 Apr 2018
Compositional Obverter Communication Learning From Raw Visual Input
Compositional Obverter Communication Learning From Raw Visual Input
Edward Choi
Angeliki Lazaridou
Nando de Freitas
148
76
0
06 Apr 2018
Question Type Guided Attention in Visual Question Answering
Question Type Guided Attention in Visual Question Answering
Yang Shi
Tommaso Furlanello
Sheng Zha
Anima Anandkumar
72
46
0
06 Apr 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory
  Input
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
David Harwath
Adrià Recasens
Dídac Surís
Galen Chuang
Antonio Torralba
James R. Glass
123
201
0
04 Apr 2018
Improved Fusion of Visual and Language Representations by Dense
  Symmetric Co-Attention for Visual Question Answering
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen
Takayuki Okatani
84
282
0
03 Apr 2018
Seeing Voices and Hearing Faces: Cross-modal biometric matching
Seeing Voices and Hearing Faces: Cross-modal biometric matching
Arsha Nagrani
Samuel Albanie
Andrew Zisserman
CVBM
107
221
0
01 Apr 2018
Differential Attention for Visual Question Answering
Differential Attention for Visual Question Answering
Badri N. Patro
Vinay P. Namboodiri
AIMat
84
75
0
01 Apr 2018
Visual Question Reasoning on General Dependency Tree
Visual Question Reasoning on General Dependency Tree
Qingxing Cao
Xiaodan Liang
Bailin Li
Guanbin Li
Liang Lin
CoGe
83
37
0
31 Mar 2018
Guide Me: Interacting with Deep Networks
Guide Me: Interacting with Deep Networks
Christian Rupprecht
Iro Laina
Nassir Navab
Gregory Hager
Federico Tombari
HAI
73
38
0
30 Mar 2018
Two can play this Game: Visual Dialog with Discriminative Question
  Generation and Answering
Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering
Unnat Jain
Svetlana Lazebnik
Alex Schwing
MLLM
76
82
0
29 Mar 2018
Motion-Appearance Co-Memory Networks for Video Question Answering
Motion-Appearance Co-Memory Networks for Video Question Answering
J. Gao
Runzhou Ge
Kan Chen
Ram Nevatia
151
241
0
29 Mar 2018
Neural Baby Talk
Neural Baby Talk
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
VLM
242
436
0
27 Mar 2018
Generalized Hadamard-Product Fusion Operators for Visual Question
  Answering
Generalized Hadamard-Product Fusion Operators for Visual Question Answering
Brendan Duke
Graham W. Taylor
40
8
0
26 Mar 2018
Scene Graph Parsing as Dependency Parsing
Scene Graph Parsing as Dependency Parsing
Yu-Siang Wang
Chenxi Liu
Fangyin Wei
Alan Yuille
GNN3DV
52
53
0
25 Mar 2018
Explicit Reasoning over End-to-End Neural Architectures for Visual
  Question Answering
Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering
Somak Aditya
Yezhou Yang
Chitta Baral
LRMNAIReLM
68
53
0
23 Mar 2018
Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement
  Learning for Planned-Ahead Vision-and-Language Navigation
Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation
Xin Eric Wang
Wenhan Xiong
Hongmin Wang
William Yang Wang
85
202
0
21 Mar 2018
Attention on Attention: Architectures for Visual Question Answering
  (VQA)
Attention on Attention: Architectures for Visual Question Answering (VQA)
Jasdeep Singh
Vincent Ying
Alex Nutkiewicz
60
26
0
21 Mar 2018
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual
  Questions
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Qing Li
Qingyi Tao
Shafiq Joty
Jianfei Cai
Jiebo Luo
107
109
0
20 Mar 2018
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis
  Tool
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
71
29
0
16 Mar 2018
A Dataset and Architecture for Visual Reasoning with a Working Memory
A Dataset and Architecture for Visual Reasoning with a Working Memory
G. R. Yang
Igor Ganichev
Xiao-Jing Wang
Jonathon Shlens
David Sussillo
71
55
0
16 Mar 2018
Compositional Attention Networks for Machine Reasoning
Compositional Attention Networks for Machine Reasoning
Drew A. Hudson
Christopher D. Manning
BDLOODLRM
205
578
0
08 Mar 2018
Annotation Artifacts in Natural Language Inference Data
Annotation Artifacts in Natural Language Inference Data
Suchin Gururangan
Swabha Swayamdipta
Omer Levy
Roy Schwartz
Samuel R. Bowman
Noah A. Smith
225
1,181
0
06 Mar 2018
Totally Looks Like - How Humans Compare, Compared to Machines
Totally Looks Like - How Humans Compare, Compared to Machines
Amir Rosenfeld
M. Solbach
John K. Tsotsos
3DH
94
29
0
05 Mar 2018
Deep-neural-network based sinogram synthesis for sparse-view CT image
  reconstruction
Deep-neural-network based sinogram synthesis for sparse-view CT image reconstruction
Hoyeon Lee
Jongha Lee
Hyeongseok Kim
B. Cho
Seungryong Cho
74
228
0
02 Mar 2018
VizWiz Grand Challenge: Answering Visual Questions from Blind People
VizWiz Grand Challenge: Answering Visual Questions from Blind People
Danna Gurari
Qing Li
Abigale Stangl
Anhong Guo
Chi Lin
Kristen Grauman
Jiebo Luo
Jeffrey P. Bigham
CoGe
171
864
0
22 Feb 2018
Multimodal Named Entity Recognition for Short Social Media Posts
Multimodal Named Entity Recognition for Short Social Media Posts
Seungwhan Moon
Leonardo Neves
Vitor R. Carvalho
84
155
0
22 Feb 2018
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
Pelin Dogan
Boyang Albert Li
Leonid Sigal
Markus Gross
AI4TS
73
20
0
19 Feb 2018
Learning to Count Objects in Natural Images for Visual Question
  Answering
Learning to Count Objects in Natural Images for Visual Question Answering
Yan Zhang
Jonathon S. Hare
Adam Prugel-Bennett
OOD
90
208
0
15 Feb 2018
Multimodal Explanations: Justifying Decisions and Pointing to the
  Evidence
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
87
425
0
15 Feb 2018
Challenging Images For Minds and Machines
Challenging Images For Minds and Machines
Amir Rosenfeld
John K. Tsotsos
VLM
31
1
0
13 Feb 2018
Answerer in Questioner's Mind: Information Theoretic Approach to
  Goal-Oriented Visual Dialog
Answerer in Questioner's Mind: Information Theoretic Approach to Goal-Oriented Visual Dialog
Sang-Woo Lee
Y. Heo
Byoung-Tak Zhang
75
31
0
12 Feb 2018
FlipDial: A Generative Model for Two-Way Visual Dialogue
FlipDial: A Generative Model for Two-Way Visual Dialogue
Daniela Massiceti
N. Siddharth
P. Dokania
Philip Torr
MLLM
92
41
0
11 Feb 2018
Recent Advances in Neural Program Synthesis
Recent Advances in Neural Program Synthesis
Neel Kant
NAI
106
37
0
07 Feb 2018
Previous
123...525354...585960
Next