ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1505.00468
  4. Cited By
VQA: Visual Question Answering

VQA: Visual Question Answering

3 May 2015
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
    CoGe
ArXivPDFHTML

Papers citing "VQA: Visual Question Answering"

50 / 2,890 papers shown
Title
Visual Fashion-Product Search at SK Planet
Visual Fashion-Product Search at SK Planet
Taewan Kim
Seyeong Kim
Sangil Na
Hayoon Kim
Moonki Kim
Beyeongki Jeon
11
6
0
26 Sep 2016
Image-embodied Knowledge Representation Learning
Image-embodied Knowledge Representation Learning
Ruobing Xie
Zhiyuan Liu
Huanbo Luan
Maosong Sun
122
211
0
22 Sep 2016
The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question
  Answering (FSVQA)
The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA)
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
52
14
0
21 Sep 2016
Graph-Structured Representations for Visual Question Answering
Graph-Structured Representations for Visual Question Answering
Damien Teney
Lingqiao Liu
Anton Van Den Hengel
GNN
NAI
40
416
0
19 Sep 2016
The ACRV Picking Benchmark (APB): A Robotic Shelf Picking Benchmark to
  Foster Reproducible Research
The ACRV Picking Benchmark (APB): A Robotic Shelf Picking Benchmark to Foster Reproducible Research
Jurgen Leitner
Adam W. Tow
Jake E. Dean
Niko Sünderhauf
Joseph W. Durham
...
James Sergeant
Liao Wu
Fangyi Zhang
B. Upcroft
Peter Corke
18
78
0
17 Sep 2016
Towards Transparent AI Systems: Interpreting Visual Question Answering
  Models
Towards Transparent AI Systems: Interpreting Visual Question Answering Models
Yash Goyal
Akrit Mohapatra
Devi Parikh
Dhruv Batra
25
74
0
31 Aug 2016
Measuring Machine Intelligence Through Visual Question Answering
Measuring Machine Intelligence Through Visual Question Answering
C. L. Zitnick
Aishwarya Agrawal
Stanislaw Antol
Margaret Mitchell
Dhruv Batra
Devi Parikh
27
37
0
31 Aug 2016
Visual Question: Predicting If a Crowd Will Agree on the Answer
Visual Question: Predicting If a Crowd Will Agree on the Answer
Danna Gurari
Kristen Grauman
HAI
29
2
0
29 Aug 2016
Machine Comprehension Using Match-LSTM and Answer Pointer
Machine Comprehension Using Match-LSTM and Answer Pointer
Shuohang Wang
Jing Jiang
15
594
0
29 Aug 2016
Convolutional Network for Attribute-driven and Identity-preserving Human
  Face Generation
Convolutional Network for Attribute-driven and Identity-preserving Human Face Generation
Mu Li
W. Zuo
David C. Zhang
CVBM
9
51
0
23 Aug 2016
Solving Visual Madlibs with Multiple Cues
Solving Visual Madlibs with Multiple Cues
Tatiana Tommasi
Arun Mallya
Bryan A. Plummer
Svetlana Lazebnik
Alexander C. Berg
Tamara L. Berg
37
18
0
11 Aug 2016
Mean Box Pooling: A Rich Image Representation and Output Embedding for
  the Visual Madlibs Task
Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs Task
Ashkan Mokarian
Mateusz Malinowski
Mario Fritz
27
5
0
09 Aug 2016
Cognitive Science in the era of Artificial Intelligence: A roadmap for
  reverse-engineering the infant language-learner
Cognitive Science in the era of Artificial Intelligence: A roadmap for reverse-engineering the infant language-learner
Emmanuel Dupoux
29
152
0
29 Jul 2016
Visual Question Answering: A Survey of Methods and Datasets
Visual Question Answering: A Survey of Methods and Datasets
Qi Wu
Damien Teney
Peng Wang
Chunhua Shen
A. Dick
Anton Van Den Hengel
35
413
0
20 Jul 2016
Annotation Methodologies for Vision and Language Dataset Creation
Annotation Methodologies for Vision and Language Dataset Creation
Gitit Kehat
James Pustejovsky
16
2
0
10 Jul 2016
Intra-layer Nonuniform Quantization for Deep Convolutional Neural
  Network
Intra-layer Nonuniform Quantization for Deep Convolutional Neural Network
Fangxuan Sun
Jun Lin
Zhongfeng Wang
MQ
15
3
0
10 Jul 2016
Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes
Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes
Çağlar Gülçehre
A. Chandar
Kyunghyun Cho
Yoshua Bengio
20
64
0
30 Jun 2016
Sort Story: Sorting Jumbled Images and Captions into Stories
Sort Story: Sorting Jumbled Images and Captions into Stories
Harsh Agrawal
Arjun Chandrasekaran
Dhruv Batra
Devi Parikh
Joey Tianyi Zhou
22
60
0
23 Jun 2016
CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation
  Tasks
CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks
Jindrich Libovický
Jindřich Helcl
Marek Tlustý
Pavel Pecina
Ondrej Bojar
11
67
0
23 Jun 2016
VideoMCC: a New Benchmark for Video Comprehension
VideoMCC: a New Benchmark for Video Comprehension
Du Tran
Maksim Bolonkin
Manohar Paluri
Lorenzo Torresani
29
1
0
23 Jun 2016
Analyzing the Behavior of Visual Question Answering Models
Analyzing the Behavior of Visual Question Answering Models
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
38
309
0
23 Jun 2016
Semantic Parsing to Probabilistic Programs for Situated Question
  Answering
Semantic Parsing to Probabilistic Programs for Situated Question Answering
Jayant Krishnamurthy
Oyvind Tafjord
Aniruddha Kembhavi
34
24
0
22 Jun 2016
Question Relevance in VQA: Identifying Non-Visual And False-Premise
  Questions
Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
Arijit Ray
Gordon A. Christie
Joey Tianyi Zhou
Dhruv Batra
Devi Parikh
27
56
0
21 Jun 2016
DualNet: Domain-Invariant Network for Visual Question Answering
DualNet: Domain-Invariant Network for Visual Question Answering
Kuniaki Saito
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
37
58
0
20 Jun 2016
FVQA: Fact-based Visual Question Answering
FVQA: Fact-based Visual Question Answering
Peng Wang
Qi Wu
Chunhua Shen
Anton van den Hengel
A. Dick
CoGe
39
454
0
17 Jun 2016
Training Recurrent Answering Units with Joint Loss Minimization for VQA
Training Recurrent Answering Units with Joint Loss Minimization for VQA
Hyeonwoo Noh
Bohyung Han
32
71
0
12 Jun 2016
Human Attention in Visual Question Answering: Do Humans and Deep
  Networks Look at the Same Regions?
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
Abhishek Das
Harsh Agrawal
C. L. Zitnick
Devi Parikh
Dhruv Batra
32
465
0
11 Jun 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
167
1,465
0
06 Jun 2016
Multimodal Residual Learning for Visual QA
Multimodal Residual Learning for Visual QA
Jin-Hwa Kim
Sang-Woo Lee
Donghyun Kwak
Min-Oh Heo
Jeonghee Kim
Jung-Woo Ha
Byoung-Tak Zhang
12
300
0
05 Jun 2016
Adversarial Feature Learning
Adversarial Feature Learning
Jiasen Lu
Philipp Krahenbuhl
Trevor Darrell
GAN
59
1,825
0
31 May 2016
End-to-End Instance Segmentation with Recurrent Attention
End-to-End Instance Segmentation with Recurrent Attention
Mengye Ren
R. Zemel
SSeg
30
61
0
30 May 2016
HARRISON: A Benchmark on HAshtag Recommendation for Real-world Images in
  Social Networks
HARRISON: A Benchmark on HAshtag Recommendation for Real-world Images in Social Networks
Minseok Park
Hanxiang Li
Junmo Kim
3DV
VLM
16
26
0
17 May 2016
Review of state-of-the-arts in artificial intelligence with application
  to AI safety problem
Review of state-of-the-arts in artificial intelligence with application to AI safety problem
V. Shakirov
20
10
0
11 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
24
101
0
09 May 2016
Leveraging Visual Question Answering for Image-Caption Ranking
Leveraging Visual Question Answering for Image-Caption Ranking
Xiaoyu Lin
Devi Parikh
CoGe
22
83
0
04 May 2016
Learning Models for Actions and Person-Object Interactions with Transfer
  to Question Answering
Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering
Arun Mallya
Svetlana Lazebnik
39
119
0
16 Apr 2016
Visual Storytelling
Visual Storytelling
Ting-Hao 'Kenneth' Huang
Huang
Francis Ferraro
N. Mostafazadeh
Ishan Misra
...
C. L. Zitnick
Devi Parikh
Lucy Vanderwende
Michel Galley
Margaret Mitchell
VGen
22
464
0
13 Apr 2016
Counting Everyday Objects in Everyday Scenes
Counting Everyday Objects in Everyday Scenes
Prithvijit Chattopadhyay
Ramakrishna Vedantam
Ramprasaath R. Selvaraju
Dhruv Batra
Devi Parikh
22
156
0
12 Apr 2016
Attributes as Semantic Units between Natural Language and Visual
  Recognition
Attributes as Semantic Units between Natural Language and Visual Recognition
Marcus Rohrbach
VLM
22
3
0
12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
25
270
0
10 Apr 2016
Resolving Language and Vision Ambiguities Together: Joint Segmentation &
  Prepositional Attachment Resolution in Captioned Scenes
Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes
Gordon A. Christie
A. Laddha
Aishwarya Agrawal
Stanislaw Antol
Yash Goyal
K. Kochersberger
Dhruv Batra
28
30
0
07 Apr 2016
A Focused Dynamic Attention Model for Visual Question Answering
A Focused Dynamic Attention Model for Visual Question Answering
Ilija Ilievski
Shuicheng Yan
Jiashi Feng
28
122
0
06 Apr 2016
Deep Image Retrieval: Learning global representations for image search
Deep Image Retrieval: Learning global representations for image search
Albert Gordo
Jon Almazán
Jérôme Revaud
Diane Larlus
23
802
0
05 Apr 2016
Automatic Annotation of Structured Facts in Images
Automatic Annotation of Structured Facts in Images
Mohamed Elhoseiny
Scott D. Cohen
W. Chang
Brian L. Price
Ahmed Elgammal
21
9
0
02 Apr 2016
Unsupervised Visual Sense Disambiguation for Verbs using Multimodal
  Embeddings
Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings
Spandana Gella
Mirella Lapata
Frank Keller
CoGe
27
52
0
30 Mar 2016
A Diagram Is Worth A Dozen Images
A Diagram Is Worth A Dozen Images
Aniruddha Kembhavi
M. Salvato
Eric Kolve
Minjoon Seo
Hannaneh Hajishirzi
Ali Farhadi
3DV
12
431
0
24 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing
BreakingNews: Article Annotation by Image and Text Processing
Arnau Ramisa
F. Yan
Francesc Moreno-Noguer
K. Mikolajczyk
29
105
0
23 Mar 2016
Image Captioning and Visual Question Answering Based on Attributes and
  External Knowledge
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge
Qi Wu
Chunhua Shen
Anton Van Den Hengel
Peng Wang
A. Dick
27
360
0
09 Mar 2016
Dynamic Memory Networks for Visual and Textual Question Answering
Dynamic Memory Networks for Visual and Textual Question Answering
Caiming Xiong
Stephen Merity
R. Socher
34
753
0
04 Mar 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense
  Image Annotations
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li-Jia Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
99
5,663
0
23 Feb 2016
Previous
123...565758
Next