ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1505.00468
  4. Cited By
VQA: Visual Question Answering
v1v2v3v4v5v6v7 (latest)

VQA: Visual Question Answering

3 May 2015
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
    CoGe
ArXiv (abs)PDFHTML

Papers citing "VQA: Visual Question Answering"

50 / 2,957 papers shown
Title
Small Sample Learning in Big Data Era
Small Sample Learning in Big Data Era
Jun Shu
Zongben Xu
Deyu Meng
108
72
0
14 Aug 2018
Large Graph Exploration via Subgraph Discovery and Decomposition
Large Graph Exploration via Subgraph Discovery and Decomposition
J. Abello
Fred Hohman
Varun Bezzam
Duen Horng Chau
18
2
0
13 Aug 2018
Multimodal Differential Network for Visual Question Generation
Multimodal Differential Network for Visual Question Generation
Badri N. Patro
Sandeep Kumar
V. Kurmi
Vinay P. Namboodiri
68
40
0
12 Aug 2018
Community Regularization of Visually-Grounded Dialog
Community Regularization of Visually-Grounded Dialog
Akshat Agarwal
Swaminathan Gurumurthy
Vasu Sharma
M. Lewis
Katia Sycara
68
10
0
10 Aug 2018
Question-Guided Hybrid Convolution for Visual Question Answering
Question-Guided Hybrid Convolution for Visual Question Answering
Peng Gao
Pan Lu
Hongsheng Li
Shuang Li
Yikang Li
Guosheng Lin
Xiaogang Wang
152
69
0
08 Aug 2018
A Joint Sequence Fusion Model for Video Question Answering and Retrieval
A Joint Sequence Fusion Model for Video Question Answering and Retrieval
Youngjae Yu
Jongseok Kim
Gunhee Kim
108
347
0
07 Aug 2018
Improving Deep Visual Representation for Person Re-identification by
  Global and Local Image-language Association
Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association
Dapeng Chen
Hongsheng Li
Xihui Liu
Yantao Shen
Zejian Yuan
Xiaogang Wang
84
134
0
05 Aug 2018
Visual Reasoning with Multi-hop Feature Modulation
Visual Reasoning with Multi-hop Feature Modulation
Florian Strub
Mathieu Seurin
Ethan Perez
H. D. Vries
Jérémie Mary
Philippe Preux
Aaron Courville
Olivier Pietquin
95
26
0
03 Aug 2018
Neural Arithmetic Logic Units
Neural Arithmetic Logic Units
Andrew Trask
Felix Hill
Scott E. Reed
Jack W. Rae
Chris Dyer
Phil Blunsom
NAI
96
206
0
01 Aug 2018
Learning Visual Question Answering by Bootstrapping Hard Attention
Learning Visual Question Answering by Bootstrapping Hard Attention
Mateusz Malinowski
Carl Doersch
Adam Santoro
Peter W. Battaglia
OOD
92
96
0
01 Aug 2018
Graph R-CNN for Scene Graph Generation
Graph R-CNN for Scene Graph Generation
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
GNN
145
845
0
01 Aug 2018
Pythia v0.1: the Winning Entry to the VQA Challenge 2018
Pythia v0.1: the Winning Entry to the VQA Challenge 2018
Yu Jiang
Vivek Natarajan
Xinlei Chen
Marcus Rohrbach
Dhruv Batra
Devi Parikh
VLM
104
203
0
26 Jul 2018
Coreset-Based Neural Network Compression
Coreset-Based Neural Network Compression
Abhimanyu Dubey
Moitreya Chatterjee
Narendra Ahuja
61
81
0
25 Jul 2018
Explainable Neural Computation via Stack Neural Module Networks
Explainable Neural Computation via Stack Neural Module Networks
Ronghang Hu
Jacob Andreas
Trevor Darrell
Kate Saenko
LRMOCL
108
199
0
23 Jul 2018
Question Relevance in Visual Question Answering
Question Relevance in Visual Question Answering
Prakruthi Prabhakar
Nitish Kulkarni
Linghao Zhang
38
6
0
23 Jul 2018
Revisiting Cross Modal Retrieval
Revisiting Cross Modal Retrieval
Shah Nawaz
Muhammad Kamran Janjua
Alessandro Calefati
I. Gallo
33
6
0
19 Jul 2018
Convolutional Neural Networks for Aerial Multi-Label Pedestrian
  Detection
Convolutional Neural Networks for Aerial Multi-Label Pedestrian Detection
Amir Soleimani
Nasser M. Nasrabadi
ObjD
42
17
0
16 Jul 2018
Object Relation Detection Based on One-shot Learning
Object Relation Detection Based on One-shot Learning
Li Zhou
Jian-jun Zhao
Jianshu Li
Li-xin Yuan
Jiashi Feng
ObjD
56
23
0
16 Jul 2018
Image Classification for Arabic: Assessing the Accuracy of Direct
  English to Arabic Translations
Image Classification for Arabic: Assessing the Accuracy of Direct English to Arabic Translations
Abdulkareem Alsudais
VLM
50
4
0
13 Jul 2018
Towards Understanding End-of-trip Instructions in a Taxi Ride Scenario
Towards Understanding End-of-trip Instructions in a Taxi Ride Scenario
Deepthi Karkada
R. Manuvinakurike
Kallirroi Georgila
40
0
0
11 Jul 2018
Talk the Walk: Navigating New York City through Grounded Dialogue
Talk the Walk: Navigating New York City through Grounded Dialogue
H. D. Vries
Kurt Shuster
Dhruv Batra
Devi Parikh
Jason Weston
Douwe Kiela
100
124
0
09 Jul 2018
Dynamic Multimodal Instance Segmentation guided by natural language
  queries
Dynamic Multimodal Instance Segmentation guided by natural language queries
Edgar Margffoy-Tuay
Juan C. Pérez
Emilio Botero
Pablo Arbelaez
98
176
0
06 Jul 2018
End-to-End Audio Visual Scene-Aware Dialog using Multimodal
  Attention-Based Video Features
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features
Chiori Hori
Huda AlAmri
Jue Wang
Gordon Wichern
Takaaki Hori
...
Raphael Gontijo-Lopes
Abhishek Das
Irfan Essa
Dhruv Batra
Devi Parikh
VGen
81
125
0
21 Jun 2018
Modularity Matters: Learning Invariant Relational Reasoning Tasks
Modularity Matters: Learning Invariant Relational Reasoning Tasks
Jason Jo
Vikas Verma
Yoshua Bengio
OOD
49
8
0
18 Jun 2018
The Neural Painter: Multi-Turn Image Generation
The Neural Painter: Multi-Turn Image Generation
Ryan Y. Benmalek
Claire Cardie
Serge J. Belongie
Xiaodong He
Jianfeng Gao
MLLM
55
7
0
16 Jun 2018
Grounded Textual Entailment
Grounded Textual Entailment
H. Vu
Claudio Greco
A. Erofeeva
Somayeh Jafaritazehjan
Guido M. Linders
Marc Tanti
A. Testoni
Raffaella Bernardi
Albert Gatt
78
29
0
14 Jun 2018
Learning Visual Knowledge Memory Networks for Visual Question Answering
Learning Visual Knowledge Memory Networks for Visual Question Answering
Zhou Su
Chen Zhu
Yinpeng Dong
Dongqi Cai
Yurong Chen
Jianguo Li
88
62
0
13 Jun 2018
Cross-Dataset Adaptation for Visual Question Answering
Cross-Dataset Adaptation for Visual Question Answering
Wei-Lun Chao
Hexiang Hu
Fei Sha
OOD
83
49
0
10 Jun 2018
Learning Answer Embeddings for Visual Question Answering
Learning Answer Embeddings for Visual Question Answering
Hexiang Hu
Wei-Lun Chao
Fei Sha
65
33
0
10 Jun 2018
CS-VQA: Visual Question Answering with Compressively Sensed Images
CS-VQA: Visual Question Answering with Compressively Sensed Images
Li-Chi Huang
K. Kulkarni
Anik Jha
Suhas Lohit
Suren Jayasuriya
Pavan Turaga
CoGe
34
8
0
08 Jun 2018
Focal Visual-Text Attention for Visual Question Answering
Focal Visual-Text Attention for Visual Question Answering
Junwei Liang
Lu Jiang
Liangliang Cao
Li Li
Alexander G. Hauptmann
68
112
0
05 Jun 2018
On the Flip Side: Identifying Counterexamples in Visual Question
  Answering
On the Flip Side: Identifying Counterexamples in Visual Question Answering
Gabriel Grand
Aron Szanto
Yoon Kim
Alexander Rush
25
0
0
03 Jun 2018
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7
Huda AlAmri
Vincent Cartillier
Raphael Gontijo-Lopes
Abhishek Das
Jue Wang
...
Dhruv Batra
Devi Parikh
A. Cherian
Tim K. Marks
Chiori Hori
60
33
0
01 Jun 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Mian
Wen Liu
Syed Zulqarnain Gilani
Mubarak Shah
124
93
0
01 Jun 2018
Explaining Explanations: An Overview of Interpretability of Machine
  Learning
Explaining Explanations: An Overview of Interpretability of Machine Learning
Leilani H. Gilpin
David Bau
Ben Z. Yuan
Ayesha Bajwa
Michael A. Specter
Lalana Kagal
XAI
129
1,873
0
31 May 2018
Visual Referring Expression Recognition: What Do Systems Actually Learn?
Visual Referring Expression Recognition: What Do Systems Actually Learn?
Volkan Cirik
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
81
63
0
30 May 2018
Multi-turn Dialogue Response Generation in an Adversarial Learning
  Framework
Multi-turn Dialogue Response Generation in an Adversarial Learning Framework
O. Olabiyi
A. Salimov
Anish Khazane
Erik T. Mueller
GAN
80
32
0
30 May 2018
GLAC Net: GLocal Attention Cascading Networks for Multi-image Cued Story
  Generation
GLAC Net: GLocal Attention Cascading Networks for Multi-image Cued Story Generation
Taehyeong Kim
Min-Oh Heo
Seonil Son
Kyoung-Wha Park
Byoung-Tak Zhang
64
78
0
28 May 2018
Interactive Text2Pickup Network for Natural Language based Human-Robot
  Collaboration
Interactive Text2Pickup Network for Natural Language based Human-Robot Collaboration
Hyemin Ahn
Sungjoon Choi
Nuri Kim
Geonho Cha
Songhwai Oh
56
7
0
28 May 2018
Think Visually: Question Answering through Virtual Imagery
Think Visually: Question Answering through Virtual Imagery
Ankit Goyal
Jian Wang
Jia Deng
49
2
0
25 May 2018
Hyperbolic Attention Networks
Hyperbolic Attention Networks
Çağlar Gülçehre
Misha Denil
Mateusz Malinowski
Ali Razavi
Razvan Pascanu
...
Peter W. Battaglia
V. Bapst
David Raposo
Adam Santoro
Nando de Freitas
187
224
0
24 May 2018
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual
  Question Answering
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering
Pan Lu
Lei Ji
Wei Zhang
Nan Duan
M. Zhou
Jianyong Wang
CoGe
61
79
0
24 May 2018
Deep Reinforcement Learning For Sequence to Sequence Models
Deep Reinforcement Learning For Sequence to Sequence Models
Yaser Keneshloo
Tian Shi
Naren Ramakrishnan
Chandan K. Reddy
AIMat3DVOffRL
92
211
0
24 May 2018
Joint Image Captioning and Question Answering
Joint Image Captioning and Question Answering
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
57
13
0
22 May 2018
Guided Feature Transformation (GFT): A Neural Language Grounding Module
  for Embodied Agents
Guided Feature Transformation (GFT): A Neural Language Grounding Module for Embodied Agents
Haonan Yu
Xiaochen Lian
Haichao Zhang
Wenyuan Xu
LM&Ro
58
21
0
22 May 2018
Bilinear Attention Networks
Bilinear Attention Networks
Jin-Hwa Kim
Jaehyun Jun
Byoung-Tak Zhang
AIMat
139
880
0
21 May 2018
Defoiling Foiled Image Captions
Defoiling Foiled Image Captions
Pranava Madhyastha
Josiah Wang
Lucia Specia
65
9
0
16 May 2018
Stories for Images-in-Sequence by using Visual and Narrative Components
Stories for Images-in-Sequence by using Visual and Narrative Components
Marko Smilevski
Ilija Lalkovski
Gjorgji Madjarov
45
19
0
15 May 2018
Did the Model Understand the Question?
Did the Model Understand the Question?
Pramod Kaushik Mudrakarta
Ankur Taly
Mukund Sundararajan
Kedar Dhamdhere
ELMOODFAtt
85
200
0
14 May 2018
Domain Adapted Word Embeddings for Improved Sentiment Classification
Domain Adapted Word Embeddings for Improved Sentiment Classification
P. Sarma
Yingyu Liang
W. Sethares
66
67
0
11 May 2018
Previous
123...515253...585960
Next