v1v2v3v4v5v6v7 (latest)

VQA: Visual Question Answering

3 May 2015

Devi Parikh

Papers citing "VQA: Visual Question Answering"

50 / 2,957 papers shown

Title
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering Mateusz Malinowski Marcus Rohrbach Mario Fritz 106 101 0 09 May 2016
Leveraging Visual Question Answering for Image-Caption Ranking Xiaoyu Lin Devi Parikh CoGe 99 84 0 04 May 2016
Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering Arun Mallya Svetlana Lazebnik 101 119 0 16 Apr 2016
Visual Storytelling Ting-Hao 'Kenneth' Huang Huang Francis Ferraro N. Mostafazadeh Ishan Misra ... C. L. Zitnick Devi Parikh Lucy Vanderwende Michel Galley Margaret Mitchell VGen 90 480 0 13 Apr 2016
Counting Everyday Objects in Everyday Scenes Prithvijit Chattopadhyay Ramakrishna Vedantam Ramprasaath R. Selvaraju Dhruv Batra Devi Parikh 111 157 0 12 Apr 2016
Attributes as Semantic Units between Natural Language and Visual Recognition Marcus Rohrbach VLM 40 3 0 12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description Yuncheng Li Yale Song Liangliang Cao Joel R. Tetreault Larry Goldberg A. Jaimes Jiebo Luo 83 274 0 10 Apr 2016
Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes Gordon A. Christie A. Laddha Aishwarya Agrawal Stanislaw Antol Yash Goyal K. Kochersberger Dhruv Batra 81 30 0 07 Apr 2016
A Focused Dynamic Attention Model for Visual Question Answering Ilija Ilievski Shuicheng Yan Jiashi Feng 77 122 0 06 Apr 2016
Deep Image Retrieval: Learning global representations for image search Albert Gordo Jon Almazán Jérôme Revaud Diane Larlus 84 807 0 05 Apr 2016
Automatic Annotation of Structured Facts in Images Mohamed Elhoseiny Scott D. Cohen W. Chang Brian L. Price Ahmed Elgammal 61 9 0 02 Apr 2016
Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings Spandana Gella Mirella Lapata Frank Keller CoGe 82 53 0 30 Mar 2016
A Diagram Is Worth A Dozen Images Aniruddha Kembhavi M. Salvato Eric Kolve Minjoon Seo Hannaneh Hajishirzi Ali Farhadi 3DV 103 505 0 24 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing Arnau Ramisa F. Yan Francesc Moreno-Noguer K. Mikolajczyk 72 106 0 23 Mar 2016
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge Qi Wu Chunhua Shen Anton Van Den Hengel Peng Wang A. Dick 89 362 0 09 Mar 2016
Dynamic Memory Networks for Visual and Textual Question Answering Caiming Xiong Stephen Merity R. Socher 88 756 0 04 Mar 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations Ranjay Krishna Yuke Zhu Oliver Groth Justin Johnson Kenji Hata ... Yannis Kalantidis Li Li David A. Shamma Michael S. Bernstein Fei-Fei Li 273 5,779 0 23 Feb 2016
A Taxonomy of Deep Convolutional Neural Nets for Computer Vision Suraj Srinivas Ravi Kiran Sarvadevabhatla Konda Reddy Mopuri N. Prabhu S. Kruthiventi R. Venkatesh Babu OOD 69 216 0 25 Jan 2016
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures Raffaella Bernardi Ruken Cakici Desmond Elliott Aykut Erdem Erkut Erdem Nazli Ikizler-Cinbis Frank Keller A. Muscat Barbara Plank EGVM VLM 92 365 0 15 Jan 2016
Learning to Compose Neural Networks for Question Answering Jacob Andreas Marcus Rohrbach Trevor Darrell Dan Klein NAI KELM BDL CoGe 130 568 0 07 Jan 2016
We Are Humor Beings: Understanding and Predicting Visual Humor Arjun Chandrasekaran Ashwin K. Vijayakumar Stanislaw Antol Joey Tianyi Zhou Dhruv Batra C. L. Zitnick Devi Parikh 113 57 0 14 Dec 2015
Neural Self Talk: Image Understanding via Continuous Questioning and Answering Yezhou Yang Yi Li Cornelia Fermuller Yiannis Aloimonos 38 24 0 10 Dec 2015
MovieQA: Understanding Stories in Movies through Question-Answering Makarand Tapaswi Yukun Zhu Rainer Stiefelhagen Antonio Torralba R. Urtasun Sanja Fidler 122 752 0 09 Dec 2015
Simple Baseline for Visual Question Answering Bolei Zhou Yuandong Tian Sainbayar Sukhbaatar Arthur Szlam Rob Fergus FAtt 108 324 0 07 Dec 2015
A Restricted Visual Turing Test for Deep Scene and Event Understanding Qi Tianfu Wu M. Lee Song-Chun Zhu 62 12 0 06 Dec 2015
Natural Language Understanding with Distributed Representation Kyunghyun Cho GNN BDL 86 55 0 24 Nov 2015
Where To Look: Focus Regions for Visual Question Answering Kevin J. Shih Saurabh Singh Derek Hoiem 98 462 0 23 Nov 2015
Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes Satwik Kottur Ramakrishna Vedantam José M. F. Moura Devi Parikh VLM 114 85 0 22 Nov 2015
Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources Qi Wu Peng Wang Chunhua Shen A. Dick Anton Van Den Hengel 82 372 0 22 Nov 2015
Learning Deep Structure-Preserving Image-Text Embeddings Liwei Wang Yin Li Svetlana Lazebnik 115 784 0 19 Nov 2015
Reducing Overfitting in Deep Networks by Decorrelating Representations Michael Cogswell Faruk Ahmed Ross B. Girshick C. L. Zitnick Dhruv Batra 120 416 0 19 Nov 2015
ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering Kan Chen Jiang Wang Liang-Chieh Chen Haoyuan Gao Wenyuan Xu Ram Nevatia 86 288 0 18 Nov 2015
Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction Hyeonwoo Noh Paul Hongsuck Seo Bohyung Han OOD 78 327 0 18 Nov 2015
Compositional Memory for Visual Question Answering Aiwen Jiang Fang Wang Fatih Porikli Yi Li CoGe 59 42 0 18 Nov 2015
Learning Articulated Motion Models from Visual and Lingual Signals Zhengyang Wu Joey Tianyi Zhou Matthew R. Walter 31 0 0 17 Nov 2015
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering Huijuan Xu Kate Saenko 122 763 0 17 Nov 2015
Yin and Yang: Balancing and Answering Binary Visual Questions Peng Zhang Yash Goyal D. Summers-Stay Dhruv Batra Devi Parikh CoGe 107 352 0 16 Nov 2015
Sherlock: Scalable Fact Learning in Images Mohamed Elhoseiny Scott D. Cohen W. Chang Brian L. Price Ahmed Elgammal 59 26 0 16 Nov 2015
Uncovering Temporal Context for Video Question and Answering Linchao Zhu Zhongwen Xu Yi Yang Alexander G. Hauptmann BDL 86 45 0 15 Nov 2015
Visual7W: Grounded Question Answering in Images Yuke Zhu Oliver Groth Michael S. Bernstein Li Fei-Fei 157 890 0 11 Nov 2015
Neural Module Networks Jacob Andreas Marcus Rohrbach Trevor Darrell Dan Klein CoGe 177 1,079 0 09 Nov 2015
Explicit Knowledge-based Reasoning for Visual Question Answering Peng Wang Qi Wu Chunhua Shen Anton Van Den Hengel A. Dick 91 261 0 09 Nov 2015
Generation and Comprehension of Unambiguous Object Descriptions Junhua Mao Jonathan Huang Alexander Toshev Oana-Maria Camburu Alan Yuille Kevin Patrick Murphy ObjD 144 1,362 0 07 Nov 2015
Stacked Attention Networks for Image Question Answering Zichao Yang Xiaodong He Jianfeng Gao Li Deng Alex Smola BDL 166 1,889 0 07 Nov 2015
Learning Visual Features from Large Weakly Supervised Data Armand Joulin Laurens van der Maaten Allan Jabri Nicolas Vasilache SSL 131 410 0 06 Nov 2015
VISALOGY: Answering Visual Analogy Questions Fereshteh Sadeghi C. L. Zitnick Ali Farhadi 83 46 0 30 Oct 2015
Using Thought-Provoking Children's Questions to Drive Artificial Intelligence Research E. Mueller H. Minsky LRM 18 1 0 27 Aug 2015
A Survey of Current Datasets for Vision and Language Research Francis Ferraro N. Mostafazadeh Ting-Hao 'Kenneth' Huang Huang Lucy Vanderwende Jacob Devlin Michel Galley Margaret Mitchell VLM 92 75 0 23 Jun 2015
Describing Common Human Visual Actions in Images M. R. Ronchi Pietro Perona 88 64 0 07 Jun 2015
What value do explicit high level concepts have in vision to language problems? Qi Wu Chunhua Shen Lingqiao Liu A. Dick Anton Van Den Hengel 89 444 0 03 Jun 2015