Learning to Answer Questions From Image Using Convolutional Neural Network

1 June 2015

Papers citing "Learning to Answer Questions From Image Using Convolutional Neural Network"

44 / 44 papers shown

Title
GC-KBVQA: A New Four-Stage Framework for Enhancing Knowledge Based Visual Question Answering Performance Mohammad Mahdi Moradi Sudhir Mudur 15 0 0 25 May 2025
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities Md Farhan Ishmam Md Sakib Hossain Shovon M. F. Mridha Nilanjan Dey 85 39 0 01 Nov 2023
Noisy Correspondence Learning with Meta Similarity Correction Haocheng Han Kaiyao Miao Qinghua Zheng Minnan Luo 47 28 0 13 Apr 2023
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval Haoran Wang Dongliang He Wenhao Wu Boyang Xia Min Yang Fu Li YunLong Yu Zhong Ji Errui Ding Jingdong Wang 38 23 0 21 Aug 2022
From Pixels to Objects: Cubic Visual Attention for Visual Question Answering Jingkuan Song Pengpeng Zeng Lianli Gao Heng Tao Shen 32 62 0 04 Jun 2022
Dynamic Key-value Memory Enhanced Multi-step Graph Reasoning for Knowledge-based Visual Question Answering Mingxiao Li Marie-Francine Moens 30 13 0 06 Mar 2022
CQ-VQA: Visual Question Answering on Categorized Questions Aakansha Mishra A. Anand Prithwijit Guha 52 6 0 17 Feb 2020
Robust Explanations for Visual Question Answering Badri N. Patro Shivansh Pate Vinay P. Namboodiri OOD AAML 25 20 0 23 Jan 2020
Towards Causal VQA: Revealing and Reducing Spurious Correlations by Invariant and Covariant Semantic Editing Vedika Agarwal Rakshith Shetty Mario Fritz CML AAML 32 156 0 16 Dec 2019
Assessing the Robustness of Visual Question Answering Models Jia-Hong Huang Modar Alfadly Guohao Li Marcel Worring AAML OOD 28 23 0 30 Nov 2019
TAB-VCR: Tags and Attributes based Visual Commonsense Reasoning Baselines Jingxiang Lin Unnat Jain Alex Schwing LRM ReLM 51 9 0 31 Oct 2019
Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video Zhenfang Chen Lin Ma Wenhan Luo Kwan-Yee K. Wong 37 101 0 06 Jun 2019
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning Wei Zhang Bairui Wang Lin Ma Wei Liu 25 67 0 03 Jun 2019
A Trainable Multiplication Layer for Auto-correlation and Co-occurrence Extraction Hideaki Hayashi S. Uchida 14 3 0 30 May 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism Xuelong Li Aihong Yuan Xiaoqiang Lu 21 37 0 29 May 2019
Memory-Attended Recurrent Network for Video Captioning Wenjie Pei Jiyuan Zhang Xiangrong Wang Lei Ke Xiaoyong Shen Yu-Wing Tai 27 199 0 10 May 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog Idan Schwartz Alex Schwing Tamir Hazan 32 69 0 11 Apr 2019
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering Pan Lu Lei Ji Wei Zhang Nan Duan M. Zhou Jianyong Wang CoGe 32 79 0 24 May 2018
Reconstruction Network for Video Captioning Bairui Wang Lin Ma Wei Zhang Wen Liu 43 316 0 30 Mar 2018
Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering Unnat Jain Svetlana Lazebnik Alex Schwing MLLM 37 81 0 29 Mar 2018
Unsupervised Textual Grounding: Linking Words to Image Concepts Raymond A. Yeh Minh Do Alex Schwing 22 40 0 29 Mar 2018
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks Nikolaos Passalis Anastasios Tefas 28 70 0 25 Jul 2017
Survey of Visual Question Answering: Datasets and Techniques A. Gupta 21 38 0 10 May 2017
The Forgettable-Watcher Model for Video Question Answering Hongyang Xue Zhou Zhao Deng Cai 21 9 0 03 May 2017
VQABQ: Visual Question Answering by Basic Questions Jia-Hong Huang Modar Alfadly Guohao Li 27 24 0 19 Mar 2017
Task-driven Visual Saliency and Attention-based Visual Question Answering Yuetan Lin Zhangyang Pang Donghui Wang Yueting Zhuang 35 26 0 22 Feb 2017
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning Justin Johnson B. Hariharan Laurens van der Maaten Li Fei-Fei C. L. Zitnick Ross B. Girshick CoGe 191 2,338 0 20 Dec 2016
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions Peng Wang Qi Wu Chunhua Shen Anton Van Den Hengel OOD 39 86 0 16 Dec 2016
Leveraging Video Descriptions to Learn Video Question Answering Kuo-Hao Zeng Tseng-Hung Chen Ching-Yao Chuang Yuan-Hong Liao Juan Carlos Niebles Min Sun 44 176 0 12 Nov 2016
Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs Task Ashkan Mokarian Mateusz Malinowski Mario Fritz 27 5 0 09 Aug 2016
DualNet: Domain-Invariant Network for Visual Question Answering Kuniaki Saito Andrew Shin Yoshitaka Ushiku Tatsuya Harada 37 58 0 20 Jun 2016
Adversarial Feature Learning Jiasen Lu Philipp Krahenbuhl Trevor Darrell GAN 59 1,601 0 31 May 2016
A Focused Dynamic Attention Model for Visual Question Answering Ilija Ilievski Shuicheng Yan Jiashi Feng 28 122 0 06 Apr 2016
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge Qi Wu Chunhua Shen Anton Van Den Hengel Peng Wang A. Dick 27 360 0 09 Mar 2016
Dynamic Memory Networks for Visual and Textual Question Answering Caiming Xiong Stephen Merity R. Socher 34 753 0 04 Mar 2016
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures Raffaella Bernardi Ruken Cakici Desmond Elliott Aykut Erdem Erkut Erdem Nazli Ikizler-Cinbis Frank Keller A. Muscat Barbara Plank EGVM VLM 27 363 0 15 Jan 2016
Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources Qi Wu Peng Wang Chunhua Shen A. Dick Anton Van Den Hengel 31 370 0 22 Nov 2015
Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction Hyeonwoo Noh Paul Hongsuck Seo Bohyung Han OOD 33 327 0 18 Nov 2015
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering Huijuan Xu Kate Saenko 36 761 0 17 Nov 2015
Visual7W: Grounded Question Answering in Images Yuke Zhu Oliver Groth Michael S. Bernstein Li Fei-Fei 44 877 0 11 Nov 2015
Neural Module Networks Jacob Andreas Marcus Rohrbach Trevor Darrell Dan Klein CoGe 66 1,064 0 09 Nov 2015
What value do explicit high level concepts have in vision to language problems? Qi Wu Chunhua Shen Lingqiao Liu A. Dick Anton Van Den Hengel 33 443 0 03 Jun 2015
Exploring Models and Data for Image Question Answering Mengye Ren Ryan Kiros R. Zemel 44 712 0 08 May 2015
Ask Your Neurons: A Neural-based Approach to Answering Questions about Images Mateusz Malinowski Marcus Rohrbach Mario Fritz 41 596 0 05 May 2015