Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature

18 May 2023

Ana Claudia Akemi Matsuki de Faria

Felype de Castro Bastos

Jose Victor Nogueira Alves da Silva

Vitor Lopes Fabris

Valeska Uchôa

Décio Gonccalves de Aguiar Neto

C. F. G. Santos

ArXiv PDF HTML

Papers citing "Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature"

22 / 22 papers shown

Title
GC-KBVQA: A New Four-Stage Framework for Enhancing Knowledge Based Visual Question Answering Performance Mohammad Mahdi Moradi Sudhir Mudur 15 0 0 25 May 2025
CrafText Benchmark: Advancing Instruction Following in Complex Multimodal Open-Ended World Zoya Volovikova G. Gorbov Petr Kuderov Aleksandr I. Panov A. Skrynnik 39 0 0 17 May 2025
Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru Dunant Cusipuma David Ortega Victor Flores-Benites Arturo Deza OOD 105 0 0 10 Mar 2025
RoboDesign1M: A Large-scale Dataset for Robot Design Understanding T. H. Le T. H. Nguyen Quang-Dieu Tran Quang Minh Nguyen Baoru Huang Hoan Nguyen M. Vu Tung D. Ta A. Nguyen 3DV 88 0 0 09 Mar 2025
SparrowVQE: Visual Question Explanation for Course Content Understanding Jialu Li Manish Kumar Thota Ruslan Gokhman Radek Holik Youshan Zhang 55 1 0 12 Nov 2024
Knowledge-Aware Reasoning over Multimodal Semi-structured Tables Suyash Vardhan Mathur J. Bafna Kunal Kartik Harshita Khandelwal Manish Shrivastava Vivek Gupta Joey Tianyi Zhou Dan Roth LMTD 45 1 0 25 Aug 2024
Sportify: Question Answering with Embedded Visualizations and Personified Narratives for Sports Video Chunggi Lee Tica Lin Hanspeter Pfister Chen Zhu-Tian 53 1 0 09 Aug 2024
VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks Juhwan Choi Junehyoung Kwon Jungmin Yun Seunguk Yu Youngbin Kim 53 1 0 29 Jul 2024
ECOR: Explainable CLIP for Object Recognition Ali Rasekh Sepehr Kazemi Ranjbar Milad Heidari Wolfgang Nejdl VLM 72 4 0 19 Apr 2024
Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models Songtao Jiang Tuo Zheng Yan Zhang Yeying Jin Li Yuan Zuozhu Liu MoE 71 15 0 16 Apr 2024
Navigating the Landscape of Hint Generation Research: From the Past to the Future Anubhav Jangra Jamshid Mozafari Adam Jatowt Smaranda Muresan 45 2 0 06 Apr 2024
Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models Songtao Jiang Yan Zhang Chenyi Zhou Yeying Jin Yang Feng Jian Wu Zuozhu Liu LRM VLM 64 4 0 06 Apr 2024
A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming Pengyuan Zhou Lin Wang Zhi Liu Yanbin Hao Pan Hui Sasu Tarkoma J. Kangasharju VGen 54 27 0 30 Jan 2024
Multimodality of AI for Education: Towards Artificial General Intelligence Gyeong-Geon Lee Lehong Shi Ehsan Latif Yizhu Gao Arne Bewersdorff ... Zheng Liu Hui Wang Gengchen Mai Tiaming Liu Xiaoming Zhai 53 40 0 10 Dec 2023
Multiscale Superpixel Structured Difference Graph Convolutional Network for VL Representation Siyu Zhang Ye-Ting Chen Fang Wang Yaoru Sun Jun Yang Lizhi Bai SSL 39 0 0 20 Oct 2023
Robust Visual Question Answering: Datasets, Methods, and Future Challenges Jie Ma Pinghui Wang Dechen Kong Zewei Wang Jun Liu Hongbin Pei Junzhou Zhao OOD 49 18 0 21 Jul 2023
MUST-VQA: MUltilingual Scene-text VQA Emanuele Vivoli Ali Furkan Biten Andrés Mafla Dimosthenis Karatzas Lluís Gómez 60 6 0 14 Sep 2022
Coarse-to-Fine Reasoning for Visual Question Answering Binh X. Nguyen Tuong Khanh Long Do Huy Tran Erman Tjiputra Quang-Dieu Tran A. Nguyen NAI 83 36 0 06 Oct 2021
Graphhopper: Multi-Hop Scene Graph Reasoning for Visual Question Answering Rajat Koner Hang Li Marcel Hildebrandt Deepan Das Volker Tresp Stephan Günnemann 43 31 0 13 Jul 2021
A survey on VQA_Datasets and Approaches Yeyun Zou Qiyu Xie 50 18 0 02 May 2021
Densely Connected Convolutional Networks Gao Huang Zhuang Liu Laurens van der Maaten Kilian Q. Weinberger PINN 3DV 368 36,493 0 25 Aug 2016
ImageNet Large Scale Visual Recognition Challenge Olga Russakovsky Jia Deng Hao Su J. Krause S. Satheesh ... A. Karpathy A. Khosla Michael S. Bernstein Alexander C. Berg Li Fei-Fei VLM ObjD 379 39,309 0 01 Sep 2014