Improving Selective Visual Question Answering by Learning from Your
Peers

Improving Selective Visual Question Answering by Learning from Your Peers

14 June 2023

Corentin Dancette

Spencer Whitehead

Rishabh Maheshwary

Ramakrishna Vedantam

Marcus Rohrbach

ArXiv (abs)PDF HTML

Papers citing "Improving Selective Visual Question Answering by Learning from Your Peers"

12 / 12 papers shown

Title
Variational Visual Question Answering Tobias Jan Wieczorek Nathalie Daun Mohammad Emtiyaz Khan Marcus Rohrbach OOD 94 0 0 14 May 2025
Pre-Training Multimodal Hallucination Detectors with Corrupted Grounding Data Spencer Whitehead Jacob Phillips Sean Hendryx 66 0 0 30 Aug 2024
Selectively Answering Visual Questions Julian Martin Eisenschlos Hernán Maina Guido Ivetta Luciana Benotti 88 0 0 03 Jun 2024
Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions Junzhang Liu Zhecan Wang Hammad A. Ayyubi Haoxuan You Chris Thomas Rui Sun Shih-Fu Chang Kai-Wei Chang 166 0 0 18 May 2024
Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering Zaid Khan Yun Fu AAML 83 10 0 16 Apr 2024
Selective Temporal Knowledge Graph Reasoning Zhongni Hou Xiaolong Jin Zixuan Li Long Bai Jiafeng Guo Xueqi Cheng 87 0 0 02 Apr 2024
Improved Baselines for Data-efficient Perceptual Augmentation of LLMs Théophane Vallaeys Mustafa Shukor Matthieu Cord Jakob Verbeek 103 13 0 20 Mar 2024
Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning Tejas Srinivasan Jack Hessel Tanmay Gupta Bill Yuchen Lin Yejin Choi Jesse Thomason Khyathi Chandu 71 9 0 23 Feb 2024
UNK-VQA: A Dataset and a Probe into the Abstention Ability of Multi-modal Large Models Yanyang Guo Fangkai Jiao Zhiqi Shen Liqiang Nie Mohan S. Kankanhalli MLLM 87 7 0 17 Oct 2023
Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context Learning Mustafa Shukor Alexandre Ramé Corentin Dancette Matthieu Cord LRM MLLM 113 22 0 01 Oct 2023
An Outlook into the Future of Egocentric Vision Chiara Plizzari Gabriele Goletto Antonino Furnari Siddhant Bansal Francesco Ragusa G. Farinella Dima Damen Tatiana Tommasi EgoV 120 47 0 14 Aug 2023
UnIVAL: Unified Model for Image, Video, Audio and Language Tasks Mustafa Shukor Corentin Dancette Alexandre Ramé Matthieu Cord MoMe MLLM 126 46 0 30 Jul 2023