Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing

10 October 2022

Papers citing "Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing"

3 / 3 papers shown

Title
The curse of language biases in remote sensing VQA: the role of spatial attributes, language diversity, and the need for clear evaluation Christel Chappuis Eliot Walt Vincent Mendez Sylvain Lobry B. L. Saux D. Tuia 23 3 0 28 Nov 2023
HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language Shantipriya Parida Idris Abdulmumin Shamsuddeen Hassan Muhammad Aneesh Bose Guneet Singh Kohli I. Ahmad Ketan Kotwal S. Sarkar Ondrej Bojar Habeebah Adamu Kakudi 24 4 0 28 May 2023
How to find a good image-text embedding for remote sensing visual question answering? Christel Chappuis Sylvain Lobry B. Kellenberger Bertrand Le Saux D. Tuia 37 20 0 24 Sep 2021