Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.04510
Cited By
Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing
10 October 2022
Tim Siebert
Kai Norman Clasen
Mahdyar Ravanbakhsh
Begüm Demir
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing"
3 / 3 papers shown
Title
The curse of language biases in remote sensing VQA: the role of spatial attributes, language diversity, and the need for clear evaluation
Christel Chappuis
Eliot Walt
Vincent Mendez
Sylvain Lobry
B. L. Saux
D. Tuia
23
3
0
28 Nov 2023
HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language
Shantipriya Parida
Idris Abdulmumin
Shamsuddeen Hassan Muhammad
Aneesh Bose
Guneet Singh Kohli
I. Ahmad
Ketan Kotwal
S. Sarkar
Ondrej Bojar
Habeebah Adamu Kakudi
24
4
0
28 May 2023
How to find a good image-text embedding for remote sensing visual question answering?
Christel Chappuis
Sylvain Lobry
B. Kellenberger
Bertrand Le Saux
D. Tuia
37
20
0
24 Sep 2021
1