Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.13139
Cited By
Multimodal Integration of Human-Like Attention in Visual Question Answering
27 September 2021
Ekta Sood
Fabian Kögel
Philippe Muller
Dominike Thomas
Mihai Bâce
Andreas Bulling
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multimodal Integration of Human-Like Attention in Visual Question Answering"
14 / 14 papers shown
Title
Bridging Ears and Eyes: Analyzing Audio and Visual Large Language Models to Humans in Visible Sound Recognition and Reducing Their Sensory Gap via Cross-Modal Distillation
Xilin Jiang
Junkai Wu
Vishal B. Choudhari
N. Mesgarani
VLM
30
0
0
11 May 2025
Integrating Cognitive Processing Signals into Language Models: A Review of Advances, Applications and Future Directions
Angela Lopez-Cardona
Sebastian Idesis
Ioannis Arapakis
26
0
0
09 Apr 2025
GazeLLM: Multimodal LLMs incorporating Human Visual Attention
Jun Rekimoto
38
0
0
31 Mar 2025
Where do Large Vision-Language Models Look at when Answering Questions?
X. Xing
Chia-Wen Kuo
Li Fuxin
Yulei Niu
Fan Chen
Ming Li
Ying Wu
Longyin Wen
Sijie Zhu
LRM
60
0
0
18 Mar 2025
Learning User Embeddings from Human Gaze for Personalised Saliency Prediction
Florian Strohm
Mihai Bâce
Andreas Bulling
26
1
0
20 Mar 2024
Trends, Applications, and Challenges in Human Attention Modelling
Giuseppe Cartella
Marcella Cornia
Vittorio Cuculo
Alessandro D’Amelio
Dario Zanca
Giuseppe Boccignone
Rita Cucchiara
40
6
0
28 Feb 2024
Pre-Trained Language Models Augmented with Synthetic Scanpaths for Natural Language Understanding
Shuwen Deng
Paul Prasse
D. R. Reich
Tobias Scheffer
Lena A. Jäger
38
5
0
23 Oct 2023
Deep Metric Loss for Multimodal Learning
Sehwan Moon
Hyun-Yong Lee
16
0
0
21 Aug 2023
Eyettention: An Attention-based Dual-Sequence Model for Predicting Human Scanpaths during Reading
Shuwen Deng
D. R. Reich
Paul Prasse
Patrick Haller
Tobias Scheffer
Lena A. Jäger
21
17
0
21 Apr 2023
That's the Wrong Lung! Evaluating and Improving the Interpretability of Unsupervised Multimodal Encoders for Medical Data
Denis Jered McInerney
Geoffrey S. Young
Jan Willem van de Meent
Byron C. Wallace
13
0
0
12 Oct 2022
Detection of ADHD based on Eye Movements during Natural Viewing
Shuwen Deng
Paul Prasse
D. R. Reich
S. Dziemian
Maja Stegenwallner-Schütz
Daniel G. Krakowczyk
Silvia Makowski
N. Langer
Tobias Scheffer
Lena A. Jäger
28
9
0
04 Jul 2022
Do Transformer Models Show Similar Attention Patterns to Task-Specific Human Gaze?
Stephanie Brandl
Oliver Eberle
Jonas Pilot
Anders Søgaard
67
33
0
25 Apr 2022
VQA-MHUG: A Gaze Dataset to Study Multimodal Neural Attention in Visual Question Answering
Ekta Sood
Fabian Kögel
Florian Strohm
Prajit Dhar
Andreas Bulling
40
19
0
27 Sep 2021
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,926
0
17 Aug 2015
1