Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.13139
Cited By
Multimodal Integration of Human-Like Attention in Visual Question Answering
27 September 2021
Ekta Sood
Fabian Kögel
Philippe Muller
Dominike Thomas
Mihai Bâce
Andreas Bulling
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multimodal Integration of Human-Like Attention in Visual Question Answering"
14 / 14 papers shown
Title
Bridging Ears and Eyes: Analyzing Audio and Visual Large Language Models to Humans in Visible Sound Recognition and Reducing Their Sensory Gap via Cross-Modal Distillation
Xilin Jiang
Junkai Wu
Vishal B. Choudhari
N. Mesgarani
VLM
30
0
0
11 May 2025
Integrating Cognitive Processing Signals into Language Models: A Review of Advances, Applications and Future Directions
Angela Lopez-Cardona
Sebastian Idesis
Ioannis Arapakis
28
0
0
09 Apr 2025
GazeLLM: Multimodal LLMs incorporating Human Visual Attention
Jun Rekimoto
40
0
0
31 Mar 2025
Where do Large Vision-Language Models Look at when Answering Questions?
X. Xing
Chia-Wen Kuo
Li Fuxin
Yulei Niu
Fan Chen
Ming Li
Ying Wu
Longyin Wen
Sijie Zhu
LRM
62
0
0
18 Mar 2025
Learning User Embeddings from Human Gaze for Personalised Saliency Prediction
Florian Strohm
Mihai Bâce
Andreas Bulling
31
1
0
20 Mar 2024
Trends, Applications, and Challenges in Human Attention Modelling
Giuseppe Cartella
Marcella Cornia
Vittorio Cuculo
Alessandro D’Amelio
Dario Zanca
Giuseppe Boccignone
Rita Cucchiara
40
6
0
28 Feb 2024
Pre-Trained Language Models Augmented with Synthetic Scanpaths for Natural Language Understanding
Shuwen Deng
Paul Prasse
D. R. Reich
Tobias Scheffer
Lena A. Jäger
40
5
0
23 Oct 2023
Deep Metric Loss for Multimodal Learning
Sehwan Moon
Hyun-Yong Lee
16
0
0
21 Aug 2023
Eyettention: An Attention-based Dual-Sequence Model for Predicting Human Scanpaths during Reading
Shuwen Deng
D. R. Reich
Paul Prasse
Patrick Haller
Tobias Scheffer
Lena A. Jäger
23
17
0
21 Apr 2023
That's the Wrong Lung! Evaluating and Improving the Interpretability of Unsupervised Multimodal Encoders for Medical Data
Denis Jered McInerney
Geoffrey S. Young
Jan-Willem van de Meent
Byron C. Wallace
18
0
0
12 Oct 2022
Detection of ADHD based on Eye Movements during Natural Viewing
Shuwen Deng
Paul Prasse
D. R. Reich
S. Dziemian
Maja Stegenwallner-Schütz
Daniel G. Krakowczyk
Silvia Makowski
N. Langer
Tobias Scheffer
Lena A. Jäger
30
9
0
04 Jul 2022
Do Transformer Models Show Similar Attention Patterns to Task-Specific Human Gaze?
Stephanie Brandl
Oliver Eberle
Jonas Pilot
Anders Søgaard
67
33
0
25 Apr 2022
VQA-MHUG: A Gaze Dataset to Study Multimodal Neural Attention in Visual Question Answering
Ekta Sood
Fabian Kögel
Florian Strohm
Prajit Dhar
Andreas Bulling
40
19
0
27 Sep 2021
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,926
0
17 Aug 2015
1