Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.02788
Cited By
GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths
5 August 2024
Xianyu Chen
Ming Jiang
Qi Zhao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths"
6 / 6 papers shown
Title
SIGN: A Statistically-Informed Gaze Network for Gaze Time Prediction
Jianping Ye
Michel Wedel
ViT
SLR
49
0
0
29 Jan 2025
SummAct: Uncovering User Intentions Through Interactive Behaviour Summarisation
Guanhua Zhang
Mohamed Ahmed
Zhiming Hu
Andreas Bulling
AI4TS
23
1
0
10 Oct 2024
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
272
4,244
0
30 Jan 2023
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
392
4,137
0
28 Jan 2022
CIDEr-R: Robust Consensus-based Image Description Evaluation
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
42
30
0
28 Sep 2021
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions
Radhika Dua
Sai Srinivas Kancheti
V. Balasubramanian
LRM
38
22
0
24 Oct 2020
1