ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.00928
  4. Cited By
Quantifying Attention Flow in Transformers

Quantifying Attention Flow in Transformers

2 May 2020
Samira Abnar
Willem H. Zuidema
ArXivPDFHTML

Papers citing "Quantifying Attention Flow in Transformers"

50 / 403 papers shown
Title
On the Interplay of Human-AI Alignment,Fairness, and Performance Trade-offs in Medical Imaging
On the Interplay of Human-AI Alignment,Fairness, and Performance Trade-offs in Medical Imaging
Haozhe Luo
Ziyu Zhou
Zixin Shu
Aurélie Pahud de Mortanges
Robert Berke
Mauricio Reyes
26
0
0
15 May 2025
DocVXQA: Context-Aware Visual Explanations for Document Question Answering
DocVXQA: Context-Aware Visual Explanations for Document Question Answering
Mohamed Ali Souibgui
Changkyu Choi
Andrey Barsky
Kangsoo Jung
Ernest Valveny
Dimosthenis Karatzas
28
0
0
12 May 2025
What's Wrong with Your Synthetic Tabular Data? Using Explainable AI to Evaluate Generative Models
What's Wrong with Your Synthetic Tabular Data? Using Explainable AI to Evaluate Generative Models
Jan Kapar
Niklas Koenen
Martin Jullum
66
0
0
29 Apr 2025
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax
Zayd Muhammad Kawakibi Zuhri
Erland Hilman Fuadi
Alham Fikri Aji
33
0
0
29 Apr 2025
GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability
GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability
Sehyeong Jo
Gangjae Jang
Haesol Park
32
0
0
28 Apr 2025
Exploring How LLMs Capture and Represent Domain-Specific Knowledge
Exploring How LLMs Capture and Represent Domain-Specific Knowledge
Mirian Hipolito Garcia
Camille Couturier
Daniel Madrigal Diaz
Ankur Mallick
Anastasios Kyrillidis
Robert Sim
Victor Rühle
Saravan Rajmohan
30
0
0
23 Apr 2025
Learning to Attribute with Attention
Learning to Attribute with Attention
Benjamin Cohen-Wang
Yung-Sung Chuang
Aleksander Madry
30
0
0
18 Apr 2025
Embedding Radiomics into Vision Transformers for Multimodal Medical Image Classification
Embedding Radiomics into Vision Transformers for Multimodal Medical Image Classification
Zhenyu Yang
Haiming Zhu
Rihui Zhang
Haipeng Zhang
Jianliang Wang
Chunhao Wang
Minbin Chen
F. Yin
MedIm
38
0
0
15 Apr 2025
STaRFormer: Semi-Supervised Task-Informed Representation Learning via Dynamic Attention-Based Regional Masking for Sequential Data
STaRFormer: Semi-Supervised Task-Informed Representation Learning via Dynamic Attention-Based Regional Masking for Sequential Data
Maxmilian Forstenhäusler
Daniel Külzer
Christos Anagnostopoulos
S. P. Parambath
Natascha Weber
AI4TS
MedIm
40
0
0
14 Apr 2025
Are We Merely Justifying Results ex Post Facto? Quantifying Explanatory Inversion in Post-Hoc Model Explanations
Are We Merely Justifying Results ex Post Facto? Quantifying Explanatory Inversion in Post-Hoc Model Explanations
Zhen Tan
Song Wang
Yifan Li
Yu Kong
Jundong Li
Tianlong Chen
Huan Liu
FAtt
45
0
0
11 Apr 2025
A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Medical Image Classification
A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Medical Image Classification
K. Djoumessi
Samuel Ofosu Mensah
Philipp Berens
ViT
MedIm
37
0
0
11 Apr 2025
EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture
EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture
Wenfeng Feng
Guoying Sun
31
0
0
09 Apr 2025
GraphPINE: Graph Importance Propagation for Interpretable Drug Response Prediction
GraphPINE: Graph Importance Propagation for Interpretable Drug Response Prediction
Yoshitaka Inoue
Tianfan Fu
Augustin Luna
26
0
0
07 Apr 2025
Les Dissonances: Cross-Tool Harvesting and Polluting in Multi-Tool Empowered LLM Agents
Les Dissonances: Cross-Tool Harvesting and Polluting in Multi-Tool Empowered LLM Agents
Zichuan Li
Jian Cui
Xiaojing Liao
Luyi Xing
LLMAG
47
0
0
04 Apr 2025
Noiser: Bounded Input Perturbations for Attributing Large Language Models
Noiser: Bounded Input Perturbations for Attributing Large Language Models
Mohammad Reza Ghasemi Madani
Aryo Pradipta Gema
Gabriele Sarti
Yu Zhao
Pasquale Minervini
Andrea Passerini
AAML
35
0
0
03 Apr 2025
Follow the Flow: On Information Flow Across Textual Tokens in Text-to-Image Models
Follow the Flow: On Information Flow Across Textual Tokens in Text-to-Image Models
Guy Kaplan
Michael Toker
Yuval Reif
Yonatan Belinkov
Roy Schwartz
DiffM
48
0
0
01 Apr 2025
Hierarchical Attention Network for Interpretable ECG-based Heart Disease Classification
Hierarchical Attention Network for Interpretable ECG-based Heart Disease Classification
Mario Padilla Rodriguez
Mohamed Nafea
28
0
0
25 Mar 2025
EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision
EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision
Xiaofeng Mao
YueFeng Chen
Rong Zhang
Hui Xue
Zhao Li
Hang Su
AAML
VLM
43
0
0
21 Mar 2025
Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agents
Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agents
Juhee Kim
Woohyuk Choi
Byoungyoung Lee
LLMAG
87
1
0
17 Mar 2025
UniNet: A Unified Multi-granular Traffic Modeling Framework for Network Security
Binghui Wu
D. Divakaran
M. Gurusamy
57
0
0
06 Mar 2025
Visual Attention Exploration in Vision-Based Mamba Models
Visual Attention Exploration in Vision-Based Mamba Models
Junpeng Wang
Chin-Chia Michael Yeh
Uday Singh Saini
Mahashweta Das
Mamba
59
0
0
28 Feb 2025
Attend or Perish: Benchmarking Attention in Algorithmic Reasoning
Michal Spiegel
Michal Štefánik
Marek Kadlcík
Josef Kuchař
37
0
0
28 Feb 2025
Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions
Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions
Palawat Busaranuvong
Emmanuel O. Agu
Reza Saadati Fard
Deepak Kumar
Shefalika Gautam
B. Tulu
Diane Strong
MedIm
65
0
0
27 Feb 2025
Interpreting CLIP with Hierarchical Sparse Autoencoders
Interpreting CLIP with Hierarchical Sparse Autoencoders
Vladimir Zaigrajew
Hubert Baniecki
P. Biecek
56
0
0
27 Feb 2025
Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models
Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models
Itay Benou
Tammy Riklin-Raviv
67
0
0
27 Feb 2025
Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP
Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP
Chenyang Zhao
Kun Wang
J. H. Hsiao
Antoni B. Chan
CLIP
71
0
0
26 Feb 2025
Disentangling Visual Transformers: Patch-level Interpretability for Image Classification
Disentangling Visual Transformers: Patch-level Interpretability for Image Classification
Guillaume Jeanneret
Loïc Simon
F. Jurie
ViT
49
0
0
24 Feb 2025
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models
L. Arras
Bruno Puri
Patrick Kahardipraja
Sebastian Lapuschkin
Wojciech Samek
46
0
0
21 Feb 2025
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow
Behrooz Azarkhalili
Maxwell Libbrecht
39
0
0
14 Feb 2025
Protego: Detecting Adversarial Examples for Vision Transformers via Intrinsic Capabilities
Protego: Detecting Adversarial Examples for Vision Transformers via Intrinsic Capabilities
Jialin Wu
Kaikai Pan
Yanjiao Chen
Jiangyi Deng
Shengyuan Pang
Wenyuan Xu
ViT
AAML
43
0
0
13 Jan 2025
Attention Mechanisms Don't Learn Additive Models: Rethinking Feature Importance for Transformers
Attention Mechanisms Don't Learn Additive Models: Rethinking Feature Importance for Transformers
Tobias Leemann
Alina Fastowski
Felix Pfeiffer
Gjergji Kasneci
62
4
0
10 Jan 2025
xMIL: Insightful Explanations for Multiple Instance Learning in Histopathology
xMIL: Insightful Explanations for Multiple Instance Learning in Histopathology
Julius Hense
M. J. Idaji
Oliver Eberle
Thomas Schnake
Jonas Dippel
Laure Ciernik
Oliver Buchstab
Andreas Mock
Frederick Klauschen
Klaus-Robert Müller
51
3
0
08 Jan 2025
Navigating the Maze of Explainable AI: A Systematic Approach to Evaluating Methods and Metrics
Navigating the Maze of Explainable AI: A Systematic Approach to Evaluating Methods and Metrics
Lukas Klein
Carsten T. Lüth
U. Schlegel
Till J. Bungert
Mennatallah El-Assady
Paul F. Jäger
XAI
ELM
42
2
0
03 Jan 2025
Multi-Head Explainer: A General Framework to Improve Explainability in CNNs and Transformers
Multi-Head Explainer: A General Framework to Improve Explainability in CNNs and Transformers
Bohang Sun
Pietro Liò
ViT
AAML
40
1
0
02 Jan 2025
A Room to Roam: Reset Prediction Based on Physical Object Placement for
  Redirected Walking
A Room to Roam: Reset Prediction Based on Physical Object Placement for Redirected Walking
Sulim Chun
Ho Jung Lee
In-Kwon Lee
35
0
0
23 Dec 2024
Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
Aodi Li
Liansheng Zhuang
Xiao Long
Minghong Yao
Shafei Wang
195
0
0
18 Dec 2024
Analyzing the Attention Heads for Pronoun Disambiguation in
  Context-aware Machine Translation Models
Analyzing the Attention Heads for Pronoun Disambiguation in Context-aware Machine Translation Models
Paweł Mąka
Yusuf Can Semerci
Jan Scholtes
Gerasimos Spanakis
86
0
0
15 Dec 2024
Advancing Attribution-Based Neural Network Explainability through
  Relative Absolute Magnitude Layer-Wise Relevance Propagation and
  Multi-Component Evaluation
Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation
Davor Vukadin
Petar Afrić
Marin Šilić
Goran Delač
FAtt
95
2
0
12 Dec 2024
Token Cropr: Faster ViTs for Quite a Few Tasks
Token Cropr: Faster ViTs for Quite a Few Tasks
Benjamin Bergner
C. Lippert
Aravindh Mahendran
ViT
VLM
74
0
0
01 Dec 2024
Evidential Federated Learning for Skin Lesion Image Classification
Evidential Federated Learning for Skin Lesion Image Classification
Rutger Hendrix
Federica Proietto Salanitri
C. Spampinato
S. Palazzo
Ulas Bagci
VLM
FedML
31
0
0
15 Nov 2024
Visual Fourier Prompt Tuning
Visual Fourier Prompt Tuning
Runjia Zeng
Cheng Han
Qifan Wang
Chunshu Wu
Tong Geng
Lifu Huang
Ying Nian Wu
Dongfang Liu
VPVLM
VLM
58
6
0
02 Nov 2024
Cross-Fundus Transformer for Multi-modal Diabetic Retinopathy Grading
  with Cataract
Cross-Fundus Transformer for Multi-modal Diabetic Retinopathy Grading with Cataract
Fan Xiao
Junlin Hou
Ruiwei Zhao
Rui Feng
Haidong Zou
Lina Lu
Yongjun Xu
Juzhao Zhang
MedIm
38
1
0
01 Nov 2024
Beyond Accuracy: Ensuring Correct Predictions With Correct Rationales
Beyond Accuracy: Ensuring Correct Predictions With Correct Rationales
Tang Li
Mengmeng Ma
Xi Peng
45
2
0
31 Oct 2024
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
John Wu
David Wu
Jimeng Sun
52
1
0
31 Oct 2024
Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Transformers
Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Transformers
Shaobo Wang
Hongxuan Tang
Mingyang Wang
Hao Zhang
Xuyang Liu
Weiya Li
Xuming Hu
Linfeng Zhang
19
0
0
29 Oct 2024
Explainability in AI Based Applications: A Framework for Comparing
  Different Techniques
Explainability in AI Based Applications: A Framework for Comparing Different Techniques
Arne Grobrugge
Nidhi Mishra
Johannes Jakubik
G. Satzger
99
1
0
28 Oct 2024
Interpretable Image Classification with Adaptive Prototype-based Vision
  Transformers
Interpretable Image Classification with Adaptive Prototype-based Vision Transformers
Chiyu Ma
J. Donnelly
Wenjun Liu
Soroush Vosoughi
Cynthia Rudin
Chaofan Chen
ViT
47
8
0
28 Oct 2024
CrystalX: Ultra-Precision Crystal Structure Resolution and Error
  Correction Using Deep Learning
CrystalX: Ultra-Precision Crystal Structure Resolution and Error Correction Using Deep Learning
Kaipeng Zheng
Weiran Huang
Wanli Ouyang
Han-Sen Zhong
Yuan Li
36
0
0
17 Oct 2024
Sparse Prototype Network for Explainable Pedestrian Behavior Prediction
Sparse Prototype Network for Explainable Pedestrian Behavior Prediction
Yan Feng
Alexander Carballo
K. Takeda
ViT
42
0
0
16 Oct 2024
A Theoretical Survey on Foundation Models
A Theoretical Survey on Foundation Models
Shi Fu
Yuzhu Chen
Yingjie Wang
Dacheng Tao
28
0
0
15 Oct 2024
123456789
Next