Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.00667
Cited By
v1
v2
v3 (latest)
Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers
1 October 2020
Hanjie Chen
Yangfeng Ji
AAML
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"
36 / 36 papers shown
Title
Fact in Fragments: Deconstructing Complex Claims via LLM-based Atomic Fact Extraction and Verification
Liwen Zheng
Chaozhuo Li
Zheng Liu
Feiran Huang
Haoran Jia
Zaisheng Ye
Xi Zhang
HILM
23
0
0
09 Jun 2025
Learning Distribution-Wise Control in Representation Space for Language Models
Chunyuan Deng
Ruidi Chang
Hanjie Chen
22
0
0
07 Jun 2025
Normalized AOPC: Fixing Misleading Faithfulness Metrics for Feature Attribution Explainability
Joakim Edin
Andreas Geert Motzfeldt
Casper L. Christensen
Tuukka Ruotsalo
Lars Maaløe
Maria Maistro
132
4
0
15 Aug 2024
XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution
Yurui Chang
Bochuan Cao
Yujia Wang
Jinghui Chen
Lu Lin
LRM
85
2
0
30 May 2024
Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales
Lucas Resck
Marcos M. Raimundo
Jorge Poco
129
3
0
03 Apr 2024
Using Interpretation Methods for Model Enhancement
Zhuo Chen
Chengyue Jiang
Kewei Tu
83
2
0
02 Apr 2024
Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery
Linan Yue
Qi Liu
Yichao Du
Li Wang
Weibo Gao
Yanqing An
82
5
0
12 Mar 2024
Learning to Maximize Mutual Information for Chain-of-Thought Distillation
Xin Chen
Hanxian Huang
Yanjun Gao
Yi Wang
Jishen Zhao
Ke Ding
100
15
0
05 Mar 2024
Learning Intrinsic Dimension via Information Bottleneck for Explainable Aspect-based Sentiment Analysis
Zhenxiao Cheng
Jie Zhou
Wen Wu
Qin Chen
Liang He
70
0
0
28 Feb 2024
CIDR: A Cooperative Integrated Dynamic Refining Method for Minimal Feature Removal Problem
Qian Chen
Tao Zhang
Dongyang Li
Xiaofeng He
95
0
0
13 Dec 2023
Unsupervised Text Style Transfer with Deep Generative Models
Zhongtao Jiang
Yuanzhe Zhang
Yiming Ju
Kang Liu
71
0
0
31 Aug 2023
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision
Wenting Zhao
Justin T. Chiu
Claire Cardie
Alexander M. Rush
LRM
80
4
0
23 May 2023
Consistent Multi-Granular Rationale Extraction for Explainable Multi-hop Fact Verification
Jiasheng Si
Yingjie Zhu
Deyu Zhou
AAML
131
4
0
16 May 2023
A Multi-Grained Self-Interpretable Symbolic-Neural Model For Single/Multi-Labeled Text Classification
Xiang Hu
Xinyu Kong
Kewei Tu
MILM
BDL
65
5
0
06 Mar 2023
Tell Model Where to Attend: Improving Interpretability of Aspect-Based Sentiment Classification via Small Explanation Annotations
Zhenxiao Cheng
Jie Zhou
Wen Wu
Qin Chen
Liang He
79
3
0
21 Feb 2023
Improving Interpretability via Explicit Word Interaction Graph Layer
Arshdeep Sekhon
Hanjie Chen
A. Shrivastava
Zhe Wang
Yangfeng Ji
Yanjun Qi
AI4CE
MILM
70
6
0
03 Feb 2023
Identifying the Source of Vulnerability in Explanation Discrepancy: A Case Study in Neural Text Classification
Ruixuan Tang
Hanjie Chen
Yangfeng Ji
AAML
FAtt
73
3
0
10 Dec 2022
Unsupervised Text Deidentification
John X. Morris
Justin T. Chiu
Ramin Zabih
Alexander M. Rush
70
7
0
20 Oct 2022
An Information Minimization Based Contrastive Learning Model for Unsupervised Sentence Embeddings Learning
Shaobin Chen
Jie Zhou
Yuling Sun
Liang He
SSL
78
7
0
22 Sep 2022
Human-Centric Research for NLP: Towards a Definition and Guiding Questions
Bhushan Kotnis
Kiril Gashteovski
J. Gastinger
G. Serra
Francesco Alesiani
T. Sztyler
Ammar Shaker
Na Gong
Carolin (Haas) Lawrence
Zhao Xu
78
9
0
10 Jul 2022
Pathologies of Pre-trained Language Models in Few-shot Fine-tuning
Hanjie Chen
Guoqing Zheng
Ahmed Hassan Awadallah
Yangfeng Ji
AI4MH
86
3
0
17 Apr 2022
Interpretable Research Replication Prediction via Variational Contextual Consistency Sentence Masking
Tianyi Luo
Rui Meng
Xinze Wang
Yongxu Liu
54
4
0
28 Mar 2022
Adversarial Training for Improving Model Robustness? Look at Both Prediction and Interpretation
Hanjie Chen
Yangfeng Ji
OOD
AAML
VLM
101
21
0
23 Mar 2022
Towards Explainable Evaluation Metrics for Natural Language Generation
Christoph Leiter
Piyawat Lertvittayakumjorn
M. Fomicheva
Wei Zhao
Yang Gao
Steffen Eger
AAML
ELM
76
20
0
21 Mar 2022
Hierarchical Interpretation of Neural Text Classification
Hanqi Yan
Lin Gui
Yulan He
107
14
0
20 Feb 2022
Human Guided Exploitation of Interpretable Attention Patterns in Summarization and Topic Segmentation
Raymond Li
Wen Xiao
Linzi Xing
Lanjun Wang
Gabriel Murray
Giuseppe Carenini
ViT
67
8
0
10 Dec 2021
Interpreting Deep Learning Models in Natural Language Processing: A Review
Xiaofei Sun
Diyi Yang
Xiaoya Li
Tianwei Zhang
Yuxian Meng
Han Qiu
Guoyin Wang
Eduard H. Hovy
Jiwei Li
99
47
0
20 Oct 2021
Logic Traps in Evaluating Attribution Scores
Yiming Ju
Yuanzhe Zhang
Zhao Yang
Zhongtao Jiang
Kang Liu
Jun Zhao
XAI
FAtt
119
19
0
12 Sep 2021
Towards Improving Adversarial Training of NLP Models
Jin Yong Yoo
Yanjun Qi
AAML
206
127
0
01 Sep 2021
Local Explanation of Dialogue Response Generation
Yi-Lin Tuan
Connor Pryor
Wenhu Chen
Lise Getoor
Wenjie Wang
76
12
0
11 Jun 2021
The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations
Peter Hase
Harry Xie
Joey Tianyi Zhou
OODD
LRM
FAtt
123
91
0
01 Jun 2021
Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification
G. Chrysostomou
Nikolaos Aletras
85
38
0
06 May 2021
Flexible Instance-Specific Rationalization of NLP Models
G. Chrysostomou
Nikolaos Aletras
82
14
0
16 Apr 2021
Explaining Neural Network Predictions on Sentence Pairs via Learning Word-Group Masks
Hanjie Chen
Song Feng
Jatin Ganhotra
H. Wan
Chulaka Gunasekara
Sachindra Joshi
Yangfeng Ji
73
18
0
09 Apr 2021
Explaining the Road Not Taken
Hua Shen
Ting-Hao 'Kenneth' Huang
FAtt
XAI
64
9
0
27 Mar 2021
A Unified Approach to Interpreting and Boosting Adversarial Transferability
Xin Eric Wang
Jie Ren
Shuyu Lin
Xiangming Zhu
Yisen Wang
Quanshi Zhang
AAML
143
96
0
08 Oct 2020
1