ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.00667
  4. Cited By
Learning Variational Word Masks to Improve the Interpretability of
  Neural Text Classifiers
v1v2v3 (latest)

Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers

1 October 2020
Hanjie Chen
Yangfeng Ji
    AAMLVLM
ArXiv (abs)PDFHTML

Papers citing "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"

36 / 36 papers shown
Title
Fact in Fragments: Deconstructing Complex Claims via LLM-based Atomic Fact Extraction and Verification
Fact in Fragments: Deconstructing Complex Claims via LLM-based Atomic Fact Extraction and Verification
Liwen Zheng
Chaozhuo Li
Zheng Liu
Feiran Huang
Haoran Jia
Zaisheng Ye
Xi Zhang
HILM
23
0
0
09 Jun 2025
Learning Distribution-Wise Control in Representation Space for Language Models
Learning Distribution-Wise Control in Representation Space for Language Models
Chunyuan Deng
Ruidi Chang
Hanjie Chen
22
0
0
07 Jun 2025
Normalized AOPC: Fixing Misleading Faithfulness Metrics for Feature Attribution Explainability
Normalized AOPC: Fixing Misleading Faithfulness Metrics for Feature Attribution Explainability
Joakim Edin
Andreas Geert Motzfeldt
Casper L. Christensen
Tuukka Ruotsalo
Lars Maaløe
Maria Maistro
132
4
0
15 Aug 2024
XPrompt:Explaining Large Language Model's Generation via Joint Prompt
  Attribution
XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution
Yurui Chang
Bochuan Cao
Yujia Wang
Jinghui Chen
Lu Lin
LRM
85
2
0
30 May 2024
Exploring the Trade-off Between Model Performance and Explanation
  Plausibility of Text Classifiers Using Human Rationales
Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales
Lucas Resck
Marcos M. Raimundo
Jorge Poco
129
3
0
03 Apr 2024
Using Interpretation Methods for Model Enhancement
Using Interpretation Methods for Model Enhancement
Zhuo Chen
Chengyue Jiang
Kewei Tu
83
2
0
02 Apr 2024
Towards Faithful Explanations: Boosting Rationalization with Shortcuts
  Discovery
Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery
Linan Yue
Qi Liu
Yichao Du
Li Wang
Weibo Gao
Yanqing An
82
5
0
12 Mar 2024
Learning to Maximize Mutual Information for Chain-of-Thought
  Distillation
Learning to Maximize Mutual Information for Chain-of-Thought Distillation
Xin Chen
Hanxian Huang
Yanjun Gao
Yi Wang
Jishen Zhao
Ke Ding
100
15
0
05 Mar 2024
Learning Intrinsic Dimension via Information Bottleneck for Explainable
  Aspect-based Sentiment Analysis
Learning Intrinsic Dimension via Information Bottleneck for Explainable Aspect-based Sentiment Analysis
Zhenxiao Cheng
Jie Zhou
Wen Wu
Qin Chen
Liang He
70
0
0
28 Feb 2024
CIDR: A Cooperative Integrated Dynamic Refining Method for Minimal
  Feature Removal Problem
CIDR: A Cooperative Integrated Dynamic Refining Method for Minimal Feature Removal Problem
Qian Chen
Tao Zhang
Dongyang Li
Xiaofeng He
95
0
0
13 Dec 2023
Unsupervised Text Style Transfer with Deep Generative Models
Unsupervised Text Style Transfer with Deep Generative Models
Zhongtao Jiang
Yuanzhe Zhang
Yiming Ju
Kang Liu
71
0
0
31 Aug 2023
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale
  Supervision
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision
Wenting Zhao
Justin T. Chiu
Claire Cardie
Alexander M. Rush
LRM
80
4
0
23 May 2023
Consistent Multi-Granular Rationale Extraction for Explainable Multi-hop
  Fact Verification
Consistent Multi-Granular Rationale Extraction for Explainable Multi-hop Fact Verification
Jiasheng Si
Yingjie Zhu
Deyu Zhou
AAML
131
4
0
16 May 2023
A Multi-Grained Self-Interpretable Symbolic-Neural Model For
  Single/Multi-Labeled Text Classification
A Multi-Grained Self-Interpretable Symbolic-Neural Model For Single/Multi-Labeled Text Classification
Xiang Hu
Xinyu Kong
Kewei Tu
MILMBDL
65
5
0
06 Mar 2023
Tell Model Where to Attend: Improving Interpretability of Aspect-Based
  Sentiment Classification via Small Explanation Annotations
Tell Model Where to Attend: Improving Interpretability of Aspect-Based Sentiment Classification via Small Explanation Annotations
Zhenxiao Cheng
Jie Zhou
Wen Wu
Qin Chen
Liang He
79
3
0
21 Feb 2023
Improving Interpretability via Explicit Word Interaction Graph Layer
Improving Interpretability via Explicit Word Interaction Graph Layer
Arshdeep Sekhon
Hanjie Chen
A. Shrivastava
Zhe Wang
Yangfeng Ji
Yanjun Qi
AI4CEMILM
70
6
0
03 Feb 2023
Identifying the Source of Vulnerability in Explanation Discrepancy: A
  Case Study in Neural Text Classification
Identifying the Source of Vulnerability in Explanation Discrepancy: A Case Study in Neural Text Classification
Ruixuan Tang
Hanjie Chen
Yangfeng Ji
AAMLFAtt
73
3
0
10 Dec 2022
Unsupervised Text Deidentification
Unsupervised Text Deidentification
John X. Morris
Justin T. Chiu
Ramin Zabih
Alexander M. Rush
70
7
0
20 Oct 2022
An Information Minimization Based Contrastive Learning Model for
  Unsupervised Sentence Embeddings Learning
An Information Minimization Based Contrastive Learning Model for Unsupervised Sentence Embeddings Learning
Shaobin Chen
Jie Zhou
Yuling Sun
Liang He
SSL
78
7
0
22 Sep 2022
Human-Centric Research for NLP: Towards a Definition and Guiding
  Questions
Human-Centric Research for NLP: Towards a Definition and Guiding Questions
Bhushan Kotnis
Kiril Gashteovski
J. Gastinger
G. Serra
Francesco Alesiani
T. Sztyler
Ammar Shaker
Na Gong
Carolin (Haas) Lawrence
Zhao Xu
78
9
0
10 Jul 2022
Pathologies of Pre-trained Language Models in Few-shot Fine-tuning
Pathologies of Pre-trained Language Models in Few-shot Fine-tuning
Hanjie Chen
Guoqing Zheng
Ahmed Hassan Awadallah
Yangfeng Ji
AI4MH
86
3
0
17 Apr 2022
Interpretable Research Replication Prediction via Variational Contextual
  Consistency Sentence Masking
Interpretable Research Replication Prediction via Variational Contextual Consistency Sentence Masking
Tianyi Luo
Rui Meng
Xinze Wang
Yongxu Liu
54
4
0
28 Mar 2022
Adversarial Training for Improving Model Robustness? Look at Both
  Prediction and Interpretation
Adversarial Training for Improving Model Robustness? Look at Both Prediction and Interpretation
Hanjie Chen
Yangfeng Ji
OODAAMLVLM
101
21
0
23 Mar 2022
Towards Explainable Evaluation Metrics for Natural Language Generation
Towards Explainable Evaluation Metrics for Natural Language Generation
Christoph Leiter
Piyawat Lertvittayakumjorn
M. Fomicheva
Wei Zhao
Yang Gao
Steffen Eger
AAMLELM
76
20
0
21 Mar 2022
Hierarchical Interpretation of Neural Text Classification
Hierarchical Interpretation of Neural Text Classification
Hanqi Yan
Lin Gui
Yulan He
107
14
0
20 Feb 2022
Human Guided Exploitation of Interpretable Attention Patterns in
  Summarization and Topic Segmentation
Human Guided Exploitation of Interpretable Attention Patterns in Summarization and Topic Segmentation
Raymond Li
Wen Xiao
Linzi Xing
Lanjun Wang
Gabriel Murray
Giuseppe Carenini
ViT
67
8
0
10 Dec 2021
Interpreting Deep Learning Models in Natural Language Processing: A
  Review
Interpreting Deep Learning Models in Natural Language Processing: A Review
Xiaofei Sun
Diyi Yang
Xiaoya Li
Tianwei Zhang
Yuxian Meng
Han Qiu
Guoyin Wang
Eduard H. Hovy
Jiwei Li
99
47
0
20 Oct 2021
Logic Traps in Evaluating Attribution Scores
Logic Traps in Evaluating Attribution Scores
Yiming Ju
Yuanzhe Zhang
Zhao Yang
Zhongtao Jiang
Kang Liu
Jun Zhao
XAIFAtt
119
19
0
12 Sep 2021
Towards Improving Adversarial Training of NLP Models
Towards Improving Adversarial Training of NLP Models
Jin Yong Yoo
Yanjun Qi
AAML
206
127
0
01 Sep 2021
Local Explanation of Dialogue Response Generation
Local Explanation of Dialogue Response Generation
Yi-Lin Tuan
Connor Pryor
Wenhu Chen
Lise Getoor
Wenjie Wang
76
12
0
11 Jun 2021
The Out-of-Distribution Problem in Explainability and Search Methods for
  Feature Importance Explanations
The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations
Peter Hase
Harry Xie
Joey Tianyi Zhou
OODDLRMFAtt
123
91
0
01 Jun 2021
Improving the Faithfulness of Attention-based Explanations with
  Task-specific Information for Text Classification
Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification
G. Chrysostomou
Nikolaos Aletras
85
38
0
06 May 2021
Flexible Instance-Specific Rationalization of NLP Models
Flexible Instance-Specific Rationalization of NLP Models
G. Chrysostomou
Nikolaos Aletras
82
14
0
16 Apr 2021
Explaining Neural Network Predictions on Sentence Pairs via Learning
  Word-Group Masks
Explaining Neural Network Predictions on Sentence Pairs via Learning Word-Group Masks
Hanjie Chen
Song Feng
Jatin Ganhotra
H. Wan
Chulaka Gunasekara
Sachindra Joshi
Yangfeng Ji
73
18
0
09 Apr 2021
Explaining the Road Not Taken
Explaining the Road Not Taken
Hua Shen
Ting-Hao 'Kenneth' Huang
FAttXAI
64
9
0
27 Mar 2021
A Unified Approach to Interpreting and Boosting Adversarial
  Transferability
A Unified Approach to Interpreting and Boosting Adversarial Transferability
Xin Eric Wang
Jie Ren
Shuyu Lin
Xiangming Zhu
Yisen Wang
Quanshi Zhang
AAML
143
96
0
08 Oct 2020
1