v1v2v3 (latest)

Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers

1 October 2020

Papers citing "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"

36 / 36 papers shown

Title
Fact in Fragments: Deconstructing Complex Claims via LLM-based Atomic Fact Extraction and Verification Liwen Zheng Chaozhuo Li Zheng Liu Feiran Huang Haoran Jia Zaisheng Ye Xi Zhang HILM 23 0 0 09 Jun 2025
Learning Distribution-Wise Control in Representation Space for Language Models Chunyuan Deng Ruidi Chang Hanjie Chen 22 0 0 07 Jun 2025
Normalized AOPC: Fixing Misleading Faithfulness Metrics for Feature Attribution Explainability Joakim Edin Andreas Geert Motzfeldt Casper L. Christensen Tuukka Ruotsalo Lars Maaløe Maria Maistro 132 4 0 15 Aug 2024
XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution Yurui Chang Bochuan Cao Yujia Wang Jinghui Chen Lu Lin LRM 85 2 0 30 May 2024
Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales Lucas Resck Marcos M. Raimundo Jorge Poco 129 3 0 03 Apr 2024
Using Interpretation Methods for Model Enhancement Zhuo Chen Chengyue Jiang Kewei Tu 83 2 0 02 Apr 2024
Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery Linan Yue Qi Liu Yichao Du Li Wang Weibo Gao Yanqing An 82 5 0 12 Mar 2024
Learning to Maximize Mutual Information for Chain-of-Thought Distillation Xin Chen Hanxian Huang Yanjun Gao Yi Wang Jishen Zhao Ke Ding 100 15 0 05 Mar 2024
Learning Intrinsic Dimension via Information Bottleneck for Explainable Aspect-based Sentiment Analysis Zhenxiao Cheng Jie Zhou Wen Wu Qin Chen Liang He 70 0 0 28 Feb 2024
CIDR: A Cooperative Integrated Dynamic Refining Method for Minimal Feature Removal Problem Qian Chen Tao Zhang Dongyang Li Xiaofeng He 95 0 0 13 Dec 2023
Unsupervised Text Style Transfer with Deep Generative Models Zhongtao Jiang Yuanzhe Zhang Yiming Ju Kang Liu 71 0 0 31 Aug 2023
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision Wenting Zhao Justin T. Chiu Claire Cardie Alexander M. Rush LRM 80 4 0 23 May 2023
Consistent Multi-Granular Rationale Extraction for Explainable Multi-hop Fact Verification Jiasheng Si Yingjie Zhu Deyu Zhou AAML 131 4 0 16 May 2023
A Multi-Grained Self-Interpretable Symbolic-Neural Model For Single/Multi-Labeled Text Classification Xiang Hu Xinyu Kong Kewei Tu MILM BDL 65 5 0 06 Mar 2023
Tell Model Where to Attend: Improving Interpretability of Aspect-Based Sentiment Classification via Small Explanation Annotations Zhenxiao Cheng Jie Zhou Wen Wu Qin Chen Liang He 79 3 0 21 Feb 2023
Improving Interpretability via Explicit Word Interaction Graph Layer Arshdeep Sekhon Hanjie Chen A. Shrivastava Zhe Wang Yangfeng Ji Yanjun Qi AI4CE MILM 70 6 0 03 Feb 2023
Identifying the Source of Vulnerability in Explanation Discrepancy: A Case Study in Neural Text Classification Ruixuan Tang Hanjie Chen Yangfeng Ji AAML FAtt 73 3 0 10 Dec 2022
Unsupervised Text Deidentification John X. Morris Justin T. Chiu Ramin Zabih Alexander M. Rush 70 7 0 20 Oct 2022
An Information Minimization Based Contrastive Learning Model for Unsupervised Sentence Embeddings Learning Shaobin Chen Jie Zhou Yuling Sun Liang He SSL 78 7 0 22 Sep 2022
Human-Centric Research for NLP: Towards a Definition and Guiding Questions Bhushan Kotnis Kiril Gashteovski J. Gastinger G. Serra Francesco Alesiani T. Sztyler Ammar Shaker Na Gong Carolin (Haas) Lawrence Zhao Xu 78 9 0 10 Jul 2022
Pathologies of Pre-trained Language Models in Few-shot Fine-tuning Hanjie Chen Guoqing Zheng Ahmed Hassan Awadallah Yangfeng Ji AI4MH 86 3 0 17 Apr 2022
Interpretable Research Replication Prediction via Variational Contextual Consistency Sentence Masking Tianyi Luo Rui Meng Xinze Wang Yongxu Liu 54 4 0 28 Mar 2022
Adversarial Training for Improving Model Robustness? Look at Both Prediction and Interpretation Hanjie Chen Yangfeng Ji OOD AAML VLM 101 21 0 23 Mar 2022
Towards Explainable Evaluation Metrics for Natural Language Generation Christoph Leiter Piyawat Lertvittayakumjorn M. Fomicheva Wei Zhao Yang Gao Steffen Eger AAML ELM 76 20 0 21 Mar 2022
Hierarchical Interpretation of Neural Text Classification Hanqi Yan Lin Gui Yulan He 107 14 0 20 Feb 2022
Human Guided Exploitation of Interpretable Attention Patterns in Summarization and Topic Segmentation Raymond Li Wen Xiao Linzi Xing Lanjun Wang Gabriel Murray Giuseppe Carenini ViT 67 8 0 10 Dec 2021
Interpreting Deep Learning Models in Natural Language Processing: A Review Xiaofei Sun Diyi Yang Xiaoya Li Tianwei Zhang Yuxian Meng Han Qiu Guoyin Wang Eduard H. Hovy Jiwei Li 99 47 0 20 Oct 2021
Logic Traps in Evaluating Attribution Scores Yiming Ju Yuanzhe Zhang Zhao Yang Zhongtao Jiang Kang Liu Jun Zhao XAI FAtt 119 19 0 12 Sep 2021
Towards Improving Adversarial Training of NLP Models Jin Yong Yoo Yanjun Qi AAML 206 127 0 01 Sep 2021
Local Explanation of Dialogue Response Generation Yi-Lin Tuan Connor Pryor Wenhu Chen Lise Getoor Wenjie Wang 76 12 0 11 Jun 2021
The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations Peter Hase Harry Xie Joey Tianyi Zhou OODD LRM FAtt 123 91 0 01 Jun 2021
Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification G. Chrysostomou Nikolaos Aletras 85 38 0 06 May 2021
Flexible Instance-Specific Rationalization of NLP Models G. Chrysostomou Nikolaos Aletras 82 14 0 16 Apr 2021
Explaining Neural Network Predictions on Sentence Pairs via Learning Word-Group Masks Hanjie Chen Song Feng Jatin Ganhotra H. Wan Chulaka Gunasekara Sachindra Joshi Yangfeng Ji 73 18 0 09 Apr 2021
Explaining the Road Not Taken Hua Shen Ting-Hao 'Kenneth' Huang FAtt XAI 64 9 0 27 Mar 2021
A Unified Approach to Interpreting and Boosting Adversarial Transferability Xin Eric Wang Jie Ren Shuyu Lin Xiangming Zhu Yisen Wang Quanshi Zhang AAML 143 96 0 08 Oct 2020