CausaLM: Causal Model Explanation Through Counterfactual Language Models

27 May 2020

Papers citing "CausaLM: Causal Model Explanation Through Counterfactual Language Models"

41 / 41 papers shown

Title
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs Nitay Calderon Roi Reichart 40 10 0 27 Jul 2024
Representing Rule-based Chatbots with Transformers Dan Friedman Abhishek Panigrahi Danqi Chen 66 1 0 15 Jul 2024
Beyond Individual Facts: Investigating Categorical Knowledge Locality of Taxonomy and Meronomy Concepts in GPT Models Christopher Burger Yifan Hu Thai Le KELM 41 0 0 22 Jun 2024
Relative Counterfactual Contrastive Learning for Mitigating Pretrained Stance Bias in Stance Detection Jiarui Zhang Shaojuan Wu Xiaowang Zhang Zhiyong Feng 36 0 0 16 May 2024
Interpretation of Intracardiac Electrograms Through Textual Representations William Jongwon Han Diana Gomez Avi Alok Chaojing Duan Michael A. Rosenberg Douglas Weber Emerson Liu Ding Zhao 26 1 0 02 Feb 2024
A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia Giovanni Monea Maxime Peyrard Martin Josifoski Vishrav Chaudhary Jason Eisner Emre Kiciman Hamid Palangi Barun Patra Robert West KELM 51 12 0 04 Dec 2023
Data Augmentations for Improved (Large) Language Model Generalization Amir Feder Yoav Wald Claudia Shi S. Saria David M. Blei OOD CML 32 7 0 19 Oct 2023
Accurate Use of Label Dependency in Multi-Label Text Classification Through the Lens of Causality Caoyun Fan Wenqing Chen Jidong Tian Yitian Li Hao He Yaohui Jin 49 6 0 11 Oct 2023
A Geometric Notion of Causal Probing Clément Guerner Anej Svete Tianyu Liu Alex Warstadt Ryan Cotterell LLMSV 38 12 0 27 Jul 2023
Entity-Based Evaluation of Political Bias in Automatic Summarization Karen Zhou Chenhao Tan 35 1 0 03 May 2023
NxPlain: Web-based Tool for Discovery of Latent Concepts Fahim Dalvi Nadir Durrani Hassan Sajjad Tamim Jaban Musab Husaini Ummar Abbas 15 1 0 06 Mar 2023
A Picture May Be Worth a Thousand Lives: An Interpretable Artificial Intelligence Strategy for Predictions of Suicide Risk from Social Media Images Yael Badian Yaakov Ophir Refael Tikochinski Nitay Calderon A. Klomek Roi Reichart 24 4 0 19 Feb 2023
On the Transformation of Latent Space in Fine-Tuned NLP Models Nadir Durrani Hassan Sajjad Fahim Dalvi Firoj Alam 32 18 0 23 Oct 2022
Probing with Noise: Unpicking the Warp and Weft of Embeddings Filip Klubicka John D. Kelleher 30 4 0 21 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review Dieuwke Hupkes Mario Giulianelli Verna Dankers Mikel Artetxe Yanai Elazar ... Leila Khalatbari Maria Ryskina Rita Frieske Ryan Cotterell Zhijing Jin 119 93 0 06 Oct 2022
Causal Proxy Models for Concept-Based Model Explanations Zhengxuan Wu Karel DÓosterlinck Atticus Geiger Amir Zur Christopher Potts MILM 80 35 0 28 Sep 2022
Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models Esma Balkir S. Kiritchenko I. Nejadgholi Kathleen C. Fraser 21 36 0 08 Jun 2022
Post-hoc Concept Bottleneck Models Mert Yuksekgonul Maggie Wang James Zou 145 185 0 31 May 2022
Interpretation of Black Box NLP Models: A Survey Shivani Choudhary N. Chatterjee S. K. Saha FAtt 34 10 0 31 Mar 2022
How Pre-trained Language Models Capture Factual Knowledge? A Causal-Inspired Analysis Shaobo Li Xiaoguang Li Lifeng Shang Zhenhua Dong Chengjie Sun Bingquan Liu Zhenzhou Ji Xin Jiang Qun Liu KELM 31 53 0 31 Mar 2022
FaiRR: Faithful and Robust Deductive Reasoning over Natural Language Soumya Sanyal Harman Singh Xiang Ren ReLM LRM 29 44 0 19 Mar 2022
Locating and Editing Factual Associations in GPT Kevin Meng David Bau A. Andonian Yonatan Belinkov KELM 56 1,192 0 10 Feb 2022
Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation Yi-Fan Zhang Hanlin Zhang Zachary Chase Lipton Li Erran Li Eric P. Xing OODD 24 29 0 02 Feb 2022
Diagnosing AI Explanation Methods with Folk Concepts of Behavior Alon Jacovi Jasmijn Bastings Sebastian Gehrmann Yoav Goldberg Katja Filippova 36 15 0 27 Jan 2022
A Causal Lens for Controllable Text Generation Zhiting Hu Erran L. Li 45 59 0 22 Jan 2022
Sparse Interventions in Language Models with Differentiable Masking Nicola De Cao Leon Schmid Dieuwke Hupkes Ivan Titov 40 27 0 13 Dec 2021
Inducing Causal Structure for Interpretable Neural Networks Atticus Geiger Zhengxuan Wu Hanson Lu J. Rozner Elisa Kreiss Thomas F. Icard Noah D. Goodman Christopher Potts CML OOD 35 70 0 01 Dec 2021
On the Pitfalls of Analyzing Individual Neurons in Language Models Omer Antverg Yonatan Belinkov MILM 24 49 0 14 Oct 2021
Putting Words in BERT's Mouth: Navigating Contextualized Vector Spaces with Pseudowords Taelin Karidi Yichu Zhou Nathan Schneider Omri Abend Vivek Srikumar 86 13 0 23 Sep 2021
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond Amir Feder Katherine A. Keith Emaad A. Manzoor Reid Pryzant Dhanya Sridhar ... Roi Reichart Margaret E. Roberts Brandon M Stewart Victor Veitch Diyi Yang CML 41 234 0 02 Sep 2021
Neuron-level Interpretation of Deep NLP Models: A Survey Hassan Sajjad Nadir Durrani Fahim Dalvi MILM AI4CE 35 80 0 30 Aug 2021
Counterfactual Explainable Recommendation Juntao Tan Shuyuan Xu Yingqiang Ge Yunqi Li Xu Chen Yongfeng Zhang CML 22 141 0 24 Aug 2021
Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused Interventions Daniel Rosenberg Itai Gat Amir Feder Roi Reichart AAML 36 16 0 08 Jun 2021
Causal Abstractions of Neural Networks Atticus Geiger Hanson Lu Thomas F. Icard Christopher Potts NAI CML 17 218 0 06 Jun 2021
Contrastive Explanations for Model Interpretability Alon Jacovi Swabha Swayamdipta Shauli Ravfogel Yanai Elazar Yejin Choi Yoav Goldberg 44 95 0 02 Mar 2021
Probing Classifiers: Promises, Shortcomings, and Advances Yonatan Belinkov 226 405 0 24 Feb 2021
What you can cram into a single vector: Probing sentence embeddings for linguistic properties Alexis Conneau Germán Kruszewski Guillaume Lample Loïc Barrault Marco Baroni 201 882 0 03 May 2018
How to Make Causal Inferences Using Texts Naoki Egami Christian Fong Justin Grimmer Margaret E. Roberts Brandon M Stewart CML 28 137 0 06 Feb 2018
Towards A Rigorous Science of Interpretable Machine Learning Finale Doshi-Velez Been Kim XAI FaML 257 3,684 0 28 Feb 2017
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation Yonghui Wu M. Schuster Z. Chen Quoc V. Le Mohammad Norouzi ... Alex Rudnick Oriol Vinyals G. Corrado Macduff Hughes J. Dean AIMat 716 6,746 0 26 Sep 2016
Learning Representations for Counterfactual Inference Fredrik D. Johansson Uri Shalit David Sontag CML OOD BDL 232 719 0 12 May 2016