CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model
Behavior

CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior

27 May 2022

Eldar David Abraham

Karel DÓosterlinck

Christopher Potts

Papers citing "CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior"

12 / 12 papers shown

Title
MIB: A Mechanistic Interpretability Benchmark Aaron Mueller Atticus Geiger Sarah Wiegreffe Dana Arad Iván Arcuschin ... Alessandro Stolfo Martin Tutek Amir Zur David Bau Yonatan Belinkov 51 1 0 17 Apr 2025
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs Nitay Calderon Roi Reichart 40 10 0 27 Jul 2024
Latent Concept-based Explanation of NLP Models Xuemin Yu Fahim Dalvi Nadir Durrani Marzia Nouri Hassan Sajjad LRM FAtt 29 1 0 18 Apr 2024
Interpreting Pretrained Language Models via Concept Bottlenecks Zhen Tan Lu Cheng Song Wang Yuan Bo Wenlin Yao Huan Liu LRM 32 20 0 08 Nov 2023
Data Augmentations for Improved (Large) Language Model Generalization Amir Feder Yoav Wald Claudia Shi S. Saria David M. Blei OOD CML 32 7 0 19 Oct 2023
A Geometric Notion of Causal Probing Clément Guerner Anej Svete Tianyu Liu Alex Warstadt Ryan Cotterell LLMSV 41 12 0 27 Jul 2023
Causal Proxy Models for Concept-Based Model Explanations Zhengxuan Wu Karel DÓosterlinck Atticus Geiger Amir Zur Christopher Potts MILM 80 35 0 28 Sep 2022
Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation Yi-Fan Zhang Hanlin Zhang Zachary Chase Lipton Li Erran Li Eric P. Xing OODD 24 29 0 02 Feb 2022
On Completeness-aware Concept-Based Explanations in Deep Neural Networks Chih-Kuan Yeh Been Kim Sercan Ö. Arik Chun-Liang Li Tomas Pfister Pradeep Ravikumar FAtt 122 297 0 17 Oct 2019
What you can cram into a single vector: Probing sentence embeddings for linguistic properties Alexis Conneau Germán Kruszewski Guillaume Lample Loïc Barrault Marco Baroni 201 882 0 03 May 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 297 6,984 0 20 Apr 2018
Effective Approaches to Attention-based Neural Machine Translation Thang Luong Hieu H. Pham Christopher D. Manning 218 7,925 0 17 Aug 2015