v1v2 (latest)

Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models

1 January 2021

Tongshuang Wu

Marco Tulio Ribeiro

Jeffrey Heer

Daniel S. Weld

ArXiv (abs)PDF HTML

Papers citing "Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models"

32 / 182 papers shown

Title
Counterfactual Explanations for Natural Language Interfaces George Tolkachev Stephen Mell Steve Zdancewic Osbert Bastani LRM AAML 45 4 0 27 Apr 2022
Learning to Scaffold: Optimizing Model Explanations for Teaching Patrick Fernandes Marcos Vinícius Treviso Danish Pruthi André F. T. Martins Graham Neubig FAtt 98 22 0 22 Apr 2022
Measuring Compositional Consistency for Video Question Answering Mona Gandhi Mustafa Omer Gul Eva Prakash Madeleine Grunde-McLaughlin Ranjay Krishna Maneesh Agrawala CoGe 92 16 0 14 Apr 2022
Interpretation of Black Box NLP Models: A Survey Shivani Choudhary N. Chatterjee S. K. Saha FAtt 86 11 0 31 Mar 2022
Text Transformations in Contrastive Self-Supervised Learning: A Review Amrita Bhattacharjee Mansooreh Karami Huan Liu SSL 108 23 0 22 Mar 2022
CARETS: A Consistency And Robustness Evaluative Test Suite for VQA Carlos E. Jimenez Olga Russakovsky Karthik Narasimhan CoGe 84 14 0 15 Mar 2022
Counterfactually Evaluating Explanations in Recommender Systems Yuanshun Yao Chong Wang Hang Li OffRL LRM 82 7 0 02 Mar 2022
Automatically Generating Counterfactuals for Relation Classification Mi Zhang T. Qian Tingyu Zhang CML 57 0 0 22 Feb 2022
Prediction Sensitivity: Continual Audit of Counterfactual Fairness in Deployed Classifiers Krystal Maughan Ivoline C. Ngong Joseph P. Near 44 2 0 09 Feb 2022
Red Teaming Language Models with Language Models Ethan Perez Saffron Huang Francis Song Trevor Cai Roman Ring John Aslanides Amelia Glaese Nat McAleese G. Irving AAML 240 672 0 07 Feb 2022
Analogies and Feature Attributions for Model Agnostic Explanation of Similarity Learners Karthikeyan N. Ramamurthy Amit Dhurandhar Dennis L. Wei Zaid Bin Tariq FAtt 81 3 0 02 Feb 2022
ROCK: Causal Inference Principles for Reasoning about Commonsense Causality Jiayao Zhang Hongming Zhang Weijie J. Su Dan Roth CML LRM 247 25 0 31 Jan 2022
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants Max Bartolo Tristan Thrush Sebastian Riedel Pontus Stenetorp Robin Jia Douwe Kiela 100 34 0 16 Dec 2021
Measure and Improve Robustness in NLP Models: A Survey Xuezhi Wang Haohan Wang Diyi Yang 305 139 0 15 Dec 2021
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation Kaustubh D. Dhole Varun Gangal Sebastian Gehrmann Aadesh Gupta Zhenhao Li ... Tianbao Xie Usama Yaseen Michael A. Yee Jing Zhang Yue Zhang 244 88 0 06 Dec 2021
How Emotionally Stable is ALBERT? Testing Robustness with Stochastic Weight Averaging on a Sentiment Analysis Task Urja Khurana Eric T. Nalisnick Antske Fokkens MoMe 72 6 0 18 Nov 2021
SynthBio: A Case Study in Human-AI Collaborative Curation of Text Datasets Ann Yuan Daphne Ippolito Vitaly Nikolaev Chris Callison-Burch Andy Coenen Sebastian Gehrmann SyDa 192 23 0 11 Nov 2021
Counterfactual Explanations for Models of Code Jürgen Cito Işıl Dillig V. Murali S. Chandra AAML LRM 73 52 0 10 Nov 2021
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey Bonan Min Hayley L Ross Elior Sulem Amir Pouran Ben Veyseh Thien Huu Nguyen Oscar Sainz Eneko Agirre Ilana Heinz Dan Roth LM&MA VLM AI4CE 197 1,103 0 01 Nov 2021
Retrieval-guided Counterfactual Generation for QA Bhargavi Paranjape Matthew Lamm Ian Tenney 94 31 0 14 Oct 2021
AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts Tongshuang Wu Michael Terry Carrie J. Cai LLMAG AI4CE LRM 127 472 0 04 Oct 2021
Enhancing Model Robustness and Fairness with Causality: A Regularization Approach Zhao Wang Kai Shu A. Culotta OOD 120 14 0 03 Oct 2021
Let the CAT out of the bag: Contrastive Attributed explanations for Text Saneem A. Chemmengath A. Azad Ronny Luss Amit Dhurandhar FAtt 103 10 0 16 Sep 2021
Post-hoc Interpretability for Neural NLP: A Survey Andreas Madsen Siva Reddy A. Chandar XAI 131 234 0 10 Aug 2021
Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition Mor Geva Tomer Wolfson Jonathan Berant ReLM LRM 65 21 0 29 Jul 2021
Tailor: Generating and Perturbing Text with Semantic Controls Alexis Ross Tongshuang Wu Hao Peng Matthew E. Peters Matt Gardner 202 79 0 15 Jul 2021
An Investigation of the (In)effectiveness of Counterfactually Augmented Data Nitish Joshi He He OODD 88 47 0 01 Jul 2021
Counterfactual Invariance to Spurious Correlations: Why and How to Pass Stress Tests Victor Veitch Alexander DÁmour Steve Yadlowsky Jacob Eisenstein OOD 91 94 0 31 May 2021
Local Interpretations for Explainable Natural Language Processing: A Survey Siwen Luo Hamish Ivison S. Han Josiah Poon MILM 120 52 0 20 Mar 2021
Contrastive Explanations for Model Interpretability Alon Jacovi Swabha Swayamdipta Shauli Ravfogel Yanai Elazar Yejin Choi Yoav Goldberg 163 98 0 02 Mar 2021
Benchmarking and Survey of Explanation Methods for Black Box Models F. Bodria F. Giannotti Riccardo Guidotti Francesca Naretto D. Pedreschi S. Rinzivillo XAI 127 234 0 25 Feb 2021
Teach Me to Explain: A Review of Datasets for Explainable Natural Language Processing Sarah Wiegreffe Ana Marasović XAI 104 146 0 24 Feb 2021