Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.00288
Cited By
v1
v2 (latest)
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models
1 January 2021
Tongshuang Wu
Marco Tulio Ribeiro
Jeffrey Heer
Daniel S. Weld
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models"
32 / 182 papers shown
Title
Counterfactual Explanations for Natural Language Interfaces
George Tolkachev
Stephen Mell
Steve Zdancewic
Osbert Bastani
LRM
AAML
45
4
0
27 Apr 2022
Learning to Scaffold: Optimizing Model Explanations for Teaching
Patrick Fernandes
Marcos Vinícius Treviso
Danish Pruthi
André F. T. Martins
Graham Neubig
FAtt
98
22
0
22 Apr 2022
Measuring Compositional Consistency for Video Question Answering
Mona Gandhi
Mustafa Omer Gul
Eva Prakash
Madeleine Grunde-McLaughlin
Ranjay Krishna
Maneesh Agrawala
CoGe
92
16
0
14 Apr 2022
Interpretation of Black Box NLP Models: A Survey
Shivani Choudhary
N. Chatterjee
S. K. Saha
FAtt
86
11
0
31 Mar 2022
Text Transformations in Contrastive Self-Supervised Learning: A Review
Amrita Bhattacharjee
Mansooreh Karami
Huan Liu
SSL
108
23
0
22 Mar 2022
CARETS: A Consistency And Robustness Evaluative Test Suite for VQA
Carlos E. Jimenez
Olga Russakovsky
Karthik Narasimhan
CoGe
84
14
0
15 Mar 2022
Counterfactually Evaluating Explanations in Recommender Systems
Yuanshun Yao
Chong Wang
Hang Li
OffRL
LRM
82
7
0
02 Mar 2022
Automatically Generating Counterfactuals for Relation Classification
Mi Zhang
T. Qian
Tingyu Zhang
CML
57
0
0
22 Feb 2022
Prediction Sensitivity: Continual Audit of Counterfactual Fairness in Deployed Classifiers
Krystal Maughan
Ivoline C. Ngong
Joseph P. Near
44
2
0
09 Feb 2022
Red Teaming Language Models with Language Models
Ethan Perez
Saffron Huang
Francis Song
Trevor Cai
Roman Ring
John Aslanides
Amelia Glaese
Nat McAleese
G. Irving
AAML
240
672
0
07 Feb 2022
Analogies and Feature Attributions for Model Agnostic Explanation of Similarity Learners
Karthikeyan N. Ramamurthy
Amit Dhurandhar
Dennis L. Wei
Zaid Bin Tariq
FAtt
81
3
0
02 Feb 2022
ROCK: Causal Inference Principles for Reasoning about Commonsense Causality
Jiayao Zhang
Hongming Zhang
Weijie J. Su
Dan Roth
CML
LRM
247
25
0
31 Jan 2022
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants
Max Bartolo
Tristan Thrush
Sebastian Riedel
Pontus Stenetorp
Robin Jia
Douwe Kiela
100
34
0
16 Dec 2021
Measure and Improve Robustness in NLP Models: A Survey
Xuezhi Wang
Haohan Wang
Diyi Yang
305
139
0
15 Dec 2021
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Kaustubh D. Dhole
Varun Gangal
Sebastian Gehrmann
Aadesh Gupta
Zhenhao Li
...
Tianbao Xie
Usama Yaseen
Michael A. Yee
Jing Zhang
Yue Zhang
244
88
0
06 Dec 2021
How Emotionally Stable is ALBERT? Testing Robustness with Stochastic Weight Averaging on a Sentiment Analysis Task
Urja Khurana
Eric T. Nalisnick
Antske Fokkens
MoMe
72
6
0
18 Nov 2021
SynthBio: A Case Study in Human-AI Collaborative Curation of Text Datasets
Ann Yuan
Daphne Ippolito
Vitaly Nikolaev
Chris Callison-Burch
Andy Coenen
Sebastian Gehrmann
SyDa
192
23
0
11 Nov 2021
Counterfactual Explanations for Models of Code
Jürgen Cito
Işıl Dillig
V. Murali
S. Chandra
AAML
LRM
73
52
0
10 Nov 2021
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
Bonan Min
Hayley L Ross
Elior Sulem
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Oscar Sainz
Eneko Agirre
Ilana Heinz
Dan Roth
LM&MA
VLM
AI4CE
197
1,103
0
01 Nov 2021
Retrieval-guided Counterfactual Generation for QA
Bhargavi Paranjape
Matthew Lamm
Ian Tenney
94
31
0
14 Oct 2021
AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts
Tongshuang Wu
Michael Terry
Carrie J. Cai
LLMAG
AI4CE
LRM
127
472
0
04 Oct 2021
Enhancing Model Robustness and Fairness with Causality: A Regularization Approach
Zhao Wang
Kai Shu
A. Culotta
OOD
120
14
0
03 Oct 2021
Let the CAT out of the bag: Contrastive Attributed explanations for Text
Saneem A. Chemmengath
A. Azad
Ronny Luss
Amit Dhurandhar
FAtt
103
10
0
16 Sep 2021
Post-hoc Interpretability for Neural NLP: A Survey
Andreas Madsen
Siva Reddy
A. Chandar
XAI
131
234
0
10 Aug 2021
Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition
Mor Geva
Tomer Wolfson
Jonathan Berant
ReLM
LRM
65
21
0
29 Jul 2021
Tailor: Generating and Perturbing Text with Semantic Controls
Alexis Ross
Tongshuang Wu
Hao Peng
Matthew E. Peters
Matt Gardner
202
79
0
15 Jul 2021
An Investigation of the (In)effectiveness of Counterfactually Augmented Data
Nitish Joshi
He He
OODD
88
47
0
01 Jul 2021
Counterfactual Invariance to Spurious Correlations: Why and How to Pass Stress Tests
Victor Veitch
Alexander DÁmour
Steve Yadlowsky
Jacob Eisenstein
OOD
91
94
0
31 May 2021
Local Interpretations for Explainable Natural Language Processing: A Survey
Siwen Luo
Hamish Ivison
S. Han
Josiah Poon
MILM
120
52
0
20 Mar 2021
Contrastive Explanations for Model Interpretability
Alon Jacovi
Swabha Swayamdipta
Shauli Ravfogel
Yanai Elazar
Yejin Choi
Yoav Goldberg
163
98
0
02 Mar 2021
Benchmarking and Survey of Explanation Methods for Black Box Models
F. Bodria
F. Giannotti
Riccardo Guidotti
Francesca Naretto
D. Pedreschi
S. Rinzivillo
XAI
127
234
0
25 Feb 2021
Teach Me to Explain: A Review of Datasets for Explainable Natural Language Processing
Sarah Wiegreffe
Ana Marasović
XAI
104
146
0
24 Feb 2021
Previous
1
2
3
4