Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.16863
Cited By
v1
v2 (latest)
Controlling Learned Effects to Reduce Spurious Correlations in Text Classifiers
26 May 2023
Parikshit Bansal
Amit Sharma
CML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Controlling Learned Effects to Reduce Spurious Correlations in Text Classifiers"
27 / 27 papers shown
Title
Estimating Causal Effects of Text Interventions Leveraging LLMs
Siyi Guo
Myrl G. Marmarelis
Fred Morstatter
Kristina Lerman
CML
473
0
0
28 Oct 2024
Are All Spurious Features in Natural Language Alike? An Analysis through a Causal Lens
Nitish Joshi
X. Pan
Hengxing He
CML
110
30
0
25 Oct 2022
NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation
Phillip Howard
Gadi Singer
Vasudev Lal
Yejin Choi
Swabha Swayamdipta
CML
103
25
0
22 Oct 2022
Controlling Bias Exposure for Fair Interpretable Predictions
Zexue He
Yu Wang
Julian McAuley
Bodhisattwa Prasad Majumder
58
19
0
14 Oct 2022
Causal Estimation for Text Data with (Apparent) Overlap Violations
Lin Gui
Victor Veitch
OOD
77
13
0
30 Sep 2022
Shortcut Learning of Large Language Models in Natural Language Understanding
Mengnan Du
Fengxiang He
Na Zou
Dacheng Tao
Xia Hu
KELM
OffRL
123
89
0
25 Aug 2022
Probing Classifiers are Unreliable for Concept Removal and Detection
Abhinav Kumar
Chenhao Tan
Amit Sharma
AAML
72
25
0
08 Jul 2022
Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal
Umang Gupta
Jwala Dhamala
Varun Kumar
Apurv Verma
Yada Pruksachatkun
Satyapriya Krishna
Rahul Gupta
Kai-Wei Chang
Greg Ver Steeg
Aram Galstyan
53
53
0
23 Mar 2022
Diversify and Disambiguate: Learning From Underspecified Data
Yoonho Lee
Huaxiu Yao
Chelsea Finn
263
66
0
07 Feb 2022
Linear Adversarial Concept Erasure
Shauli Ravfogel
Michael Twiton
Yoav Goldberg
Ryan Cotterell
KELM
117
63
0
28 Jan 2022
Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection
Maarten Sap
Swabha Swayamdipta
Laura Vianna
Xuhui Zhou
Yejin Choi
Noah A. Smith
81
283
0
15 Nov 2021
Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models
Tianlu Wang
Rohit Sridhar
Diyi Yang
Xuezhi Wang
AAML
195
76
0
14 Oct 2021
RieszNet and ForestRiesz: Automatic Debiased Machine Learning with Neural Nets and Random Forests
Victor Chernozhukov
Whitney Newey
Victor Quintas-Martinez
Vasilis Syrgkanis
CML
55
40
0
06 Oct 2021
Combining Feature and Instance Attribution to Detect Artifacts
Pouya Pezeshkpour
Sarthak Jain
Sameer Singh
Byron C. Wallace
TDI
106
42
0
01 Jul 2021
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models
Tongshuang Wu
Marco Tulio Ribeiro
Jeffrey Heer
Daniel S. Weld
103
250
0
01 Jan 2021
Underspecification Presents Challenges for Credibility in Modern Machine Learning
Alexander DÁmour
Katherine A. Heller
D. Moldovan
Ben Adlam
B. Alipanahi
...
Kellie Webster
Steve Yadlowsky
T. Yun
Xiaohua Zhai
D. Sculley
OffRL
120
688
0
06 Nov 2020
An Investigation of Why Overparameterization Exacerbates Spurious Correlations
Shiori Sagawa
Aditi Raghunathan
Pang Wei Koh
Percy Liang
191
383
0
09 May 2020
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Marco Tulio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
ELM
208
1,107
0
08 May 2020
Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection
Shauli Ravfogel
Yanai Elazar
Hila Gonen
Michael Twiton
Yoav Goldberg
138
388
0
16 Apr 2020
Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization
Shiori Sagawa
Pang Wei Koh
Tatsunori B. Hashimoto
Percy Liang
OOD
108
1,248
0
20 Nov 2019
Learning the Difference that Makes a Difference with Counterfactually-Augmented Data
Divyansh Kaushik
Eduard H. Hovy
Zachary Chase Lipton
CML
96
570
0
26 Sep 2019
End-to-End Bias Mitigation by Modelling Biases in Corpora
Rabeeh Karimi Mahabadi
Yonatan Belinkov
James Henderson
127
181
0
13 Sep 2019
Mitigating Gender Bias in Natural Language Processing: Literature Review
Tony Sun
Andrew Gaut
Shirlyn Tang
Yuxin Huang
Mai Elsherief
Jieyu Zhao
Diba Mirza
E. Belding-Royer
Kai-Wei Chang
William Yang Wang
AI4CE
108
562
0
21 Jun 2019
Adapting Neural Networks for the Estimation of Treatment Effects
Claudia Shi
David M. Blei
Victor Veitch
CML
148
376
0
05 Jun 2019
Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification
Daniel Borkan
Lucas Dixon
Jeffrey Scott Sorensen
Nithum Thain
Lucy Vasserman
90
492
0
11 Mar 2019
Gender Bias in Neural Natural Language Processing
Kaiji Lu
Piotr (Peter) Mardziel
Fangjing Wu
Preetam Amancharla
Anupam Datta
117
356
0
31 Jul 2018
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
Adina Williams
Nikita Nangia
Samuel R. Bowman
524
4,494
0
18 Apr 2017
1