v1v2 (latest)

Controlling Learned Effects to Reduce Spurious Correlations in Text Classifiers

26 May 2023

Papers citing "Controlling Learned Effects to Reduce Spurious Correlations in Text Classifiers"

27 / 27 papers shown

Title
Estimating Causal Effects of Text Interventions Leveraging LLMs Siyi Guo Myrl G. Marmarelis Fred Morstatter Kristina Lerman CML 473 0 0 28 Oct 2024
Are All Spurious Features in Natural Language Alike? An Analysis through a Causal Lens Nitish Joshi X. Pan Hengxing He CML 110 30 0 25 Oct 2022
NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation Phillip Howard Gadi Singer Vasudev Lal Yejin Choi Swabha Swayamdipta CML 103 25 0 22 Oct 2022
Controlling Bias Exposure for Fair Interpretable Predictions Zexue He Yu Wang Julian McAuley Bodhisattwa Prasad Majumder 58 19 0 14 Oct 2022
Causal Estimation for Text Data with (Apparent) Overlap Violations Lin Gui Victor Veitch OOD 77 13 0 30 Sep 2022
Shortcut Learning of Large Language Models in Natural Language Understanding Mengnan Du Fengxiang He Na Zou Dacheng Tao Xia Hu KELM OffRL 123 89 0 25 Aug 2022
Probing Classifiers are Unreliable for Concept Removal and Detection Abhinav Kumar Chenhao Tan Amit Sharma AAML 72 25 0 08 Jul 2022
Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal Umang Gupta Jwala Dhamala Varun Kumar Apurv Verma Yada Pruksachatkun Satyapriya Krishna Rahul Gupta Kai-Wei Chang Greg Ver Steeg Aram Galstyan 53 53 0 23 Mar 2022
Diversify and Disambiguate: Learning From Underspecified Data Yoonho Lee Huaxiu Yao Chelsea Finn 263 66 0 07 Feb 2022
Linear Adversarial Concept Erasure Shauli Ravfogel Michael Twiton Yoav Goldberg Ryan Cotterell KELM 117 63 0 28 Jan 2022
Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection Maarten Sap Swabha Swayamdipta Laura Vianna Xuhui Zhou Yejin Choi Noah A. Smith 81 283 0 15 Nov 2021
Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models Tianlu Wang Rohit Sridhar Diyi Yang Xuezhi Wang AAML 195 76 0 14 Oct 2021
RieszNet and ForestRiesz: Automatic Debiased Machine Learning with Neural Nets and Random Forests Victor Chernozhukov Whitney Newey Victor Quintas-Martinez Vasilis Syrgkanis CML 55 40 0 06 Oct 2021
Combining Feature and Instance Attribution to Detect Artifacts Pouya Pezeshkpour Sarthak Jain Sameer Singh Byron C. Wallace TDI 106 42 0 01 Jul 2021
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models Tongshuang Wu Marco Tulio Ribeiro Jeffrey Heer Daniel S. Weld 103 250 0 01 Jan 2021
Underspecification Presents Challenges for Credibility in Modern Machine Learning Alexander DÁmour Katherine A. Heller D. Moldovan Ben Adlam B. Alipanahi ... Kellie Webster Steve Yadlowsky T. Yun Xiaohua Zhai D. Sculley OffRL 120 688 0 06 Nov 2020
An Investigation of Why Overparameterization Exacerbates Spurious Correlations Shiori Sagawa Aditi Raghunathan Pang Wei Koh Percy Liang 191 383 0 09 May 2020
Beyond Accuracy: Behavioral Testing of NLP models with CheckList Marco Tulio Ribeiro Tongshuang Wu Carlos Guestrin Sameer Singh ELM 208 1,107 0 08 May 2020
Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection Shauli Ravfogel Yanai Elazar Hila Gonen Michael Twiton Yoav Goldberg 138 388 0 16 Apr 2020
Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization Shiori Sagawa Pang Wei Koh Tatsunori B. Hashimoto Percy Liang OOD 108 1,248 0 20 Nov 2019
Learning the Difference that Makes a Difference with Counterfactually-Augmented Data Divyansh Kaushik Eduard H. Hovy Zachary Chase Lipton CML 96 570 0 26 Sep 2019
End-to-End Bias Mitigation by Modelling Biases in Corpora Rabeeh Karimi Mahabadi Yonatan Belinkov James Henderson 127 181 0 13 Sep 2019
Mitigating Gender Bias in Natural Language Processing: Literature Review Tony Sun Andrew Gaut Shirlyn Tang Yuxin Huang Mai Elsherief Jieyu Zhao Diba Mirza E. Belding-Royer Kai-Wei Chang William Yang Wang AI4CE 108 562 0 21 Jun 2019
Adapting Neural Networks for the Estimation of Treatment Effects Claudia Shi David M. Blei Victor Veitch CML 148 376 0 05 Jun 2019
Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification Daniel Borkan Lucas Dixon Jeffrey Scott Sorensen Nithum Thain Lucy Vasserman 90 492 0 11 Mar 2019
Gender Bias in Neural Natural Language Processing Kaiji Lu Piotr (Peter) Mardziel Fangjing Wu Preetam Amancharla Anupam Datta 117 356 0 31 Jul 2018
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference Adina Williams Nikita Nangia Samuel R. Bowman 524 4,494 0 18 Apr 2017