Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.04571
Cited By
Counterfactual Data Augmentation for Mitigating Gender Stereotypes in Languages with Rich Morphology
11 June 2019
Ran Zmigrod
Sabrina J. Mielke
Hanna M. Wallach
Ryan Cotterell
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Counterfactual Data Augmentation for Mitigating Gender Stereotypes in Languages with Rich Morphology"
50 / 68 papers shown
Title
Man Made Language Models? Evaluating LLMs' Perpetuation of Masculine Generics Bias
Enzo Doyen
Amalia Todirascu
42
0
0
14 Feb 2025
Collapsed Language Models Promote Fairness
Jingxuan Xu
Wuyang Chen
Linyi Li
Yao Zhao
Yunchao Wei
51
0
0
06 Oct 2024
Are Large Language Models Really Bias-Free? Jailbreak Prompts for Assessing Adversarial Robustness to Bias Elicitation
Riccardo Cantini
Giada Cosenza
A. Orsino
Domenico Talia
AAML
65
5
0
11 Jul 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Damin Zhang
Yi Zhang
Geetanjali Bihani
Julia Taylor Rayz
56
2
0
06 May 2024
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction
Ziyang Xu
Keqin Peng
Liang Ding
Dacheng Tao
Xiliang Lu
34
10
0
15 Mar 2024
Self-Supervised Position Debiasing for Large Language Models
Zhongkun Liu
Zheng Chen
Mengqi Zhang
Zhaochun Ren
Pengjie Ren
Zhumin Chen
36
1
0
02 Jan 2024
Data Augmentations for Improved (Large) Language Model Generalization
Amir Feder
Yoav Wald
Claudia Shi
Suchi Saria
David M. Blei
OOD
CML
34
7
0
19 Oct 2023
Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model
Abhijith Chintam
Rahel Beloch
Willem H. Zuidema
Michael Hanna
Oskar van der Wal
28
16
0
19 Oct 2023
A Survey on Fairness in Large Language Models
Yingji Li
Mengnan Du
Rui Song
Xin Wang
Ying Wang
ALM
57
60
0
20 Aug 2023
Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models
Somayeh Ghanbarzadeh
Yan-ping Huang
Hamid Palangi
R. C. Moreno
Hamed Khanpour
42
12
0
20 Jul 2023
Prompt Tuning Pushes Farther, Contrastive Learning Pulls Closer: A Two-Stage Approach to Mitigate Social Biases
Yingji Li
Mengnan Du
Xin Wang
Ying Wang
53
27
0
04 Jul 2023
An Empirical Analysis of Parameter-Efficient Methods for Debiasing Pre-Trained Language Models
Zhongbin Xie
Thomas Lukasiewicz
33
12
0
06 Jun 2023
On Counterfactual Data Augmentation Under Confounding
Abbavaram Gowtham Reddy
Saketh Bachu
Saloni Dash
Charchit Sharma
Amit Sharma
V. Balasubramanian
CML
BDL
33
0
0
29 May 2023
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Yangqiu Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Jindong Wang
Jennifer Foster
Yue Zhang
OOD
42
2
0
23 May 2023
Effectiveness of Debiasing Techniques: An Indigenous Qualitative Analysis
Vithya Yogarajan
Gillian Dobbie
Henry Gouk
19
3
0
17 Apr 2023
Evaluation of Social Biases in Recent Large Pre-Trained Models
Swapnil Sharma
Nikita Anand
V. KranthiKiranG.
Alind Jain
26
0
0
13 Apr 2023
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning
Hongyin Luo
James R. Glass
NAI
29
7
0
10 Mar 2023
Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns
Zhongbin Xie
Vid Kocijan
Thomas Lukasiewicz
Oana-Maria Camburu
10
2
0
11 Feb 2023
Manifestations of Xenophobia in AI Systems
Nenad Tomašev
J. L. Maynard
Iason Gabriel
24
9
0
15 Dec 2022
Event knowledge in large language models: the gap between the impossible and the unlikely
Carina Kauf
Anna A. Ivanova
Giulia Rambelli
Emmanuele Chersoni
Jingyuan Selena She
Zawad Chowdhury
Evelina Fedorenko
Alessandro Lenci
37
67
0
02 Dec 2022
The Shared Task on Gender Rewriting
Bashar Alhafni
Nizar Habash
Houda Bouamor
Ossama Obeid
Sultan Alrowili
...
Mohamed Gabr
Abderrahmane Issam
Abdelrahim Qaddoumi
K. Vijay-Shanker
Mahmoud Zyate
34
1
0
22 Oct 2022
Robustifying Sentiment Classification by Maximally Exploiting Few Counterfactuals
Maarten De Raedt
Fréderic Godin
Chris Develder
Thomas Demeester
13
1
0
21 Oct 2022
Choose Your Lenses: Flaws in Gender Bias Evaluation
Hadas Orgad
Yonatan Belinkov
27
35
0
20 Oct 2022
The User-Aware Arabic Gender Rewriter
Bashar Alhafni
Ossama Obeid
Nizar Habash
29
2
0
14 Oct 2022
SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models
Haozhe An
Zongxia Li
Jieyu Zhao
Rachel Rudinger
35
25
0
13 Oct 2022
HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis Response
Selim Fekih
Nicolò Tamagnone
Benjamin Minixhofer
R. Shrestha
Ximena Contla
Ewan Oglethorpe
Navid Rekabsaz
21
6
0
10 Oct 2022
Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization
Zonghan Yang
Xiaoyuan Yi
Peng Li
Yang Liu
Xing Xie
38
33
0
10 Oct 2022
FAST: Improving Controllability for Text Generation with Feedback Aware Self-Training
Junyi Chai
Reid Pryzant
Victor Ye Dong
Konstantin Golobokov
Chenguang Zhu
Yi Liu
37
5
0
06 Oct 2022
Debiasing isn't enough! -- On the Effectiveness of Debiasing MLMs and their Social Biases in Downstream Tasks
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
28
41
0
06 Oct 2022
Detecting Harmful Online Conversational Content towards LGBTQIA+ Individuals
Jamell Dacon
Harry Shomer
Shaylynn Crum-Dacon
Jiliang Tang
32
8
0
15 Jun 2022
Optimizing Relevance Maps of Vision Transformers Improves Robustness
Hila Chefer
Idan Schwartz
Lior Wolf
ViT
40
38
0
02 Jun 2022
Analyzing Gender Representation in Multilingual Models
Hila Gonen
Shauli Ravfogel
Yoav Goldberg
25
11
0
20 Apr 2022
Text Revision by On-the-Fly Representation Optimization
Jingjing Li
Zichao Li
Tao Ge
Irwin King
M. Lyu
BDL
31
17
0
15 Apr 2022
Fair and Argumentative Language Modeling for Computational Argumentation
Carolin Holtermann
Anne Lauscher
Simone Paolo Ponzetto
24
21
0
08 Apr 2022
Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal
Umang Gupta
Jwala Dhamala
Varun Kumar
Apurv Verma
Yada Pruksachatkun
Satyapriya Krishna
Rahul Gupta
Kai-Wei Chang
Greg Ver Steeg
Aram Galstyan
21
49
0
23 Mar 2022
Screening Gender Transfer in Neural Machine Translation
Guillaume Wisniewski
Lichao Zhu
Nicolas Bailler
François Yvon
6
4
0
25 Feb 2022
A Causal Lens for Controllable Text Generation
Zhiting Hu
Erran L. Li
45
59
0
22 Jan 2022
A Survey on Gender Bias in Natural Language Processing
Karolina Stañczak
Isabelle Augenstein
30
111
0
28 Dec 2021
On Causally Disentangled Representations
Abbavaram Gowtham Reddy
Benin Godfrey L
V. Balasubramanian
OOD
CML
39
21
0
10 Dec 2021
Matching Learned Causal Effects of Neural Networks with Domain Priors
Sai Srinivas Kancheti
Abbavaram Gowtham Reddy
V. Balasubramanian
Amit Sharma
CML
36
13
0
24 Nov 2021
Perceptual Score: What Data Modalities Does Your Model Perceive?
Itai Gat
Idan Schwartz
Alex Schwing
44
30
0
27 Oct 2021
An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models
Nicholas Meade
Elinor Poole-Dayan
Siva Reddy
22
124
0
16 Oct 2021
Influence Tuning: Demoting Spurious Correlations via Instance Attribution and Instance-Driven Updates
Xiaochuang Han
Yulia Tsvetkov
TDI
31
30
0
07 Oct 2021
Data Augmentation Approaches in Natural Language Processing: A Survey
Bohan Li
Yutai Hou
Wanxiang Che
132
274
0
05 Oct 2021
One Source, Two Targets: Challenges and Rewards of Dual Decoding
Jitao Xu
François Yvon
18
6
0
21 Sep 2021
Top-N Recommendation with Counterfactual User Preference Simulation
Mengyue Yang
Quanyu Dai
Zhenhua Dong
Xu Chen
Xiuqiang He
Jun Wang
CML
BDL
50
65
0
02 Sep 2021
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond
Amir Feder
Katherine A. Keith
Emaad A. Manzoor
Reid Pryzant
Dhanya Sridhar
...
Roi Reichart
Margaret E. Roberts
Brandon M Stewart
Victor Veitch
Diyi Yang
CML
41
235
0
02 Sep 2021
On Measures of Biases and Harms in NLP
Sunipa Dev
Emily Sheng
Jieyu Zhao
Aubrie Amstutz
Jiao Sun
...
M. Sanseverino
Jiin Kim
Akihiro Nishi
Nanyun Peng
Kai-Wei Chang
33
80
0
07 Aug 2021
An Investigation of the (In)effectiveness of Counterfactually Augmented Data
Nitish Joshi
He He
OODD
21
46
0
01 Jul 2021
Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis
Linyi Yang
Jiazheng Li
Padraig Cunningham
Yue Zhang
Barry Smyth
Ruihai Dong
19
47
0
29 Jun 2021
1
2
Next