ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.07231
  4. Cited By
Reducing Gender Bias in Abusive Language Detection

Reducing Gender Bias in Abusive Language Detection

22 August 2018
Ji Ho Park
Jamin Shin
Pascale Fung
    FaML
ArXivPDFHTML

Papers citing "Reducing Gender Bias in Abusive Language Detection"

50 / 70 papers shown
Title
Gender Encoding Patterns in Pretrained Language Model Representations
Mahdi Zakizadeh
Mohammad Taher Pilehvar
48
0
0
09 Mar 2025
Longitudinal Abuse and Sentiment Analysis of Hollywood Movie Dialogues using LLMs
Longitudinal Abuse and Sentiment Analysis of Hollywood Movie Dialogues using LLMs
Rohitash Chandra
Guoxiang Ren
G. Houseman
51
0
0
20 Jan 2025
Quite Good, but Not Enough: Nationality Bias in Large Language Models --
  A Case Study of ChatGPT
Quite Good, but Not Enough: Nationality Bias in Large Language Models -- A Case Study of ChatGPT
Shucheng Zhu
Weikang Wang
Ying Liu
37
5
0
11 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Damin Zhang
Yi Zhang
Geetanjali Bihani
Julia Taylor Rayz
53
2
0
06 May 2024
From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets
From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets
Manuel Tonneau
Diyi Liu
Samuel Fraiberger
Ralph Schroeder
Scott A. Hale
Paul Röttger
37
5
0
27 Apr 2024
Revisiting The Classics: A Study on Identifying and Rectifying Gender
  Stereotypes in Rhymes and Poems
Revisiting The Classics: A Study on Identifying and Rectifying Gender Stereotypes in Rhymes and Poems
Aditya Narayan Sankaran
Vigneshwaran Shankaran
Sampath Lonka
Rajesh Sharma
32
0
0
18 Mar 2024
Beyond Detection: Unveiling Fairness Vulnerabilities in Abusive Language
  Models
Beyond Detection: Unveiling Fairness Vulnerabilities in Abusive Language Models
Yueqing Liang
Lu Cheng
Ali Payani
Kai Shu
28
3
0
15 Nov 2023
FTFT: Efficient and Robust Fine-Tuning by Transferring Training Dynamics
FTFT: Efficient and Robust Fine-Tuning by Transferring Training Dynamics
Yupei Du
Albert Gatt
Dong Nguyen
31
1
0
10 Oct 2023
Examining Temporal Bias in Abusive Language Detection
Examining Temporal Bias in Abusive Language Detection
Mali Jin
Yida Mu
Diana Maynard
Kalina Bontcheva
34
5
0
25 Sep 2023
A Survey on Fairness in Large Language Models
A Survey on Fairness in Large Language Models
Yingji Li
Mengnan Du
Rui Song
Xin Wang
Ying Wang
ALM
52
59
0
20 Aug 2023
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in
  Large Language Models
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models
Paul Röttger
Hannah Rose Kirk
Bertie Vidgen
Giuseppe Attanasio
Federico Bianchi
Dirk Hovy
ALM
ELM
AILaw
25
125
0
02 Aug 2023
Learning to Generate Equitable Text in Dialogue from Biased Training
  Data
Learning to Generate Equitable Text in Dialogue from Biased Training Data
Anthony Sicilia
Malihe Alikhani
47
15
0
10 Jul 2023
Out-of-Distribution Generalization in Text Classification: Past,
  Present, and Future
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Yangqiu Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Jindong Wang
Jennifer Foster
Yue Zhang
OOD
37
2
0
23 May 2023
Should We Attend More or Less? Modulating Attention for Fairness
Should We Attend More or Less? Modulating Attention for Fairness
A. Zayed
Gonçalo Mordido
Samira Shabanian
Sarath Chandar
37
10
0
22 May 2023
Bias mitigation techniques in image classification: fair machine
  learning in human heritage collections
Bias mitigation techniques in image classification: fair machine learning in human heritage collections
Dalia Ortiz Pablo
Sushruth Badri
Erik Norén
Christoph Nötzli
33
1
0
20 Mar 2023
Data Augmentation for Neural NLP
Data Augmentation for Neural NLP
Domagoj Pluscec
Jan Snajder
19
6
0
22 Feb 2023
Same Same, But Different: Conditional Multi-Task Learning for
  Demographic-Specific Toxicity Detection
Same Same, But Different: Conditional Multi-Task Learning for Demographic-Specific Toxicity Detection
Soumyajit Gupta
Sooyong Lee
Maria De-Arteaga
Matthew Lease
27
13
0
14 Feb 2023
Rating Sentiment Analysis Systems for Bias through a Causal Lens
Rating Sentiment Analysis Systems for Bias through a Causal Lens
Kausik Lakkaraju
Biplav Srivastava
Marco Valtorta
34
7
0
04 Feb 2023
A Comprehensive Study of Gender Bias in Chemical Named Entity
  Recognition Models
A Comprehensive Study of Gender Bias in Chemical Named Entity Recognition Models
Xingmeng Zhao
A. Niazi
Anthony Rios
25
2
0
24 Dec 2022
Fair Infinitesimal Jackknife: Mitigating the Influence of Biased
  Training Data Points Without Refitting
Fair Infinitesimal Jackknife: Mitigating the Influence of Biased Training Data Points Without Refitting
P. Sattigeri
S. Ghosh
Inkit Padhi
Pierre L. Dognin
Kush R. Varshney
FaML
25
28
0
13 Dec 2022
Casual Conversations v2: Designing a large consent-driven dataset to
  measure algorithmic bias and robustness
Casual Conversations v2: Designing a large consent-driven dataset to measure algorithmic bias and robustness
C. Hazirbas
Yejin Bang
Tiezheng Yu
Parisa Assar
Bilal Porgali
...
Jacqueline Pan
Emily McReynolds
Miranda Bogen
Pascale Fung
Cristian Canton Ferrer
27
8
0
10 Nov 2022
Detecting Unintended Social Bias in Toxic Language Datasets
Detecting Unintended Social Bias in Toxic Language Datasets
Nihar Ranjan Sahoo
Himanshu Gupta
P. Bhattacharyya
13
18
0
21 Oct 2022
Choose Your Lenses: Flaws in Gender Bias Evaluation
Choose Your Lenses: Flaws in Gender Bias Evaluation
Hadas Orgad
Yonatan Belinkov
27
35
0
20 Oct 2022
MoCoDA: Model-based Counterfactual Data Augmentation
MoCoDA: Model-based Counterfactual Data Augmentation
Silviu Pitis
Elliot Creager
Ajay Mandlekar
Animesh Garg
OffRL
48
33
0
20 Oct 2022
Controlling Bias Exposure for Fair Interpretable Predictions
Controlling Bias Exposure for Fair Interpretable Predictions
Zexue He
Yu-Xiang Wang
Julian McAuley
Bodhisattwa Prasad Majumder
22
19
0
14 Oct 2022
A Keyword Based Approach to Understanding the Overpenalization of
  Marginalized Groups by English Marginal Abuse Models on Twitter
A Keyword Based Approach to Understanding the Overpenalization of Marginalized Groups by English Marginal Abuse Models on Twitter
Kyra Yee
Alice Schoenauer Sebag
Olivia Redfield
Emily Sheng
Matthias Eck
Luca Belli
28
2
0
07 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review
State-of-the-art generalisation research in NLP: A taxonomy and review
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Ryan Cotterell
Zhijing Jin
119
93
0
06 Oct 2022
Fairness Reprogramming
Fairness Reprogramming
Guanhua Zhang
Yihua Zhang
Yang Zhang
Wenqi Fan
Qing Li
Sijia Liu
Shiyu Chang
AAML
83
38
0
21 Sep 2022
The (de)biasing effect of GAN-based augmentation methods on skin lesion
  images
The (de)biasing effect of GAN-based augmentation methods on skin lesion images
Agnieszka Mikołajczyk
Sylwia Majchrowska
Sandra Carrasco Limeros
MedIm
27
20
0
30 Jun 2022
"You Can't Fix What You Can't Measure": Privately Measuring Demographic
  Performance Disparities in Federated Learning
"You Can't Fix What You Can't Measure": Privately Measuring Demographic Performance Disparities in Federated Learning
Marc Juárez
Aleksandra Korolova
FedML
32
9
0
24 Jun 2022
Toward Understanding Bias Correlations for Mitigation in NLP
Toward Understanding Bias Correlations for Mitigation in NLP
Lu Cheng
Suyu Ge
Huan Liu
36
8
0
24 May 2022
User Guide for KOTE: Korean Online Comments Emotions Dataset
User Guide for KOTE: Korean Online Comments Emotions Dataset
Duyoung Jeon
Junho Lee
Cheongtag Kim
33
0
0
11 May 2022
Necessity and Sufficiency for Explaining Text Classifiers: A Case Study
  in Hate Speech Detection
Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Esma Balkir
I. Nejadgholi
Kathleen C. Fraser
S. Kiritchenko
FAtt
35
27
0
06 May 2022
Human-AI Collaboration via Conditional Delegation: A Case Study of
  Content Moderation
Human-AI Collaboration via Conditional Delegation: A Case Study of Content Moderation
Vivian Lai
Samuel Carton
Rajat Bhatnagar
Vera Liao
Yunfeng Zhang
Chenhao Tan
29
130
0
25 Apr 2022
Easy Adaptation to Mitigate Gender Bias in Multilingual Text
  Classification
Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification
Xiaolei Huang
FaML
13
8
0
12 Apr 2022
Handling Bias in Toxic Speech Detection: A Survey
Handling Bias in Toxic Speech Detection: A Survey
Tanmay Garg
Sarah Masud
Tharun Suresh
Tanmoy Chakraborty
17
91
0
26 Jan 2022
Text and Code Embeddings by Contrastive Pre-Training
Text and Code Embeddings by Contrastive Pre-Training
Arvind Neelakantan
Tao Xu
Raul Puri
Alec Radford
Jesse Michael Han
...
Tabarak Khan
Toki Sherbakov
Joanne Jang
Peter Welinder
Lilian Weng
SSL
AI4TS
232
422
0
24 Jan 2022
Causal effect of racial bias in data and machine learning algorithms on user persuasiveness & discriminatory decision making: An Empirical Study
Kinshuk Sengupta
Praveen Ranjan Srivastava
36
6
0
22 Jan 2022
A Survey on Gender Bias in Natural Language Processing
A Survey on Gender Bias in Natural Language Processing
Karolina Stañczak
Isabelle Augenstein
30
109
0
28 Dec 2021
Latent Space Smoothing for Individually Fair Representations
Latent Space Smoothing for Individually Fair Representations
Momchil Peychev
Anian Ruoss
Mislav Balunović
Maximilian Baader
Martin Vechev
FaML
36
19
0
26 Nov 2021
Unpacking the Interdependent Systems of Discrimination: Ableist Bias in
  NLP Systems through an Intersectional Lens
Unpacking the Interdependent Systems of Discrimination: Ableist Bias in NLP Systems through an Intersectional Lens
Saad Hassan
Matt Huenerfauth
Cecilia Ovesdotter Alm
46
38
0
01 Oct 2021
Mitigating Racial Biases in Toxic Language Detection with an
  Equity-Based Ensemble Framework
Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework
Matan Halevy
Camille Harris
A. Bruckman
Diyi Yang
A. Howard
42
35
0
27 Sep 2021
Balancing out Bias: Achieving Fairness Through Balanced Training
Balancing out Bias: Achieving Fairness Through Balanced Training
Xudong Han
Timothy Baldwin
Trevor Cohn
26
39
0
16 Sep 2021
Mitigating Language-Dependent Ethnic Bias in BERT
Mitigating Language-Dependent Ethnic Bias in BERT
Jaimeen Ahn
Alice H. Oh
142
91
0
13 Sep 2021
SS-BERT: Mitigating Identity Terms Bias in Toxic Comment Classification
  by Utilising the Notion of "Subjectivity" and "Identity Terms"
SS-BERT: Mitigating Identity Terms Bias in Toxic Comment Classification by Utilising the Notion of "Subjectivity" and "Identity Terms"
Zhixue Zhao
Ziqi Zhang
F. Hopfgartner
16
5
0
06 Sep 2021
Investigating Bias In Automatic Toxic Comment Detection: An Empirical
  Study
Investigating Bias In Automatic Toxic Comment Detection: An Empirical Study
Ayush Kumar
Pratik Kumar
20
0
0
14 Aug 2021
On Measures of Biases and Harms in NLP
On Measures of Biases and Harms in NLP
Sunipa Dev
Emily Sheng
Jieyu Zhao
Aubrie Amstutz
Jiao Sun
...
M. Sanseverino
Jiin Kim
Akihiro Nishi
Nanyun Peng
Kai-Wei Chang
31
80
0
07 Aug 2021
Improving Counterfactual Generation for Fair Hate Speech Detection
Improving Counterfactual Generation for Fair Hate Speech Detection
Aida Mostafazadeh Davani
Ali Omrani
Brendan Kennedy
M. Atari
Xiang Ren
Morteza Dehghani
30
9
0
03 Aug 2021
Learning Stable Classifiers by Transferring Unstable Features
Learning Stable Classifiers by Transferring Unstable Features
Yujia Bao
Shiyu Chang
Regina Barzilay
OOD
27
8
0
15 Jun 2021
Measuring Model Fairness under Noisy Covariates: A Theoretical
  Perspective
Measuring Model Fairness under Noisy Covariates: A Theoretical Perspective
Flavien Prost
Pranjal Awasthi
Nicholas Blumm
A. Kumthekar
Trevor Potter
Li Wei
Xuezhi Wang
Ed H. Chi
Jilin Chen
Alex Beutel
48
15
0
20 May 2021
12
Next