Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.07231
Cited By
Reducing Gender Bias in Abusive Language Detection
22 August 2018
Ji Ho Park
Jamin Shin
Pascale Fung
FaML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reducing Gender Bias in Abusive Language Detection"
28 / 78 papers shown
Title
Trustworthy AI: A Computational Perspective
Haochen Liu
Yiqi Wang
Wenqi Fan
Xiaorui Liu
Yaxin Li
Shaili Jain
Yunhao Liu
Anil K. Jain
Jiliang Tang
FaML
104
196
0
12 Jul 2021
Learning Stable Classifiers by Transferring Unstable Features
Yujia Bao
Shiyu Chang
Regina Barzilay
OOD
27
8
0
15 Jun 2021
Measuring Model Fairness under Noisy Covariates: A Theoretical Perspective
Flavien Prost
Pranjal Awasthi
Nicholas Blumm
A. Kumthekar
Trevor Potter
Li Wei
Xuezhi Wang
Ed H. Chi
Jilin Chen
Alex Beutel
48
15
0
20 May 2021
Evaluating Gender Bias in Natural Language Inference
Shanya Sharma
Manan Dey
Koustuv Sinha
25
41
0
12 May 2021
Explanation-Based Human Debugging of NLP Models: A Survey
Piyawat Lertvittayakumjorn
Francesca Toni
LRM
42
79
0
30 Apr 2021
Towards generalisable hate speech detection: a review on obstacles and solutions
Wenjie Yin
A. Zubiaga
117
164
0
17 Feb 2021
HateCheck: Functional Tests for Hate Speech Detection Models
Paul Röttger
B. Vidgen
Dong Nguyen
Zeerak Talat
Helen Z. Margetts
J. Pierrehumbert
31
259
0
31 Dec 2020
WILDS: A Benchmark of in-the-Wild Distribution Shifts
Pang Wei Koh
Shiori Sagawa
Henrik Marklund
Sang Michael Xie
Marvin Zhang
...
A. Kundaje
Emma Pierson
Sergey Levine
Chelsea Finn
Percy Liang
OOD
53
1,377
0
14 Dec 2020
Selective Classification Can Magnify Disparities Across Groups
Erik Jones
Shiori Sagawa
Pang Wei Koh
Ananya Kumar
Percy Liang
23
46
0
27 Oct 2020
Towards Socially Responsible AI: Cognitive Bias-Aware Multi-Objective Learning
Procheta Sen
Debasis Ganguly
27
18
0
14 May 2020
Intersectional Bias in Hate Speech and Abusive Language Datasets
Jae-Yeon Kim
Carlos Ortiz
S. Nam
Sarah Santiago
V. Datta
9
45
0
12 May 2020
Contextualizing Hate Speech Classifiers with Post-hoc Explanation
Brendan Kennedy
Xisen Jin
Aida Mostafazadeh Davani
Morteza Dehghani
Xiang Ren
11
137
0
05 May 2020
Reducing Gender Bias in Neural Machine Translation as a Domain Adaptation Problem
Danielle Saunders
Bill Byrne
AI4CE
16
136
0
09 Apr 2020
Multilingual Twitter Corpus and Baselines for Evaluating Demographic Bias in Hate Speech Recognition
Xiaolei Huang
Linzi Xing
Franck Dernoncourt
Michael J. Paul
13
87
0
24 Feb 2020
GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies
Marta R. Costa-jussá
P. Lin
C. España-Bonet
SyDa
23
24
0
10 Dec 2019
Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation
Emily Dinan
Angela Fan
Adina Williams
Jack Urbanek
Douwe Kiela
Jason Weston
27
205
0
10 Nov 2019
Assessing Social and Intersectional Biases in Contextualized Word Representations
Y. Tan
Elisa Celis
FaML
21
223
0
04 Nov 2019
Toward Gender-Inclusive Coreference Resolution
Yang Trista Cao
Hal Daumé
31
141
0
30 Oct 2019
Content Removal as a Moderation Strategy: Compliance and Other Outcomes in the ChangeMyView Community
K. Srinivasan
Cristian Danescu-Niculescu-Mizil
Lillian Lee
Chenhao Tan
17
81
0
21 Oct 2019
A General Framework for Implicit and Explicit Debiasing of Distributional Word Vector Spaces
Anne Lauscher
Goran Glavas
Simone Paolo Ponzetto
Ivan Vulić
27
62
0
13 Sep 2019
Multilingual and Multi-Aspect Hate Speech Analysis
N. Ousidhoum
Zizheng Lin
Hongming Zhang
Yangqiu Song
Dit-Yan Yeung
24
282
0
29 Aug 2019
Mitigating Gender Bias in Natural Language Processing: Literature Review
Tony Sun
Andrew Gaut
Shirlyn Tang
Yuxin Huang
Mai Elsherief
Jieyu Zhao
Diba Mirza
E. Belding-Royer
Kai-Wei Chang
William Yang Wang
AI4CE
29
542
0
21 Jun 2019
Incorporating Priors with Feature Attribution on Text Classification
Frederick Liu
Besim Avci
FAtt
FaML
31
120
0
19 Jun 2019
Sentiment analysis is not solved! Assessing and probing sentiment classification
Jeremy Barnes
Lilja Øvrelid
Erik Velldal
16
32
0
13 Jun 2019
Racial Bias in Hate Speech and Abusive Language Detection Datasets
Thomas Davidson
Debasmita Bhattacharya
Ingmar Weber
22
451
0
29 May 2019
Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification
Daniel Borkan
Lucas Dixon
Jeffrey Scott Sorensen
Nithum Thain
Lucy Vasserman
14
471
0
11 Mar 2019
Equalizing Gender Biases in Neural Machine Translation with Word Embeddings Techniques
Joel Escudé Font
Marta R. Costa-jussá
16
167
0
10 Jan 2019
Counterfactual Fairness in Text Classification through Robustness
Sahaj Garg
Vincent Perot
Nicole Limtiaco
Ankur Taly
Ed H. Chi
Alex Beutel
22
258
0
27 Sep 2018
Previous
1
2