Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.06460
Cited By
Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model
14 August 2020
Marzieh Mozafari
R. Farahbakhsh
Noel Crespi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model"
21 / 21 papers shown
Title
Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language Models
Paloma Piot
Patricia Martín-Rodilla
Javier Parapar
50
0
0
04 May 2025
Introducing MBIB -- the first Media Bias Identification Benchmark Task and Dataset Collection
Martin Wessel
Tomávs Horych
Terry Ruas
Akiko Aizawa
Bela Gipp
Timo Spinde
32
21
0
25 Apr 2023
SensePOLAR: Word sense aware interpretability for pre-trained contextual word embeddings
Jan Engler
Sandipan Sikdar
Marlene Lutz
M. Strohmaier
32
7
0
11 Jan 2023
Leveraging World Knowledge in Implicit Hate Speech Detection
Jessica Lin
21
6
0
28 Dec 2022
A Comprehensive Study of Gender Bias in Chemical Named Entity Recognition Models
Xingmeng Zhao
A. Niazi
Anthony Rios
31
2
0
24 Dec 2022
Multi-VALUE: A Framework for Cross-Dialectal English NLP
Caleb Ziems
William B. Held
Jingfeng Yang
Jwala Dhamala
Rahul Gupta
Diyi Yang
46
40
0
15 Dec 2022
Detecting Unintended Social Bias in Toxic Language Datasets
Nihar Ranjan Sahoo
Himanshu Gupta
P. Bhattacharyya
18
18
0
21 Oct 2022
BERT-based Ensemble Approaches for Hate Speech Detection
Khouloud Mnassri
P. Rajapaksha
R. Farahbakhsh
Noel Crespi
17
18
0
14 Sep 2022
SoK: Content Moderation in Social Media, from Guidelines to Enforcement, and Research to Practice
Mohit Singhal
Chen Ling
Pujan Paudel
Poojitha Thota
Nihal Kumarswamy
Gianluca Stringhini
Shirin Nilizadeh
75
28
0
29 Jun 2022
Toward Understanding Bias Correlations for Mitigation in NLP
Lu Cheng
Suyu Ge
Huan Liu
39
8
0
24 May 2022
Interactive Model Cards: A Human-Centered Approach to Model Documentation
Anamaria Crisan
Margaret Drouhard
Jesse Vig
Nazneen Rajani
HAI
40
87
0
05 May 2022
BERTuit: Understanding Spanish language in Twitter through a native transformer
Javier Huertas-Tato
Alejandro Martín
David Camacho
26
9
0
07 Apr 2022
Gender and Racial Stereotype Detection in Legal Opinion Word Embeddings
S. Matthews
John Stephen Hudzina
Dawn Sepehr
AILaw
FaML
13
12
0
24 Mar 2022
Dominant Set-based Active Learning for Text Classification and its Application to Online Social Media
Toktam A. Oghaz
Ivan I. Garibay
8
0
0
28 Jan 2022
Handling Bias in Toxic Speech Detection: A Survey
Tanmay Garg
Sarah Masud
Tharun Suresh
Tanmoy Chakraborty
17
91
0
26 Jan 2022
Unraveling Social Perceptions & Behaviors towards Migrants on Twitter
A. Khatua
Wolfgang Nejdl
29
11
0
04 Dec 2021
Character-level HyperNetworks for Hate Speech Detection
Tomer Wullach
A. Adler
Einat Minkov
24
12
0
11 Nov 2021
Detection of Hate Speech using BERT and Hate Speech Word Embedding with Deep Model
Hind S. Alatawi
Areej M. Alhothali
K. Moria
27
86
0
02 Nov 2021
Detecting Inspiring Content on Social Media
Oana Ignat
Y-Lan Boureau
Jane A. Yu
A. Halevy
24
6
0
06 Sep 2021
Towards generalisable hate speech detection: a review on obstacles and solutions
Wenjie Yin
A. Zubiaga
117
164
0
17 Feb 2021
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
273
13,368
0
25 Aug 2014
1