Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model

14 August 2020

Papers citing "Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model"

21 / 21 papers shown

Title
Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language Models Paloma Piot Patricia Martín-Rodilla Javier Parapar 50 0 0 04 May 2025
Introducing MBIB -- the first Media Bias Identification Benchmark Task and Dataset Collection Martin Wessel Tomávs Horych Terry Ruas Akiko Aizawa Bela Gipp Timo Spinde 32 21 0 25 Apr 2023
SensePOLAR: Word sense aware interpretability for pre-trained contextual word embeddings Jan Engler Sandipan Sikdar Marlene Lutz M. Strohmaier 32 7 0 11 Jan 2023
Leveraging World Knowledge in Implicit Hate Speech Detection Jessica Lin 21 6 0 28 Dec 2022
A Comprehensive Study of Gender Bias in Chemical Named Entity Recognition Models Xingmeng Zhao A. Niazi Anthony Rios 31 2 0 24 Dec 2022
Multi-VALUE: A Framework for Cross-Dialectal English NLP Caleb Ziems William B. Held Jingfeng Yang Jwala Dhamala Rahul Gupta Diyi Yang 46 40 0 15 Dec 2022
Detecting Unintended Social Bias in Toxic Language Datasets Nihar Ranjan Sahoo Himanshu Gupta P. Bhattacharyya 18 18 0 21 Oct 2022
BERT-based Ensemble Approaches for Hate Speech Detection Khouloud Mnassri P. Rajapaksha R. Farahbakhsh Noel Crespi 17 18 0 14 Sep 2022
SoK: Content Moderation in Social Media, from Guidelines to Enforcement, and Research to Practice Mohit Singhal Chen Ling Pujan Paudel Poojitha Thota Nihal Kumarswamy Gianluca Stringhini Shirin Nilizadeh 75 28 0 29 Jun 2022
Toward Understanding Bias Correlations for Mitigation in NLP Lu Cheng Suyu Ge Huan Liu 39 8 0 24 May 2022
Interactive Model Cards: A Human-Centered Approach to Model Documentation Anamaria Crisan Margaret Drouhard Jesse Vig Nazneen Rajani HAI 40 87 0 05 May 2022
BERTuit: Understanding Spanish language in Twitter through a native transformer Javier Huertas-Tato Alejandro Martín David Camacho 26 9 0 07 Apr 2022
Gender and Racial Stereotype Detection in Legal Opinion Word Embeddings S. Matthews John Stephen Hudzina Dawn Sepehr AILaw FaML 13 12 0 24 Mar 2022
Dominant Set-based Active Learning for Text Classification and its Application to Online Social Media Toktam A. Oghaz Ivan I. Garibay 8 0 0 28 Jan 2022
Handling Bias in Toxic Speech Detection: A Survey Tanmay Garg Sarah Masud Tharun Suresh Tanmoy Chakraborty 17 91 0 26 Jan 2022
Unraveling Social Perceptions & Behaviors towards Migrants on Twitter A. Khatua Wolfgang Nejdl 29 11 0 04 Dec 2021
Character-level HyperNetworks for Hate Speech Detection Tomer Wullach A. Adler Einat Minkov 24 12 0 11 Nov 2021
Detection of Hate Speech using BERT and Hate Speech Word Embedding with Deep Model Hind S. Alatawi Areej M. Alhothali K. Moria 27 86 0 02 Nov 2021
Detecting Inspiring Content on Social Media Oana Ignat Y-Lan Boureau Jane A. Yu A. Halevy 24 6 0 06 Sep 2021
Towards generalisable hate speech detection: a review on obstacles and solutions Wenjie Yin A. Zubiaga 117 164 0 17 Feb 2021
Convolutional Neural Networks for Sentence Classification Yoon Kim AILaw VLM 273 13,368 0 25 Aug 2014