Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.10289
Cited By
HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
18 December 2020
Binny Mathew
Punyajoy Saha
Seid Muhie Yimam
Chris Biemann
Pawan Goyal
Animesh Mukherjee
Re-assign community
ArXiv
PDF
HTML
Papers citing
"HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection"
50 / 280 papers shown
Title
COT: A Generative Approach for Hate Speech Counter-Narratives via Contrastive Optimal Transport
Linhao Zhang
Li Jin
Guangluan Xu
Xiaoyu Li
Xian Sun
53
0
0
18 Jun 2024
AustroTox: A Dataset for Target-Based Austrian German Offensive Language Detection
Pia Pachinger
Janis Goldzycher
A. Planitzer
Wojciech Kusa
Allan Hanbury
Julia Neidhardt
55
2
0
12 Jun 2024
Explainability and Hate Speech: Structured Explanations Make Social Media Moderators Faster
Agostina Calabrese
Leonardo Neves
Neil Shah
Maarten W. Bos
Björn Ross
Mirella Lapata
Francesco Barbieri
FAtt
42
1
0
06 Jun 2024
Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech
Neemesh Yadav
Sarah Masud
Vikram Goyal
Vikram Goyal
Md. Shad Akhtar
Tanmoy Chakraborty
36
5
0
06 Jun 2024
Improving code-mixed hate detection by native sample mixing: A case study for Hindi-English code-mixed scenario
Debajyoti Mazumder
Aakash Kumar
Jasabanta Patro
23
0
0
31 May 2024
Hate Speech Detection with Generalizable Target-aware Fairness
Tong Chen
Danny Wang
Xurong Liang
Marten Risius
Gianluca Demartini
Hongzhi Yin
35
3
0
28 May 2024
Grounding Toxicity in Real-World Events across Languages
Wondimagegnhue Tufa
Ilia Markov
Piek Vossen
26
0
0
22 May 2024
The Unappreciated Role of Intent in Algorithmic Moderation of Social Media Content
Xinyu Wang
S. Koneru
Pranav Narayanan Venkit
Brett Frischmann
Sarah Rajtmajer
29
0
0
17 May 2024
"They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations
Preetam Prabhu Srikar Dammu
Hayoung Jung
Anjali Singh
Monojit Choudhury
Tanushree Mitra
42
8
0
08 May 2024
SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore
Ri Chi Ng
Nirmalendu Prakash
Ming Shan Hee
K. T. W. Choo
Roy Ka-Wei Lee
43
4
0
03 May 2024
ViTHSD: Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts
Cuong Nhat Vo
Khanh Bao Huynh
Son T. Luu
Trong-Hop Do
47
1
0
30 Apr 2024
The Constant in HATE: Analyzing Toxicity in Reddit across Topics and Languages
Wondimagegnhue Tufa
Ilia Markov
Piek Vossen
18
0
0
29 Apr 2024
EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter
Comfort Eseohen Ilevbare
Jesujoba Oluwadara Alabi
David Ifeoluwa Adelani
Firdous Damilola Bakare
O. B. Abiola
O. Adeyemo
35
7
0
28 Apr 2024
From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets
Manuel Tonneau
Diyi Liu
Samuel Fraiberger
Ralph Schroeder
Scott A. Hale
Paul Röttger
37
5
0
27 Apr 2024
Exploring Boundaries and Intensities in Offensive and Hate Speech: Unveiling the Complex Spectrum of Social Media Discourse
A. Ayele
Esubalew alemneh Jalew
Adem Chanie Ali
Seid Muhie Yimam
Christian Biemann
16
2
0
18 Apr 2024
A Federated Learning Approach to Privacy Preserving Offensive Language Identification
Marcos Zampieri
Damith Premasiri
Tharindu Ranasinghe
FedML
26
2
0
17 Apr 2024
Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement
Paras Sheth
Tharindu Kumarage
Raha Moraffah
Amanat Chadha
Huan Liu
34
1
0
17 Apr 2024
What's Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs
Anna Wegmann
T. Broek
Dong Nguyen
40
1
0
10 Apr 2024
Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales
Lucas Resck
Marcos M. Raimundo
Jorge Poco
50
1
0
03 Apr 2024
Target Span Detection for Implicit Harmful Content
Nazanin Jafari
James Allan
Sheikh Muhammad Sarwar
48
1
0
28 Mar 2024
NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data
Manuel Tonneau
Pedro Vitor Quinta de Castro
Karim Lasri
I. Farouq
Lakshminarayanan Subramanian
Victor Orozco-Olvera
Samuel Fraiberger
44
10
0
28 Mar 2024
ToXCL: A Unified Framework for Toxic Speech Detection and Explanation
Nhat M. Hoang
Do Xuan Long
Duc Anh Do
Duc Anh Vu
Anh Tuan Luu
47
4
0
25 Mar 2024
On Zero-Shot Counterspeech Generation by LLMs
Punyajoy Saha
Aalok Agrawal
Abhik Jana
Chris Biemann
Animesh Mukherjee
43
12
0
22 Mar 2024
EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation
A. Tonja
Israel Abebe Azime
Tadesse Destaw Belay
M. Yigezu
Moges Ahmed Mehamed
...
Olga Kolesnikova
Philipp Slusallek
Dietrich Klakow
Shengwu Xiong
Seid Muhie Yimam
54
5
0
20 Mar 2024
Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales
Ayushi Nirmal
Amrita Bhattacharjee
Paras Sheth
Huan Liu
AAML
43
10
0
19 Mar 2024
HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models
H. Nghiem
Hal Daumé
42
1
0
18 Mar 2024
OffensiveLang: A Community Based Implicit Offensive Language Dataset
Amit Das
Mostafa Rahgouy
Dongji Feng
Zheng Zhang
Tathagata Bhattacharya
...
Aman Chadha
Mary J. Sandage
Lauramarie Pope
Gerry V. Dozier
Cheryl Seals
34
1
0
04 Mar 2024
MAGPIE: Multi-Task Media-Bias Analysis Generalization for Pre-Trained Identification of Expressions
Tomávs Horych
Martin Wessel
Jan Philip Wahle
Terry Ruas
Jerome Wassmuth
André Greiner-Petter
Akiko Aizawa
Bela Gipp
Timo Spinde
46
1
0
27 Feb 2024
Algorithmic Arbitrariness in Content Moderation
Juan Felipe Gomez
Caio Vieira Machado
Lucas Monteiro Paes
Flavio du Pin Calmon
36
9
0
26 Feb 2024
InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks
Somnath Banerjee
Maulindu Sarkar
Punyajoy Saha
Binny Mathew
Animesh Mukherjee
TDI
34
0
0
22 Feb 2024
Eagle: Ethical Dataset Given from Real Interactions
Masahiro Kaneko
Danushka Bollegala
Timothy Baldwin
44
3
0
22 Feb 2024
Investigating the Impact of Model Instability on Explanations and Uncertainty
Sara Vera Marjanović
Isabelle Augenstein
Christina Lioma
AAML
48
0
0
20 Feb 2024
A Dataset for the Detection of Dehumanizing Language
Paul Engelmann
Peter Brunsgaard Trolle
Christian Hardmeier
17
1
0
13 Feb 2024
"Define Your Terms" : Enhancing Efficient Offensive Speech Classification with Definition
H. Nghiem
Umang Gupta
Fred Morstatter
39
4
0
05 Feb 2024
Probing Critical Learning Dynamics of PLMs for Hate Speech Detection
Sarah Masud
Mohammad Aflah Khan
Vikram Goyal
Md. Shad Akhtar
Tanmoy Chakraborty
21
0
0
03 Feb 2024
MasonPerplexity at Multimodal Hate Speech Event Detection 2024: Hate Speech and Target Detection Using Transformer Ensembles
Amrita Ganguly
Al Nahian Bin Emran
Sadiya Sayara Chowdhury Puspo
Md. Nishat Raihan
Dhiman Goswami
Marcos Zampieri
32
3
0
03 Feb 2024
Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models
Ming Shan Hee
Shivam Sharma
Rui Cao
Palash Nandi
Tanmoy Chakraborty
Roy Ka-Wei Lee
43
14
0
30 Jan 2024
Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse
Seungyoon Lee
Dahyun Jung
Chanjun Park
Seolhwa Lee
Heu-Jeoung Lim
34
1
0
26 Jan 2024
Meme-ingful Analysis: Enhanced Understanding of Cyberbullying in Memes Through Multimodal Explanations
Prince Jha
Krishanu Maity
Raghav Jain
Apoorv Verma
Sriparna Saha
P. Bhattacharyya
41
7
0
18 Jan 2024
Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges
Aiqi Jiang
A. Zubiaga
AAML
31
3
0
17 Jan 2024
Explain Thyself Bully: Sentiment Aided Cyberbullying Detection with Explanation
Krishanu Maity
Prince Jha
Raghav Jain
S. Saha
P. Bhattacharyya
23
1
0
17 Jan 2024
MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection
Paloma Piot-Perez-Abadin
Patricia Martín-Rodilla
Javier Parapar
23
2
0
12 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
Zujie Wen
Ke Xu
Qi Li
63
57
0
11 Jan 2024
An Investigation of Large Language Models for Real-World Hate Speech Detection
Keyan Guo
Alexander Hu
Jaden Mu
Ziheng Shi
Ziming Zhao
Nishant Vishwamitra
Hongxin Hu
25
12
0
07 Jan 2024
Building Efficient Universal Classifiers with Natural Language Inference
Moritz Laurer
W. Atteveldt
Andreu Casas
Kasper Welbers
38
8
0
29 Dec 2023
The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media Bias
Timo Spinde
Smilla Hinterreiter
Fabian Haak
Terry Ruas
Helge Giese
Norman Meuschke
Bela Gipp
27
12
0
26 Dec 2023
Moderating New Waves of Online Hate with Chain-of-Thought Reasoning in Large Language Models
Nishant Vishwamitra
Keyan Guo
Farhan Tajwar Romit
Isabelle Ondracek
Long Cheng
Ziming Zhao
Hongxin Hu
19
12
0
22 Dec 2023
HCDIR: End-to-end Hate Context Detection, and Intensity Reduction model for online comments
Neeraj Kumar Singh
Koyel Ghosh
Joy Mahapatra
Utpal Garain
Apurbalal Senapati
22
0
0
20 Dec 2023
Multi-Label Classification of COVID-Tweets Using Large Language Models
Aniket Deroy
Subhankar Maity
29
5
0
17 Dec 2023
Abusive Span Detection for Vietnamese Narrative Texts
Nhu-Thanh Nguyen
Khoa Thi-Kim Phan
Duc-Vu Nguyen
Ngan Luu-Thuy Nguyen
25
0
0
13 Dec 2023
Previous
1
2
3
4
5
6
Next