HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

18 December 2020

Papers citing "HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection"

50 / 280 papers shown

Title
COT: A Generative Approach for Hate Speech Counter-Narratives via Contrastive Optimal Transport Linhao Zhang Li Jin Guangluan Xu Xiaoyu Li Xian Sun 53 0 0 18 Jun 2024
AustroTox: A Dataset for Target-Based Austrian German Offensive Language Detection Pia Pachinger Janis Goldzycher A. Planitzer Wojciech Kusa Allan Hanbury Julia Neidhardt 55 2 0 12 Jun 2024
Explainability and Hate Speech: Structured Explanations Make Social Media Moderators Faster Agostina Calabrese Leonardo Neves Neil Shah Maarten W. Bos Björn Ross Mirella Lapata Francesco Barbieri FAtt 42 1 0 06 Jun 2024
Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech Neemesh Yadav Sarah Masud Vikram Goyal Vikram Goyal Md. Shad Akhtar Tanmoy Chakraborty 36 5 0 06 Jun 2024
Improving code-mixed hate detection by native sample mixing: A case study for Hindi-English code-mixed scenario Debajyoti Mazumder Aakash Kumar Jasabanta Patro 23 0 0 31 May 2024
Hate Speech Detection with Generalizable Target-aware Fairness Tong Chen Danny Wang Xurong Liang Marten Risius Gianluca Demartini Hongzhi Yin 35 3 0 28 May 2024
Grounding Toxicity in Real-World Events across Languages Wondimagegnhue Tufa Ilia Markov Piek Vossen 26 0 0 22 May 2024
The Unappreciated Role of Intent in Algorithmic Moderation of Social Media Content Xinyu Wang S. Koneru Pranav Narayanan Venkit Brett Frischmann Sarah Rajtmajer 29 0 0 17 May 2024
"They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations Preetam Prabhu Srikar Dammu Hayoung Jung Anjali Singh Monojit Choudhury Tanushree Mitra 42 8 0 08 May 2024
SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore Ri Chi Ng Nirmalendu Prakash Ming Shan Hee K. T. W. Choo Roy Ka-Wei Lee 43 4 0 03 May 2024
ViTHSD: Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts Cuong Nhat Vo Khanh Bao Huynh Son T. Luu Trong-Hop Do 47 1 0 30 Apr 2024
The Constant in HATE: Analyzing Toxicity in Reddit across Topics and Languages Wondimagegnhue Tufa Ilia Markov Piek Vossen 18 0 0 29 Apr 2024
EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter Comfort Eseohen Ilevbare Jesujoba Oluwadara Alabi David Ifeoluwa Adelani Firdous Damilola Bakare O. B. Abiola O. Adeyemo 35 7 0 28 Apr 2024
From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets Manuel Tonneau Diyi Liu Samuel Fraiberger Ralph Schroeder Scott A. Hale Paul Röttger 37 5 0 27 Apr 2024
Exploring Boundaries and Intensities in Offensive and Hate Speech: Unveiling the Complex Spectrum of Social Media Discourse A. Ayele Esubalew alemneh Jalew Adem Chanie Ali Seid Muhie Yimam Christian Biemann 16 2 0 18 Apr 2024
A Federated Learning Approach to Privacy Preserving Offensive Language Identification Marcos Zampieri Damith Premasiri Tharindu Ranasinghe FedML 26 2 0 17 Apr 2024
Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement Paras Sheth Tharindu Kumarage Raha Moraffah Amanat Chadha Huan Liu 34 1 0 17 Apr 2024
What's Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs Anna Wegmann T. Broek Dong Nguyen 40 1 0 10 Apr 2024
Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales Lucas Resck Marcos M. Raimundo Jorge Poco 50 1 0 03 Apr 2024
Target Span Detection for Implicit Harmful Content Nazanin Jafari James Allan Sheikh Muhammad Sarwar 48 1 0 28 Mar 2024
NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data Manuel Tonneau Pedro Vitor Quinta de Castro Karim Lasri I. Farouq Lakshminarayanan Subramanian Victor Orozco-Olvera Samuel Fraiberger 44 10 0 28 Mar 2024
ToXCL: A Unified Framework for Toxic Speech Detection and Explanation Nhat M. Hoang Do Xuan Long Duc Anh Do Duc Anh Vu Anh Tuan Luu 47 4 0 25 Mar 2024
On Zero-Shot Counterspeech Generation by LLMs Punyajoy Saha Aalok Agrawal Abhik Jana Chris Biemann Animesh Mukherjee 43 12 0 22 Mar 2024
EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation A. Tonja Israel Abebe Azime Tadesse Destaw Belay M. Yigezu Moges Ahmed Mehamed ... Olga Kolesnikova Philipp Slusallek Dietrich Klakow Shengwu Xiong Seid Muhie Yimam 54 5 0 20 Mar 2024
Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales Ayushi Nirmal Amrita Bhattacharjee Paras Sheth Huan Liu AAML 43 10 0 19 Mar 2024
HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models H. Nghiem Hal Daumé 42 1 0 18 Mar 2024
OffensiveLang: A Community Based Implicit Offensive Language Dataset Amit Das Mostafa Rahgouy Dongji Feng Zheng Zhang Tathagata Bhattacharya ... Aman Chadha Mary J. Sandage Lauramarie Pope Gerry V. Dozier Cheryl Seals 34 1 0 04 Mar 2024
MAGPIE: Multi-Task Media-Bias Analysis Generalization for Pre-Trained Identification of Expressions Tomávs Horych Martin Wessel Jan Philip Wahle Terry Ruas Jerome Wassmuth André Greiner-Petter Akiko Aizawa Bela Gipp Timo Spinde 46 1 0 27 Feb 2024
Algorithmic Arbitrariness in Content Moderation Juan Felipe Gomez Caio Vieira Machado Lucas Monteiro Paes Flavio du Pin Calmon 36 9 0 26 Feb 2024
InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks Somnath Banerjee Maulindu Sarkar Punyajoy Saha Binny Mathew Animesh Mukherjee TDI 34 0 0 22 Feb 2024
Eagle: Ethical Dataset Given from Real Interactions Masahiro Kaneko Danushka Bollegala Timothy Baldwin 44 3 0 22 Feb 2024
Investigating the Impact of Model Instability on Explanations and Uncertainty Sara Vera Marjanović Isabelle Augenstein Christina Lioma AAML 48 0 0 20 Feb 2024
A Dataset for the Detection of Dehumanizing Language Paul Engelmann Peter Brunsgaard Trolle Christian Hardmeier 17 1 0 13 Feb 2024
"Define Your Terms" : Enhancing Efficient Offensive Speech Classification with Definition H. Nghiem Umang Gupta Fred Morstatter 39 4 0 05 Feb 2024
Probing Critical Learning Dynamics of PLMs for Hate Speech Detection Sarah Masud Mohammad Aflah Khan Vikram Goyal Md. Shad Akhtar Tanmoy Chakraborty 21 0 0 03 Feb 2024
MasonPerplexity at Multimodal Hate Speech Event Detection 2024: Hate Speech and Target Detection Using Transformer Ensembles Amrita Ganguly Al Nahian Bin Emran Sadiya Sayara Chowdhury Puspo Md. Nishat Raihan Dhiman Goswami Marcos Zampieri 32 3 0 03 Feb 2024
Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models Ming Shan Hee Shivam Sharma Rui Cao Palash Nandi Tanmoy Chakraborty Roy Ka-Wei Lee 43 14 0 30 Jan 2024
Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse Seungyoon Lee Dahyun Jung Chanjun Park Seolhwa Lee Heu-Jeoung Lim 34 1 0 26 Jan 2024
Meme-ingful Analysis: Enhanced Understanding of Cyberbullying in Memes Through Multimodal Explanations Prince Jha Krishanu Maity Raghav Jain Apoorv Verma Sriparna Saha P. Bhattacharyya 41 7 0 18 Jan 2024
Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges Aiqi Jiang A. Zubiaga AAML 31 3 0 17 Jan 2024
Explain Thyself Bully: Sentiment Aided Cyberbullying Detection with Explanation Krishanu Maity Prince Jha Raghav Jain S. Saha P. Bhattacharyya 23 1 0 17 Jan 2024
MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection Paloma Piot-Perez-Abadin Patricia Martín-Rodilla Javier Parapar 23 2 0 12 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems Tianyu Cui Yanling Wang Chuanpu Fu Yong Xiao Sijia Li ... Junwu Xiong Xinyu Kong Zujie Wen Ke Xu Qi Li 63 57 0 11 Jan 2024
An Investigation of Large Language Models for Real-World Hate Speech Detection Keyan Guo Alexander Hu Jaden Mu Ziheng Shi Ziming Zhao Nishant Vishwamitra Hongxin Hu 25 12 0 07 Jan 2024
Building Efficient Universal Classifiers with Natural Language Inference Moritz Laurer W. Atteveldt Andreu Casas Kasper Welbers 38 8 0 29 Dec 2023
The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media Bias Timo Spinde Smilla Hinterreiter Fabian Haak Terry Ruas Helge Giese Norman Meuschke Bela Gipp 27 12 0 26 Dec 2023
Moderating New Waves of Online Hate with Chain-of-Thought Reasoning in Large Language Models Nishant Vishwamitra Keyan Guo Farhan Tajwar Romit Isabelle Ondracek Long Cheng Ziming Zhao Hongxin Hu 19 12 0 22 Dec 2023
HCDIR: End-to-end Hate Context Detection, and Intensity Reduction model for online comments Neeraj Kumar Singh Koyel Ghosh Joy Mahapatra Utpal Garain Apurbalal Senapati 22 0 0 20 Dec 2023
Multi-Label Classification of COVID-Tweets Using Large Language Models Aniket Deroy Subhankar Maity 29 5 0 17 Dec 2023
Abusive Span Detection for Vietnamese Narrative Texts Nhu-Thanh Nguyen Khoa Thi-Kim Phan Duc-Vu Nguyen Ngan Luu-Thuy Nguyen 25 0 0 13 Dec 2023