ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.06772
20
0

SynHate: Detecting Hate Speech in Synthetic Deepfake Audio

7 June 2025
Rishabh Ranjan
Kishan Pipariya
Mayank Vatsa
Richa Singh
ArXiv (abs)PDFHTML
Main:4 Pages
1 Figures
Bibliography:1 Pages
5 Tables
Abstract

The rise of deepfake audio and hate speech, powered by advanced text-to-speech, threatens online safety. We present SynHate, the first multilingual dataset for detecting hate speech in synthetic audio, spanning 37 languages. SynHate uses a novel four-class scheme: Real-normal, Real-hate, Fake-normal, and Fake-hate. Built from MuTox and ADIMA datasets, it captures diverse hate speech patterns globally and in India. We evaluate five leading self-supervised models (Whisper-small/medium, XLS-R, AST, mHuBERT), finding notable performance differences by language, with Whisper-small performing best overall. Cross-dataset generalization remains a challenge. By releasing SynHate and baseline code, we aim to advance robust, culturally sensitive, and multilingual solutions against synthetic hate speech. The dataset is available atthis https URL.

View on arXiv
@article{ranjan2025_2506.06772,
  title={ SynHate: Detecting Hate Speech in Synthetic Deepfake Audio },
  author={ Rishabh Ranjan and Kishan Pipariya and Mayank Vatsa and Richa Singh },
  journal={arXiv preprint arXiv:2506.06772},
  year={ 2025 }
}
Comments on this paper