ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.24713
17
0

Voice Conversion Improves Cross-Domain Robustness for Spoken Arabic Dialect Identification

30 May 2025
Badr M. Abdullah
Matthew Baas
Bernd Möbius
Dietrich Klakow
ArXiv (abs)PDFHTML
Main:4 Pages
2 Figures
Bibliography:2 Pages
3 Tables
Abstract

Arabic dialect identification (ADI) systems are essential for large-scale data collection pipelines that enable the development of inclusive speech technologies for Arabic language varieties. However, the reliability of current ADI systems is limited by poor generalization to out-of-domain speech. In this paper, we present an effective approach based on voice conversion for training ADI models that achieves state-of-the-art performance and significantly improves robustness in cross-domain scenarios. Evaluated on a newly collected real-world test set spanning four different domains, our approach yields consistent improvements of up to +34.1% in accuracy across domains. Furthermore, we present an analysis of our approach and demonstrate that voice conversion helps mitigate the speaker bias in the ADI dataset. We release our robust ADI model and cross-domain evaluation dataset to support the development of inclusive speech technologies for Arabic.

View on arXiv
@article{abdullah2025_2505.24713,
  title={ Voice Conversion Improves Cross-Domain Robustness for Spoken Arabic Dialect Identification },
  author={ Badr M. Abdullah and Matthew Baas and Bernd Möbius and Dietrich Klakow },
  journal={arXiv preprint arXiv:2505.24713},
  year={ 2025 }
}
Comments on this paper