ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.02172
29
0

Different Speech Translation Models Encode and Translate Speaker Gender Differently

2 June 2025
Dennis Fucci
Marco Gaido
Matteo Negri
L. Bentivogli
André F. T. Martins
Giuseppe Attanasio
ArXiv (abs)PDFHTML
Main:5 Pages
5 Figures
Bibliography:4 Pages
6 Tables
Appendix:6 Pages
Abstract

Recent studies on interpreting the hidden states of speech models have shown their ability to capture speaker-specific features, including gender. Does this finding also hold for speech translation (ST) models? If so, what are the implications for the speaker's gender assignment in translation? We address these questions from an interpretability perspective, using probing methods to assess gender encoding across diverse ST models. Results on three language directions (English-French/Italian/Spanish) indicate that while traditional encoder-decoder models capture gender information, newer architectures -- integrating a speech encoder with a machine translation system via adapters -- do not. We also demonstrate that low gender encoding capabilities result in systems' tendency toward a masculine default, a translation bias that is more pronounced in newer architectures.

View on arXiv
@article{fucci2025_2506.02172,
  title={ Different Speech Translation Models Encode and Translate Speaker Gender Differently },
  author={ Dennis Fucci and Marco Gaido and Matteo Negri and Luisa Bentivogli and Andre Martins and Giuseppe Attanasio },
  journal={arXiv preprint arXiv:2506.02172},
  year={ 2025 }
}
Comments on this paper