ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.12007
  4. Cited By
Demographic Attributes Prediction from Speech Using WavLM Embeddings

Demographic Attributes Prediction from Speech Using WavLM Embeddings

17 February 2025
Yuchen Yang
Thomas Thebaud
Najim Dehak
ArXivPDFHTML

Papers citing "Demographic Attributes Prediction from Speech Using WavLM Embeddings"

9 / 9 papers shown
Title
Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits
Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits
Tiantian Feng
Jihwan Lee
Anfeng Xu
Yoonjeong Lee
Thanathai Lertpetchpun
...
Thomas Thebaud
Laureano Moro-Velazquez
D. Byrd
Najim Dehak
Shrikanth Narayanan
57
0
0
20 May 2025
MLP-ASR: Sequence-length agnostic all-MLP architectures for speech
  recognition
MLP-ASR: Sequence-length agnostic all-MLP architectures for speech recognition
Jin Sakuma
Tatsuya Komatsu
Robin Scheibler
30
6
0
17 Feb 2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech
  Processing
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
180
1,794
0
26 Oct 2021
Self-Supervised Representation Learning: Introduction, Advances and
  Challenges
Self-Supervised Representation Learning: Introduction, Advances and Challenges
Linus Ericsson
Henry Gouk
Chen Change Loy
Timothy M. Hospedales
SSL
OOD
AI4TS
60
275
0
18 Oct 2021
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
192
5,734
0
20 Jun 2020
End-to-End Speaker Diarization for an Unknown Number of Speakers with
  Encoder-Decoder Based Attractors
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Kenji Nagamatsu
109
189
0
20 May 2020
AutoSpeech: Neural Architecture Search for Speaker Recognition
AutoSpeech: Neural Architecture Search for Speaker Recognition
Shaojin Ding
Tianlong Chen
Xinyu Gong
Weiwei Zha
Zhangyang Wang
48
57
0
07 May 2020
Inception-v4, Inception-ResNet and the Impact of Residual Connections on
  Learning
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
Christian Szegedy
Sergey Ioffe
Vincent Vanhoucke
Alexander A. Alemi
316
14,196
0
23 Feb 2016
SMOTE: Synthetic Minority Over-sampling Technique
SMOTE: Synthetic Minority Over-sampling Technique
Nitesh Chawla
Kevin W. Bowyer
Lawrence Hall
W. Kegelmeyer
AI4TS
283
25,443
0
09 Jun 2011
1