Demographic Attributes Prediction from Speech Using WavLM Embeddings

17 February 2025

Papers citing "Demographic Attributes Prediction from Speech Using WavLM Embeddings"

9 / 9 papers shown

Title
Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits Tiantian Feng Jihwan Lee Anfeng Xu Yoonjeong Lee Thanathai Lertpetchpun ... Thomas Thebaud Laureano Moro-Velazquez D. Byrd Najim Dehak Shrikanth Narayanan 57 0 0 20 May 2025
MLP-ASR: Sequence-length agnostic all-MLP architectures for speech recognition Jin Sakuma Tatsuya Komatsu Robin Scheibler 30 6 0 17 Feb 2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing Sanyuan Chen Chengyi Wang Zhengyang Chen Yu-Huan Wu Shujie Liu ... Yao Qian Jian Wu Micheal Zeng Xiangzhan Yu Furu Wei SSL 180 1,794 0 26 Oct 2021
Self-Supervised Representation Learning: Introduction, Advances and Challenges Linus Ericsson Henry Gouk Chen Change Loy Timothy M. Hospedales SSL OOD AI4TS 60 275 0 18 Oct 2021
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Alexei Baevski Henry Zhou Abdel-rahman Mohamed Michael Auli SSL 192 5,734 0 20 Jun 2020
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors Shota Horiguchi Yusuke Fujita Shinji Watanabe Yawen Xue Kenji Nagamatsu 109 189 0 20 May 2020
AutoSpeech: Neural Architecture Search for Speaker Recognition Shaojin Ding Tianlong Chen Xinyu Gong Weiwei Zha Zhangyang Wang 48 57 0 07 May 2020
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning Christian Szegedy Sergey Ioffe Vincent Vanhoucke Alexander A. Alemi 316 14,196 0 23 Feb 2016
SMOTE: Synthetic Minority Over-sampling Technique Nitesh Chawla Kevin W. Bowyer Lawrence Hall W. Kegelmeyer AI4TS 283 25,443 0 09 Jun 2011