x-vectors meet emotions: A study on dependencies between emotion and speaker recognition

12 February 2020

R. Pappagari

Tianzi Wang

Jesus Villalba

Nanxin Chen

Najim Dehak

ArXiv PDF HTML

Papers citing "x-vectors meet emotions: A study on dependencies between emotion and speaker recognition"

49 / 49 papers shown

Title
Improving speaker verification robustness with synthetic emotional utterances Nikhil Kumar Koditala C. Ju Ruirui Li Minho Jin Aman Chadha A. Stolcke 57 0 0 30 Nov 2024
On-the-fly Modulation for Balanced Multimodal Learning Yake Wei D. Hu Henghui Du Ji-Rong Wen 26 7 0 15 Oct 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning Shuai Wang Zheng-Shou Chen Kong Aik Lee Yan-min Qian Haizhou Li 39 4 0 21 Jul 2024
Disentangled Representation Learning for Environment-agnostic Speaker Recognition Kihyun Nam Hee-Soo Heo Jee-weon Jung Joon Son Chung 50 0 0 20 Jun 2024
Speaker Characterization by means of Attention Pooling Federico Costa Miquel India Javier Hernando 25 1 0 07 May 2024
The VoicePrivacy 2024 Challenge Evaluation Plan N. Tomashenko Xiaoxiao Miao Pierre Champion Sarina Meyer Xin Wang Emmanuel Vincent Michele Panariello Nicholas W. D. Evans Junichi Yamagishi Massimiliano Todisco 36 21 0 03 Apr 2024
Are Paralinguistic Representations all that is needed for Speech Emotion Recognition? Orchid Chetia Phukan Gautam Siddharth Kashyap Arun Balaji Buduru Rajesh Sharma 29 0 0 02 Feb 2024
Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition Ismail Rasim Ulgen Zongyang Du Carlos Busso Berrak Sisman 21 2 0 19 Jan 2024
Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement Soumya Dutta Sriram Ganapathy 18 1 0 09 Jan 2024
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition Ziyang Ma Wen Wu Zhisheng Zheng Yiwei Guo Qian Chen Shiliang Zhang Xie Chen 27 15 0 19 Sep 2023
Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures J. Yip Dianwen Ng Bin Ma Chng Eng Siong 23 0 0 14 Sep 2023
Vocal Style Factorization for Effective Speaker Recognition in Affective Scenarios Morgan Sandler Arun Ross CVBM 18 0 0 13 May 2023
A Comparative Study of Pre-trained Speech and Audio Embeddings for Speech Emotion Recognition Orchid Chetia Phukan Arun Balaji Buduru Rajesh Sharma 28 6 0 22 Apr 2023
Evaluation of Speaker Anonymization on Emotional Speech Hubert Nourtel Pierre Champion D. Jouvet Anthony Larcher Marie Tahon 32 8 0 15 Apr 2023
On the Impact of Voice Anonymization on Speech Diagnostic Applications: a Case Study on COVID-19 Detection Yi Zhu Mohamed Imoussaïne-Aïkous Carolyn Côté-Lussier Tiago H. Falk 18 4 0 05 Apr 2023
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective Spandan Dey Md. Sahidullah G. Saha 17 20 0 30 Nov 2022
Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition Morgan Sandler Arun Ross 14 0 0 15 Nov 2022
Distribution-based Emotion Recognition in Conversation Wen Wu C. Zhang P. Woodland 19 4 0 09 Nov 2022
Disentangled representation learning for multilingual speaker recognition Kihyun Nam You-kyong. Kim Jaesung Huh Hee-Soo Heo Jee-weon Jung Joon Son Chung 50 6 0 01 Nov 2022
Model Compression for DNN-based Speaker Verification Using Weight Quantization Jingyu Li W. Liu Zhaoyang Zhang Jiong Wang Tan Lee MQ 18 3 0 31 Oct 2022
Training speech emotion classifier without categorical annotations Meysam Shamsi Marie Tahon 18 2 0 14 Oct 2022
Analysis of impact of emotions on target speech extraction and speech separation Jan vSvec Katevrina vZmolíková M. Kocour Marc Delcroix Tsubasa Ochiai Ladislav Movsner JanHonza'' vCernocký 15 4 0 15 Aug 2022
Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech Jaejin Cho Jesús Villalba Laureano Moro Velázquez Najim Dehak SSL 31 16 0 10 Aug 2022
Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations Jaejin Cho R. Pappagari Piotr Żelasko Laureano Moro Velázquez Jesús Villalba Najim Dehak SSL 15 13 0 10 Aug 2022
Nonwords Pronunciation Classification in Language Development Tests for Preschool Children Ilja Baumann Dominik Wagner Sebastian P. Bayerl Tobias Bocklet 19 5 0 16 Jun 2022
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features Xulong Zhang Jianzong Wang Ning Cheng Jing Xiao 16 16 0 24 May 2022
Balanced Multimodal Learning via On-the-fly Gradient Modulation Xiaokang Peng Yake Wei Andong Deng Dong Wang Di Hu 19 194 0 29 Mar 2022
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors Wen Wu C. Zhang Xixin Wu P. Woodland 48 14 0 08 Mar 2022
Multimodal Emotion Recognition using Transfer Learning from Speaker Recognition and BERT-based models Sarala Padi S. O. Sadjadi Tianyi Zhou Ram D. Sriram 28 36 0 16 Feb 2022
Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition Ayoub Ghriss Bo Yang Viktor Rozgic Elizabeth Shriberg Chao Wang 22 21 0 27 Jan 2022
Emotional Speaker Identification using a Novel Capsule Nets Model Ali Bou Nassif I. Shahin A. Elnagar Divya Velayudhan A. Alhudhaif K. Polat 14 28 0 09 Jan 2022
X-Vector based voice activity detection for multi-genre broadcast speech-to-text Misa Ogura Matt Haynes 9 0 0 09 Dec 2021
Beyond Isolated Utterances: Conversational Emotion Recognition R. Pappagari Piotr Żelasko Jesús Villalba Laureano Moro Velázquez Najim Dehak 14 4 0 13 Sep 2021
Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments L. F. Parra-Gallego J. Orozco-Arroyave 6 17 0 26 Aug 2021
Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation Sarala Padi S. O. Sadjadi Tianyi Zhou Ram D. Sriram 16 34 0 05 Aug 2021
The Role of Phonetic Units in Speech Emotion Recognition Jiahong Yuan Xingyu Cai Renjie Zheng Liang Huang Kenneth Ward Church 15 15 0 02 Aug 2021
Significance of Speaker Embeddings and Temporal Context for Depression Detection Sri Harsha Dumpala Sebastian Rodriguez S. Rempel Rudolf Uher Sageev Oore 12 4 0 24 Jul 2021
An Attribute-Aligned Strategy for Learning Speech Representation Yu-Lin Huang Bo-Hao Su Y.-W. Peter Hong Chi-Chun Lee 13 5 0 05 Jun 2021
Multi-Modal Emotion Detection with Transfer Learning Amith Ananthram Kailash Saravanakumar Jessica Huynh Homayoon Beigi 13 3 0 13 Nov 2020
CopyPaste: An Augmentation Method for Speech Emotion Recognition R. Pappagari Jesús Villalba Piotr Żelasko Laureano Moro Velázquez Najim Dehak 6 39 0 27 Oct 2020
Leveraging speaker attribute information using multi task learning for speaker verification and diarization Chau Luu P. Bell Steve Renals 22 8 0 27 Oct 2020
Emotion recognition by fusing time synchronous and time asynchronous representations Wen Wu Chao Zhang P. Woodland 14 67 0 27 Oct 2020
End-to-end Triplet Loss based Emotion Embedding System for Speech Emotion Recognition Puneet Kumar S. Jain Balasubramanian Raman P. Roy Masakazu Iwamura 10 24 0 13 Oct 2020
They are wearing a mask! Identification of Subjects Wearing a Surgical Mask from their Speech by means of x-vectors and Fisher Vectors José Vicente Egas López 8 0 0 23 Aug 2020
Transformer based unsupervised pre-training for acoustic representation learning Ruixiong Zhang Haiwei Wu Wubo Li Dongwei Jiang Wei Zou Xiangang Li SSL ViT 25 27 0 29 Jul 2020
Double Multi-Head Attention for Speaker Verification Miquel India Pooyan Safari Javier Hernando 28 18 0 26 Jul 2020
Pathological speech detection using x-vector embeddings Catarina Botelho Francisco Teixeira T. Rolland A. Abad Isabel Trancoso 13 11 0 02 Mar 2020
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications Chao Zhang Zichao Yang Xiaodong He Li Deng HAI AI4TS 27 321 0 10 Nov 2019
Probing the Information Encoded in X-vectors Desh Raj David Snyder Daniel Povey Sanjeev Khudanpur 37 84 0 13 Sep 2019