ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.05039
  4. Cited By
x-vectors meet emotions: A study on dependencies between emotion and
  speaker recognition

x-vectors meet emotions: A study on dependencies between emotion and speaker recognition

12 February 2020
R. Pappagari
Tianzi Wang
Jesus Villalba
Nanxin Chen
Najim Dehak
ArXivPDFHTML

Papers citing "x-vectors meet emotions: A study on dependencies between emotion and speaker recognition"

49 / 49 papers shown
Title
Improving speaker verification robustness with synthetic emotional utterances
Nikhil Kumar Koditala
C. Ju
Ruirui Li
Minho Jin
Aman Chadha
A. Stolcke
57
0
0
30 Nov 2024
On-the-fly Modulation for Balanced Multimodal Learning
On-the-fly Modulation for Balanced Multimodal Learning
Yake Wei
D. Hu
Henghui Du
Ji-Rong Wen
26
7
0
15 Oct 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep
  Speaker Representation Learning
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
39
4
0
21 Jul 2024
Disentangled Representation Learning for Environment-agnostic Speaker
  Recognition
Disentangled Representation Learning for Environment-agnostic Speaker Recognition
Kihyun Nam
Hee-Soo Heo
Jee-weon Jung
Joon Son Chung
50
0
0
20 Jun 2024
Speaker Characterization by means of Attention Pooling
Speaker Characterization by means of Attention Pooling
Federico Costa
Miquel India
Javier Hernando
25
1
0
07 May 2024
The VoicePrivacy 2024 Challenge Evaluation Plan
The VoicePrivacy 2024 Challenge Evaluation Plan
N. Tomashenko
Xiaoxiao Miao
Pierre Champion
Sarina Meyer
Xin Wang
Emmanuel Vincent
Michele Panariello
Nicholas W. D. Evans
Junichi Yamagishi
Massimiliano Todisco
36
21
0
03 Apr 2024
Are Paralinguistic Representations all that is needed for Speech Emotion
  Recognition?
Are Paralinguistic Representations all that is needed for Speech Emotion Recognition?
Orchid Chetia Phukan
Gautam Siddharth Kashyap
Arun Balaji Buduru
Rajesh Sharma
29
0
0
02 Feb 2024
Revealing Emotional Clusters in Speaker Embeddings: A Contrastive
  Learning Strategy for Speech Emotion Recognition
Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition
Ismail Rasim Ulgen
Zongyang Du
Carlos Busso
Berrak Sisman
21
2
0
19 Jan 2024
Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement
Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement
Soumya Dutta
Sriram Ganapathy
18
1
0
09 Jan 2024
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion
  Recognition
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Ziyang Ma
Wen Wu
Zhisheng Zheng
Yiwei Guo
Qian Chen
Shiliang Zhang
Xie Chen
27
15
0
19 Sep 2023
Analysis of Speech Separation Performance Degradation on Emotional
  Speech Mixtures
Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures
J. Yip
Dianwen Ng
Bin Ma
Chng Eng Siong
23
0
0
14 Sep 2023
Vocal Style Factorization for Effective Speaker Recognition in Affective
  Scenarios
Vocal Style Factorization for Effective Speaker Recognition in Affective Scenarios
Morgan Sandler
Arun Ross
CVBM
18
0
0
13 May 2023
A Comparative Study of Pre-trained Speech and Audio Embeddings for
  Speech Emotion Recognition
A Comparative Study of Pre-trained Speech and Audio Embeddings for Speech Emotion Recognition
Orchid Chetia Phukan
Arun Balaji Buduru
Rajesh Sharma
28
6
0
22 Apr 2023
Evaluation of Speaker Anonymization on Emotional Speech
Evaluation of Speaker Anonymization on Emotional Speech
Hubert Nourtel
Pierre Champion
D. Jouvet
Anthony Larcher
Marie Tahon
32
8
0
15 Apr 2023
On the Impact of Voice Anonymization on Speech Diagnostic Applications:
  a Case Study on COVID-19 Detection
On the Impact of Voice Anonymization on Speech Diagnostic Applications: a Case Study on COVID-19 Detection
Yi Zhu
Mohamed Imoussaïne-Aïkous
Carolyn Côté-Lussier
Tiago H. Falk
18
4
0
05 Apr 2023
An Overview of Indian Spoken Language Recognition from Machine Learning
  Perspective
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
Spandan Dey
Md. Sahidullah
G. Saha
17
20
0
30 Nov 2022
Is Style All You Need? Dependencies Between Emotion and GST-based
  Speaker Recognition
Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition
Morgan Sandler
Arun Ross
14
0
0
15 Nov 2022
Distribution-based Emotion Recognition in Conversation
Distribution-based Emotion Recognition in Conversation
Wen Wu
C. Zhang
P. Woodland
19
4
0
09 Nov 2022
Disentangled representation learning for multilingual speaker
  recognition
Disentangled representation learning for multilingual speaker recognition
Kihyun Nam
You-kyong. Kim
Jaesung Huh
Hee-Soo Heo
Jee-weon Jung
Joon Son Chung
50
6
0
01 Nov 2022
Model Compression for DNN-based Speaker Verification Using Weight
  Quantization
Model Compression for DNN-based Speaker Verification Using Weight Quantization
Jingyu Li
W. Liu
Zhaoyang Zhang
Jiong Wang
Tan Lee
MQ
18
3
0
31 Oct 2022
Training speech emotion classifier without categorical annotations
Training speech emotion classifier without categorical annotations
Meysam Shamsi
Marie Tahon
18
2
0
14 Oct 2022
Analysis of impact of emotions on target speech extraction and speech
  separation
Analysis of impact of emotions on target speech extraction and speech separation
Jan vSvec
Katevrina vZmolíková
M. Kocour
Marc Delcroix
Tsubasa Ochiai
Ladislav Movsner
JanHonza'' vCernocký
15
4
0
15 Aug 2022
Non-Contrastive Self-supervised Learning for Utterance-Level Information
  Extraction from Speech
Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech
Jaejin Cho
Jesús Villalba
Laureano Moro Velázquez
Najim Dehak
SSL
31
16
0
10 Aug 2022
Non-Contrastive Self-Supervised Learning of Utterance-Level Speech
  Representations
Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations
Jaejin Cho
R. Pappagari
Piotr Żelasko
Laureano Moro Velázquez
Jesús Villalba
Najim Dehak
SSL
15
13
0
10 Aug 2022
Nonwords Pronunciation Classification in Language Development Tests for
  Preschool Children
Nonwords Pronunciation Classification in Language Development Tests for Preschool Children
Ilja Baumann
Dominik Wagner
Sebastian P. Bayerl
Tobias Bocklet
19
5
0
16 Jun 2022
Singer Identification for Metaverse with Timbral and Middle-Level
  Perceptual Features
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
16
16
0
24 May 2022
Balanced Multimodal Learning via On-the-fly Gradient Modulation
Balanced Multimodal Learning via On-the-fly Gradient Modulation
Xiaokang Peng
Yake Wei
Andong Deng
Dong Wang
Di Hu
19
194
0
29 Mar 2022
Estimating the Uncertainty in Emotion Class Labels with
  Utterance-Specific Dirichlet Priors
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors
Wen Wu
C. Zhang
Xixin Wu
P. Woodland
48
14
0
08 Mar 2022
Multimodal Emotion Recognition using Transfer Learning from Speaker
  Recognition and BERT-based models
Multimodal Emotion Recognition using Transfer Learning from Speaker Recognition and BERT-based models
Sarala Padi
S. O. Sadjadi
Tianyi Zhou
Ram D. Sriram
28
36
0
16 Feb 2022
Sentiment-Aware Automatic Speech Recognition pre-training for enhanced
  Speech Emotion Recognition
Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition
Ayoub Ghriss
Bo Yang
Viktor Rozgic
Elizabeth Shriberg
Chao Wang
22
21
0
27 Jan 2022
Emotional Speaker Identification using a Novel Capsule Nets Model
Emotional Speaker Identification using a Novel Capsule Nets Model
Ali Bou Nassif
I. Shahin
A. Elnagar
Divya Velayudhan
A. Alhudhaif
K. Polat
14
28
0
09 Jan 2022
X-Vector based voice activity detection for multi-genre broadcast
  speech-to-text
X-Vector based voice activity detection for multi-genre broadcast speech-to-text
Misa Ogura
Matt Haynes
9
0
0
09 Dec 2021
Beyond Isolated Utterances: Conversational Emotion Recognition
Beyond Isolated Utterances: Conversational Emotion Recognition
R. Pappagari
Piotr Żelasko
Jesús Villalba
Laureano Moro Velázquez
Najim Dehak
14
4
0
13 Sep 2021
Classification of Emotions and Evaluation of Customer Satisfaction from
  Speech in Real World Acoustic Environments
Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments
L. F. Parra-Gallego
J. Orozco-Arroyave
6
17
0
26 Aug 2021
Improved Speech Emotion Recognition using Transfer Learning and
  Spectrogram Augmentation
Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Sarala Padi
S. O. Sadjadi
Tianyi Zhou
Ram D. Sriram
16
34
0
05 Aug 2021
The Role of Phonetic Units in Speech Emotion Recognition
The Role of Phonetic Units in Speech Emotion Recognition
Jiahong Yuan
Xingyu Cai
Renjie Zheng
Liang Huang
Kenneth Ward Church
15
15
0
02 Aug 2021
Significance of Speaker Embeddings and Temporal Context for Depression
  Detection
Significance of Speaker Embeddings and Temporal Context for Depression Detection
Sri Harsha Dumpala
Sebastian Rodriguez
S. Rempel
Rudolf Uher
Sageev Oore
12
4
0
24 Jul 2021
An Attribute-Aligned Strategy for Learning Speech Representation
An Attribute-Aligned Strategy for Learning Speech Representation
Yu-Lin Huang
Bo-Hao Su
Y.-W. Peter Hong
Chi-Chun Lee
13
5
0
05 Jun 2021
Multi-Modal Emotion Detection with Transfer Learning
Multi-Modal Emotion Detection with Transfer Learning
Amith Ananthram
Kailash Saravanakumar
Jessica Huynh
Homayoon Beigi
13
3
0
13 Nov 2020
CopyPaste: An Augmentation Method for Speech Emotion Recognition
CopyPaste: An Augmentation Method for Speech Emotion Recognition
R. Pappagari
Jesús Villalba
Piotr Żelasko
Laureano Moro Velázquez
Najim Dehak
6
39
0
27 Oct 2020
Leveraging speaker attribute information using multi task learning for
  speaker verification and diarization
Leveraging speaker attribute information using multi task learning for speaker verification and diarization
Chau Luu
P. Bell
Steve Renals
22
8
0
27 Oct 2020
Emotion recognition by fusing time synchronous and time asynchronous
  representations
Emotion recognition by fusing time synchronous and time asynchronous representations
Wen Wu
Chao Zhang
P. Woodland
14
67
0
27 Oct 2020
End-to-end Triplet Loss based Emotion Embedding System for Speech
  Emotion Recognition
End-to-end Triplet Loss based Emotion Embedding System for Speech Emotion Recognition
Puneet Kumar
S. Jain
Balasubramanian Raman
P. Roy
Masakazu Iwamura
10
24
0
13 Oct 2020
They are wearing a mask! Identification of Subjects Wearing a Surgical
  Mask from their Speech by means of x-vectors and Fisher Vectors
They are wearing a mask! Identification of Subjects Wearing a Surgical Mask from their Speech by means of x-vectors and Fisher Vectors
José Vicente Egas López
8
0
0
23 Aug 2020
Transformer based unsupervised pre-training for acoustic representation
  learning
Transformer based unsupervised pre-training for acoustic representation learning
Ruixiong Zhang
Haiwei Wu
Wubo Li
Dongwei Jiang
Wei Zou
Xiangang Li
SSL
ViT
25
27
0
29 Jul 2020
Double Multi-Head Attention for Speaker Verification
Double Multi-Head Attention for Speaker Verification
Miquel India
Pooyan Safari
Javier Hernando
28
18
0
26 Jul 2020
Pathological speech detection using x-vector embeddings
Catarina Botelho
Francisco Teixeira
T. Rolland
A. Abad
Isabel Trancoso
13
11
0
02 Mar 2020
Multimodal Intelligence: Representation Learning, Information Fusion,
  and Applications
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAI
AI4TS
27
321
0
10 Nov 2019
Probing the Information Encoded in X-vectors
Probing the Information Encoded in X-vectors
Desh Raj
David Snyder
Daniel Povey
Sanjeev Khudanpur
37
84
0
13 Sep 2019
1