In defence of metric learning for speaker recognition

26 March 2020

Joon Son Chung

Papers citing "In defence of metric learning for speaker recognition"

50 / 79 papers shown

Title
USED: Universal Speaker Extraction and Diarization Junyi Ao Mehmet Sinan Yildirim Ruijie Tao Mengyao Ge Shuai Wang Yan-min Qian Haizhou Li 41 6 0 17 Jan 2025
Exploring synthetic data for cross-speaker style transfer in style representation based TTS Lucas Ueda Leonardo B. de M. M. Marques Flávio O. Simões Mário Uliani Neto Fernando Runstein Bianca Dal Bó Paula D. P. Costa 26 0 0 25 Sep 2024
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction Bang Zeng Ming Li 37 2 0 04 Sep 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning Shuai Wang Zheng-Shou Chen Kong Aik Lee Yan-min Qian Haizhou Li 39 4 0 21 Jul 2024
Towards Supervised Performance on Speaker Verification with Self-Supervised Learning by Leveraging Large-Scale ASR Models Victor Miara Theo Lepage Reda Dehak 37 1 0 04 Jun 2024
CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild Xingqun Qi Hengyuan Zhang Yatian Wang J. Pan Chen Liu ... Qixun Zhang Shanghang Zhang Wenhan Luo Qifeng Liu Qi-fei Liu DiffM SLR 110 5 0 27 May 2024
An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution Tien-Hong Lo Fu-An Chao Tzu-I Wu Yao-Ting Sung Berlin Chen 23 3 0 11 Apr 2024
Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices Beltrán Labrador Manuel Otero-Gonzalez Alicia Lozano-Diez D. Ramos-Castro Doroteo T. Toledano Joaquín González-Rodríguez 21 0 0 20 Dec 2023
NeXt-TDNN: Modernizing Multi-Scale Temporal Convolution Backbone for Speaker Verification Hyunjun Heo U.H Shin Ran Lee YoungJu Cheon Hyung-Min Park 26 9 0 14 Dec 2023
Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation Xingqun Qi Jiahao Pan Peng Li Ruibin Yuan Xiaowei Chi ... Wenhan Luo Wei Xue Shanghang Zhang Qi-fei Liu Yi-Ting Guo SLR 34 11 0 29 Nov 2023
Deep Neural Networks for Automatic Speaker Recognition Do Not Learn Supra-Segmental Temporal Features Daniel Neururer Volker Dellwo Thilo Stadelmann 21 2 0 01 Nov 2023
An Initial Investigation of Neural Replay Simulator for Over-the-Air Adversarial Perturbations to Automatic Speaker Verification Jiaqi Li Li Wang Liumeng Xue Lei Wang Zhizheng Wu AAML 27 3 0 09 Oct 2023
PAS: Partial Additive Speech Data Augmentation Method for Noise Robust Speaker Verification Wonbin Kim Hyun-Seo Shin Ju-ho Kim Ju-Sung Heo Chanmann Lim Ha-Jin Yu 23 0 0 20 Jul 2023
Exploring Binary Classification Loss For Speaker Verification Bing Han Zhengyang Chen Y. Qian CVBM 24 10 0 17 Jul 2023
Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction Rohit Paturi S. Srinivasan Xiang Li 18 13 0 15 Jun 2023
Experimenting with Additive Margins for Contrastive Self-Supervised Speaker Verification Theo Lepage Reda Dehak SSL 13 3 0 06 Jun 2023
Few-Shot Open-Set Learning for On-Device Customization of KeyWord Spotting Systems Manuele Rusci Tinne Tuytelaars 27 5 0 03 Jun 2023
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings L. Serafini Samuele Cornell Giovanni Morrone Enrico Zovato A. Brutti S. Squartini 47 9 0 29 May 2023
A Study on Bias and Fairness In Deep Speaker Recognition Amirhossein Hajavi Ali Etemad 27 2 0 14 Mar 2023
I-MSV 2022: Indic-Multilingual and Multi-sensor Speaker Verification Challenge Jagabandhu Mishra Mrinmoy Bhattacharjee S. M. I. S. R. Mahadeva Prasanna 16 1 0 26 Feb 2023
Interpretable Spectrum Transformation Attacks to Speaker Recognition Jiadi Yao H. Luo Xiao-Lei Zhang AAML 32 1 0 21 Feb 2023
VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge Jaesung Huh A. Brown Jee-weon Jung Joon Son Chung Arsha Nagrani D. Garcia-Romero Andrew Zisserman 23 26 0 20 Feb 2023
Residual Information in Deep Speaker Embedding Architectures Adriana Stan 34 5 0 06 Feb 2023
Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection Rahul Sharma Shrikanth Narayanan 37 8 0 01 Dec 2022
Low Pass Filtering and Bandwidth Extension for Robust Anti-spoofing Countermeasure Against Codec Variabilities Yikang Wang Xingming Wang Hiromitsu Nishizaki Ming Li 24 6 0 12 Nov 2022
High-resolution embedding extractor for speaker diarisation Hee-Soo Heo Youngki Kwon Bong-Jin Lee You Jin Kim Jee-weon Jung 29 5 0 08 Nov 2022
LMD: A Learnable Mask Network to Detect Adversarial Examples for Speaker Verification Xingqi Chen Jie Wang Xiaoli Zhang Weiqiang Zhang Kunde Yang AAML 26 7 0 02 Nov 2022
Metric Learning for User-defined Keyword Spotting Jaemin Jung You-kyong. Kim Jihwan Park Youshin Lim Byeong-Yeol Kim Youngjoon Jang Joon Son Chung 40 9 0 01 Nov 2022
Symmetric Saliency-based Adversarial Attack To Speaker Identification Jiadi Yao Xing Chen Xiao-Lei Zhang Weiqiang Zhang Kunde Yang AAML 31 8 0 30 Oct 2022
Speaker Representation Learning via Contrastive Loss with Maximal Speaker Separability Zhe Li Man-Wai Mak SSL 23 6 0 29 Oct 2022
Privacy-preserving Automatic Speaker Diarization Francisco Teixeira A. Abad Bhiksha Raj Isabel Trancoso 27 4 0 26 Oct 2022
Deepfake audio detection by speaker verification Alessandro Pianese D. Cozzolino Giovanni Poggi L. Verdoliva 38 38 0 28 Sep 2022
Unsupervised active speaker detection in media content using cross-modal information Rahul Sharma Shrikanth Narayanan 21 3 0 24 Sep 2022
Disentangled Speaker Representation Learning via Mutual Information Minimization Sung Hwan Mun Mingrui Han Minchan Kim Dongjune Lee N. Kim DRL 41 9 0 17 Aug 2022
Generating gender-ambiguous voices for privacy-preserving speech recognition Dimitrios Stoidis Andrea Cavallaro 36 14 0 03 Jul 2022
Personalized Keyword Spotting through Multi-task Learning Seunghan Yang Byeonggeun Kim Inseop Chung Simyung Chang 23 8 0 28 Jun 2022
Domain Agnostic Few-shot Learning for Speaker Verification Seunghan Yang Debasmit Das Jang Hyun Cho Hyoungwoo Park Sungrack Yun OOD 19 7 0 28 Jun 2022
Domain Generalization with Relaxed Instance Frequency-wise Normalization for Multi-device Acoustic Scene Classification Byeonggeun Kim Seunghan Yang Jangho Kim Hyunsin Park Juntae Lee Simyung Chang 43 28 0 24 Jun 2022
DT-SV: A Transformer-based Time-domain Approach for Speaker Verification Nan Zhang Jianzong Wang Zhenhou Hong Chendong Zhao Xiaoyang Qu Jing Xiao 34 5 0 26 May 2022
End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions Wonjune Kang M. Hasegawa-Johnson D. Roy 32 8 0 19 May 2022
Efficient dynamic filter for robust and low computational feature extraction Donghyeon Kim Gwantae Kim Bokyeung Lee Jeong-gi Kwak D. Han Hanseok Ko 28 3 0 03 May 2022
Baselines and Protocols for Household Speaker Recognition A. Sholokhov Xuechen Liu Md. Sahidullah Tomi Kinnunen 25 4 0 30 Apr 2022
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches Zifeng Zhao Dongchao Yang Rongzhi Gu Haoran Zhang Yuexian Zou 23 16 0 04 Apr 2022
Frequency and Multi-Scale Selective Kernel Attention for Speaker Verification Sung Hwan Mun Jee-weon Jung Min Hyun Han N. Kim 50 21 0 03 Apr 2022
Adversarial Speaker Distillation for Countermeasure Model on Automatic Speaker Verification Yen-Lun Liao Xuan-Bo Chen Chung-Che Wang J. Jang AAML 41 8 0 31 Mar 2022
Decomposed Temporal Dynamic CNN: Efficient Time-Adaptive Network for Text-Independent Speaker Verification Explained with Speaker Activation Map Seong-Hu Kim Hyeonuk Nam Yong-Hwa Park 22 9 0 29 Mar 2022
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification Yang Zhang Zhiqiang Lv Haibin Wu Shanshan Zhang Pengfei Hu Zhiyong Wu Hung-yi Lee Helen Meng ViT 24 130 0 29 Mar 2022
Magnitude-aware Probabilistic Speaker Embeddings Nikita Kuzmin Igor Fedorov A. Sholokhov 27 7 0 28 Feb 2022
Contrastive-mixup learning for improved speaker verification Xin Zhang Minho Jin R. Cheng Ruirui Li Eunjung Han A. Stolcke AAML SSL 23 10 0 22 Feb 2022
MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances Tianchi Liu Rohan Kumar Das Kong Aik Lee Haizhou Li 21 69 0 03 Feb 2022