Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System

14 April 2018

Papers citing "Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System"

50 / 146 papers shown

Title
Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models Xiaoxiao Miao Xin Wang Erica Cooper Junichi Yamagishi N. Tomashenko 64 25 0 26 Feb 2022
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge Weiqing Wang Xiaoyi Qin Ming Li 19 27 0 06 Feb 2022
Graph attentive feature aggregation for text-independent speaker verification Hye-jin Shim Ju-Sung Heo Jae-han Park Gareth Lee Ha-Jin Yu 35 16 0 23 Dec 2021
Explore Long-Range Context feature for Speaker Verification Zhuo Li 33 6 0 14 Dec 2021
Self-Supervised Speaker Verification with Simple Siamese Network and Self-Supervised Regularization Mufan Sang Haoqi Li F. Liu Andrew O. Arnold Li Wan SSL 16 40 0 08 Dec 2021
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone Edresson Casanova Julian Weber C. Shulby Arnaldo Cândido Júnior Eren Golge M. Ponti 185 382 0 04 Dec 2021
A Study on Decoupled Probabilistic Linear Discriminant Analysis Ding Wang Lantian Li Hongzhi Yu Dong Wang 11 0 0 24 Nov 2021
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines Haozhe Zhang Zexin Cai Xiaoyi Qin Ming Li 54 15 0 06 Nov 2021
A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data Madina Abdrakhmanova Siwen Guo Yerbolat Khassanov Shohreh Haddadan 11 5 0 23 Oct 2021
Real Additive Margin Softmax for Speaker Verification Lantian Li Ruiqian Nai Dong Wang 6 14 0 18 Oct 2021
Simple Attention Module based Speaker Verification with Iterative noisy label detection Xiaoyi Qin Na Li Chao Weng Dan Su Ming Li NoLa 65 50 0 13 Oct 2021
Multi-View Self-Attention Based Transformer for Speaker Recognition Rui Wang Junyi Ao Long Zhou Shujie Liu Zhihua Wei Tom Ko Qing Li Yu Zhang ViT 14 31 0 11 Oct 2021
Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification Qingjian Lin Lin Yang Xuyang Wang Xiaoyi Qin Junjie Wang Ming Li 30 21 0 09 Oct 2021
Multi-task Voice Activated Framework using Self-supervised Learning Shehzeen Samarah Hussain V. Nguyen Shuhua Zhang Erik M. Visser SSL 27 12 0 03 Oct 2021
Beijing ZKJ-NPU Speaker Verification System for VoxCeleb Speaker Recognition Challenge 2021 Li Zhang Huan Zhao Qinling Meng Yanli Chen Min Liu Lei Xie 32 10 0 08 Sep 2021
The DKU-DukeECE System for the Self-Supervision Speaker Verification Task of the 2021 VoxCeleb Speaker Recognition Challenge Danwei Cai Ming Li 11 15 0 07 Sep 2021
GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints Ji-Hoon Kim Sang-Hoon Lee Ji-Hyun Lee Hong G Jung Seong-Whan Lee 47 6 0 16 Aug 2021
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds Xuan Shi Erica Cooper Junichi Yamagishi 33 7 0 24 Jul 2021
The HCCL Speaker Verification System for Far-Field Speaker Verification Challenge Zhuo Li Ce Fang Runqiu Xiao Zhigao Chen Wenchao Wang Yonghong Yan 25 2 0 03 Jul 2021
Adaptive Margin Circle Loss for Speaker Verification Runqiu Xiao 33 11 0 15 Jun 2021
Low-Resource Spoken Language Identification Using Self-Attentive Pooling and Deep 1D Time-Channel Separable Convolutions Roman Bedyakin N. Mikhaylovskiy 14 5 0 31 May 2021
Accent Recognition with Hybrid Phonetic Features Zhan Zhang Xi Chen Yuehai Wang Jianyi Yang 24 18 0 05 May 2021
Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss Yaogen Yang Haozhe Zhang Xiaoyi Qin Shanshan Liang Huahua Cui Mingyang Xu Ming Li 53 4 0 22 Apr 2021
Binary Neural Network for Speaker Verification Tinglong Zhu Xiaoyi Qin Ming Li MQ 21 12 0 06 Apr 2021
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model Edresson Casanova C. Shulby Eren Golge Nicolas Müller F. S. Oliveira Arnaldo Cândido Júnior A. S. Soares S. Aluísio M. Ponti 16 98 0 02 Apr 2021
The DKU-Duke-Lenovo System Description for the Third DIHARD Speech Diarization Challenge Weiqing Wang Qingjian Lin Danwei Cai Lin Yang Ming Li 13 8 0 06 Feb 2021
A Principle Solution for Enroll-Test Mismatch in Speaker Recognition Lantian Li Dong Wang Jiawen Kang Renyu Wang Jingqian Wu Zhendong Gao Xiao Chen 13 7 0 23 Dec 2020
CN-Celeb: multi-genre speaker recognition Lantian Li Ruiqi Liu Jiawen Kang Yue Fan Hao Cui Yunqi Cai Ravichander Vipperla T. Zheng Dong Wang 33 119 0 23 Dec 2020
DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning Mufan Sang Wei Xia John H. L. Hansen OOD DRL 21 23 0 12 Dec 2020
Exploring wav2vec 2.0 on speaker verification and language identification Zhiyun Fan Meng Li Shiyu Zhou Bo Xu 117 202 0 11 Dec 2020
Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation Paul-Gauthier Noé Mohammad MohammadAmini D. Matrouf Titouan Parcollet Andreas Nautsch J. Bonastre 29 28 0 08 Dec 2020
A Unified Deep Speaker Embedding Framework for Mixed-Bandwidth Speech Data Weicheng Cai Ming Li 23 3 0 01 Dec 2020
Look who's not talking Youngki Kwon Hee-Soo Heo Jaesung Huh Bong-Jin Lee Joon Son Chung 4 29 0 30 Nov 2020
Deep Discriminative Feature Learning for Accent Recognition Wei Wang Chao Zhang Xiao-pei Wu 34 2 0 25 Nov 2020
Exploring Voice Conversion based Data Augmentation in Text-Dependent Speaker Verification Xiaoyi Qin Yaogen Yang Lin Yang Xuyang Wang Junjie Wang Ming Li 24 0 0 21 Nov 2020
Supervised attention for speaker recognition Seong Min Kye Joon Son Chung Hoirin Kim 15 11 0 10 Nov 2020
Non-local convolutional neural networks (nlcnn) for speaker recognition Haici Yang Hongda Mao Ruirui Li C. Ju Oguz H. Elibol 20 0 0 07 Nov 2020
Deep Speaker Vector Normalization with Maximum Gaussianality Training Yunqi Cai Lantian Li Dong Wang Andrew Abel 11 6 0 30 Oct 2020
The ins and outs of speaker recognition: lessons from VoxSRC 2020 Yoohwan Kwon Hee-Soo Heo Bong-Jin Lee Joon Son Chung 26 59 0 29 Oct 2020
Playing a Part: Speaker Verification at the Movies A. Brown Jaesung Huh Arsha Nagrani Joon Son Chung Andrew Zisserman 18 23 0 29 Oct 2020
An iterative framework for self-supervised deep speaker representation learning Danwei Cai Weiqing Wang Ming Li SSL 19 37 0 25 Oct 2020
Learning Speaker Embedding from Text-to-Speech Jaejin Cho Piotr Żelasko Jesus Villalba Shinji Watanabe Najim Dehak 31 10 0 21 Oct 2020
The UPC Speaker Verification System Submitted to VoxCeleb Speaker Recognition Challenge 2020 (VoxSRC-20) Muhammad Umair Ahmed Khan Javier Hernando DRL 6 3 0 21 Oct 2020
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm Jennifer Williams Yi Zhao Erica Cooper Junichi Yamagishi SSL 25 23 0 21 Oct 2020
Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020 Shufan Shen Ran Miao Yi Wang Zhihua Wei 16 0 0 20 Oct 2020
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments Youngmoon Jung Yeunju Choi Hyungjun Lim Hoirin Kim 19 13 0 06 Oct 2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020 Hee-Soo Heo Bong-Jin Lee Jaesung Huh Joon Son Chung 13 132 0 29 Sep 2020
Open-set Short Utterance Forensic Speaker Verification using Teacher-Student Network with Explicit Inductive Bias Mufan Sang Wei Xia John H. L. Hansen 33 17 0 21 Sep 2020
Cross-domain Adaptation with Discrepancy Minimization for Text-independent Forensic Speaker Verification Zhenyu Wang Wei Xia John H. L. Hansen 20 12 0 05 Sep 2020
Fine-grained Early Frequency Attention for Deep Speaker Representation Learning Amirhossein Hajavi Ali Etemad 24 2 0 03 Sep 2020