Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System

14 April 2018

Papers citing "Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System"

46 / 146 papers shown

Title
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations Wei Xia John H. L. Hansen 22 9 0 02 Sep 2020
Cross attentive pooling for speaker verification Seong Min Kye Yoohwan Kwon Joon Son Chung 25 9 0 13 Aug 2020
Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling Haiwei Wu Lin Zhang Lin Yang Xuyang Wang Junjie Wang Dong Zhang Ming Li 14 2 0 12 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning Jing-Xuan Zhang Zhenhua Ling Lirong Dai 15 6 0 05 Aug 2020
Intra-class variation reduction of speaker representation in disentanglement framework Yoohwan Kwon Soo-Whan Chung Hong-Goo Kang DRL 14 21 0 04 Aug 2020
Self-attention encoding and pooling for speaker recognition Pooyan Safari Miquel India Javier Hernando ViT 22 81 0 03 Aug 2020
Privacy-preserving Voice Analysis via Disentangled Representations Ranya Aloufi Hamed Haddadi David E. Boyle DRL 28 58 0 29 Jul 2020
Self-Attentive Multi-Layer Aggregation with Feature Recalibration and Normalization for End-to-End Speaker Verification System Soonshin Seo Ji-Hwan Kim 26 0 0 27 Jul 2020
Double Multi-Head Attention for Speaker Verification Miquel India Pooyan Safari Javier Hernando 28 18 0 26 Jul 2020
Deep multi-metric learning for text-independent speaker verification Jiwei Xu Xinggang Wang Bin Feng Wenyu Liu 49 25 0 17 Jul 2020
ResNeXt and Res2Net Structures for Speaker Verification Tianyan Zhou Yong Zhao Jian Wu 12 27 0 06 Jul 2020
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge Xiaoyi Qin Ming Li Hui Bu Wei Rao Rohan Kumar Das Shrikanth Narayanan Haizhou Li 42 47 0 16 May 2020
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction Yi Zhao Haoyu Li Cheng-I Jeff Lai Jennifer Williams Erica Cooper Junichi Yamagishi 42 18 0 16 May 2020
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint Zexin Cai Chuxiong Zhang Ming Li 24 41 0 10 May 2020
AutoSpeech: Neural Architecture Search for Speaker Recognition Shaojin Ding Tianlong Chen Xinyu Gong Weiwei Zha Zhangyang Wang 28 57 0 07 May 2020
Deep Normalization for Speaker Vectors Yunqi Cai Lantian Li Dong Wang Andrew Abel 42 25 0 07 Apr 2020
Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances Youngmoon Jung Seong Min Kye Yeunju Choi Myunghun Jung Hoirin Kim 23 36 0 07 Apr 2020
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs Seong Min Kye Youngmoon Jung Haebeom Lee Sung Ju Hwang Hoirin Kim 38 49 0 06 Apr 2020
In defence of metric learning for speaker recognition Joon Son Chung Jaesung Huh Seongkyu Mun Minjae Lee Hee-Soo Heo Soyeon Choe Chiheon Ham Sung-Ye Jung Bong-Jin Lee Icksang Han 32 433 0 26 Mar 2020
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team Qingjian Lin Weicheng Cai Lin Yang Junjie Wang J. Zhang Ming Li VLM 13 18 0 23 Feb 2020
An end-to-end approach for the verification problem: learning the right distance João Monteiro Isabela Albuquerque Md. Jahangir Alam R. Devon Hjelm T. Falk 32 6 0 21 Feb 2020
Within-sample variability-invariant loss for robust speaker recognition under noisy environments Danwei Cai Weicheng Cai Ming Li 32 47 0 03 Feb 2020
MCSAE: Masked Cross Self-Attentive Encoding for Speaker Embedding Soonshin Seo Ji-Hwan Kim 20 0 0 28 Jan 2020
HI-MIA : A Far-field Text-Dependent Speaker Verification Database and the Baselines Xiaoyi Qin Hui Bu Ming Li 36 67 0 03 Dec 2019
Biometrics Recognition Using Deep Learning: A Survey Shervin Minaee AmirAli Abdolrashidi Hang Su Bennamoun David C. Zhang 29 84 0 30 Nov 2019
SEEF-ALDR: A Speaker Embedding Enhancement Framework via Adversarial Learning based Disentangled Representation Jianwei Tai Xiaoqi Jia Qingjia Huang Weijuan Zhang Haichao Du Shengzhi Zhang 24 1 0 27 Nov 2019
Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification Zhongxin Bai Xiao-Lei Zhang Jingdong Chen 23 29 0 19 Nov 2019
Delving into VoxCeleb: environment invariant speaker recognition Joon Son Chung Jaesung Huh Seongkyu Mun 33 50 0 24 Oct 2019
Frequency and temporal convolutional attention for text-independent speaker recognition Sarthak Yadav A. Rai 60 58 0 16 Oct 2019
Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks for Robust Speaker Verification Youngmoon Jung Yeunju Choi Hoirin Kim 12 17 0 26 Sep 2019
VAE-based Domain Adaptation for Speaker Verification Xueyi Wang Lantian Li Dong Wang 24 16 0 27 Aug 2019
LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization Qingjian Lin Ruiqing Yin Ming Li H. Bredin C. Barras 11 90 0 23 Jul 2019
Speaker Recognition with Random Digit Strings Using Uncertainty Normalized HMM-based i-vectors N. Maghsoodi Hossein Sameti Hossein Zeinali Themos Stafylakis 22 13 0 13 Jul 2019
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion Weicheng Cai Haiwei Wu Danwei Cai Ming Li 8 53 0 05 Jul 2019
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition Shaoshi Ling Julian Salazar Yuzong Liu Katrin Kirchhoff SSL 30 28 0 30 Jun 2019
Self Multi-Head Attention for Speaker Recognition Miquel India Pooyan Safari Javier Hernando 19 110 0 24 Jun 2019
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification Youngmoon Jung Younggwan Kim Hyungjun Lim Yeunju Choi Hoirin Kim 21 32 0 19 Jun 2019
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition Xu Xiang Shuai Wang Houjun Huang Y. Qian Kai Yu DRL 24 142 0 18 Jun 2019
STC Speaker Recognition Systems for the VOiCES From a Distance Challenge Sergey Novoselov Aleksei Gusev Artem Ivanov Timur Pekhovsky Andrey Shulipa G. Lavrentyeva V. Volokhov Alexander Kozlov 22 25 0 12 Apr 2019
VAE-based regularization for deep speaker embedding Yang Zhang Lantian Li Dong Wang DRL BDL 19 19 0 07 Apr 2019
VoiceID Loss: Speech Enhancement for Speaker Verification Suwon Shon Hao Tang James R. Glass VLM 11 87 0 07 Apr 2019
Contrastive Predictive Coding Based Feature for Automatic Speaker Verification Cheng-I Jeff Lai SSL 32 27 0 01 Apr 2019
Utterance-level Aggregation For Speaker Recognition In The Wild Weidi Xie Arsha Nagrani Joon Son Chung Andrew Zisserman 19 343 0 26 Feb 2019
Utterance-level end-to-end language identification using attention-based CNN-BLSTM Weicheng Cai Danwei Cai Shen Huang Ming Li 14 50 0 20 Feb 2019
Unified Hypersphere Embedding for Speaker Recognition Mahdi Hajibabaei Dengxin Dai 24 86 0 22 Jul 2018
Analysis of Length Normalization in End-to-End Speaker Verification System Weicheng Cai Jinkun Chen Ming Li VLM 22 39 0 08 Jun 2018