Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.05160
Cited By
Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System
14 April 2018
Weicheng Cai
Jinkun Chen
Ming Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System"
46 / 146 papers shown
Title
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations
Wei Xia
John H. L. Hansen
22
9
0
02 Sep 2020
Cross attentive pooling for speaker verification
Seong Min Kye
Yoohwan Kwon
Joon Son Chung
25
9
0
13 Aug 2020
Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling
Haiwei Wu
Lin Zhang
Lin Yang
Xuyang Wang
Junjie Wang
Dong Zhang
Ming Li
14
2
0
12 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
15
6
0
05 Aug 2020
Intra-class variation reduction of speaker representation in disentanglement framework
Yoohwan Kwon
Soo-Whan Chung
Hong-Goo Kang
DRL
14
21
0
04 Aug 2020
Self-attention encoding and pooling for speaker recognition
Pooyan Safari
Miquel India
Javier Hernando
ViT
22
81
0
03 Aug 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
28
58
0
29 Jul 2020
Self-Attentive Multi-Layer Aggregation with Feature Recalibration and Normalization for End-to-End Speaker Verification System
Soonshin Seo
Ji-Hwan Kim
26
0
0
27 Jul 2020
Double Multi-Head Attention for Speaker Verification
Miquel India
Pooyan Safari
Javier Hernando
28
18
0
26 Jul 2020
Deep multi-metric learning for text-independent speaker verification
Jiwei Xu
Xinggang Wang
Bin Feng
Wenyu Liu
49
25
0
17 Jul 2020
ResNeXt and Res2Net Structures for Speaker Verification
Tianyan Zhou
Yong Zhao
Jian Wu
12
27
0
06 Jul 2020
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge
Xiaoyi Qin
Ming Li
Hui Bu
Wei Rao
Rohan Kumar Das
Shrikanth Narayanan
Haizhou Li
42
47
0
16 May 2020
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction
Yi Zhao
Haoyu Li
Cheng-I Jeff Lai
Jennifer Williams
Erica Cooper
Junichi Yamagishi
42
18
0
16 May 2020
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint
Zexin Cai
Chuxiong Zhang
Ming Li
24
41
0
10 May 2020
AutoSpeech: Neural Architecture Search for Speaker Recognition
Shaojin Ding
Tianlong Chen
Xinyu Gong
Weiwei Zha
Zhangyang Wang
28
57
0
07 May 2020
Deep Normalization for Speaker Vectors
Yunqi Cai
Lantian Li
Dong Wang
Andrew Abel
42
25
0
07 Apr 2020
Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances
Youngmoon Jung
Seong Min Kye
Yeunju Choi
Myunghun Jung
Hoirin Kim
23
36
0
07 Apr 2020
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs
Seong Min Kye
Youngmoon Jung
Haebeom Lee
Sung Ju Hwang
Hoirin Kim
38
49
0
06 Apr 2020
In defence of metric learning for speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
Minjae Lee
Hee-Soo Heo
Soyeon Choe
Chiheon Ham
Sung-Ye Jung
Bong-Jin Lee
Icksang Han
32
433
0
26 Mar 2020
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team
Qingjian Lin
Weicheng Cai
Lin Yang
Junjie Wang
J. Zhang
Ming Li
VLM
13
18
0
23 Feb 2020
An end-to-end approach for the verification problem: learning the right distance
João Monteiro
Isabela Albuquerque
Md. Jahangir Alam
R. Devon Hjelm
T. Falk
32
6
0
21 Feb 2020
Within-sample variability-invariant loss for robust speaker recognition under noisy environments
Danwei Cai
Weicheng Cai
Ming Li
32
47
0
03 Feb 2020
MCSAE: Masked Cross Self-Attentive Encoding for Speaker Embedding
Soonshin Seo
Ji-Hwan Kim
20
0
0
28 Jan 2020
HI-MIA : A Far-field Text-Dependent Speaker Verification Database and the Baselines
Xiaoyi Qin
Hui Bu
Ming Li
36
67
0
03 Dec 2019
Biometrics Recognition Using Deep Learning: A Survey
Shervin Minaee
AmirAli Abdolrashidi
Hang Su
Bennamoun
David C. Zhang
29
84
0
30 Nov 2019
SEEF-ALDR: A Speaker Embedding Enhancement Framework via Adversarial Learning based Disentangled Representation
Jianwei Tai
Xiaoqi Jia
Qingjia Huang
Weijuan Zhang
Haichao Du
Shengzhi Zhang
24
1
0
27 Nov 2019
Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification
Zhongxin Bai
Xiao-Lei Zhang
Jingdong Chen
23
29
0
19 Nov 2019
Delving into VoxCeleb: environment invariant speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
33
50
0
24 Oct 2019
Frequency and temporal convolutional attention for text-independent speaker recognition
Sarthak Yadav
A. Rai
60
58
0
16 Oct 2019
Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks for Robust Speaker Verification
Youngmoon Jung
Yeunju Choi
Hoirin Kim
12
17
0
26 Sep 2019
VAE-based Domain Adaptation for Speaker Verification
Xueyi Wang
Lantian Li
Dong Wang
24
16
0
27 Aug 2019
LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization
Qingjian Lin
Ruiqing Yin
Ming Li
H. Bredin
C. Barras
11
90
0
23 Jul 2019
Speaker Recognition with Random Digit Strings Using Uncertainty Normalized HMM-based i-vectors
N. Maghsoodi
Hossein Sameti
Hossein Zeinali
Themos Stafylakis
22
13
0
13 Jul 2019
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion
Weicheng Cai
Haiwei Wu
Danwei Cai
Ming Li
8
53
0
05 Jul 2019
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition
Shaoshi Ling
Julian Salazar
Yuzong Liu
Katrin Kirchhoff
SSL
30
28
0
30 Jun 2019
Self Multi-Head Attention for Speaker Recognition
Miquel India
Pooyan Safari
Javier Hernando
19
110
0
24 Jun 2019
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Youngmoon Jung
Younggwan Kim
Hyungjun Lim
Yeunju Choi
Hoirin Kim
21
32
0
19 Jun 2019
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition
Xu Xiang
Shuai Wang
Houjun Huang
Y. Qian
Kai Yu
DRL
24
142
0
18 Jun 2019
STC Speaker Recognition Systems for the VOiCES From a Distance Challenge
Sergey Novoselov
Aleksei Gusev
Artem Ivanov
Timur Pekhovsky
Andrey Shulipa
G. Lavrentyeva
V. Volokhov
Alexander Kozlov
22
25
0
12 Apr 2019
VAE-based regularization for deep speaker embedding
Yang Zhang
Lantian Li
Dong Wang
DRL
BDL
19
19
0
07 Apr 2019
VoiceID Loss: Speech Enhancement for Speaker Verification
Suwon Shon
Hao Tang
James R. Glass
VLM
11
87
0
07 Apr 2019
Contrastive Predictive Coding Based Feature for Automatic Speaker Verification
Cheng-I Jeff Lai
SSL
32
27
0
01 Apr 2019
Utterance-level Aggregation For Speaker Recognition In The Wild
Weidi Xie
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
19
343
0
26 Feb 2019
Utterance-level end-to-end language identification using attention-based CNN-BLSTM
Weicheng Cai
Danwei Cai
Shen Huang
Ming Li
14
50
0
20 Feb 2019
Unified Hypersphere Embedding for Speaker Recognition
Mahdi Hajibabaei
Dengxin Dai
24
86
0
22 Jul 2018
Analysis of Length Normalization in End-to-End Speaker Verification System
Weicheng Cai
Jinkun Chen
Ming Li
VLM
22
39
0
08 Jun 2018
Previous
1
2
3