ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.05160
  4. Cited By
Exploring the Encoding Layer and Loss Function in End-to-End Speaker and
  Language Recognition System

Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System

14 April 2018
Weicheng Cai
Jinkun Chen
Ming Li
ArXivPDFHTML

Papers citing "Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System"

46 / 146 papers shown
Title
Speaker Representation Learning using Global Context Guided Channel and
  Time-Frequency Transformations
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations
Wei Xia
John H. L. Hansen
22
9
0
02 Sep 2020
Cross attentive pooling for speaker verification
Cross attentive pooling for speaker verification
Seong Min Kye
Yoohwan Kwon
Joon Son Chung
25
9
0
13 Aug 2020
Mask Detection and Breath Monitoring from Speech: on Data Augmentation,
  Feature Representation and Modeling
Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling
Haiwei Wu
Lin Zhang
Lin Yang
Xuyang Wang
Junjie Wang
Dong Zhang
Ming Li
14
2
0
12 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with
  Adversarial Learning
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
15
6
0
05 Aug 2020
Intra-class variation reduction of speaker representation in
  disentanglement framework
Intra-class variation reduction of speaker representation in disentanglement framework
Yoohwan Kwon
Soo-Whan Chung
Hong-Goo Kang
DRL
14
21
0
04 Aug 2020
Self-attention encoding and pooling for speaker recognition
Self-attention encoding and pooling for speaker recognition
Pooyan Safari
Miquel India
Javier Hernando
ViT
22
81
0
03 Aug 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
28
58
0
29 Jul 2020
Self-Attentive Multi-Layer Aggregation with Feature Recalibration and
  Normalization for End-to-End Speaker Verification System
Self-Attentive Multi-Layer Aggregation with Feature Recalibration and Normalization for End-to-End Speaker Verification System
Soonshin Seo
Ji-Hwan Kim
26
0
0
27 Jul 2020
Double Multi-Head Attention for Speaker Verification
Double Multi-Head Attention for Speaker Verification
Miquel India
Pooyan Safari
Javier Hernando
28
18
0
26 Jul 2020
Deep multi-metric learning for text-independent speaker verification
Deep multi-metric learning for text-independent speaker verification
Jiwei Xu
Xinggang Wang
Bin Feng
Wenyu Liu
49
25
0
17 Jul 2020
ResNeXt and Res2Net Structures for Speaker Verification
ResNeXt and Res2Net Structures for Speaker Verification
Tianyan Zhou
Yong Zhao
Jian Wu
12
27
0
06 Jul 2020
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge
Xiaoyi Qin
Ming Li
Hui Bu
Wei Rao
Rohan Kumar Das
Shrikanth Narayanan
Haizhou Li
42
47
0
16 May 2020
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE
  Speech Waveform Reconstruction
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction
Yi Zhao
Haoyu Li
Cheng-I Jeff Lai
Jennifer Williams
Erica Cooper
Junichi Yamagishi
42
18
0
16 May 2020
From Speaker Verification to Multispeaker Speech Synthesis, Deep
  Transfer with Feedback Constraint
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint
Zexin Cai
Chuxiong Zhang
Ming Li
24
41
0
10 May 2020
AutoSpeech: Neural Architecture Search for Speaker Recognition
AutoSpeech: Neural Architecture Search for Speaker Recognition
Shaojin Ding
Tianlong Chen
Xinyu Gong
Weiwei Zha
Zhangyang Wang
28
57
0
07 May 2020
Deep Normalization for Speaker Vectors
Deep Normalization for Speaker Vectors
Yunqi Cai
Lantian Li
Dong Wang
Andrew Abel
42
25
0
07 Apr 2020
Improving Multi-Scale Aggregation Using Feature Pyramid Module for
  Robust Speaker Verification of Variable-Duration Utterances
Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances
Youngmoon Jung
Seong Min Kye
Yeunju Choi
Myunghun Jung
Hoirin Kim
23
36
0
07 Apr 2020
Meta-Learning for Short Utterance Speaker Recognition with Imbalance
  Length Pairs
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs
Seong Min Kye
Youngmoon Jung
Haebeom Lee
Sung Ju Hwang
Hoirin Kim
38
49
0
06 Apr 2020
In defence of metric learning for speaker recognition
In defence of metric learning for speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
Minjae Lee
Hee-Soo Heo
Soyeon Choe
Chiheon Ham
Sung-Ye Jung
Bong-Jin Lee
Icksang Han
32
433
0
26 Mar 2020
DIHARD II is Still Hard: Experimental Results and Discussions from the
  DKU-LENOVO Team
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team
Qingjian Lin
Weicheng Cai
Lin Yang
Junjie Wang
J. Zhang
Ming Li
VLM
13
18
0
23 Feb 2020
An end-to-end approach for the verification problem: learning the right
  distance
An end-to-end approach for the verification problem: learning the right distance
João Monteiro
Isabela Albuquerque
Md. Jahangir Alam
R. Devon Hjelm
T. Falk
32
6
0
21 Feb 2020
Within-sample variability-invariant loss for robust speaker recognition
  under noisy environments
Within-sample variability-invariant loss for robust speaker recognition under noisy environments
Danwei Cai
Weicheng Cai
Ming Li
32
47
0
03 Feb 2020
MCSAE: Masked Cross Self-Attentive Encoding for Speaker Embedding
MCSAE: Masked Cross Self-Attentive Encoding for Speaker Embedding
Soonshin Seo
Ji-Hwan Kim
20
0
0
28 Jan 2020
HI-MIA : A Far-field Text-Dependent Speaker Verification Database and
  the Baselines
HI-MIA : A Far-field Text-Dependent Speaker Verification Database and the Baselines
Xiaoyi Qin
Hui Bu
Ming Li
36
67
0
03 Dec 2019
Biometrics Recognition Using Deep Learning: A Survey
Biometrics Recognition Using Deep Learning: A Survey
Shervin Minaee
AmirAli Abdolrashidi
Hang Su
Bennamoun
David C. Zhang
29
84
0
30 Nov 2019
SEEF-ALDR: A Speaker Embedding Enhancement Framework via Adversarial
  Learning based Disentangled Representation
SEEF-ALDR: A Speaker Embedding Enhancement Framework via Adversarial Learning based Disentangled Representation
Jianwei Tai
Xiaoqi Jia
Qingjia Huang
Weijuan Zhang
Haichao Du
Shengzhi Zhang
24
1
0
27 Nov 2019
Partial AUC optimization based deep speaker embeddings with class-center
  learning for text-independent speaker verification
Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification
Zhongxin Bai
Xiao-Lei Zhang
Jingdong Chen
23
29
0
19 Nov 2019
Delving into VoxCeleb: environment invariant speaker recognition
Delving into VoxCeleb: environment invariant speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
33
50
0
24 Oct 2019
Frequency and temporal convolutional attention for text-independent
  speaker recognition
Frequency and temporal convolutional attention for text-independent speaker recognition
Sarthak Yadav
A. Rai
60
58
0
16 Oct 2019
Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks
  for Robust Speaker Verification
Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks for Robust Speaker Verification
Youngmoon Jung
Yeunju Choi
Hoirin Kim
12
17
0
26 Sep 2019
VAE-based Domain Adaptation for Speaker Verification
VAE-based Domain Adaptation for Speaker Verification
Xueyi Wang
Lantian Li
Dong Wang
24
16
0
27 Aug 2019
LSTM based Similarity Measurement with Spectral Clustering for Speaker
  Diarization
LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization
Qingjian Lin
Ruiqing Yin
Ming Li
H. Bredin
C. Barras
11
90
0
23 Jul 2019
Speaker Recognition with Random Digit Strings Using Uncertainty
  Normalized HMM-based i-vectors
Speaker Recognition with Random Digit Strings Using Uncertainty Normalized HMM-based i-vectors
N. Maghsoodi
Hossein Sameti
Hossein Zeinali
Themos Stafylakis
22
13
0
13 Jul 2019
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data
  Augmentation, Feature Representation, Classification, and Fusion
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion
Weicheng Cai
Haiwei Wu
Danwei Cai
Ming Li
8
53
0
05 Jul 2019
BERTphone: Phonetically-Aware Encoder Representations for
  Utterance-Level Speaker and Language Recognition
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition
Shaoshi Ling
Julian Salazar
Yuzong Liu
Katrin Kirchhoff
SSL
30
28
0
30 Jun 2019
Self Multi-Head Attention for Speaker Recognition
Self Multi-Head Attention for Speaker Recognition
Miquel India
Pooyan Safari
Javier Hernando
19
110
0
24 Jun 2019
Spatial Pyramid Encoding with Convex Length Normalization for
  Text-Independent Speaker Verification
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Youngmoon Jung
Younggwan Kim
Hyungjun Lim
Yeunju Choi
Hoirin Kim
21
32
0
19 Jun 2019
Margin Matters: Towards More Discriminative Deep Neural Network
  Embeddings for Speaker Recognition
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition
Xu Xiang
Shuai Wang
Houjun Huang
Y. Qian
Kai Yu
DRL
24
142
0
18 Jun 2019
STC Speaker Recognition Systems for the VOiCES From a Distance Challenge
STC Speaker Recognition Systems for the VOiCES From a Distance Challenge
Sergey Novoselov
Aleksei Gusev
Artem Ivanov
Timur Pekhovsky
Andrey Shulipa
G. Lavrentyeva
V. Volokhov
Alexander Kozlov
22
25
0
12 Apr 2019
VAE-based regularization for deep speaker embedding
VAE-based regularization for deep speaker embedding
Yang Zhang
Lantian Li
Dong Wang
DRL
BDL
19
19
0
07 Apr 2019
VoiceID Loss: Speech Enhancement for Speaker Verification
VoiceID Loss: Speech Enhancement for Speaker Verification
Suwon Shon
Hao Tang
James R. Glass
VLM
11
87
0
07 Apr 2019
Contrastive Predictive Coding Based Feature for Automatic Speaker
  Verification
Contrastive Predictive Coding Based Feature for Automatic Speaker Verification
Cheng-I Jeff Lai
SSL
32
27
0
01 Apr 2019
Utterance-level Aggregation For Speaker Recognition In The Wild
Utterance-level Aggregation For Speaker Recognition In The Wild
Weidi Xie
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
19
343
0
26 Feb 2019
Utterance-level end-to-end language identification using attention-based
  CNN-BLSTM
Utterance-level end-to-end language identification using attention-based CNN-BLSTM
Weicheng Cai
Danwei Cai
Shen Huang
Ming Li
14
50
0
20 Feb 2019
Unified Hypersphere Embedding for Speaker Recognition
Unified Hypersphere Embedding for Speaker Recognition
Mahdi Hajibabaei
Dengxin Dai
24
86
0
22 Jul 2018
Analysis of Length Normalization in End-to-End Speaker Verification
  System
Analysis of Length Normalization in End-to-End Speaker Verification System
Weicheng Cai
Jinkun Chen
Ming Li
VLM
22
39
0
08 Jun 2018
Previous
123