ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.05160
  4. Cited By
Exploring the Encoding Layer and Loss Function in End-to-End Speaker and
  Language Recognition System

Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System

14 April 2018
Weicheng Cai
Jinkun Chen
Ming Li
ArXivPDFHTML

Papers citing "Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System"

50 / 146 papers shown
Title
Language-Independent Speaker Anonymization Approach using
  Self-Supervised Pre-Trained Models
Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
N. Tomashenko
64
25
0
26 Feb 2022
Cross-Channel Attention-Based Target Speaker Voice Activity Detection:
  Experimental Results for M2MeT Challenge
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge
Weiqing Wang
Xiaoyi Qin
Ming Li
19
27
0
06 Feb 2022
Graph attentive feature aggregation for text-independent speaker
  verification
Graph attentive feature aggregation for text-independent speaker verification
Hye-jin Shim
Ju-Sung Heo
Jae-han Park
Gareth Lee
Ha-Jin Yu
35
16
0
23 Dec 2021
Explore Long-Range Context feature for Speaker Verification
Explore Long-Range Context feature for Speaker Verification
Zhuo Li
33
6
0
14 Dec 2021
Self-Supervised Speaker Verification with Simple Siamese Network and
  Self-Supervised Regularization
Self-Supervised Speaker Verification with Simple Siamese Network and Self-Supervised Regularization
Mufan Sang
Haoqi Li
F. Liu
Andrew O. Arnold
Li Wan
SSL
16
40
0
08 Dec 2021
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice
  Conversion for everyone
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Edresson Casanova
Julian Weber
C. Shulby
Arnaldo Cândido Júnior
Eren Golge
M. Ponti
185
382
0
04 Dec 2021
A Study on Decoupled Probabilistic Linear Discriminant Analysis
A Study on Decoupled Probabilistic Linear Discriminant Analysis
Ding Wang
Lantian Li
Hongzhi Yu
Dong Wang
11
0
0
24 Nov 2021
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System
  for Both Human Beings and Machines
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines
Haozhe Zhang
Zexin Cai
Xiaoyi Qin
Ming Li
54
15
0
06 Nov 2021
A Study of Multimodal Person Verification Using Audio-Visual-Thermal
  Data
A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data
Madina Abdrakhmanova
Siwen Guo
Yerbolat Khassanov
Shohreh Haddadan
11
5
0
23 Oct 2021
Real Additive Margin Softmax for Speaker Verification
Real Additive Margin Softmax for Speaker Verification
Lantian Li
Ruiqian Nai
Dong Wang
6
14
0
18 Oct 2021
Simple Attention Module based Speaker Verification with Iterative noisy
  label detection
Simple Attention Module based Speaker Verification with Iterative noisy label detection
Xiaoyi Qin
Na Li
Chao Weng
Dan Su
Ming Li
NoLa
65
50
0
13 Oct 2021
Multi-View Self-Attention Based Transformer for Speaker Recognition
Multi-View Self-Attention Based Transformer for Speaker Recognition
Rui Wang
Junyi Ao
Long Zhou
Shujie Liu
Zhihua Wei
Tom Ko
Qing Li
Yu Zhang
ViT
14
31
0
11 Oct 2021
Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for
  Speaker Verification
Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification
Qingjian Lin
Lin Yang
Xuyang Wang
Xiaoyi Qin
Junjie Wang
Ming Li
30
21
0
09 Oct 2021
Multi-task Voice Activated Framework using Self-supervised Learning
Multi-task Voice Activated Framework using Self-supervised Learning
Shehzeen Samarah Hussain
V. Nguyen
Shuhua Zhang
Erik M. Visser
SSL
27
12
0
03 Oct 2021
Beijing ZKJ-NPU Speaker Verification System for VoxCeleb Speaker
  Recognition Challenge 2021
Beijing ZKJ-NPU Speaker Verification System for VoxCeleb Speaker Recognition Challenge 2021
Li Zhang
Huan Zhao
Qinling Meng
Yanli Chen
Min Liu
Lei Xie
32
10
0
08 Sep 2021
The DKU-DukeECE System for the Self-Supervision Speaker Verification
  Task of the 2021 VoxCeleb Speaker Recognition Challenge
The DKU-DukeECE System for the Self-Supervision Speaker Verification Task of the 2021 VoxCeleb Speaker Recognition Challenge
Danwei Cai
Ming Li
11
15
0
07 Sep 2021
GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints
GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints
Ji-Hoon Kim
Sang-Hoon Lee
Ji-Hyun Lee
Hong G Jung
Seong-Whan Lee
47
6
0
16 Aug 2021
Use of speaker recognition approaches for learning and evaluating
  embedding representations of musical instrument sounds
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds
Xuan Shi
Erica Cooper
Junichi Yamagishi
33
7
0
24 Jul 2021
The HCCL Speaker Verification System for Far-Field Speaker Verification
  Challenge
The HCCL Speaker Verification System for Far-Field Speaker Verification Challenge
Zhuo Li
Ce Fang
Runqiu Xiao
Zhigao Chen
Wenchao Wang
Yonghong Yan
25
2
0
03 Jul 2021
Adaptive Margin Circle Loss for Speaker Verification
Adaptive Margin Circle Loss for Speaker Verification
Runqiu Xiao
33
11
0
15 Jun 2021
Low-Resource Spoken Language Identification Using Self-Attentive Pooling
  and Deep 1D Time-Channel Separable Convolutions
Low-Resource Spoken Language Identification Using Self-Attentive Pooling and Deep 1D Time-Channel Separable Convolutions
Roman Bedyakin
N. Mikhaylovskiy
14
5
0
31 May 2021
Accent Recognition with Hybrid Phonetic Features
Accent Recognition with Hybrid Phonetic Features
Zhan Zhang
Xi Chen
Yuehai Wang
Jianyi Yang
24
18
0
05 May 2021
Building Bilingual and Code-Switched Voice Conversion with Limited
  Training Data Using Embedding Consistency Loss
Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss
Yaogen Yang
Haozhe Zhang
Xiaoyi Qin
Shanshan Liang
Huahua Cui
Mingyang Xu
Ming Li
53
4
0
22 Apr 2021
Binary Neural Network for Speaker Verification
Binary Neural Network for Speaker Verification
Tinglong Zhu
Xiaoyi Qin
Ming Li
MQ
21
12
0
06 Apr 2021
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
Edresson Casanova
C. Shulby
Eren Golge
Nicolas Müller
F. S. Oliveira
Arnaldo Cândido Júnior
A. S. Soares
S. Aluísio
M. Ponti
16
98
0
02 Apr 2021
The DKU-Duke-Lenovo System Description for the Third DIHARD Speech
  Diarization Challenge
The DKU-Duke-Lenovo System Description for the Third DIHARD Speech Diarization Challenge
Weiqing Wang
Qingjian Lin
Danwei Cai
Lin Yang
Ming Li
13
8
0
06 Feb 2021
A Principle Solution for Enroll-Test Mismatch in Speaker Recognition
A Principle Solution for Enroll-Test Mismatch in Speaker Recognition
Lantian Li
Dong Wang
Jiawen Kang
Renyu Wang
Jingqian Wu
Zhendong Gao
Xiao Chen
13
7
0
23 Dec 2020
CN-Celeb: multi-genre speaker recognition
CN-Celeb: multi-genre speaker recognition
Lantian Li
Ruiqi Liu
Jiawen Kang
Yue Fan
Hao Cui
Yunqi Cai
Ravichander Vipperla
T. Zheng
Dong Wang
33
119
0
23 Dec 2020
DEAAN: Disentangled Embedding and Adversarial Adaptation Network for
  Robust Speaker Representation Learning
DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning
Mufan Sang
Wei Xia
John H. L. Hansen
OOD
DRL
21
23
0
12 Dec 2020
Exploring wav2vec 2.0 on speaker verification and language
  identification
Exploring wav2vec 2.0 on speaker verification and language identification
Zhiyun Fan
Meng Li
Shiyu Zhou
Bo Xu
117
202
0
11 Dec 2020
Adversarial Disentanglement of Speaker Representation for
  Attribute-Driven Privacy Preservation
Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation
Paul-Gauthier Noé
Mohammad MohammadAmini
D. Matrouf
Titouan Parcollet
Andreas Nautsch
J. Bonastre
29
28
0
08 Dec 2020
A Unified Deep Speaker Embedding Framework for Mixed-Bandwidth Speech
  Data
A Unified Deep Speaker Embedding Framework for Mixed-Bandwidth Speech Data
Weicheng Cai
Ming Li
23
3
0
01 Dec 2020
Look who's not talking
Look who's not talking
Youngki Kwon
Hee-Soo Heo
Jaesung Huh
Bong-Jin Lee
Joon Son Chung
4
29
0
30 Nov 2020
Deep Discriminative Feature Learning for Accent Recognition
Deep Discriminative Feature Learning for Accent Recognition
Wei Wang
Chao Zhang
Xiao-pei Wu
34
2
0
25 Nov 2020
Exploring Voice Conversion based Data Augmentation in Text-Dependent
  Speaker Verification
Exploring Voice Conversion based Data Augmentation in Text-Dependent Speaker Verification
Xiaoyi Qin
Yaogen Yang
Lin Yang
Xuyang Wang
Junjie Wang
Ming Li
24
0
0
21 Nov 2020
Supervised attention for speaker recognition
Supervised attention for speaker recognition
Seong Min Kye
Joon Son Chung
Hoirin Kim
15
11
0
10 Nov 2020
Non-local convolutional neural networks (nlcnn) for speaker recognition
Non-local convolutional neural networks (nlcnn) for speaker recognition
Haici Yang
Hongda Mao
Ruirui Li
C. Ju
Oguz H. Elibol
20
0
0
07 Nov 2020
Deep Speaker Vector Normalization with Maximum Gaussianality Training
Deep Speaker Vector Normalization with Maximum Gaussianality Training
Yunqi Cai
Lantian Li
Dong Wang
Andrew Abel
11
6
0
30 Oct 2020
The ins and outs of speaker recognition: lessons from VoxSRC 2020
The ins and outs of speaker recognition: lessons from VoxSRC 2020
Yoohwan Kwon
Hee-Soo Heo
Bong-Jin Lee
Joon Son Chung
26
59
0
29 Oct 2020
Playing a Part: Speaker Verification at the Movies
Playing a Part: Speaker Verification at the Movies
A. Brown
Jaesung Huh
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
18
23
0
29 Oct 2020
An iterative framework for self-supervised deep speaker representation
  learning
An iterative framework for self-supervised deep speaker representation learning
Danwei Cai
Weiqing Wang
Ming Li
SSL
19
37
0
25 Oct 2020
Learning Speaker Embedding from Text-to-Speech
Learning Speaker Embedding from Text-to-Speech
Jaejin Cho
Piotr Żelasko
Jesus Villalba
Shinji Watanabe
Najim Dehak
31
10
0
21 Oct 2020
The UPC Speaker Verification System Submitted to VoxCeleb Speaker
  Recognition Challenge 2020 (VoxSRC-20)
The UPC Speaker Verification System Submitted to VoxCeleb Speaker Recognition Challenge 2020 (VoxSRC-20)
Muhammad Umair Ahmed Khan
Javier Hernando
DRL
6
3
0
21 Oct 2020
Learning Disentangled Phone and Speaker Representations in a
  Semi-Supervised VQ-VAE Paradigm
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm
Jennifer Williams
Yi Zhao
Erica Cooper
Junichi Yamagishi
SSL
25
23
0
21 Oct 2020
Tongji University Undergraduate Team for the VoxCeleb Speaker
  Recognition Challenge2020
Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020
Shufan Shen
Ran Miao
Yi Wang
Zhihua Wei
16
0
0
20 Oct 2020
A Unified Deep Learning Framework for Short-Duration Speaker
  Verification in Adverse Environments
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
Youngmoon Jung
Yeunju Choi
Hyungjun Lim
Hoirin Kim
19
13
0
06 Oct 2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge
  2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020
Hee-Soo Heo
Bong-Jin Lee
Jaesung Huh
Joon Son Chung
13
132
0
29 Sep 2020
Open-set Short Utterance Forensic Speaker Verification using
  Teacher-Student Network with Explicit Inductive Bias
Open-set Short Utterance Forensic Speaker Verification using Teacher-Student Network with Explicit Inductive Bias
Mufan Sang
Wei Xia
John H. L. Hansen
33
17
0
21 Sep 2020
Cross-domain Adaptation with Discrepancy Minimization for
  Text-independent Forensic Speaker Verification
Cross-domain Adaptation with Discrepancy Minimization for Text-independent Forensic Speaker Verification
Zhenyu Wang
Wei Xia
John H. L. Hansen
20
12
0
05 Sep 2020
Fine-grained Early Frequency Attention for Deep Speaker Representation
  Learning
Fine-grained Early Frequency Attention for Deep Speaker Representation Learning
Amirhossein Hajavi
Ali Etemad
24
2
0
03 Sep 2020
Previous
123
Next