ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.08612
  4. Cited By
VoxCeleb: a large-scale speaker identification dataset
v1v2 (latest)

VoxCeleb: a large-scale speaker identification dataset

26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,111 papers shown
Title
Speaker Diarization with Region Proposal Network
Speaker Diarization with Region Proposal Network
Zili Huang
Shinji Watanabe
Yusuke Fujita
Leibny Paola García-Perera
Yiwen Shao
Daniel Povey
Sanjeev Khudanpur
59
60
0
14 Feb 2020
Deep Speaker Embeddings for Far-Field Speaker Recognition on Short
  Utterances
Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances
Aleksei Gusev
V. Volokhov
Tseren Andzhukaev
Sergey Novoselov
G. Lavrentyeva
...
Anastasia Avdeeva
Artem Ivanov
Alexander Kozlov
Timur Pekhovsky
Yuri N. Matveev
59
48
0
14 Feb 2020
Self-supervised learning for audio-visual speaker diarization
Self-supervised learning for audio-visual speaker diarization
Yifan Ding
Yong-mei Xu
Shi-Xiong Zhang
Yahuan Cong
Liqiang Wang
VLM
78
29
0
13 Feb 2020
AlignNet: A Unifying Approach to Audio-Visual Alignment
AlignNet: A Unifying Approach to Audio-Visual Alignment
Jianren Wang
Zhaoyuan Fang
Hang Zhao
57
37
0
12 Feb 2020
NPLDA: A Deep Neural PLDA Model for Speaker Verification
NPLDA: A Deep Neural PLDA Model for Speaker Verification
Shreyas Ramoji
Prashant Krishnan
Sriram Ganapathy
47
31
0
10 Feb 2020
An empirical analysis of information encoded in disentangled neural
  speaker representations
An empirical analysis of information encoded in disentangled neural speaker representations
Raghuveer Peri
Haoqi Li
Krishna Somandepalli
Arindam Jati
Shrikanth Narayanan
DRL
75
14
0
10 Feb 2020
$M^3$T: Multi-Modal Continuous Valence-Arousal Estimation in the Wild
M3M^3M3T: Multi-Modal Continuous Valence-Arousal Estimation in the Wild
Yuanhang Zhang
Rulin Huang
Jiabei Zeng
Shiguang Shan
Xilin Chen
CVBM
70
27
0
07 Feb 2020
LEAP System for SRE19 CTS Challenge -- Improvements and Error Analysis
LEAP System for SRE19 CTS Challenge -- Improvements and Error Analysis
Shreyas Ramoji
Prashant Krishnan
Bhargavram Mysore
Prachi Singh
Sriram Ganapathy
51
2
0
07 Feb 2020
An initial investigation on optimizing tandem speaker verification and
  countermeasure systems using reinforcement learning
An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning
Anssi Kanervisto
Ville Hautamaki
Tomi Kinnunen
Junichi Yamagishi
28
2
0
06 Feb 2020
Within-sample variability-invariant loss for robust speaker recognition
  under noisy environments
Within-sample variability-invariant loss for robust speaker recognition under noisy environments
Danwei Cai
Weicheng Cai
Ming Li
64
47
0
03 Feb 2020
DropClass and DropAdapt: Dropping classes for deep speaker
  representation learning
DropClass and DropAdapt: Dropping classes for deep speaker representation learning
Chau Luu
P. Bell
Steve Renals
VLM
65
3
0
02 Feb 2020
Analysis of Deep Feature Loss based Enhancement for Speaker Verification
Analysis of Deep Feature Loss based Enhancement for Speaker Verification
Saurabh Kataria
P. S. Nidadavolu
Jesús Villalba
Najim Dehak
103
13
0
01 Feb 2020
MCSAE: Masked Cross Self-Attentive Encoding for Speaker Embedding
MCSAE: Masked Cross Self-Attentive Encoding for Speaker Embedding
Soonshin Seo
Ji-Hwan Kim
24
0
0
28 Jan 2020
Pairwise Discriminative Neural PLDA for Speaker Verification
Pairwise Discriminative Neural PLDA for Speaker Verification
Shreyas Ramoji
Prashant Krishnan
Prachi Singh
Sriram Ganapathy
48
7
0
20 Jan 2020
Everybody's Talkin': Let Me Talk as You Want
Everybody's Talkin': Let Me Talk as You Want
Linsen Song
Wayne Wu
Chao Qian
Ran He
Chen Change Loy
DiffMVGen
79
147
0
15 Jan 2020
Robust Speaker Recognition Using Speech Enhancement And Attention Model
Robust Speaker Recognition Using Speech Enhancement And Attention Model
Yanpei Shi
Qiang Huang
Thomas Hain
86
26
0
14 Jan 2020
Deep Audio-Visual Learning: A Survey
Deep Audio-Visual Learning: A Survey
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
Ran He
75
161
0
14 Jan 2020
Gaussian speaker embedding learning for text-independent speaker
  verification
Gaussian speaker embedding learning for text-independent speaker verification
Bin Gu
Wu Guo
BDL
57
1
0
14 Jan 2020
On the Resilience of Biometric Authentication Systems against Random
  Inputs
On the Resilience of Biometric Authentication Systems against Random Inputs
Benjamin Zi Hao Zhao
Hassan Jameel Asghar
M. Kâafar
AAML
133
23
0
13 Jan 2020
Learning Speaker Embedding with Momentum Contrast
Learning Speaker Embedding with Momentum Contrast
Ke Ding
Xuanji He
Guanglu Wan
SSL
108
10
0
07 Jan 2020
Destruction of Image Steganography using Generative Adversarial Networks
Destruction of Image Steganography using Generative Adversarial Networks
Isaac Corley
Jonathan Lwowski
Justin Hoffman
AAML
42
13
0
20 Dec 2019
Large-scale Multi-modal Person Identification in Real Unconstrained
  Environments
Large-scale Multi-modal Person Identification in Real Unconstrained Environments
Jiajie Ye
Y. Guan
Junfa Liu
Xinghong Huang
Kuanqi Cai
28
1
0
17 Dec 2019
Speech-driven facial animation using polynomial fusion of features
Speech-driven facial animation using polynomial fusion of features
Triantafyllos Kefalas
Konstantinos Vougioukas
Yannis Panagakis
Stavros Petridis
Jean Kossaifi
Maja Pantic
24
6
0
12 Dec 2019
Advances in Online Audio-Visual Meeting Transcription
Advances in Online Audio-Visual Meeting Transcription
Takuya Yoshioka
Igor Abramovski
Cem Aksoylar
Zhuo Chen
Moshe David
...
Huaming Wang
Zhenghao Wang
Jun Zhang
Yong Zhao
Tianyan Zhou
95
75
0
10 Dec 2019
A Multi Purpose and Large Scale Speech Corpus in Persian and English for
  Speaker and Speech Recognition: the DeepMine Database
A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Hossein Zeinali
L. Burget
J. Černocký
39
40
0
08 Dec 2019
VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge
VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge
Joon Son Chung
Arsha Nagrani
Ernesto Coto
Weidi Xie
Mitchell McLaren
D. Reynolds
Andrew Zisserman
81
60
0
05 Dec 2019
Smartphone Multi-modal Biometric Authentication: Database and Evaluation
Smartphone Multi-modal Biometric Authentication: Database and Evaluation
Raghavendra Ramachandra
Martin Stokkenes
A. Mohammadi
S. Venkatesh
Kiran Raja
Pankaj Wasnik
Eric Poiret
S´ebastien Marcel
Christoph Busch
CVBM
50
18
0
05 Dec 2019
HI-MIA : A Far-field Text-Dependent Speaker Verification Database and
  the Baselines
HI-MIA : A Far-field Text-Dependent Speaker Verification Database and the Baselines
Xiaoyi Qin
Hui Bu
Ming Li
112
69
0
03 Dec 2019
Speaker detection in the wild: Lessons learned from JSALT 2019
Speaker detection in the wild: Lessons learned from JSALT 2019
Leibny Paola García-Perera
Jesus Villalba
H. Bredin
Jun Du
Diego Castán
...
Wassim Bouaziz
Hadrien Titeux
Emmanuel Dupoux
Kong Aik Lee
Najim Dehak
43
30
0
02 Dec 2019
Biometrics Recognition Using Deep Learning: A Survey
Biometrics Recognition Using Deep Learning: A Survey
Shervin Minaee
AmirAli Abdolrashidi
Hang Su
Bennamoun
David C. Zhang
113
85
0
30 Nov 2019
SEEF-ALDR: A Speaker Embedding Enhancement Framework via Adversarial
  Learning based Disentangled Representation
SEEF-ALDR: A Speaker Embedding Enhancement Framework via Adversarial Learning based Disentangled Representation
Jianwei Tai
Xiaoqi Jia
Qingjia Huang
Weijuan Zhang
Haichao Du
Shengzhi Zhang
43
1
0
27 Nov 2019
Self-Enhanced Convolutional Network for Facial Video Hallucination
Self-Enhanced Convolutional Network for Facial Video Hallucination
Chaowei Fang
Guanbin Li
Xiaoguang Han
Yizhou Yu
SupR
65
9
0
23 Nov 2019
Voice-Face Cross-modal Matching and Retrieval: A Benchmark
Voice-Face Cross-modal Matching and Retrieval: A Benchmark
Chuyu Xiong
Deyuan Zhang
Tao Liu
Xiaoyong Du
CVBM
48
9
0
21 Nov 2019
MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen
  Targets
MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen Targets
S. Ha
Martin Kersner
Beomsu Kim
Seokjun Seo
Dongyoung Kim
CVBM
121
167
0
19 Nov 2019
Partial AUC optimization based deep speaker embeddings with class-center
  learning for text-independent speaker verification
Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification
Zhongxin Bai
Xiao-Lei Zhang
Jingdong Chen
77
29
0
19 Nov 2019
N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System
N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System
Shuo Liu
Gil Keren
Björn Schuller
70
4
0
16 Nov 2019
Deep learning methods in speaker recognition: a review
Deep learning methods in speaker recognition: a review
Dávid Sztahó
György Szaszák
A. Beke
VLM
66
46
0
14 Nov 2019
Adversarial Attacks on GMM i-vector based Speaker Verification Systems
Adversarial Attacks on GMM i-vector based Speaker Verification Systems
Xu Li
Jinghua Zhong
Xixin Wu
Jianwei Yu
Xunying Liu
Helen Meng
AAML
74
79
0
08 Nov 2019
The sound of my voice: speaker representation loss for target voice
  separation
The sound of my voice: speaker representation loss for target voice separation
Seongkyu Mun
Soyeon Choe
Jaesung Huh
Joon Son Chung
57
16
0
06 Nov 2019
ASVspoof 2019: A large-scale public database of synthesized, converted
  and replayed speech
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
Xin Wang
Junichi Yamagishi
Massimiliano Todisco
Héctor Delgado
A. Nautsch
...
J. Bonastre
Avashna Govender
S. Ronanki
Jing-Xuan Zhang
Zhenhua Ling
83
12
0
05 Nov 2019
Voice Biometrics Security: Extrapolating False Alarm Rate via
  Hierarchical Bayesian Modeling of Speaker Verification Scores
Voice Biometrics Security: Extrapolating False Alarm Rate via Hierarchical Bayesian Modeling of Speaker Verification Scores
A. Sholokhov
Tomi Kinnunen
Ville Vestman
Kong Aik Lee
124
12
0
04 Nov 2019
Robust speaker recognition using unsupervised adversarial invariance
Robust speaker recognition using unsupervised adversarial invariance
Raghuveer Peri
Monisankha Pal
Arindam Jati
Krishna Somandepalli
Shrikanth Narayanan
56
24
0
03 Nov 2019
Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems
Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems
Guangke Chen
Sen Chen
Lingling Fan
Xiaoning Du
Zhe Zhao
Fu Song
Yang Liu
AAML
114
197
0
03 Nov 2019
CN-CELEB: a challenging Chinese speaker recognition dataset
CN-CELEB: a challenging Chinese speaker recognition dataset
Yue Fan
Jiawen Kang
Lantian Li
Keliang Li
Haolin Chen
Sitong Cheng
Pengyuan Zhang
Ziya Zhou
Yunqi Cai
Dong Wang
97
206
0
31 Oct 2019
Mixture factorized auto-encoder for unsupervised hierarchical deep
  factorization of speech signal
Mixture factorized auto-encoder for unsupervised hierarchical deep factorization of speech signal
Zhiyuan Peng
Siyuan Feng
Tan Lee
47
6
0
30 Oct 2019
Unsupervised Feature Enhancement for speaker verification
Unsupervised Feature Enhancement for speaker verification
P. S. Nidadavolu
Saurabh Kataria
Jesús Villalba
Leibny Paola García-Perera
Najim Dehak
69
18
0
25 Oct 2019
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-GANs
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-GANs
P. S. Nidadavolu
Saurabh Kataria
Jesús Villalba
Najim Dehak
62
24
0
25 Oct 2019
Adaptive blind audio source extraction supervised by dominant speaker
  identification using x-vectors
Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors
Jakub Janský
J. Málek
Jaroslav Cmejla
Tomás Kounovský
Zbyněk Koldovský
J. Zdánský
BDL
50
27
0
25 Oct 2019
Channel adversarial training for speaker verification and diarization
Channel adversarial training for speaker verification and diarization
Chau Luu
P. Bell
Steve Renals
72
17
0
25 Oct 2019
Structural sparsification for Far-field Speaker Recognition with GNA
Structural sparsification for Far-field Speaker Recognition with GNA
Jingchi Zhang
Jonathan Huang
Michael Deisher
Hai Helen Li
Yiran Chen
31
0
0
25 Oct 2019
Previous
123...1920212223
Next