ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.08612
  4. Cited By
VoxCeleb: a large-scale speaker identification dataset

VoxCeleb: a large-scale speaker identification dataset

26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
ArXivPDFHTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,098 papers shown
Title
Everybody's Talkin': Let Me Talk as You Want
Everybody's Talkin': Let Me Talk as You Want
Linsen Song
Wayne Wu
Chao Qian
Ran He
Chen Change Loy
DiffM
VGen
28
143
0
15 Jan 2020
Robust Speaker Recognition Using Speech Enhancement And Attention Model
Robust Speaker Recognition Using Speech Enhancement And Attention Model
Yanpei Shi
Qiang Huang
Thomas Hain
27
25
0
14 Jan 2020
Deep Audio-Visual Learning: A Survey
Deep Audio-Visual Learning: A Survey
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
Ran He
31
156
0
14 Jan 2020
Gaussian speaker embedding learning for text-independent speaker
  verification
Gaussian speaker embedding learning for text-independent speaker verification
Bin Gu
Wu Guo
BDL
23
1
0
14 Jan 2020
On the Resilience of Biometric Authentication Systems against Random
  Inputs
On the Resilience of Biometric Authentication Systems against Random Inputs
Benjamin Zi Hao Zhao
Hassan Jameel Asghar
M. Kâafar
AAML
39
23
0
13 Jan 2020
Learning Speaker Embedding with Momentum Contrast
Learning Speaker Embedding with Momentum Contrast
Ke Ding
Xuanji He
Guanglu Wan
SSL
25
10
0
07 Jan 2020
Destruction of Image Steganography using Generative Adversarial Networks
Destruction of Image Steganography using Generative Adversarial Networks
Isaac Corley
Jonathan Lwowski
Justin Hoffman
AAML
20
13
0
20 Dec 2019
Large-scale Multi-modal Person Identification in Real Unconstrained
  Environments
Large-scale Multi-modal Person Identification in Real Unconstrained Environments
Jiajie Ye
Y. Guan
Junfa Liu
Xinghong Huang
Hong Zhang
18
1
0
17 Dec 2019
Speech-driven facial animation using polynomial fusion of features
Speech-driven facial animation using polynomial fusion of features
Triantafyllos Kefalas
Konstantinos Vougioukas
Yannis Panagakis
Stavros Petridis
Jean Kossaifi
M. Pantic
22
6
0
12 Dec 2019
Advances in Online Audio-Visual Meeting Transcription
Advances in Online Audio-Visual Meeting Transcription
Takuya Yoshioka
Igor Abramovski
Cem Aksoylar
Zhuo Chen
Moshe David
...
Huaming Wang
Zhenghao Wang
Jun Zhang
Yong Zhao
Tianyan Zhou
23
74
0
10 Dec 2019
A Multi Purpose and Large Scale Speech Corpus in Persian and English for
  Speaker and Speech Recognition: the DeepMine Database
A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Hossein Zeinali
L. Burget
J. Černocký
13
39
0
08 Dec 2019
VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge
VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge
Joon Son Chung
Arsha Nagrani
Ernesto Coto
Weidi Xie
Mitchell McLaren
D. Reynolds
Andrew Zisserman
18
59
0
05 Dec 2019
Smartphone Multi-modal Biometric Authentication: Database and Evaluation
Smartphone Multi-modal Biometric Authentication: Database and Evaluation
Raghavendra Ramachandra
Martin Stokkenes
A. Mohammadi
S. Venkatesh
Kiran Raja
Pankaj Wasnik
Eric Poiret
S´ebastien Marcel
Christoph Busch
CVBM
23
18
0
05 Dec 2019
HI-MIA : A Far-field Text-Dependent Speaker Verification Database and
  the Baselines
HI-MIA : A Far-field Text-Dependent Speaker Verification Database and the Baselines
Xiaoyi Qin
Hui Bu
Ming Li
28
67
0
03 Dec 2019
Speaker detection in the wild: Lessons learned from JSALT 2019
Speaker detection in the wild: Lessons learned from JSALT 2019
Leibny Paola García-Perera
Jesus Villalba
H. Bredin
Jun Du
Diego Castán
...
Wassim Bouaziz
Hadrien Titeux
Emmanuel Dupoux
Kong Aik Lee
Najim Dehak
16
29
0
02 Dec 2019
Biometrics Recognition Using Deep Learning: A Survey
Biometrics Recognition Using Deep Learning: A Survey
Shervin Minaee
AmirAli Abdolrashidi
Hang Su
Bennamoun
David C. Zhang
21
84
0
30 Nov 2019
SEEF-ALDR: A Speaker Embedding Enhancement Framework via Adversarial
  Learning based Disentangled Representation
SEEF-ALDR: A Speaker Embedding Enhancement Framework via Adversarial Learning based Disentangled Representation
Jianwei Tai
Xiaoqi Jia
Qingjia Huang
Weijuan Zhang
Haichao Du
Shengzhi Zhang
16
1
0
27 Nov 2019
Self-Enhanced Convolutional Network for Facial Video Hallucination
Self-Enhanced Convolutional Network for Facial Video Hallucination
Chaowei Fang
Guanbin Li
Xiaoguang Han
Yizhou Yu
SupR
18
9
0
23 Nov 2019
Voice-Face Cross-modal Matching and Retrieval: A Benchmark
Voice-Face Cross-modal Matching and Retrieval: A Benchmark
Chuyu Xiong
Deyuan Zhang
Tao Liu
Xiaoyong Du
CVBM
28
9
0
21 Nov 2019
MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen
  Targets
MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen Targets
S. Ha
Martin Kersner
Beomsu Kim
Seokjun Seo
Dongyoung Kim
CVBM
28
163
0
19 Nov 2019
Partial AUC optimization based deep speaker embeddings with class-center
  learning for text-independent speaker verification
Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification
Zhongxin Bai
Xiao-Lei Zhang
Jingdong Chen
15
29
0
19 Nov 2019
N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System
N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System
Shuo Liu
Gil Keren
Björn Schuller
12
4
0
16 Nov 2019
Deep learning methods in speaker recognition: a review
Deep learning methods in speaker recognition: a review
Dávid Sztahó
György Szaszák
A. Beke
VLM
23
46
0
14 Nov 2019
Adversarial Attacks on GMM i-vector based Speaker Verification Systems
Adversarial Attacks on GMM i-vector based Speaker Verification Systems
Xu Li
Jinghua Zhong
Xixin Wu
Jianwei Yu
Xunying Liu
Helen Meng
AAML
23
78
0
08 Nov 2019
The sound of my voice: speaker representation loss for target voice
  separation
The sound of my voice: speaker representation loss for target voice separation
Seongkyu Mun
Soyeon Choe
Jaesung Huh
Joon Son Chung
17
16
0
06 Nov 2019
ASVspoof 2019: A large-scale public database of synthesized, converted
  and replayed speech
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
Xin Wang
Junichi Yamagishi
Massimiliano Todisco
Héctor Delgado
A. Nautsch
...
J. Bonastre
Avashna Govender
S. Ronanki
Jing-Xuan Zhang
Zhenhua Ling
15
12
0
05 Nov 2019
Voice Biometrics Security: Extrapolating False Alarm Rate via
  Hierarchical Bayesian Modeling of Speaker Verification Scores
Voice Biometrics Security: Extrapolating False Alarm Rate via Hierarchical Bayesian Modeling of Speaker Verification Scores
A. Sholokhov
Tomi Kinnunen
Ville Vestman
Kong Aik Lee
16
12
0
04 Nov 2019
Robust speaker recognition using unsupervised adversarial invariance
Robust speaker recognition using unsupervised adversarial invariance
Raghuveer Peri
Monisankha Pal
Arindam Jati
Krishna Somandepalli
Shrikanth Narayanan
19
23
0
03 Nov 2019
Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems
Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems
Guangke Chen
Sen Chen
Lingling Fan
Xiaoning Du
Zhe Zhao
Fu Song
Yang Liu
AAML
19
193
0
03 Nov 2019
CN-CELEB: a challenging Chinese speaker recognition dataset
CN-CELEB: a challenging Chinese speaker recognition dataset
Yue Fan
Jiawen Kang
Lantian Li
Keliang Li
Haolin Chen
Sitong Cheng
Pengyuan Zhang
Ziya Zhou
Yunqi Cai
Dong Wang
23
203
0
31 Oct 2019
Mixture factorized auto-encoder for unsupervised hierarchical deep
  factorization of speech signal
Mixture factorized auto-encoder for unsupervised hierarchical deep factorization of speech signal
Zhiyuan Peng
Siyuan Feng
Tan Lee
13
6
0
30 Oct 2019
Unsupervised Feature Enhancement for speaker verification
Unsupervised Feature Enhancement for speaker verification
P. S. Nidadavolu
Saurabh Kataria
Jesús Villalba
Leibny Paola García-Perera
Najim Dehak
30
18
0
25 Oct 2019
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-GANs
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-GANs
P. S. Nidadavolu
Saurabh Kataria
Jesús Villalba
Najim Dehak
15
24
0
25 Oct 2019
Adaptive blind audio source extraction supervised by dominant speaker
  identification using x-vectors
Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors
Jakub Janský
J. Málek
Jaroslav Cmejla
Tomás Kounovský
Zbyněk Koldovský
J. Zdánský
BDL
17
26
0
25 Oct 2019
Channel adversarial training for speaker verification and diarization
Channel adversarial training for speaker verification and diarization
Chau Luu
P. Bell
Steve Renals
14
17
0
25 Oct 2019
Structural sparsification for Far-field Speaker Recognition with GNA
Structural sparsification for Far-field Speaker Recognition with GNA
Jingchi Zhang
Jonathan Huang
Michael Deisher
Hai Helen Li
Yiran Chen
19
0
0
25 Oct 2019
Delving into VoxCeleb: environment invariant speaker recognition
Delving into VoxCeleb: environment invariant speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
33
50
0
24 Oct 2019
Self-supervised pre-training with acoustic configurations for replay
  spoofing detection
Self-supervised pre-training with acoustic configurations for replay spoofing detection
Hye-jin Shim
Hee-Soo Heo
Jee-weon Jung
Ha-Jin Yu
33
6
0
22 Oct 2019
Label-efficient audio classification through multitask learning and
  self-supervision
Label-efficient audio classification through multitask learning and self-supervision
Tyler Lee
Ting Gong
Suchismita Padhy
Andrew Rouditchenko
A. Ndirango
SSL
VLM
33
7
0
19 Oct 2019
Frequency and temporal convolutional attention for text-independent
  speaker recognition
Frequency and temporal convolutional attention for text-independent speaker recognition
Sarthak Yadav
A. Rai
60
58
0
16 Oct 2019
Non-native Speaker Verification for Spoken Language Assessment
Non-native Speaker Verification for Spoken Language Assessment
Linlin Wang
Yu Wang
Mark Gales
12
1
0
30 Sep 2019
Understanding Semantics from Speech Through Pre-training
Understanding Semantics from Speech Through Pre-training
P. Wang
Liangchen Wei
Yong Cao
Jinghui Xie
Yuji Cao
Zaiqing Nie
SSL
VLM
8
6
0
24 Sep 2019
Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual
  Signals
Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals
Shah Nawaz
Muhammad Kamran Janjua
I. Gallo
Arif Mahmood
Alessandro Calefati
12
32
0
18 Sep 2019
VAE-based Domain Adaptation for Speaker Verification
VAE-based Domain Adaptation for Speaker Verification
Xueyi Wang
Lantian Li
Dong Wang
22
16
0
27 Aug 2019
Unsupervised Learning of Landmarks by Descriptor Vector Exchange
Unsupervised Learning of Landmarks by Descriptor Vector Exchange
James Thewlis
Samuel Albanie
Hakan Bilen
Andrea Vedaldi
SSL
12
67
0
18 Aug 2019
Survey on Deep Neural Networks in Speech and Vision Systems
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DV
VLM
AI4TS
34
205
0
16 Aug 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and
  Transfer Learning
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Pavel Denisov
Ngoc Thang Vu
20
27
0
13 Aug 2019
Personal VAD: Speaker-Conditioned Voice Activity Detection
Personal VAD: Speaker-Conditioned Voice Activity Detection
Shaojin Ding
Quan Wang
Shuo-yiin Chang
Li Wan
Ignacio López Moreno
12
73
0
12 Aug 2019
A Study on Angular Based Embedding Learning for Text-independent Speaker
  Verification
A Study on Angular Based Embedding Learning for Text-independent Speaker Verification
Zhiyong Chen
Zongze Ren
Shugong Xu
13
4
0
12 Aug 2019
BPPSA: Scaling Back-propagation by Parallel Scan Algorithm
BPPSA: Scaling Back-propagation by Parallel Scan Algorithm
Shang Wang
Yifan Bai
Gennady Pekhimenko
20
6
0
23 Jul 2019
Previous
123...19202122
Next