v1v2 (latest)

VoxCeleb: a large-scale speaker identification dataset

26 June 2017

Arsha Nagrani

Joon Son Chung

Andrew Zisserman

ArXiv (abs)PDF HTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,111 papers shown

Title
Speaker Diarization with Region Proposal Network Zili Huang Shinji Watanabe Yusuke Fujita Leibny Paola García-Perera Yiwen Shao Daniel Povey Sanjeev Khudanpur 59 60 0 14 Feb 2020
Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances Aleksei Gusev V. Volokhov Tseren Andzhukaev Sergey Novoselov G. Lavrentyeva ... Anastasia Avdeeva Artem Ivanov Alexander Kozlov Timur Pekhovsky Yuri N. Matveev 59 48 0 14 Feb 2020
Self-supervised learning for audio-visual speaker diarization Yifan Ding Yong-mei Xu Shi-Xiong Zhang Yahuan Cong Liqiang Wang VLM 78 29 0 13 Feb 2020
AlignNet: A Unifying Approach to Audio-Visual Alignment Jianren Wang Zhaoyuan Fang Hang Zhao 57 37 0 12 Feb 2020
NPLDA: A Deep Neural PLDA Model for Speaker Verification Shreyas Ramoji Prashant Krishnan Sriram Ganapathy 47 31 0 10 Feb 2020
An empirical analysis of information encoded in disentangled neural speaker representations Raghuveer Peri Haoqi Li Krishna Somandepalli Arindam Jati Shrikanth Narayanan DRL 75 14 0 10 Feb 2020
$M^3$ T: Multi-Modal Continuous Valence-Arousal Estimation in the Wild Yuanhang Zhang Rulin Huang Jiabei Zeng Shiguang Shan Xilin Chen CVBM 70 27 0 07 Feb 2020
LEAP System for SRE19 CTS Challenge -- Improvements and Error Analysis Shreyas Ramoji Prashant Krishnan Bhargavram Mysore Prachi Singh Sriram Ganapathy 51 2 0 07 Feb 2020
An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning Anssi Kanervisto Ville Hautamaki Tomi Kinnunen Junichi Yamagishi 28 2 0 06 Feb 2020
Within-sample variability-invariant loss for robust speaker recognition under noisy environments Danwei Cai Weicheng Cai Ming Li 64 47 0 03 Feb 2020
DropClass and DropAdapt: Dropping classes for deep speaker representation learning Chau Luu P. Bell Steve Renals VLM 65 3 0 02 Feb 2020
Analysis of Deep Feature Loss based Enhancement for Speaker Verification Saurabh Kataria P. S. Nidadavolu Jesús Villalba Najim Dehak 103 13 0 01 Feb 2020
MCSAE: Masked Cross Self-Attentive Encoding for Speaker Embedding Soonshin Seo Ji-Hwan Kim 24 0 0 28 Jan 2020
Pairwise Discriminative Neural PLDA for Speaker Verification Shreyas Ramoji Prashant Krishnan Prachi Singh Sriram Ganapathy 48 7 0 20 Jan 2020
Everybody's Talkin': Let Me Talk as You Want Linsen Song Wayne Wu Chao Qian Ran He Chen Change Loy DiffM VGen 79 147 0 15 Jan 2020
Robust Speaker Recognition Using Speech Enhancement And Attention Model Yanpei Shi Qiang Huang Thomas Hain 86 26 0 14 Jan 2020
Deep Audio-Visual Learning: A Survey Hao Zhu Mandi Luo Rui Wang A. Zheng Ran He 75 161 0 14 Jan 2020
Gaussian speaker embedding learning for text-independent speaker verification Bin Gu Wu Guo BDL 57 1 0 14 Jan 2020
On the Resilience of Biometric Authentication Systems against Random Inputs Benjamin Zi Hao Zhao Hassan Jameel Asghar M. Kâafar AAML 133 23 0 13 Jan 2020
Learning Speaker Embedding with Momentum Contrast Ke Ding Xuanji He Guanglu Wan SSL 108 10 0 07 Jan 2020
Destruction of Image Steganography using Generative Adversarial Networks Isaac Corley Jonathan Lwowski Justin Hoffman AAML 42 13 0 20 Dec 2019
Large-scale Multi-modal Person Identification in Real Unconstrained Environments Jiajie Ye Y. Guan Junfa Liu Xinghong Huang Kuanqi Cai 28 1 0 17 Dec 2019
Speech-driven facial animation using polynomial fusion of features Triantafyllos Kefalas Konstantinos Vougioukas Yannis Panagakis Stavros Petridis Jean Kossaifi Maja Pantic 24 6 0 12 Dec 2019
Advances in Online Audio-Visual Meeting Transcription Takuya Yoshioka Igor Abramovski Cem Aksoylar Zhuo Chen Moshe David ... Huaming Wang Zhenghao Wang Jun Zhang Yong Zhao Tianyan Zhou 95 75 0 10 Dec 2019
A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database Hossein Zeinali L. Burget J. Černocký 39 40 0 08 Dec 2019
VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge Joon Son Chung Arsha Nagrani Ernesto Coto Weidi Xie Mitchell McLaren D. Reynolds Andrew Zisserman 81 60 0 05 Dec 2019
Smartphone Multi-modal Biometric Authentication: Database and Evaluation Raghavendra Ramachandra Martin Stokkenes A. Mohammadi S. Venkatesh Kiran Raja Pankaj Wasnik Eric Poiret S´ebastien Marcel Christoph Busch CVBM 50 18 0 05 Dec 2019
HI-MIA : A Far-field Text-Dependent Speaker Verification Database and the Baselines Xiaoyi Qin Hui Bu Ming Li 112 69 0 03 Dec 2019
Speaker detection in the wild: Lessons learned from JSALT 2019 Leibny Paola García-Perera Jesus Villalba H. Bredin Jun Du Diego Castán ... Wassim Bouaziz Hadrien Titeux Emmanuel Dupoux Kong Aik Lee Najim Dehak 43 30 0 02 Dec 2019
Biometrics Recognition Using Deep Learning: A Survey Shervin Minaee AmirAli Abdolrashidi Hang Su Bennamoun David C. Zhang 113 85 0 30 Nov 2019
SEEF-ALDR: A Speaker Embedding Enhancement Framework via Adversarial Learning based Disentangled Representation Jianwei Tai Xiaoqi Jia Qingjia Huang Weijuan Zhang Haichao Du Shengzhi Zhang 43 1 0 27 Nov 2019
Self-Enhanced Convolutional Network for Facial Video Hallucination Chaowei Fang Guanbin Li Xiaoguang Han Yizhou Yu SupR 65 9 0 23 Nov 2019
Voice-Face Cross-modal Matching and Retrieval: A Benchmark Chuyu Xiong Deyuan Zhang Tao Liu Xiaoyong Du CVBM 48 9 0 21 Nov 2019
MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen Targets S. Ha Martin Kersner Beomsu Kim Seokjun Seo Dongyoung Kim CVBM 121 167 0 19 Nov 2019
Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification Zhongxin Bai Xiao-Lei Zhang Jingdong Chen 77 29 0 19 Nov 2019
N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System Shuo Liu Gil Keren Björn Schuller 70 4 0 16 Nov 2019
Deep learning methods in speaker recognition: a review Dávid Sztahó György Szaszák A. Beke VLM 66 46 0 14 Nov 2019
Adversarial Attacks on GMM i-vector based Speaker Verification Systems Xu Li Jinghua Zhong Xixin Wu Jianwei Yu Xunying Liu Helen Meng AAML 74 79 0 08 Nov 2019
The sound of my voice: speaker representation loss for target voice separation Seongkyu Mun Soyeon Choe Jaesung Huh Joon Son Chung 57 16 0 06 Nov 2019
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech Xin Wang Junichi Yamagishi Massimiliano Todisco Héctor Delgado A. Nautsch ... J. Bonastre Avashna Govender S. Ronanki Jing-Xuan Zhang Zhenhua Ling 83 12 0 05 Nov 2019
Voice Biometrics Security: Extrapolating False Alarm Rate via Hierarchical Bayesian Modeling of Speaker Verification Scores A. Sholokhov Tomi Kinnunen Ville Vestman Kong Aik Lee 124 12 0 04 Nov 2019
Robust speaker recognition using unsupervised adversarial invariance Raghuveer Peri Monisankha Pal Arindam Jati Krishna Somandepalli Shrikanth Narayanan 56 24 0 03 Nov 2019
Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems Guangke Chen Sen Chen Lingling Fan Xiaoning Du Zhe Zhao Fu Song Yang Liu AAML 114 197 0 03 Nov 2019
CN-CELEB: a challenging Chinese speaker recognition dataset Yue Fan Jiawen Kang Lantian Li Keliang Li Haolin Chen Sitong Cheng Pengyuan Zhang Ziya Zhou Yunqi Cai Dong Wang 97 206 0 31 Oct 2019
Mixture factorized auto-encoder for unsupervised hierarchical deep factorization of speech signal Zhiyuan Peng Siyuan Feng Tan Lee 47 6 0 30 Oct 2019
Unsupervised Feature Enhancement for speaker verification P. S. Nidadavolu Saurabh Kataria Jesús Villalba Leibny Paola García-Perera Najim Dehak 69 18 0 25 Oct 2019
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-GANs P. S. Nidadavolu Saurabh Kataria Jesús Villalba Najim Dehak 62 24 0 25 Oct 2019
Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors Jakub Janský J. Málek Jaroslav Cmejla Tomás Kounovský Zbyněk Koldovský J. Zdánský BDL 50 27 0 25 Oct 2019
Channel adversarial training for speaker verification and diarization Chau Luu P. Bell Steve Renals 72 17 0 25 Oct 2019
Structural sparsification for Far-field Speaker Recognition with GNA Jingchi Zhang Jonathan Huang Michael Deisher Hai Helen Li Yiran Chen 31 0 0 25 Oct 2019