v1v2 (latest)

VoxCeleb: a large-scale speaker identification dataset

26 June 2017

Arsha Nagrani

Joon Son Chung

Andrew Zisserman

ArXiv (abs)PDF HTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,111 papers shown

Title
Speaker De-identification System using Autoencoders and Adversarial Training Fernando M. Espinoza-Cuadros Juan M. Perero-Codosero Javier Antón-Martín L. A. H. Gómez AAML 43 14 0 09 Nov 2020
FRILL: A Non-Semantic Speech Embedding for Mobile Devices J. Peplinski Joel Shor Sachin P. Joglekar Jake Garrison Shwetak N. Patel 68 24 0 09 Nov 2020
Masked Proxy Loss For Text-Independent Speaker Verification Jiachen Lian A. V. Kumar Hira Dhamyal Bhiksha Raj Rita Singh 64 2 0 09 Nov 2020
Non-local convolutional neural networks (nlcnn) for speaker recognition Haici Yang Hongda Mao Ruirui Li C. Ju Oguz H. Elibol 65 0 0 07 Nov 2020
Large-scale multilingual audio visual dubbing Yi Yang Brendan Shillingford Yannis Assael Miaosen Wang Wendi Liu ... Eren Sezener Luis C. Cobo Misha Denil Y. Aytar Nando de Freitas 70 21 0 06 Nov 2020
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription Xiaofei Wang Naoyuki Kanda Yashesh Gaur Zhuo Chen Zhong Meng Takuya Yoshioka 64 13 0 05 Nov 2020
Multi-class Spectral Clustering with Overlaps for Speaker Diarization Desh Raj Zili Huang Sanjeev Khudanpur 105 31 0 05 Nov 2020
Paralinguistic Privacy Protection at the Edge Ranya Aloufi Hamed Haddadi David E. Boyle 64 14 0 04 Nov 2020
Query Expansion System for the VoxCeleb Speaker Recognition Challenge 2020 Yu Cheng Chun-Liang Shih Tien-Hong Lo Wen-Ting Tseng Berlin Chen 22 0 0 04 Nov 2020
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR Naoyuki Kanda Zhong Meng Liang Lu Yashesh Gaur Xiaofei Wang Zhuo Chen Takuya Yoshioka 71 17 0 03 Nov 2020
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis Desh Raj Pavel Denisov Zhuo Chen Hakan Erdogan Zili Huang ... Yi Luo Naoyuki Kanda Jinyu Li Scott Wisdom J. Hershey 63 88 0 03 Nov 2020
ShaneRun System Description to VoxCeleb Speaker Recognition Challenge 2020 Shen Chen DRL 37 1 0 03 Nov 2020
The xx205 System for the VoxCeleb Speaker Recognition Challenge 2020 Xu Xiang 49 14 0 31 Oct 2020
Deep Speaker Vector Normalization with Maximum Gaussianality Training Yunqi Cai Lantian Li Dong Wang Andrew Abel 116 6 0 30 Oct 2020
Deep generative LDA Yunqi Cai Dong Wang 66 1 0 30 Oct 2020
Comparison of Speaker Role Recognition and Speaker Enrollment Protocol for conversational Clinical Interviews Rachid Riad Hadrien Titeux Laurie Lemoine Justine Montillot A. Sliwinski J. Bagnou Xuan-Nga Cao Anne-Catherine Bachoud-Lévi Emmanuel Dupoux 35 0 0 30 Oct 2020
The ins and outs of speaker recognition: lessons from VoxSRC 2020 Yoohwan Kwon Hee-Soo Heo Bong-Jin Lee Joon Son Chung 121 61 0 29 Oct 2020
Playing a Part: Speaker Verification at the Movies A. Brown Jaesung Huh Arsha Nagrani Joon Son Chung Andrew Zisserman 73 23 0 29 Oct 2020
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model Yanpei Shi Mingjie Chen Qiang Huang Thomas Hain 48 5 0 29 Oct 2020
Generative Adversarial Networks in Human Emotion Synthesis:A Review Noushin Hajarolasvadi M. A. Ramírez H. Demirel GAN 69 22 0 28 Oct 2020
Leveraging speaker attribute information using multi task learning for speaker verification and diarization Chau Luu P. Bell Steve Renals 54 9 0 27 Oct 2020
Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification Lantian Li Yang Zhang Jiawen Kang Tianshi Zheng Dong Wang 43 5 0 27 Oct 2020
HarperValleyBank: A Domain-Specific Spoken Dialog Corpus Mike Wu J. Nafziger A. Scodary Andrew L. Maas 91 17 0 26 Oct 2020
Speaker Anonymization with Distribution-Preserving X-Vector Generation for the VoicePrivacy Challenge 2020 H.C.M. Turner Giulio Lovisotto Ivan Martinovic 73 21 0 26 Oct 2020
An iterative framework for self-supervised deep speaker representation learning Danwei Cai Weiqing Wang Ming Li SSL 67 37 0 25 Oct 2020
Y-Vector: Multiscale Waveform Encoder for Speaker Embedding Ge Zhu Fei Jiang Z. Duan 91 25 0 24 Oct 2020
The IDLAB VoxCeleb Speaker Recognition Challenge 2020 System Description Jenthe Thienpondt Brecht Desplanques Kris Demuynck 68 49 0 23 Oct 2020
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers Zeqian Li Jacob Whitehill 140 11 0 22 Oct 2020
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge Renyu Wang Ruilin Tong Y. Yeung Xiao Chen 23 1 0 22 Oct 2020
Graph Attention Networks for Speaker Verification Jee-weon Jung Hee-Soo Heo Ha-Jin Yu Joon Son Chung 93 27 0 22 Oct 2020
Momentum Contrast Speaker Representation Learning Jangho Lee Jaihyun Koh Sungroh Yoon SSL 62 3 0 22 Oct 2020
Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium Learning Sung Hwan Mun Woohyun Kang Min Hyun Han N. Kim SSL 90 21 0 22 Oct 2020
Robust Text-Dependent Speaker Verification via Character-Level Information Preservation for the SdSV Challenge 2020 Sung Hwan Mun Woohyun Kang Min Hyun Han N. Kim 36 2 0 22 Oct 2020
The IDLAB VoxSRC-20 Submission: Large Margin Fine-Tuning and Quality-Aware Score Calibration in DNN Based Speaker Verification Jenthe Thienpondt Brecht Desplanques Kris Demuynck 82 84 0 21 Oct 2020
The UPC Speaker Verification System Submitted to VoxCeleb Speaker Recognition Challenge 2020 (VoxSRC-20) Muhammad Umair Ahmed Khan Javier Hernando DRL 42 3 0 21 Oct 2020
Multi-task Metric Learning for Text-independent Speaker Verification Yafeng Chen Wu Guo Jing Shi Jiajun Qi Tan Liu 335 0 0 21 Oct 2020
Contrastive Learning of General-Purpose Audio Representations Aaqib Saeed David Grangier Neil Zeghidour VLM SSL 91 272 0 21 Oct 2020
Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020 Shufan Shen Ran Miao Yi Wang Zhihua Wei 28 0 0 20 Oct 2020
Tongji University Team for the VoxCeleb Speaker Recognition Challenge 2020 Rui Wang Zhihua Wei Yibin Zhan Zhuoxiao Chen 24 0 0 16 Oct 2020
Viewmaker Networks: Learning Views for Unsupervised Representation Learning Alex Tamkin Mike Wu Noah D. Goodman SSL 131 64 0 14 Oct 2020
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation Rohan Kumar Das Ruijie Tao Jichen Yang Wei Rao Cheng Yu Haizhou Li 49 11 0 08 Oct 2020
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments Youngmoon Jung Yeunju Choi Hyungjun Lim Hoirin Kim 65 13 0 06 Oct 2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020 Hee-Soo Heo Bong-Jin Lee Jaesung Huh Joon Son Chung 60 134 0 29 Sep 2020
FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning Tedd Kourkounakis Amirhossein Hajavi Ali Etemad 56 23 0 23 Sep 2020
Open-set Short Utterance Forensic Speaker Verification using Teacher-Student Network with Explicit Inductive Bias Mufan Sang Wei Xia John H. L. Hansen 73 18 0 21 Sep 2020
Online Speaker Diarization with Relation Network Xiang Li Yucheng Zhao Chong Luo Wenjun Zeng 44 2 0 17 Sep 2020
When Automatic Voice Disguise Meets Automatic Speaker Verification Linlin Zheng Jiakang Li Meng Sun Xiongwei Zhang Tianshi Zheng 57 19 0 15 Sep 2020
Utterance Clustering Using Stereo Audio Channels Yingjun Dong Neil G. MacLaren Yiding Cao F. Yammarino Shelley D. Dionne M. Mumford S. Connelly Hiroki Sayama G. Ruark 23 0 0 10 Sep 2020
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling Songxiang Liu Yuewen Cao Disong Wang Xixin Wu Xunying Liu Helen Meng BDL 116 92 0 06 Sep 2020
Cross-domain Adaptation with Discrepancy Minimization for Text-independent Forensic Speaker Verification Zhenyu Wang Wei Xia John H. L. Hansen 45 12 0 05 Sep 2020