v1v2 (latest)

VoxCeleb: a large-scale speaker identification dataset

26 June 2017

Arsha Nagrani

Joon Son Chung

Andrew Zisserman

ArXiv (abs)PDF HTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,111 papers shown

Title
Delving into VoxCeleb: environment invariant speaker recognition Joon Son Chung Jaesung Huh Seongkyu Mun 94 51 0 24 Oct 2019
Self-supervised pre-training with acoustic configurations for replay spoofing detection Hye-jin Shim Hee-Soo Heo Jee-weon Jung Ha-Jin Yu 69 6 0 22 Oct 2019
Label-efficient audio classification through multitask learning and self-supervision Tyler Lee Ting Gong Suchismita Padhy Andrew Rouditchenko A. Ndirango SSL VLM 60 7 0 19 Oct 2019
Frequency and temporal convolutional attention for text-independent speaker recognition Sarthak Yadav A. Rai 120 58 0 16 Oct 2019
Non-native Speaker Verification for Spoken Language Assessment Linlin Wang Yu Wang Mark Gales 17 1 0 30 Sep 2019
Understanding Semantics from Speech Through Pre-training P. Wang Liangchen Wei Yong Cao Jinghui Xie Yuji Cao Zaiqing Nie SSL VLM 33 6 0 24 Sep 2019
Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals Shah Nawaz Muhammad Kamran Janjua I. Gallo Arif Mahmood Alessandro Calefati 67 33 0 18 Sep 2019
VAE-based Domain Adaptation for Speaker Verification Xueyi Wang Lantian Li Dong Wang 56 16 0 27 Aug 2019
Unsupervised Learning of Landmarks by Descriptor Vector Exchange James Thewlis Samuel Albanie Hakan Bilen Andrea Vedaldi SSL 103 68 0 18 Aug 2019
Survey on Deep Neural Networks in Speech and Vision Systems M. Alam Manar D. Samad Lasitha Vidyaratne Alexander M. Glandon Khan M. Iftekharuddin 3DV VLM AI4TS 100 212 0 16 Aug 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning Pavel Denisov Ngoc Thang Vu 55 27 0 13 Aug 2019
Personal VAD: Speaker-Conditioned Voice Activity Detection Shaojin Ding Quan Wang Shuo-yiin Chang Li Wan Ignacio López Moreno 76 75 0 12 Aug 2019
A Study on Angular Based Embedding Learning for Text-independent Speaker Verification Zhiyong Chen Zongze Ren Shugong Xu 34 4 0 12 Aug 2019
BPPSA: Scaling Back-propagation by Parallel Scan Algorithm Shang Wang Yifan Bai Gennady Pekhimenko 60 7 0 23 Jul 2019
A Deep Neural Network for Short-Segment Speaker Recognition Amirhossein Hajavi Ali Etemad 67 75 0 22 Jul 2019
Speaker Recognition with Random Digit Strings Using Uncertainty Normalized HMM-based i-vectors N. Maghsoodi Hossein Sameti Hossein Zeinali Themos Stafylakis 32 13 0 13 Jul 2019
Self-supervised Learning of Interpretable Keypoints from Unlabelled Videos Tomas Jakab Ankush Gupta Hakan Bilen Andrea Vedaldi SSL 93 9 0 03 Jul 2019
Sub-band Convolutional Neural Networks for Small-footprint Spoken Term Classification Chieh-Chi Kao Ming Sun Yixin Gao S. Vitaladevuni Chao Wang 62 14 0 02 Jul 2019
Synchronising audio and ultrasound by learning cross-modal embeddings Aciel Eshky M. Ribeiro Korin Richmond Steve Renals 46 5 0 01 Jul 2019
Who said that?: Audio-visual speaker diarisation of real-world meetings Joon Son Chung Bong-Jin Lee Icksang Han 70 46 0 24 Jun 2019
Single-Channel Speech Separation with Auxiliary Speaker Embeddings Shuo Liu Gil Keren Björn Schuller 40 3 0 24 Jun 2019
Self Multi-Head Attention for Speaker Recognition Miquel India Pooyan Safari Javier Hernando 76 111 0 24 Jun 2019
Unleashing the Unused Potential of I-Vectors Enabled by GPU Acceleration Ville Vestman Kong Aik Lee Tomi Kinnunen Takafumi Koshinaka 19 2 0 20 Jun 2019
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification Youngmoon Jung Younggwan Kim Hyungjun Lim Yeunju Choi Hoirin Kim 66 32 0 19 Jun 2019
The Second DIHARD Diarization Challenge: Dataset, task, and baselines Neville Ryant Kenneth Church C. Cieri Alejandrina Cristià Jun Du Sriram Ganapathy M. Liberman 58 182 0 18 Jun 2019
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition Xu Xiang Shuai Wang Houjun Huang Y. Qian Kai Yu DRL 77 145 0 18 Jun 2019
Voice Mimicry Attacks Assisted by Automatic Speaker Verification Ville Vestman Tomi Kinnunen Rosa González Hautamäki Md. Sahidullah 84 37 0 03 Jun 2019
Speaker Anonymization Using X-vector and Neural Waveform Models Fuming Fang Xin Wang Junichi Yamagishi Isao Echizen Massimiliano Todisco Nicholas W. D. Evans J. Bonastre 65 135 0 30 May 2019
ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent Generative Adversarial Networks Xiaoqi Jia Jianwei Tai Hang Zhou Yakai Li Weijuan Zhang Haichao Du Qingjia Huang GAN 34 6 0 27 May 2019
Speech2Face: Learning the Face Behind a Voice Tae-Hyun Oh Tali Dekel Changil Kim Inbar Mosseri William T. Freeman Michael Rubinstein Wojciech Matusik SSL CVBM 112 164 0 23 May 2019
Few-Shot Adversarial Learning of Realistic Neural Talking Head Models Egor Zakharov Aliaksandra Shysheya Egor Burkov Victor Lempitsky 3DH 178 631 0 20 May 2019
AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss Kaizhi Qian Yang Zhang Shiyu Chang Xuesong Yang M. Hasegawa-Johnson 135 471 0 14 May 2019
Hierarchical Cross-Modal Talking Face Generationwith Dynamic Pixel-Wise Loss Lele Chen R. Maddox Z. Duan Chenliang Xu CVBM 98 400 0 09 May 2019
Meeting Transcription Using Virtual Microphone Arrays Takuya Yoshioka Zhuo Chen Dimitrios Dimitriadis William Fu-Hinthorn Xuedong Huang A. Stolcke Michael Zeng 76 15 0 03 May 2019
Few Shot Speaker Recognition using Deep Neural Networks Prashant Anand A. Singh Siddharth Srivastava Brejesh Lall 65 40 0 17 Apr 2019
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification Jee-weon Jung Hee-Soo Heo Ju-ho Kim Hye-jin Shim Ha-Jin Yu 87 142 0 17 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering Gene-Ping Yang Chao-I Tuan Hung-yi Lee Lin-Shan Lee 61 25 0 16 Apr 2019
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences Kong Aik Lee Ville Hautamaki Tomi Kinnunen Hitoshi Yamamoto K. Okabe ... Chng Eng Siong Shivesh Ranjan John H. L. Hansen Massimiliano Todisco Nicholas W. D. Evans BDL 49 21 0 16 Apr 2019
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection Massimiliano Todisco Xin Wang Ville Vestman Md. Sahidullah Héctor Delgado A. Nautsch Junichi Yamagishi Nicholas W. D. Evans Tomi Kinnunen Kong Aik Lee 99 617 0 09 Apr 2019
VAE-based regularization for deep speaker embedding Yang Zhang Lantian Li Dong Wang DRL BDL 46 19 0 07 Apr 2019
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation Suwon Shon Najim Dehak D. Reynolds James R. Glass 38 26 0 07 Apr 2019
VoiceID Loss: Speech Enhancement for Speaker Verification Suwon Shon Hao Tang James R. Glass VLM 73 88 0 07 Apr 2019
Self-supervised speaker embeddings Themos Stafylakis Johan Rohdin Oldrich Plchot Petr Mizera L. Burget SSL 50 48 0 06 Apr 2019
Large Margin Softmax Loss for Speaker Verification Yi Y. Liu Liang He Jia-Wei Liu 68 145 0 06 Apr 2019
ICface: Interpretable and Controllable Face Reenactment Using GANs S. Tripathy Arno Solin Esa Rahtu CVBM 66 90 0 03 Apr 2019
Multi-Task Learning with High-Order Statistics for X-vector based Text-Independent Speaker Verification Lanhua You Wu Guo Lirong Dai Jun Du 44 12 0 28 Mar 2019
Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks A. Duarte Francisco Roldan Miquel Tubau Janna Escur Santiago Pascual Amaia Salvador Eva Mohedano Kevin McGuinness Jordi Torres Xavier Giró-i-Nieto GAN CVBM 71 79 0 25 Mar 2019
The VOiCES from a Distance Challenge 2019 Evaluation Plan Mahesh Kumar Nandwana Julien van Hout Mitchell McLaren Colleen Richey A. Lawson M. Barrios 60 92 0 27 Feb 2019
Utterance-level Aggregation For Speaker Recognition In The Wild Weidi Xie Arsha Nagrani Joon Son Chung Andrew Zisserman 74 344 0 26 Feb 2019
End-to-end losses based on speaker basis vectors and all-speaker hard negative mining for speaker verification Hee-Soo Heo Jee-weon Jung Il-Ho Yang Sung-Hyun Yoon Hye-jin Shim Ha-Jin Yu 85 22 0 07 Feb 2019