v1v2 (latest)

VoxCeleb: a large-scale speaker identification dataset

26 June 2017

Arsha Nagrani

Joon Son Chung

Andrew Zisserman

ArXiv (abs)PDF HTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,111 papers shown

Title
Fine-grained Early Frequency Attention for Deep Speaker Representation Learning Amirhossein Hajavi Ali Etemad 57 2 0 03 Sep 2020
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations Wei Xia John H. L. Hansen 57 9 0 02 Sep 2020
Few Shot Text-Independent speaker verification using 3D-CNN Prateek Mishra 65 5 0 25 Aug 2020
asya: Mindful verbal communication using deep learning Ē. Urtāns Ariel Tabaks VLM 111 1 0 20 Aug 2020
Mesh Guided One-shot Face Reenactment using Graph Convolutional Networks Guangming Yao Yi Yuan Tianjia Shao Kun Zhou 3DH CVBM 72 56 0 18 Aug 2020
Adversarial Attack and Defense Strategies for Deep Speaker Recognition Systems Arindam Jati Chin-Cheng Hsu Monisankha Pal Raghuveer Peri Wael AbdAlmageed Shrikanth Narayanan AAML 79 67 0 18 Aug 2020
Cross attentive pooling for speaker verification Seong Min Kye Yoohwan Kwon Joon Son Chung 88 9 0 13 Aug 2020
Automatic Quality Assessment for Audio-Visual Verification Systems. The LOVe submission to NIST SRE Challenge 2019 G. Antipov N. Gengembre Olivier Le Blouch Gaël Le Lan CVBM 37 3 0 13 Aug 2020
Compact Speaker Embedding: lrx-vector Munir Georges Jonathan Huang Tobias Bocklet 37 11 0 11 Aug 2020
S-vectors and TESA: Speaker Embeddings and a Speaker Authenticator Based on Transformer Encoder Narla John Metilda Sagaya Mary S. Umesh Sandesh V Katta 51 31 0 11 Aug 2020
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings Naoyuki Kanda Xuankai Chang Yashesh Gaur Xiaofei Wang Zhong Meng Zhuo Chen Takuya Yoshioka 74 49 0 11 Aug 2020
Neural PLDA Modeling for End-to-End Speaker Verification Shreyas Ramoji Prashant Krishnan Sriram Ganapathy 35 6 0 11 Aug 2020
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss Umut Isik Ritwik Giri Neerad Phansalkar J. Valin Karim Helwani A. Krishnaswamy 72 84 0 11 Aug 2020
Self-Supervised Learning of Audio-Visual Objects from Video Triantafyllos Afouras Andrew Owens Joon Son Chung Andrew Zisserman SSL 126 256 0 10 Aug 2020
Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification Vijay Ravi Ruchao Fan Amber Afshan Huanhua Lu Abeer Alwan 55 9 0 08 Aug 2020
Extrapolating false alarm rates in automatic speaker verification A. Sholokhov Tomi Kinnunen Ville Vestman Kong Aik Lee 42 1 0 08 Aug 2020
A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning Mathieu Seurin Florian Strub Philippe Preux Olivier Pietquin 49 5 0 07 Aug 2020
Disentangled speaker and nuisance attribute embedding for robust speaker verification Woohyun Kang Sung Hwan Mun Min Hyun Han N. Kim 67 17 0 07 Aug 2020
Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions Santi Prieto A. O. Giménez Iván López-Espejo EDUARDO LLEIDA SOLANO 20 1 0 06 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning Jing-Xuan Zhang Zhenhua Ling Lirong Dai 81 6 0 05 Aug 2020
Intra-class variation reduction of speaker representation in disentanglement framework Yoohwan Kwon Soo-Whan Chung Hong-Goo Kang DRL 114 21 0 04 Aug 2020
Self-attention encoding and pooling for speaker recognition Pooyan Safari Miquel India Javier Hernando ViT 87 81 0 03 Aug 2020
A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings Xuechen Liu Md. Sahidullah Tomi Kinnunen 42 9 0 30 Jul 2020
Privacy-preserving Voice Analysis via Disentangled Representations Ranya Aloufi Hamed Haddadi David E. Boyle DRL 130 58 0 29 Jul 2020
Families In Wild Multimedia: A Multimodal Database for Recognizing Kinship Joseph P. Robinson Zaid Khan Yu Yin Ming Shao Y. Fu 56 7 0 28 Jul 2020
Detecting and analysing spontaneous oral cancer speech in the wild B. Halpern Rob van Son M. V. D. Brekel O. Scharenborg 38 9 0 28 Jul 2020
Self-Attentive Multi-Layer Aggregation with Feature Recalibration and Normalization for End-to-End Speaker Verification System Soonshin Seo Ji-Hwan Kim 53 0 0 27 Jul 2020
Double Multi-Head Attention for Speaker Verification Miquel India Pooyan Safari Javier Hernando 72 18 0 26 Jul 2020
UIAI System for Short-Duration Speaker Verification Challenge 2020 Md. Sahidullah A. K. Sarkar Ville Vestman Xuechen Liu Romain Serizel Tomi Kinnunen Zheng-Hua Tan Emmanuel Vincent 51 4 0 26 Jul 2020
Augmentation adversarial training for self-supervised speaker recognition Jaesung Huh Hee-Soo Heo Jingu Kang Shinji Watanabe Joon Son Chung SSL 126 76 0 23 Jul 2020
Optimization of data-driven filterbank for automatic speaker verification S. K. Sarangi Md. Sahidullah G. Saha 42 38 0 21 Jul 2020
SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation using Optimally Smoothed Spectral Mapping Vinay Kothapally Wei Xia Shahram Ghorbani John H. L. Hansen Wei Xue Jing-ling Huang 67 25 0 17 Jul 2020
Deep multi-metric learning for text-independent speaker verification Jiwei Xu Xinggang Wang Bin Feng Wenyu Liu 96 26 0 17 Jul 2020
Cross-Lingual Speaker Verification with Domain-Balanced Hard Prototype Mining and Language-Dependent Score Normalization Jenthe Thienpondt Brecht Desplanques Kris Demuynck 67 24 0 15 Jul 2020
NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling Shareef Babu Kalluri Deepu Vijayasenan Sriram Ganapathy M. RageshRajan Prashant Krishnan 44 18 0 12 Jul 2020
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals Tomi Kinnunen Héctor Delgado Nicholas W. D. Evans Kong Aik Lee Ville Vestman ... Massimiliano Todisco Xin Wang Md. Sahidullah Junichi Yamagishi D. Reynolds 55 113 0 12 Jul 2020
Federated Learning of User Authentication Models H. Hosseini Sungrack Yun Hyunsin Park Christos Louizos Joseph B. Soriaga Max Welling FedML 56 13 0 09 Jul 2020
X-vectors: New Quantitative Biomarkers for Early Parkinson's Disease Detection from Speech Laetitia Jeancolas Dijana Petrovska – Delacretaz G. Mangone B. Benkelfat J. Corvol M. Vidailhet Stéphane Lehéricy Habib Benali 45 46 0 07 Jul 2020
ResNeXt and Res2Net Structures for Speaker Verification Tianyan Zhou Yong Zhao Jian Wu 58 27 0 06 Jul 2020
Spot the conversation: speaker diarisation in the wild Joon Son Chung Jaesung Huh Arsha Nagrani Triantafyllos Afouras Andrew Zisserman VGen 103 150 0 02 Jul 2020
LSTM and GPT-2 Synthetic Speech Transfer Learning for Speaker Recognition to Overcome Data Scarcity Jordan J. Bird Diego Resende Faria Anikó Ekárt C. Premebida Pedro P. S. Ayrosa 34 5 0 01 Jul 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction Jing Shi Jiaming Xu Yusuke Fujita Shinji Watanabe Bo Xu BDL 70 21 0 25 Jun 2020
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers Naoyuki Kanda Yashesh Gaur Xiaofei Wang Zhong Meng Zhuo Chen Tianyan Zhou Takuya Yoshioka 76 78 0 19 Jun 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge Ashish Arora Desh Raj Aswin Shanmugam Subramanian Ke Li Bar Ben Yair Matthew Maciejewski Piotr Żelasko Leibny Paola García-Perera Shinji Watanabe Sanjeev Khudanpur 147 9 0 14 Jun 2020
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification Xu Li Na Li Jinghua Zhong Xixin Wu Xunying Liu Jane Polak Scowcroft Dong Yu Helen Meng AAML 93 37 0 11 Jun 2020
Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging Yeqi Bai Tao Ma Lipo Wang Zhenjie Zhang CVBM 27 9 0 10 Jun 2020
Speaker Diarization as a Fully Online Learning Problem in MiniVox Baihan Lin Xinxin Zhang 98 16 0 08 Jun 2020
Semi-Supervised Contrastive Learning with Generalized Contrastive Loss and Its Application to Speaker Recognition Nakamasa Inoue Keita Goto SSL 144 54 0 08 Jun 2020
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data Mael Fabien Seyyed Saeed Sarfjoo P. Motlícek S. Madikeri 19 3 0 03 Jun 2020
Crossed-Time Delay Neural Network for Speaker Recognition Liang Chen Yanchun Liang Xiaohu Shi You Zhou Chunguo Wu 27 3 0 31 May 2020