v1v2 (latest)

VoxCeleb: a large-scale speaker identification dataset

26 June 2017

Arsha Nagrani

Joon Son Chung

Andrew Zisserman

ArXiv (abs)PDF HTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,111 papers shown

Title
Active Speakers in Context Juan Carlos León Alcázar Fabian Caba Heilbron Long Mai Federico Perazzi Joon-Young Lee Pablo Arbelaez Guohao Li 72 62 0 20 May 2020
Atss-Net: Target Speaker Separation via Attention-based Neural Network Tingle Li Qingjian Lin Yuanyuan Bao Ming Li 39 38 0 19 May 2020
Defending Your Voice: Adversarial Attack on Voice Conversion Chien-yu Huang Yist Y. Lin Hung-yi Lee Lin-Shan Lee AAML 87 52 0 18 May 2020
Metric Learning for Keyword Spotting Jaesung Huh Minjae Lee Hee-Soo Heo Seongkyu Mun Joon Son Chung 64 23 0 18 May 2020
End-to-End Lip Synchronisation Based on Pattern Classification You Jin Kim Hee-Soo Heo Soo-Whan Chung Bong-Jin Lee CVBM 40 0 0 18 May 2020
Design Choices for X-vector Based Speaker Anonymization B. M. L. Srivastava N. Tomashenko Xin Wang Emmanuel Vincent Junichi Yamagishi Mohamed Maouche A. Bellet Marc Tommasi 60 63 0 18 May 2020
Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild P. S. Nidadavolu Saurabh Kataria Leibny Paola García-Perera Jesús Villalba Najim Dehak 28 3 0 17 May 2020
AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition Afroz Ahamad Ankit Anand Pranesh Bhargava 36 23 0 16 May 2020
Speaker Re-identification with Speaker Dependent Speech Enhancement Yanpei Shi Qiang Huang Thomas Hain 43 4 0 15 May 2020
Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification Yanpei Shi Qiang Huang Thomas Hain 51 2 0 15 May 2020
ConVoice: Real-Time Zero-Shot Voice Style Transfer with Convolutional Network Yurii Rebryk Stanislav Beliaev 62 8 0 15 May 2020
ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification Brecht Desplanques Jenthe Thienpondt Kris Demuynck 90 1,349 0 14 May 2020
FaR-GAN for One-Shot Face Reenactment Hanxiang Hao Sriram Baireddy A. Reibman Edward J. Delp 3DH CVBM 62 10 0 13 May 2020
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint Zexin Cai Chuxiong Zhang Ming Li 73 42 0 10 May 2020
Segment Aggregation for short utterances speaker verification using raw waveforms Seung-bin Kim Jee-weon Jung Hye-jin Shim Ju-ho Kim Ha-Jin Yu 34 5 0 07 May 2020
AutoSpeech: Neural Architecture Search for Speaker Recognition Shaojin Ding Tianlong Chen Xinyu Gong Weiwei Zha Zhangyang Wang 72 57 0 07 May 2020
What comprises a good talking-head video generation?: A Survey and Benchmark Lele Chen Guofeng Cui Ziyi Kou Haitian Zheng Chenliang Xu EGVM 54 59 0 07 May 2020
Introducing the VoicePrivacy Initiative N. Tomashenko B. M. L. Srivastava Xin Wang Emmanuel Vincent A. Nautsch ... Nicholas W. D. Evans J. Patino J. Bonastre Paul-Gauthier Noé Massimiliano Todisco 123 132 0 04 May 2020
VGGSound: A Large-scale Audio-Visual Dataset Honglie Chen Weidi Xie Andrea Vedaldi Andrew Zisserman 110 583 0 29 Apr 2020
Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervision Soo-Whan Chung Hong-Goo Kang Joon Son Chung SSL 48 39 0 29 Apr 2020
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective M. S. Saeed Shah Nawaz Pietro Morerio Arif Mahmood I. Gallo Muhammad Haroon Yousaf Alessio Del Bue CVBM 84 26 0 28 Apr 2020
Neural Head Reenactment with Latent Pose Descriptors Egor Burkov I. Pasechnik Artur Grigorev Victor Lempitsky 3DH 122 131 0 24 Apr 2020
Voice-Indistinguishability: Protecting Voiceprint in Privacy-Preserving Speech Data Release Yaowei Han Sheng Li Yang Cao Qiang Ma Masatoshi Yoshikawa 58 45 0 16 Apr 2020
From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech Hyeong-Seok Choi Changdae Park Kyogu Lee CVBM 48 29 0 13 Apr 2020
Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification Xu Li Jinghua Zhong Jianwei Yu Shoukang Hu Xixin Wu Xunying Liu Helen Meng BDL 67 12 0 08 Apr 2020
Semi-supervised acoustic modelling for five-lingual code-switched ASR using automatically-segmented soap opera speech N. Wilkinson A. Biswas Emre Yilmaz Febe de Wet Ewald van der Westhuizen T. Niesler 75 11 0 08 Apr 2020
Motion-supervised Co-Part Segmentation Aliaksandr Siarohin Subhankar Roy Stéphane Lathuilière Sergey Tulyakov Elisa Ricci N. Sebe SSL 48 35 0 07 Apr 2020
Deep Normalization for Speaker Vectors Yunqi Cai Lantian Li Dong Wang Andrew Abel 87 25 0 07 Apr 2020
Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances Youngmoon Jung Seong Min Kye Yeunju Choi Myunghun Jung Hoirin Kim 77 37 0 07 Apr 2020
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs Seong Min Kye Youngmoon Jung Haebeom Lee Sung Ju Hwang Hoirin Kim 124 51 0 06 Apr 2020
Speaker Recognition using SincNet and X-Vector Fusion Mayank Tripathi Divyanshu Singh Seba Susan 28 8 0 05 Apr 2020
Neural i-vectors Ville Vestman Kong Aik Lee Tomi Kinnunen DRL 59 4 0 03 Apr 2020
Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection Tharindu Fernando Sridha Sridharan Mitchell McLaren Darshana Priyasad Simon Denman Clinton Fookes 41 5 0 02 Apr 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms Jee-weon Jung Seung-bin Kim Hye-jin Shim Ju-ho Kim Ha-Jin Yu 77 60 0 01 Apr 2020
AM-MobileNet1D: A Portable Model for Speaker Recognition João Antônio Chagas Nunes David Macêdo Cleber Zanchettin 64 23 0 31 Mar 2020
A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification Juan Manuel Coria H. Bredin Sahar Ghannay S. Rosset 50 15 0 31 Mar 2020
Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose Xianfang Zeng Yusu Pan Mengmeng Wang Jiangning Zhang Yong Liu CVBM 147 42 0 29 Mar 2020
Learning Inverse Rendering of Faces from Real-world Videos Yuda Qiu Zhangyang Xiong Kai Han Zhongyuan Wang Zixiang Xiong Xiaoguang Han CVBM 3DH 32 2 0 26 Mar 2020
In defence of metric learning for speaker recognition Joon Son Chung Jaesung Huh Seongkyu Mun Minjae Lee Hee-Soo Heo Soyeon Choe Chiheon Ham Sung-Ye Jung Bong-Jin Lee Icksang Han 77 438 0 26 Mar 2020
Improving Embedding Extraction for Speaker Verification with Ladder Network Fei Tao Gokhan Tur 24 3 0 20 Mar 2020
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data Vincent Roger Jérôme Farinas J. Pinquier 57 24 0 09 Mar 2020
Lightweight Speaker Verification for Online Identification of New Speakers with Short Segments I. Vélez C. Rascón Gibran Fuentes Pineda 84 10 0 06 Mar 2020
First Order Motion Model for Image Animation Aliaksandr Siarohin Stéphane Lathuilière Sergey Tulyakov Elisa Ricci N. Sebe VGen DiffM 184 942 0 29 Feb 2020
Bio-Inspired Modality Fusion for Active Speaker Detection Gustavo Assunção Nuno Gonccalves Paulo Menezes 19 3 0 28 Feb 2020
Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models Edresson Casanova Arnaldo Cândido Júnior C. Shulby F. S. Oliveira L. Gris Hamilton Pereira da Silva S. Aluísio M. Ponti 16 2 0 25 Feb 2020
Towards Learning a Universal Non-Semantic Representation of Speech Joel Shor A. Jansen Ronnie Maor Oran Lang Omry Tuval Félix de Chaumont Quitry Marco Tagliasacchi Ira Shavitt Dotan Emanuel Yinnon A. Haviv SSL 158 160 0 25 Feb 2020
Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose Ran Yi Zipeng Ye Juyong Zhang Hujun Bao Yong Liu CVBM 113 123 0 24 Feb 2020
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team Qingjian Lin Weicheng Cai Lin Yang Junjie Wang J. Zhang Ming Li VLM 59 18 0 23 Feb 2020
An end-to-end approach for the verification problem: learning the right distance João Monteiro Isabela Albuquerque Md. Jahangir Alam R. Devon Hjelm T. Falk 45 6 0 21 Feb 2020
Disentangled Speech Embeddings using Cross-modal Self-supervision Arsha Nagrani Joon Son Chung Samuel Albanie Andrew Zisserman SSL 92 88 0 20 Feb 2020