Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.08612
Cited By
v1
v2 (latest)
VoxCeleb: a large-scale speaker identification dataset
26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VoxCeleb: a large-scale speaker identification dataset"
50 / 1,111 papers shown
Title
Active Speakers in Context
Juan Carlos León Alcázar
Fabian Caba Heilbron
Long Mai
Federico Perazzi
Joon-Young Lee
Pablo Arbelaez
Guohao Li
72
62
0
20 May 2020
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Tingle Li
Qingjian Lin
Yuanyuan Bao
Ming Li
39
38
0
19 May 2020
Defending Your Voice: Adversarial Attack on Voice Conversion
Chien-yu Huang
Yist Y. Lin
Hung-yi Lee
Lin-Shan Lee
AAML
87
52
0
18 May 2020
Metric Learning for Keyword Spotting
Jaesung Huh
Minjae Lee
Hee-Soo Heo
Seongkyu Mun
Joon Son Chung
64
23
0
18 May 2020
End-to-End Lip Synchronisation Based on Pattern Classification
You Jin Kim
Hee-Soo Heo
Soo-Whan Chung
Bong-Jin Lee
CVBM
40
0
0
18 May 2020
Design Choices for X-vector Based Speaker Anonymization
B. M. L. Srivastava
N. Tomashenko
Xin Wang
Emmanuel Vincent
Junichi Yamagishi
Mohamed Maouche
A. Bellet
Marc Tommasi
60
63
0
18 May 2020
Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild
P. S. Nidadavolu
Saurabh Kataria
Leibny Paola García-Perera
Jesús Villalba
Najim Dehak
28
3
0
17 May 2020
AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition
Afroz Ahamad
Ankit Anand
Pranesh Bhargava
36
23
0
16 May 2020
Speaker Re-identification with Speaker Dependent Speech Enhancement
Yanpei Shi
Qiang Huang
Thomas Hain
43
4
0
15 May 2020
Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification
Yanpei Shi
Qiang Huang
Thomas Hain
51
2
0
15 May 2020
ConVoice: Real-Time Zero-Shot Voice Style Transfer with Convolutional Network
Yurii Rebryk
Stanislav Beliaev
62
8
0
15 May 2020
ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification
Brecht Desplanques
Jenthe Thienpondt
Kris Demuynck
90
1,349
0
14 May 2020
FaR-GAN for One-Shot Face Reenactment
Hanxiang Hao
Sriram Baireddy
A. Reibman
Edward J. Delp
3DH
CVBM
62
10
0
13 May 2020
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint
Zexin Cai
Chuxiong Zhang
Ming Li
73
42
0
10 May 2020
Segment Aggregation for short utterances speaker verification using raw waveforms
Seung-bin Kim
Jee-weon Jung
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
34
5
0
07 May 2020
AutoSpeech: Neural Architecture Search for Speaker Recognition
Shaojin Ding
Tianlong Chen
Xinyu Gong
Weiwei Zha
Zhangyang Wang
72
57
0
07 May 2020
What comprises a good talking-head video generation?: A Survey and Benchmark
Lele Chen
Guofeng Cui
Ziyi Kou
Haitian Zheng
Chenliang Xu
EGVM
54
59
0
07 May 2020
Introducing the VoicePrivacy Initiative
N. Tomashenko
B. M. L. Srivastava
Xin Wang
Emmanuel Vincent
A. Nautsch
...
Nicholas W. D. Evans
J. Patino
J. Bonastre
Paul-Gauthier Noé
Massimiliano Todisco
123
132
0
04 May 2020
VGGSound: A Large-scale Audio-Visual Dataset
Honglie Chen
Weidi Xie
Andrea Vedaldi
Andrew Zisserman
110
583
0
29 Apr 2020
Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervision
Soo-Whan Chung
Hong-Goo Kang
Joon Son Chung
SSL
48
39
0
29 Apr 2020
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
84
26
0
28 Apr 2020
Neural Head Reenactment with Latent Pose Descriptors
Egor Burkov
I. Pasechnik
Artur Grigorev
Victor Lempitsky
3DH
122
131
0
24 Apr 2020
Voice-Indistinguishability: Protecting Voiceprint in Privacy-Preserving Speech Data Release
Yaowei Han
Sheng Li
Yang Cao
Qiang Ma
Masatoshi Yoshikawa
58
45
0
16 Apr 2020
From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech
Hyeong-Seok Choi
Changdae Park
Kyogu Lee
CVBM
48
29
0
13 Apr 2020
Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification
Xu Li
Jinghua Zhong
Jianwei Yu
Shoukang Hu
Xixin Wu
Xunying Liu
Helen Meng
BDL
67
12
0
08 Apr 2020
Semi-supervised acoustic modelling for five-lingual code-switched ASR using automatically-segmented soap opera speech
N. Wilkinson
A. Biswas
Emre Yilmaz
Febe de Wet
Ewald van der Westhuizen
T. Niesler
75
11
0
08 Apr 2020
Motion-supervised Co-Part Segmentation
Aliaksandr Siarohin
Subhankar Roy
Stéphane Lathuilière
Sergey Tulyakov
Elisa Ricci
N. Sebe
SSL
48
35
0
07 Apr 2020
Deep Normalization for Speaker Vectors
Yunqi Cai
Lantian Li
Dong Wang
Andrew Abel
87
25
0
07 Apr 2020
Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances
Youngmoon Jung
Seong Min Kye
Yeunju Choi
Myunghun Jung
Hoirin Kim
77
37
0
07 Apr 2020
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs
Seong Min Kye
Youngmoon Jung
Haebeom Lee
Sung Ju Hwang
Hoirin Kim
124
51
0
06 Apr 2020
Speaker Recognition using SincNet and X-Vector Fusion
Mayank Tripathi
Divyanshu Singh
Seba Susan
28
8
0
05 Apr 2020
Neural i-vectors
Ville Vestman
Kong Aik Lee
Tomi Kinnunen
DRL
59
4
0
03 Apr 2020
Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection
Tharindu Fernando
Sridha Sridharan
Mitchell McLaren
Darshana Priyasad
Simon Denman
Clinton Fookes
41
5
0
02 Apr 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms
Jee-weon Jung
Seung-bin Kim
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
77
60
0
01 Apr 2020
AM-MobileNet1D: A Portable Model for Speaker Recognition
João Antônio Chagas Nunes
David Macêdo
Cleber Zanchettin
64
23
0
31 Mar 2020
A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification
Juan Manuel Coria
H. Bredin
Sahar Ghannay
S. Rosset
50
15
0
31 Mar 2020
Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose
Xianfang Zeng
Yusu Pan
Mengmeng Wang
Jiangning Zhang
Yong Liu
CVBM
147
42
0
29 Mar 2020
Learning Inverse Rendering of Faces from Real-world Videos
Yuda Qiu
Zhangyang Xiong
Kai Han
Zhongyuan Wang
Zixiang Xiong
Xiaoguang Han
CVBM
3DH
32
2
0
26 Mar 2020
In defence of metric learning for speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
Minjae Lee
Hee-Soo Heo
Soyeon Choe
Chiheon Ham
Sung-Ye Jung
Bong-Jin Lee
Icksang Han
77
437
0
26 Mar 2020
Improving Embedding Extraction for Speaker Verification with Ladder Network
Fei Tao
Gokhan Tur
24
3
0
20 Mar 2020
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data
Vincent Roger
Jérôme Farinas
J. Pinquier
57
24
0
09 Mar 2020
Lightweight Speaker Verification for Online Identification of New Speakers with Short Segments
I. Vélez
C. Rascón
Gibran Fuentes Pineda
84
10
0
06 Mar 2020
First Order Motion Model for Image Animation
Aliaksandr Siarohin
Stéphane Lathuilière
Sergey Tulyakov
Elisa Ricci
N. Sebe
VGen
DiffM
184
942
0
29 Feb 2020
Bio-Inspired Modality Fusion for Active Speaker Detection
Gustavo Assunção
Nuno Gonccalves
Paulo Menezes
19
3
0
28 Feb 2020
Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models
Edresson Casanova
Arnaldo Cândido Júnior
C. Shulby
F. S. Oliveira
L. Gris
Hamilton Pereira da Silva
S. Aluísio
M. Ponti
16
2
0
25 Feb 2020
Towards Learning a Universal Non-Semantic Representation of Speech
Joel Shor
A. Jansen
Ronnie Maor
Oran Lang
Omry Tuval
Félix de Chaumont Quitry
Marco Tagliasacchi
Ira Shavitt
Dotan Emanuel
Yinnon A. Haviv
SSL
158
160
0
25 Feb 2020
Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose
Ran Yi
Zipeng Ye
Juyong Zhang
Hujun Bao
Yong Liu
CVBM
113
123
0
24 Feb 2020
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team
Qingjian Lin
Weicheng Cai
Lin Yang
Junjie Wang
J. Zhang
Ming Li
VLM
59
18
0
23 Feb 2020
An end-to-end approach for the verification problem: learning the right distance
João Monteiro
Isabela Albuquerque
Md. Jahangir Alam
R. Devon Hjelm
T. Falk
45
6
0
21 Feb 2020
Disentangled Speech Embeddings using Cross-modal Self-supervision
Arsha Nagrani
Joon Son Chung
Samuel Albanie
Andrew Zisserman
SSL
92
88
0
20 Feb 2020
Previous
1
2
3
...
18
19
20
21
22
23
Next