Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.08612
Cited By
VoxCeleb: a large-scale speaker identification dataset
26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VoxCeleb: a large-scale speaker identification dataset"
50 / 1,100 papers shown
Title
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition
Maurice Gerczuk
Shahin Amiriparian
Sandra Ottl
Björn Schuller
38
58
0
10 Mar 2021
Am I a Real or Fake Celebrity? Measuring Commercial Face Recognition Web APIs under Deepfake Impersonation Attack
Shahroz Tariq
Sowon Jeon
Simon S. Woo
32
25
0
01 Mar 2021
Learnable MFCCs for Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
32
17
0
20 Feb 2021
AudioVisual Speech Synthesis: A brief literature review
Efthymios Georgiou
Athanasios Katsamanis
21
0
0
18 Feb 2021
Biometrics in the Era of COVID-19: Challenges and Opportunities
M. Gomez-Barrero
P. Drozdowski
Christian Rathgeb
J. Patino
Massimiliano Todisco
A. Nautsch
Naser Damer
Jannier Priesnitz
Nicholas W. D. Evans
Christoph Busch
45
54
0
18 Feb 2021
Adversarial defense for automatic speaker verification by cascaded self-supervised learning models
Haibin Wu
Xu Li
Andy T. Liu
Zhiyong Wu
Helen Meng
Hung-yi Lee
AAML
32
40
0
14 Feb 2021
A Multi-View Approach To Audio-Visual Speaker Verification
Leda Sari
Kritika Singh
Jiatong Zhou
Lorenzo Torresani
Nayan Singhal
Yatharth Saraf
19
38
0
11 Feb 2021
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech
A. Nautsch
Xin Wang
Nicholas W. D. Evans
Tomi Kinnunen
Ville Vestman
Massimiliano Todisco
Héctor Delgado
Md. Sahidullah
Junichi Yamagishi
Kong Aik Lee
125
146
0
11 Feb 2021
Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning
Giuseppe Ruggiero
Enrico Zovato
Luigi Di Caro
V. Pollet
DiffM
24
9
0
10 Feb 2021
The DKU-Duke-Lenovo System Description for the Third DIHARD Speech Diarization Challenge
Weiqing Wang
Qingjian Lin
Danwei Cai
Lin Yang
Ming Li
13
8
0
06 Feb 2021
Understanding the Tradeoffs in Client-side Privacy for Downstream Speech Tasks
Peter Wu
Paul Pu Liang
Jiatong Shi
Ruslan Salakhutdinov
Shinji Watanabe
Louis-Philippe Morency
31
8
0
22 Jan 2021
LEAF: A Learnable Frontend for Audio Classification
Neil Zeghidour
O. Teboul
Félix de Chaumont Quitry
Marco Tagliasacchi
VLM
AAML
85
144
0
21 Jan 2021
MAAS: Multi-modal Assignation for Active Speaker Detection
Juan Carlos León Alcázar
Fabian Caba Heilbron
Ali K. Thabet
Guohao Li
65
51
0
11 Jan 2021
FakeBuster: A DeepFakes Detection Tool for Video Conferencing Scenarios
V. Mehta
Parul Gupta
Ramanathan Subramanian
Abhinav Dhall
CVBM
33
22
0
09 Jan 2021
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Ruohan Gao
Kristen Grauman
CVBM
196
199
0
08 Jan 2021
What all do audio transformer models hear? Probing Acoustic Representations for Language Delivery and its Structure
Jui Shah
Yaman Kumar Singla
Changyou Chen
R. Shah
33
81
0
02 Jan 2021
Generalized Operating Procedure for Deep Learning: an Unconstrained Optimal Design Perspective
Shen Chen
Mingwei Zhang
Jiamin Cui
Wei Yao
CVBM
30
0
0
31 Dec 2020
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasks
Federico Landini
Jan Profant
Mireia Díez
L. Burget
216
200
0
29 Dec 2020
A Principle Solution for Enroll-Test Mismatch in Speaker Recognition
Lantian Li
Dong Wang
Jiawen Kang
Renyu Wang
Jingqian Wu
Zhendong Gao
Xiao Chen
13
7
0
23 Dec 2020
CN-Celeb: multi-genre speaker recognition
Lantian Li
Ruiqi Liu
Jiawen Kang
Yue Fan
Hao Cui
Yunqi Cai
Ravichander Vipperla
T. Zheng
Dong Wang
33
119
0
23 Dec 2020
Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification
Wei Yao
Shen Chen
Jiamin Cui
Yaolin Lou
29
5
0
21 Dec 2020
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording
Cong Han
Yi Luo
Chenda Li
Tianyan Zhou
K. Kinoshita
...
Marc Delcroix
Hakan Erdogan
J. Hershey
N. Mesgarani
Zhuo Chen
24
8
0
17 Dec 2020
HeadGAN: One-shot Neural Head Synthesis and Editing
M. Doukas
S. Zafeiriou
V. Sharmanska
CVBM
3DH
27
125
0
15 Dec 2020
Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis
Neeraj Kumar
Srishti Goel
Ankur Narang
Brejesh Lall
29
5
0
14 Dec 2020
DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning
Mufan Sang
Wei Xia
John H. L. Hansen
OOD
DRL
21
23
0
12 Dec 2020
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge
Arsha Nagrani
Joon Son Chung
Jaesung Huh
Andrew Brown
Ernesto Coto
Weidi Xie
Mitchell McLaren
D. Reynolds
Andrew Zisserman
21
74
0
12 Dec 2020
Exploring wav2vec 2.0 on speaker verification and language identification
Zhiyun Fan
Meng Li
Shiyu Zhou
Bo Xu
117
202
0
11 Dec 2020
Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation
Paul-Gauthier Noé
Mohammad MohammadAmini
D. Matrouf
Titouan Parcollet
Andreas Nautsch
J. Bonastre
29
28
0
08 Dec 2020
A Study of Few-Shot Audio Classification
Piper Wolters
Chris Careaga
Brian Hutchinson
Lauren A. Phillips
27
10
0
02 Dec 2020
Joint gender and age estimation based on speech signals using x-vectors and transfer learning
Damian Kwaśny
Daria Hemmerling
14
11
0
02 Dec 2020
A Unified Deep Speaker Embedding Framework for Mixed-Bandwidth Speech Data
Weicheng Cai
Ming Li
23
3
0
01 Dec 2020
Low Bandwidth Video-Chat Compression using Deep Generative Models
Maxime Oquab
Pierre Stock
Oran Gafni
Daniel Haziza
Tao Xu
...
Yana Hasson
Patrick Labatut
Bobo Bose-Kolanu
T. Peyronel
Camille Couprie
3DH
42
41
0
01 Dec 2020
Look who's not talking
Youngki Kwon
Hee-Soo Heo
Jaesung Huh
Bong-Jin Lee
Joon Son Chung
4
29
0
30 Nov 2020
How Far Are We from Robust Voice Conversion: A Survey
Tzu-hsien Huang
Jheng-hao Lin
Chien-yu Huang
Hung-yi Lee
24
24
0
24 Nov 2020
Exploring Voice Conversion based Data Augmentation in Text-Dependent Speaker Verification
Xiaoyi Qin
Yaogen Yang
Lin Yang
Xuyang Wang
Junjie Wang
Ming Li
24
0
0
21 Nov 2020
FoolHD: Fooling speaker identification by Highly imperceptible adversarial Disturbances
Ali Shahin Shamsabadi
Francisco Teixeira
A. Abad
Bhiksha Raj
Andrea Cavallaro
Isabel Trancoso
AAML
25
29
0
17 Nov 2020
Image Animation with Perturbed Masks
Yoav Shalev
Lior Wolf
DiffM
VGen
14
6
0
13 Nov 2020
Supervised attention for speaker recognition
Seong Min Kye
Joon Son Chung
Hoirin Kim
15
11
0
10 Nov 2020
Speaker De-identification System using Autoencoders and Adversarial Training
Fernando M. Espinoza-Cuadros
Juan M. Perero-Codosero
Javier Antón-Martín
L. A. H. Gómez
AAML
14
14
0
09 Nov 2020
FRILL: A Non-Semantic Speech Embedding for Mobile Devices
J. Peplinski
Joel Shor
Sachin P. Joglekar
Jake Garrison
Shwetak N. Patel
26
23
0
09 Nov 2020
Masked Proxy Loss For Text-Independent Speaker Verification
Jiachen Lian
A. V. Kumar
Hira Dhamyal
Bhiksha Raj
Rita Singh
12
2
0
09 Nov 2020
Non-local convolutional neural networks (nlcnn) for speaker recognition
Haici Yang
Hongda Mao
Ruirui Li
C. Ju
Oguz H. Elibol
20
0
0
07 Nov 2020
Large-scale multilingual audio visual dubbing
Yi Yang
Brendan Shillingford
Yannis Assael
Miaosen Wang
Wendi Liu
...
Eren Sezener
Luis C. Cobo
Misha Denil
Y. Aytar
Nando de Freitas
35
20
0
06 Nov 2020
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription
Xiaofei Wang
Naoyuki Kanda
Yashesh Gaur
Zhuo Chen
Zhong Meng
Takuya Yoshioka
9
13
0
05 Nov 2020
Multi-class Spectral Clustering with Overlaps for Speaker Diarization
Desh Raj
Zili Huang
Sanjeev Khudanpur
36
30
0
05 Nov 2020
Paralinguistic Privacy Protection at the Edge
Ranya Aloufi
Hamed Haddadi
David E. Boyle
19
14
0
04 Nov 2020
Query Expansion System for the VoxCeleb Speaker Recognition Challenge 2020
Yu Cheng
Chun-Liang Shih
Tien-Hong Lo
Wen-Ting Tseng
Berlin Chen
12
0
0
04 Nov 2020
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Naoyuki Kanda
Zhong Meng
Liang Lu
Yashesh Gaur
Xiaofei Wang
Zhuo Chen
Takuya Yoshioka
30
17
0
03 Nov 2020
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
Desh Raj
Pavel Denisov
Zhuo Chen
Hakan Erdogan
Zili Huang
...
Yi Luo
Naoyuki Kanda
Jinyu Li
Scott Wisdom
J. Hershey
13
84
0
03 Nov 2020
ShaneRun System Description to VoxCeleb Speaker Recognition Challenge 2020
Shen Chen
DRL
23
1
0
03 Nov 2020
Previous
1
2
3
...
15
16
17
...
20
21
22
Next