Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.08612
Cited By
VoxCeleb: a large-scale speaker identification dataset
26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VoxCeleb: a large-scale speaker identification dataset"
50 / 1,098 papers shown
Title
A Deep Neural Network for Short-Segment Speaker Recognition
Amirhossein Hajavi
Ali Etemad
14
74
0
22 Jul 2019
Speaker Recognition with Random Digit Strings Using Uncertainty Normalized HMM-based i-vectors
N. Maghsoodi
Hossein Sameti
Hossein Zeinali
Themos Stafylakis
14
13
0
13 Jul 2019
Self-supervised Learning of Interpretable Keypoints from Unlabelled Videos
Tomas Jakab
Ankush Gupta
Hakan Bilen
Andrea Vedaldi
SSL
20
9
0
03 Jul 2019
Sub-band Convolutional Neural Networks for Small-footprint Spoken Term Classification
Chieh-Chi Kao
Ming Sun
Yixin Gao
S. Vitaladevuni
Chao Wang
23
13
0
02 Jul 2019
Synchronising audio and ultrasound by learning cross-modal embeddings
Aciel Eshky
M. Ribeiro
Korin Richmond
Steve Renals
8
5
0
01 Jul 2019
Who said that?: Audio-visual speaker diarisation of real-world meetings
Joon Son Chung
Bong-Jin Lee
Icksang Han
13
44
0
24 Jun 2019
Single-Channel Speech Separation with Auxiliary Speaker Embeddings
Shuo Liu
Gil Keren
Björn Schuller
23
3
0
24 Jun 2019
Self Multi-Head Attention for Speaker Recognition
Miquel India
Pooyan Safari
Javier Hernando
19
110
0
24 Jun 2019
Unleashing the Unused Potential of I-Vectors Enabled by GPU Acceleration
Ville Vestman
Kong Aik Lee
Tomi Kinnunen
Takafumi Koshinaka
6
2
0
20 Jun 2019
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Youngmoon Jung
Younggwan Kim
Hyungjun Lim
Yeunju Choi
Hoirin Kim
21
32
0
19 Jun 2019
The Second DIHARD Diarization Challenge: Dataset, task, and baselines
Neville Ryant
Kenneth Church
C. Cieri
Alejandrina Cristià
Jun Du
Sriram Ganapathy
M. Liberman
12
180
0
18 Jun 2019
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition
Xu Xiang
Shuai Wang
Houjun Huang
Y. Qian
Kai Yu
DRL
21
142
0
18 Jun 2019
Voice Mimicry Attacks Assisted by Automatic Speaker Verification
Ville Vestman
Tomi Kinnunen
Rosa González Hautamäki
Md. Sahidullah
34
37
0
03 Jun 2019
Speaker Anonymization Using X-vector and Neural Waveform Models
Fuming Fang
Xin Wang
Junichi Yamagishi
Isao Echizen
Massimiliano Todisco
Nicholas W. D. Evans
J. Bonastre
21
135
0
30 May 2019
ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent Generative Adversarial Networks
Xiaoqi Jia
Jianwei Tai
Hang Zhou
Yakai Li
Weijuan Zhang
Haichao Du
Qingjia Huang
GAN
22
6
0
27 May 2019
Speech2Face: Learning the Face Behind a Voice
Tae-Hyun Oh
Tali Dekel
Changil Kim
Inbar Mosseri
William T. Freeman
Michael Rubinstein
Wojciech Matusik
SSL
CVBM
27
163
0
23 May 2019
Few-Shot Adversarial Learning of Realistic Neural Talking Head Models
Egor Zakharov
Aliaksandra Shysheya
Egor Burkov
Victor Lempitsky
3DH
65
625
0
20 May 2019
AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Kaizhi Qian
Yang Zhang
Shiyu Chang
Xuesong Yang
M. Hasegawa-Johnson
14
461
0
14 May 2019
Hierarchical Cross-Modal Talking Face Generationwith Dynamic Pixel-Wise Loss
Lele Chen
R. Maddox
Z. Duan
Chenliang Xu
CVBM
31
391
0
09 May 2019
Meeting Transcription Using Virtual Microphone Arrays
Takuya Yoshioka
Zhuo Chen
Dimitrios Dimitriadis
William Fu-Hinthorn
Xuedong Huang
A. Stolcke
Michael Zeng
29
15
0
03 May 2019
Few Shot Speaker Recognition using Deep Neural Networks
Prashant Anand
A. Singh
Siddharth Srivastava
Brejesh Lall
25
39
0
17 Apr 2019
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification
Jee-weon Jung
Hee-Soo Heo
Ju-ho Kim
Hye-jin Shim
Ha-Jin Yu
17
140
0
17 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
20
25
0
16 Apr 2019
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences
Kong Aik Lee
Ville Hautamaki
Tomi Kinnunen
Hitoshi Yamamoto
K. Okabe
...
Chng Eng Siong
Shivesh Ranjan
John H. L. Hansen
Massimiliano Todisco
Nicholas W. D. Evans
BDL
19
21
0
16 Apr 2019
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
Massimiliano Todisco
Xin Wang
Ville Vestman
Md. Sahidullah
Héctor Delgado
A. Nautsch
Junichi Yamagishi
Nicholas W. D. Evans
Tomi Kinnunen
Kong Aik Lee
27
595
0
09 Apr 2019
VAE-based regularization for deep speaker embedding
Yang Zhang
Lantian Li
Dong Wang
DRL
BDL
11
19
0
07 Apr 2019
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation
Suwon Shon
Najim Dehak
D. Reynolds
James R. Glass
19
26
0
07 Apr 2019
VoiceID Loss: Speech Enhancement for Speaker Verification
Suwon Shon
Hao Tang
James R. Glass
VLM
9
85
0
07 Apr 2019
Self-supervised speaker embeddings
Themos Stafylakis
Johan Rohdin
Oldrich Plchot
Petr Mizera
L. Burget
SSL
6
48
0
06 Apr 2019
Large Margin Softmax Loss for Speaker Verification
Yi Y. Liu
Liang He
Jia-Wei Liu
27
145
0
06 Apr 2019
ICface: Interpretable and Controllable Face Reenactment Using GANs
S. Tripathy
Arno Solin
Esa Rahtu
CVBM
30
87
0
03 Apr 2019
Multi-Task Learning with High-Order Statistics for X-vector based Text-Independent Speaker Verification
Lanhua You
Wu Guo
Lirong Dai
Jun Du
13
12
0
28 Mar 2019
Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks
A. Duarte
Francisco Roldan
Miquel Tubau
Janna Escur
Santiago Pascual
Amaia Salvador
Eva Mohedano
Kevin McGuinness
Jordi Torres
Xavier Giró-i-Nieto
GAN
CVBM
33
79
0
25 Mar 2019
The VOiCES from a Distance Challenge 2019 Evaluation Plan
Mahesh Kumar Nandwana
Julien van Hout
Mitchell McLaren
Colleen Richey
A. Lawson
M. Barrios
14
91
0
27 Feb 2019
Utterance-level Aggregation For Speaker Recognition In The Wild
Weidi Xie
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
19
343
0
26 Feb 2019
End-to-end losses based on speaker basis vectors and all-speaker hard negative mining for speaker verification
Hee-Soo Heo
Jee-weon Jung
Il-Ho Yang
Sung-Hyun Yoon
Hye-jin Shim
Ha-Jin Yu
19
22
0
07 Feb 2019
AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection
Joseph Roth
Sourish Chaudhuri
Ondˇrej Klejch
Radhika Marvin
Andrew C. Gallagher
...
S. Ramaswamy
Arkadiusz Stopczynski
Cordelia Schmid
Zhonghua Xi
C. Pantofaru
11
143
0
05 Jan 2019
Speech and Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
9
30
0
13 Dec 2018
Theoretical Guarantees of Deep Embedding Losses Under Label Noise
Nam Le
J. Odobez
NoLa
14
1
0
06 Dec 2018
TwoStreamVAN: Improving Motion Modeling in Video Generation
Ximeng Sun
Huijuan Xu
Kate Saenko
DiffM
VGen
18
17
0
03 Dec 2018
Learning Speaker Representations with Mutual Information
Mirco Ravanelli
Yoshua Bengio
SSL
DRL
16
91
0
01 Dec 2018
Noise-tolerant Audio-visual Online Person Verification using an Attention-based Neural Network Fusion
Suwon Shon
Tae-Hyun Oh
James R. Glass
11
50
0
27 Nov 2018
Interpretable Convolutional Filters with SincNet
Mirco Ravanelli
Yoshua Bengio
21
104
0
23 Nov 2018
iQIYI-VID: A Large Dataset for Multi-modal Person Identification
Yuanliu Liu
Bo Peng
Peipei Shi
He Yan
Yong Zhou
...
Tingwei Gao
G. Wang
Jian Liu
Xiangju Lu
Danming Xie
15
35
0
19 Nov 2018
Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection
Tomi Kinnunen
Rosa González Hautamäki
Ville Vestman
Md. Sahidullah
32
5
0
09 Nov 2018
Who Do I Sound Like? Showcasing Speaker Recognition Technology by YouTube Voice Search
R. Krishnan
Bilal Soomro
Mahesh Subedar
Ville Hautamaki
Tomi Kinnunen
27
5
0
08 Nov 2018
Gaussian-Constrained training for speaker verification
Lantian Li
Zhiyuan Tang
Ying Shi
Dong Wang
11
26
0
08 Nov 2018
Adapting End-to-End Neural Speaker Verification to New Languages and Recording Conditions with Adversarial Training
Christoph Dann
Lihong Li
Wei Wei
17
39
0
07 Nov 2018
Building Corpora for Single-Channel Speech Separation Across Multiple Domains
Aman Rana
Gregory Sell
Leibny Paola García Perera
A. Lowe
Pratik Shah
19
10
0
06 Nov 2018
How to Improve Your Speaker Embeddings Extractor in Generic Toolkits
Christopher Snyder
Lukás Burget
S. Vishwanath
Themos Stafylakis
Jan Cernocky
15
51
0
05 Nov 2018
Previous
1
2
3
...
20
21
22
Next