ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.08612
  4. Cited By
VoxCeleb: a large-scale speaker identification dataset
v1v2 (latest)

VoxCeleb: a large-scale speaker identification dataset

26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,111 papers shown
Title
Delving into VoxCeleb: environment invariant speaker recognition
Delving into VoxCeleb: environment invariant speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
94
51
0
24 Oct 2019
Self-supervised pre-training with acoustic configurations for replay
  spoofing detection
Self-supervised pre-training with acoustic configurations for replay spoofing detection
Hye-jin Shim
Hee-Soo Heo
Jee-weon Jung
Ha-Jin Yu
69
6
0
22 Oct 2019
Label-efficient audio classification through multitask learning and
  self-supervision
Label-efficient audio classification through multitask learning and self-supervision
Tyler Lee
Ting Gong
Suchismita Padhy
Andrew Rouditchenko
A. Ndirango
SSLVLM
60
7
0
19 Oct 2019
Frequency and temporal convolutional attention for text-independent
  speaker recognition
Frequency and temporal convolutional attention for text-independent speaker recognition
Sarthak Yadav
A. Rai
120
58
0
16 Oct 2019
Non-native Speaker Verification for Spoken Language Assessment
Non-native Speaker Verification for Spoken Language Assessment
Linlin Wang
Yu Wang
Mark Gales
17
1
0
30 Sep 2019
Understanding Semantics from Speech Through Pre-training
Understanding Semantics from Speech Through Pre-training
P. Wang
Liangchen Wei
Yong Cao
Jinghui Xie
Yuji Cao
Zaiqing Nie
SSLVLM
33
6
0
24 Sep 2019
Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual
  Signals
Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals
Shah Nawaz
Muhammad Kamran Janjua
I. Gallo
Arif Mahmood
Alessandro Calefati
67
33
0
18 Sep 2019
VAE-based Domain Adaptation for Speaker Verification
VAE-based Domain Adaptation for Speaker Verification
Xueyi Wang
Lantian Li
Dong Wang
56
16
0
27 Aug 2019
Unsupervised Learning of Landmarks by Descriptor Vector Exchange
Unsupervised Learning of Landmarks by Descriptor Vector Exchange
James Thewlis
Samuel Albanie
Hakan Bilen
Andrea Vedaldi
SSL
103
68
0
18 Aug 2019
Survey on Deep Neural Networks in Speech and Vision Systems
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DVVLMAI4TS
100
212
0
16 Aug 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and
  Transfer Learning
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Pavel Denisov
Ngoc Thang Vu
55
27
0
13 Aug 2019
Personal VAD: Speaker-Conditioned Voice Activity Detection
Personal VAD: Speaker-Conditioned Voice Activity Detection
Shaojin Ding
Quan Wang
Shuo-yiin Chang
Li Wan
Ignacio López Moreno
76
75
0
12 Aug 2019
A Study on Angular Based Embedding Learning for Text-independent Speaker
  Verification
A Study on Angular Based Embedding Learning for Text-independent Speaker Verification
Zhiyong Chen
Zongze Ren
Shugong Xu
34
4
0
12 Aug 2019
BPPSA: Scaling Back-propagation by Parallel Scan Algorithm
BPPSA: Scaling Back-propagation by Parallel Scan Algorithm
Shang Wang
Yifan Bai
Gennady Pekhimenko
60
7
0
23 Jul 2019
A Deep Neural Network for Short-Segment Speaker Recognition
A Deep Neural Network for Short-Segment Speaker Recognition
Amirhossein Hajavi
Ali Etemad
67
75
0
22 Jul 2019
Speaker Recognition with Random Digit Strings Using Uncertainty
  Normalized HMM-based i-vectors
Speaker Recognition with Random Digit Strings Using Uncertainty Normalized HMM-based i-vectors
N. Maghsoodi
Hossein Sameti
Hossein Zeinali
Themos Stafylakis
32
13
0
13 Jul 2019
Self-supervised Learning of Interpretable Keypoints from Unlabelled
  Videos
Self-supervised Learning of Interpretable Keypoints from Unlabelled Videos
Tomas Jakab
Ankush Gupta
Hakan Bilen
Andrea Vedaldi
SSL
93
9
0
03 Jul 2019
Sub-band Convolutional Neural Networks for Small-footprint Spoken Term
  Classification
Sub-band Convolutional Neural Networks for Small-footprint Spoken Term Classification
Chieh-Chi Kao
Ming Sun
Yixin Gao
S. Vitaladevuni
Chao Wang
62
14
0
02 Jul 2019
Synchronising audio and ultrasound by learning cross-modal embeddings
Synchronising audio and ultrasound by learning cross-modal embeddings
Aciel Eshky
M. Ribeiro
Korin Richmond
Steve Renals
46
5
0
01 Jul 2019
Who said that?: Audio-visual speaker diarisation of real-world meetings
Who said that?: Audio-visual speaker diarisation of real-world meetings
Joon Son Chung
Bong-Jin Lee
Icksang Han
70
46
0
24 Jun 2019
Single-Channel Speech Separation with Auxiliary Speaker Embeddings
Single-Channel Speech Separation with Auxiliary Speaker Embeddings
Shuo Liu
Gil Keren
Björn Schuller
40
3
0
24 Jun 2019
Self Multi-Head Attention for Speaker Recognition
Self Multi-Head Attention for Speaker Recognition
Miquel India
Pooyan Safari
Javier Hernando
76
111
0
24 Jun 2019
Unleashing the Unused Potential of I-Vectors Enabled by GPU Acceleration
Unleashing the Unused Potential of I-Vectors Enabled by GPU Acceleration
Ville Vestman
Kong Aik Lee
Tomi Kinnunen
Takafumi Koshinaka
19
2
0
20 Jun 2019
Spatial Pyramid Encoding with Convex Length Normalization for
  Text-Independent Speaker Verification
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Youngmoon Jung
Younggwan Kim
Hyungjun Lim
Yeunju Choi
Hoirin Kim
66
32
0
19 Jun 2019
The Second DIHARD Diarization Challenge: Dataset, task, and baselines
The Second DIHARD Diarization Challenge: Dataset, task, and baselines
Neville Ryant
Kenneth Church
C. Cieri
Alejandrina Cristià
Jun Du
Sriram Ganapathy
M. Liberman
58
182
0
18 Jun 2019
Margin Matters: Towards More Discriminative Deep Neural Network
  Embeddings for Speaker Recognition
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition
Xu Xiang
Shuai Wang
Houjun Huang
Y. Qian
Kai Yu
DRL
77
145
0
18 Jun 2019
Voice Mimicry Attacks Assisted by Automatic Speaker Verification
Voice Mimicry Attacks Assisted by Automatic Speaker Verification
Ville Vestman
Tomi Kinnunen
Rosa González Hautamäki
Md. Sahidullah
84
37
0
03 Jun 2019
Speaker Anonymization Using X-vector and Neural Waveform Models
Speaker Anonymization Using X-vector and Neural Waveform Models
Fuming Fang
Xin Wang
Junichi Yamagishi
Isao Echizen
Massimiliano Todisco
Nicholas W. D. Evans
J. Bonastre
65
135
0
30 May 2019
ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent
  Generative Adversarial Networks
ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent Generative Adversarial Networks
Xiaoqi Jia
Jianwei Tai
Hang Zhou
Yakai Li
Weijuan Zhang
Haichao Du
Qingjia Huang
GAN
34
6
0
27 May 2019
Speech2Face: Learning the Face Behind a Voice
Speech2Face: Learning the Face Behind a Voice
Tae-Hyun Oh
Tali Dekel
Changil Kim
Inbar Mosseri
William T. Freeman
Michael Rubinstein
Wojciech Matusik
SSLCVBM
112
164
0
23 May 2019
Few-Shot Adversarial Learning of Realistic Neural Talking Head Models
Few-Shot Adversarial Learning of Realistic Neural Talking Head Models
Egor Zakharov
Aliaksandra Shysheya
Egor Burkov
Victor Lempitsky
3DH
178
631
0
20 May 2019
AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Kaizhi Qian
Yang Zhang
Shiyu Chang
Xuesong Yang
M. Hasegawa-Johnson
135
471
0
14 May 2019
Hierarchical Cross-Modal Talking Face Generationwith Dynamic Pixel-Wise
  Loss
Hierarchical Cross-Modal Talking Face Generationwith Dynamic Pixel-Wise Loss
Lele Chen
R. Maddox
Z. Duan
Chenliang Xu
CVBM
98
400
0
09 May 2019
Meeting Transcription Using Virtual Microphone Arrays
Meeting Transcription Using Virtual Microphone Arrays
Takuya Yoshioka
Zhuo Chen
Dimitrios Dimitriadis
William Fu-Hinthorn
Xuedong Huang
A. Stolcke
Michael Zeng
76
15
0
03 May 2019
Few Shot Speaker Recognition using Deep Neural Networks
Few Shot Speaker Recognition using Deep Neural Networks
Prashant Anand
A. Singh
Siddharth Srivastava
Brejesh Lall
65
40
0
17 Apr 2019
RawNet: Advanced end-to-end deep neural network using raw waveforms for
  text-independent speaker verification
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification
Jee-weon Jung
Hee-Soo Heo
Ju-ho Kim
Hye-jin Shim
Ha-Jin Yu
87
142
0
17 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint
  Embedding and Clustering
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
61
25
0
16 Apr 2019
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared
  Experiences
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences
Kong Aik Lee
Ville Hautamaki
Tomi Kinnunen
Hitoshi Yamamoto
K. Okabe
...
Chng Eng Siong
Shivesh Ranjan
John H. L. Hansen
Massimiliano Todisco
Nicholas W. D. Evans
BDL
49
21
0
16 Apr 2019
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
Massimiliano Todisco
Xin Wang
Ville Vestman
Md. Sahidullah
Héctor Delgado
A. Nautsch
Junichi Yamagishi
Nicholas W. D. Evans
Tomi Kinnunen
Kong Aik Lee
99
617
0
09 Apr 2019
VAE-based regularization for deep speaker embedding
VAE-based regularization for deep speaker embedding
Yang Zhang
Lantian Li
Dong Wang
DRLBDL
46
19
0
07 Apr 2019
MCE 2018: The 1st Multi-target Speaker Detection and Identification
  Challenge Evaluation
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation
Suwon Shon
Najim Dehak
D. Reynolds
James R. Glass
38
26
0
07 Apr 2019
VoiceID Loss: Speech Enhancement for Speaker Verification
VoiceID Loss: Speech Enhancement for Speaker Verification
Suwon Shon
Hao Tang
James R. Glass
VLM
73
88
0
07 Apr 2019
Self-supervised speaker embeddings
Self-supervised speaker embeddings
Themos Stafylakis
Johan Rohdin
Oldrich Plchot
Petr Mizera
L. Burget
SSL
50
48
0
06 Apr 2019
Large Margin Softmax Loss for Speaker Verification
Large Margin Softmax Loss for Speaker Verification
Yi Y. Liu
Liang He
Jia-Wei Liu
68
145
0
06 Apr 2019
ICface: Interpretable and Controllable Face Reenactment Using GANs
ICface: Interpretable and Controllable Face Reenactment Using GANs
S. Tripathy
Arno Solin
Esa Rahtu
CVBM
66
90
0
03 Apr 2019
Multi-Task Learning with High-Order Statistics for X-vector based
  Text-Independent Speaker Verification
Multi-Task Learning with High-Order Statistics for X-vector based Text-Independent Speaker Verification
Lanhua You
Wu Guo
Lirong Dai
Jun Du
44
12
0
28 Mar 2019
Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial
  Networks
Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks
A. Duarte
Francisco Roldan
Miquel Tubau
Janna Escur
Santiago Pascual
Amaia Salvador
Eva Mohedano
Kevin McGuinness
Jordi Torres
Xavier Giró-i-Nieto
GANCVBM
71
79
0
25 Mar 2019
The VOiCES from a Distance Challenge 2019 Evaluation Plan
The VOiCES from a Distance Challenge 2019 Evaluation Plan
Mahesh Kumar Nandwana
Julien van Hout
Mitchell McLaren
Colleen Richey
A. Lawson
M. Barrios
60
92
0
27 Feb 2019
Utterance-level Aggregation For Speaker Recognition In The Wild
Utterance-level Aggregation For Speaker Recognition In The Wild
Weidi Xie
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
74
344
0
26 Feb 2019
End-to-end losses based on speaker basis vectors and all-speaker hard
  negative mining for speaker verification
End-to-end losses based on speaker basis vectors and all-speaker hard negative mining for speaker verification
Hee-Soo Heo
Jee-weon Jung
Il-Ho Yang
Sung-Hyun Yoon
Hye-jin Shim
Ha-Jin Yu
85
22
0
07 Feb 2019
Previous
123...20212223
Next