ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.08612
  4. Cited By
VoxCeleb: a large-scale speaker identification dataset
v1v2 (latest)

VoxCeleb: a large-scale speaker identification dataset

26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,111 papers shown
Title
Speaker De-identification System using Autoencoders and Adversarial
  Training
Speaker De-identification System using Autoencoders and Adversarial Training
Fernando M. Espinoza-Cuadros
Juan M. Perero-Codosero
Javier Antón-Martín
L. A. H. Gómez
AAML
43
14
0
09 Nov 2020
FRILL: A Non-Semantic Speech Embedding for Mobile Devices
FRILL: A Non-Semantic Speech Embedding for Mobile Devices
J. Peplinski
Joel Shor
Sachin P. Joglekar
Jake Garrison
Shwetak N. Patel
68
24
0
09 Nov 2020
Masked Proxy Loss For Text-Independent Speaker Verification
Masked Proxy Loss For Text-Independent Speaker Verification
Jiachen Lian
A. V. Kumar
Hira Dhamyal
Bhiksha Raj
Rita Singh
64
2
0
09 Nov 2020
Non-local convolutional neural networks (nlcnn) for speaker recognition
Non-local convolutional neural networks (nlcnn) for speaker recognition
Haici Yang
Hongda Mao
Ruirui Li
C. Ju
Oguz H. Elibol
65
0
0
07 Nov 2020
Large-scale multilingual audio visual dubbing
Large-scale multilingual audio visual dubbing
Yi Yang
Brendan Shillingford
Yannis Assael
Miaosen Wang
Wendi Liu
...
Eren Sezener
Luis C. Cobo
Misha Denil
Y. Aytar
Nando de Freitas
70
21
0
06 Nov 2020
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting
  Transcription
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription
Xiaofei Wang
Naoyuki Kanda
Yashesh Gaur
Zhuo Chen
Zhong Meng
Takuya Yoshioka
64
13
0
05 Nov 2020
Multi-class Spectral Clustering with Overlaps for Speaker Diarization
Multi-class Spectral Clustering with Overlaps for Speaker Diarization
Desh Raj
Zili Huang
Sanjeev Khudanpur
105
31
0
05 Nov 2020
Paralinguistic Privacy Protection at the Edge
Paralinguistic Privacy Protection at the Edge
Ranya Aloufi
Hamed Haddadi
David E. Boyle
64
14
0
04 Nov 2020
Query Expansion System for the VoxCeleb Speaker Recognition Challenge
  2020
Query Expansion System for the VoxCeleb Speaker Recognition Challenge 2020
Yu Cheng
Chun-Liang Shih
Tien-Hong Lo
Wen-Ting Tseng
Berlin Chen
22
0
0
04 Nov 2020
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Naoyuki Kanda
Zhong Meng
Liang Lu
Yashesh Gaur
Xiaofei Wang
Zhuo Chen
Takuya Yoshioka
71
17
0
03 Nov 2020
Integration of speech separation, diarization, and recognition for
  multi-speaker meetings: System description, comparison, and analysis
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
Desh Raj
Pavel Denisov
Zhuo Chen
Hakan Erdogan
Zili Huang
...
Yi Luo
Naoyuki Kanda
Jinyu Li
Scott Wisdom
J. Hershey
63
88
0
03 Nov 2020
ShaneRun System Description to VoxCeleb Speaker Recognition Challenge
  2020
ShaneRun System Description to VoxCeleb Speaker Recognition Challenge 2020
Shen Chen
DRL
37
1
0
03 Nov 2020
The xx205 System for the VoxCeleb Speaker Recognition Challenge 2020
The xx205 System for the VoxCeleb Speaker Recognition Challenge 2020
Xu Xiang
49
14
0
31 Oct 2020
Deep Speaker Vector Normalization with Maximum Gaussianality Training
Deep Speaker Vector Normalization with Maximum Gaussianality Training
Yunqi Cai
Lantian Li
Dong Wang
Andrew Abel
116
6
0
30 Oct 2020
Deep generative LDA
Deep generative LDA
Yunqi Cai
Dong Wang
66
1
0
30 Oct 2020
Comparison of Speaker Role Recognition and Speaker Enrollment Protocol
  for conversational Clinical Interviews
Comparison of Speaker Role Recognition and Speaker Enrollment Protocol for conversational Clinical Interviews
Rachid Riad
Hadrien Titeux
Laurie Lemoine
Justine Montillot
A. Sliwinski
J. Bagnou
Xuan-Nga Cao
Anne-Catherine Bachoud-Lévi
Emmanuel Dupoux
35
0
0
30 Oct 2020
The ins and outs of speaker recognition: lessons from VoxSRC 2020
The ins and outs of speaker recognition: lessons from VoxSRC 2020
Yoohwan Kwon
Hee-Soo Heo
Bong-Jin Lee
Joon Son Chung
121
61
0
29 Oct 2020
Playing a Part: Speaker Verification at the Movies
Playing a Part: Speaker Verification at the Movies
A. Brown
Jaesung Huh
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
73
23
0
29 Oct 2020
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical
  Transformer Model
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model
Yanpei Shi
Mingjie Chen
Qiang Huang
Thomas Hain
48
5
0
29 Oct 2020
Generative Adversarial Networks in Human Emotion Synthesis:A Review
Generative Adversarial Networks in Human Emotion Synthesis:A Review
Noushin Hajarolasvadi
M. A. Ramírez
H. Demirel
GAN
69
22
0
28 Oct 2020
Leveraging speaker attribute information using multi task learning for
  speaker verification and diarization
Leveraging speaker attribute information using multi task learning for speaker verification and diarization
Chau Luu
P. Bell
Steve Renals
54
9
0
27 Oct 2020
Squeezing value of cross-domain labels: a decoupled scoring approach for
  speaker verification
Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification
Lantian Li
Yang Zhang
Jiawen Kang
Tianshi Zheng
Dong Wang
43
5
0
27 Oct 2020
HarperValleyBank: A Domain-Specific Spoken Dialog Corpus
HarperValleyBank: A Domain-Specific Spoken Dialog Corpus
Mike Wu
J. Nafziger
A. Scodary
Andrew L. Maas
91
17
0
26 Oct 2020
Speaker Anonymization with Distribution-Preserving X-Vector Generation
  for the VoicePrivacy Challenge 2020
Speaker Anonymization with Distribution-Preserving X-Vector Generation for the VoicePrivacy Challenge 2020
H.C.M. Turner
Giulio Lovisotto
Ivan Martinovic
73
21
0
26 Oct 2020
An iterative framework for self-supervised deep speaker representation
  learning
An iterative framework for self-supervised deep speaker representation learning
Danwei Cai
Weiqing Wang
Ming Li
SSL
67
37
0
25 Oct 2020
Y-Vector: Multiscale Waveform Encoder for Speaker Embedding
Y-Vector: Multiscale Waveform Encoder for Speaker Embedding
Ge Zhu
Fei Jiang
Z. Duan
91
25
0
24 Oct 2020
The IDLAB VoxCeleb Speaker Recognition Challenge 2020 System Description
The IDLAB VoxCeleb Speaker Recognition Challenge 2020 System Description
Jenthe Thienpondt
Brecht Desplanques
Kris Demuynck
68
49
0
23 Oct 2020
Compositional embedding models for speaker identification and
  diarization with simultaneous speech from 2+ speakers
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers
Zeqian Li
Jacob Whitehill
140
11
0
22 Oct 2020
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker
  Diarisation Challenge
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge
Renyu Wang
Ruilin Tong
Y. Yeung
Xiao Chen
23
1
0
22 Oct 2020
Graph Attention Networks for Speaker Verification
Graph Attention Networks for Speaker Verification
Jee-weon Jung
Hee-Soo Heo
Ha-Jin Yu
Joon Son Chung
93
27
0
22 Oct 2020
Momentum Contrast Speaker Representation Learning
Momentum Contrast Speaker Representation Learning
Jangho Lee
Jaihyun Koh
Sungroh Yoon
SSL
62
3
0
22 Oct 2020
Unsupervised Representation Learning for Speaker Recognition via
  Contrastive Equilibrium Learning
Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium Learning
Sung Hwan Mun
Woohyun Kang
Min Hyun Han
N. Kim
SSL
90
21
0
22 Oct 2020
Robust Text-Dependent Speaker Verification via Character-Level
  Information Preservation for the SdSV Challenge 2020
Robust Text-Dependent Speaker Verification via Character-Level Information Preservation for the SdSV Challenge 2020
Sung Hwan Mun
Woohyun Kang
Min Hyun Han
N. Kim
36
2
0
22 Oct 2020
The IDLAB VoxSRC-20 Submission: Large Margin Fine-Tuning and
  Quality-Aware Score Calibration in DNN Based Speaker Verification
The IDLAB VoxSRC-20 Submission: Large Margin Fine-Tuning and Quality-Aware Score Calibration in DNN Based Speaker Verification
Jenthe Thienpondt
Brecht Desplanques
Kris Demuynck
82
84
0
21 Oct 2020
The UPC Speaker Verification System Submitted to VoxCeleb Speaker
  Recognition Challenge 2020 (VoxSRC-20)
The UPC Speaker Verification System Submitted to VoxCeleb Speaker Recognition Challenge 2020 (VoxSRC-20)
Muhammad Umair Ahmed Khan
Javier Hernando
DRL
42
3
0
21 Oct 2020
Multi-task Metric Learning for Text-independent Speaker Verification
Yafeng Chen
Wu Guo
Jing Shi
Jiajun Qi
Tan Liu
335
0
0
21 Oct 2020
Contrastive Learning of General-Purpose Audio Representations
Contrastive Learning of General-Purpose Audio Representations
Aaqib Saeed
David Grangier
Neil Zeghidour
VLMSSL
91
272
0
21 Oct 2020
Tongji University Undergraduate Team for the VoxCeleb Speaker
  Recognition Challenge2020
Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020
Shufan Shen
Ran Miao
Yi Wang
Zhihua Wei
28
0
0
20 Oct 2020
Tongji University Team for the VoxCeleb Speaker Recognition Challenge
  2020
Tongji University Team for the VoxCeleb Speaker Recognition Challenge 2020
Rui Wang
Zhihua Wei
Yibin Zhan
Zhuoxiao Chen
24
0
0
16 Oct 2020
Viewmaker Networks: Learning Views for Unsupervised Representation
  Learning
Viewmaker Networks: Learning Views for Unsupervised Representation Learning
Alex Tamkin
Mike Wu
Noah D. Goodman
SSL
131
64
0
14 Oct 2020
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition
  Evaluation
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation
Rohan Kumar Das
Ruijie Tao
Jichen Yang
Wei Rao
Cheng Yu
Haizhou Li
49
11
0
08 Oct 2020
A Unified Deep Learning Framework for Short-Duration Speaker
  Verification in Adverse Environments
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
Youngmoon Jung
Yeunju Choi
Hyungjun Lim
Hoirin Kim
65
13
0
06 Oct 2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge
  2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020
Hee-Soo Heo
Bong-Jin Lee
Jaesung Huh
Joon Son Chung
60
134
0
29 Sep 2020
FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning
FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning
Tedd Kourkounakis
Amirhossein Hajavi
Ali Etemad
56
23
0
23 Sep 2020
Open-set Short Utterance Forensic Speaker Verification using
  Teacher-Student Network with Explicit Inductive Bias
Open-set Short Utterance Forensic Speaker Verification using Teacher-Student Network with Explicit Inductive Bias
Mufan Sang
Wei Xia
John H. L. Hansen
73
18
0
21 Sep 2020
Online Speaker Diarization with Relation Network
Xiang Li
Yucheng Zhao
Chong Luo
Wenjun Zeng
44
2
0
17 Sep 2020
When Automatic Voice Disguise Meets Automatic Speaker Verification
When Automatic Voice Disguise Meets Automatic Speaker Verification
Linlin Zheng
Jiakang Li
Meng Sun
Xiongwei Zhang
Tianshi Zheng
57
19
0
15 Sep 2020
Utterance Clustering Using Stereo Audio Channels
Utterance Clustering Using Stereo Audio Channels
Yingjun Dong
Neil G. MacLaren
Yiding Cao
F. Yammarino
Shelley D. Dionne
M. Mumford
S. Connelly
Hiroki Sayama
G. Ruark
23
0
0
10 Sep 2020
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence
  Modeling
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling
Songxiang Liu
Yuewen Cao
Disong Wang
Xixin Wu
Xunying Liu
Helen Meng
BDL
116
92
0
06 Sep 2020
Cross-domain Adaptation with Discrepancy Minimization for
  Text-independent Forensic Speaker Verification
Cross-domain Adaptation with Discrepancy Minimization for Text-independent Forensic Speaker Verification
Zhenyu Wang
Wei Xia
John H. L. Hansen
45
12
0
05 Sep 2020
Previous
123...161718...212223
Next