Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.08612
Cited By
VoxCeleb: a large-scale speaker identification dataset
26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VoxCeleb: a large-scale speaker identification dataset"
50 / 1,098 papers shown
Title
Deep generative LDA
Yunqi Cai
Dong Wang
30
1
0
30 Oct 2020
Comparison of Speaker Role Recognition and Speaker Enrollment Protocol for conversational Clinical Interviews
Rachid Riad
Hadrien Titeux
Laurie Lemoine
Justine Montillot
A. Sliwinski
J. Bagnou
Xuan-Nga Cao
Anne-Catherine Bachoud-Lévi
Emmanuel Dupoux
15
0
0
30 Oct 2020
The ins and outs of speaker recognition: lessons from VoxSRC 2020
Yoohwan Kwon
Hee-Soo Heo
Bong-Jin Lee
Joon Son Chung
26
59
0
29 Oct 2020
Playing a Part: Speaker Verification at the Movies
A. Brown
Jaesung Huh
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
18
23
0
29 Oct 2020
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model
Yanpei Shi
Mingjie Chen
Qiang Huang
Thomas Hain
21
5
0
29 Oct 2020
Generative Adversarial Networks in Human Emotion Synthesis:A Review
Noushin Hajarolasvadi
M. A. Ramírez
H. Demirel
GAN
17
20
0
28 Oct 2020
Leveraging speaker attribute information using multi task learning for speaker verification and diarization
Chau Luu
P. Bell
Steve Renals
27
9
0
27 Oct 2020
Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification
Lantian Li
Yang Zhang
Jiawen Kang
T. Zheng
Dong Wang
11
4
0
27 Oct 2020
HarperValleyBank: A Domain-Specific Spoken Dialog Corpus
Mike Wu
J. Nafziger
A. Scodary
Andrew L. Maas
31
17
0
26 Oct 2020
Speaker Anonymization with Distribution-Preserving X-Vector Generation for the VoicePrivacy Challenge 2020
H.C.M. Turner
Giulio Lovisotto
Ivan Martinovic
8
21
0
26 Oct 2020
An iterative framework for self-supervised deep speaker representation learning
Danwei Cai
Weiqing Wang
Ming Li
SSL
22
37
0
25 Oct 2020
Y-Vector: Multiscale Waveform Encoder for Speaker Embedding
Ge Zhu
Fei Jiang
Z. Duan
16
25
0
24 Oct 2020
The IDLAB VoxCeleb Speaker Recognition Challenge 2020 System Description
Jenthe Thienpondt
Brecht Desplanques
Kris Demuynck
14
49
0
23 Oct 2020
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers
Zeqian Li
Jacob Whitehill
28
11
0
22 Oct 2020
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge
Renyu Wang
Ruilin Tong
Y. Yeung
Xiao Chen
11
1
0
22 Oct 2020
Graph Attention Networks for Speaker Verification
Jee-weon Jung
Hee-Soo Heo
Ha-Jin Yu
Joon Son Chung
25
26
0
22 Oct 2020
Momentum Contrast Speaker Representation Learning
Jangho Lee
Jaihyun Koh
Sungroh Yoon
SSL
21
3
0
22 Oct 2020
Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium Learning
Sung Hwan Mun
Woohyun Kang
Min Hyun Han
N. Kim
SSL
49
21
0
22 Oct 2020
Robust Text-Dependent Speaker Verification via Character-Level Information Preservation for the SdSV Challenge 2020
Sung Hwan Mun
Woohyun Kang
Min Hyun Han
N. Kim
24
2
0
22 Oct 2020
The IDLAB VoxSRC-20 Submission: Large Margin Fine-Tuning and Quality-Aware Score Calibration in DNN Based Speaker Verification
Jenthe Thienpondt
Brecht Desplanques
Kris Demuynck
22
83
0
21 Oct 2020
The UPC Speaker Verification System Submitted to VoxCeleb Speaker Recognition Challenge 2020 (VoxSRC-20)
Muhammad Umair Ahmed Khan
Javier Hernando
DRL
6
3
0
21 Oct 2020
Multi-task Metric Learning for Text-independent Speaker Verification
Yafeng Chen
Wu Guo
Jing Shi
Jiajun Qi
Tan Liu
174
0
0
21 Oct 2020
Contrastive Learning of General-Purpose Audio Representations
Aaqib Saeed
David Grangier
Neil Zeghidour
VLM
SSL
24
262
0
21 Oct 2020
Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020
Shufan Shen
Ran Miao
Yi Wang
Zhihua Wei
16
0
0
20 Oct 2020
Tongji University Team for the VoxCeleb Speaker Recognition Challenge 2020
Rui Wang
Zhihua Wei
Yibin Zhan
Zhuoxiao Chen
16
0
0
16 Oct 2020
Viewmaker Networks: Learning Views for Unsupervised Representation Learning
Alex Tamkin
Mike Wu
Noah D. Goodman
SSL
30
64
0
14 Oct 2020
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation
Rohan Kumar Das
Ruijie Tao
Jichen Yang
Wei Rao
Cheng Yu
Haizhou Li
30
11
0
08 Oct 2020
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
Youngmoon Jung
Yeunju Choi
Hyungjun Lim
Hoirin Kim
19
13
0
06 Oct 2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020
Hee-Soo Heo
Bong-Jin Lee
Jaesung Huh
Joon Son Chung
13
132
0
29 Sep 2020
FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning
Tedd Kourkounakis
Amirhossein Hajavi
Ali Etemad
19
22
0
23 Sep 2020
Open-set Short Utterance Forensic Speaker Verification using Teacher-Student Network with Explicit Inductive Bias
Mufan Sang
Wei Xia
John H. L. Hansen
33
17
0
21 Sep 2020
Online Speaker Diarization with Relation Network
Xiang Li
Yucheng Zhao
Chong Luo
Wenjun Zeng
10
2
0
17 Sep 2020
When Automatic Voice Disguise Meets Automatic Speaker Verification
Linlin Zheng
Jiakang Li
Meng Sun
Xiongwei Zhang
T. Zheng
29
17
0
15 Sep 2020
Utterance Clustering Using Stereo Audio Channels
Yingjun Dong
Neil G. MacLaren
Yiding Cao
F. Yammarino
Shelley D. Dionne
M. Mumford
S. Connelly
Hiroki Sayama
G. Ruark
6
0
0
10 Sep 2020
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling
Songxiang Liu
Yuewen Cao
Disong Wang
Xixin Wu
Xunying Liu
Helen Meng
BDL
29
88
0
06 Sep 2020
Cross-domain Adaptation with Discrepancy Minimization for Text-independent Forensic Speaker Verification
Zhenyu Wang
Wei Xia
John H. L. Hansen
20
12
0
05 Sep 2020
Fine-grained Early Frequency Attention for Deep Speaker Representation Learning
Amirhossein Hajavi
Ali Etemad
24
2
0
03 Sep 2020
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations
Wei Xia
John H. L. Hansen
22
9
0
02 Sep 2020
Few Shot Text-Independent speaker verification using 3D-CNN
Prateek Mishra
27
5
0
25 Aug 2020
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
33
1
0
20 Aug 2020
Mesh Guided One-shot Face Reenactment using Graph Convolutional Networks
Guangming Yao
Yi Yuan
Tianjia Shao
Kun Zhou
3DH
CVBM
35
56
0
18 Aug 2020
Adversarial Attack and Defense Strategies for Deep Speaker Recognition Systems
Arindam Jati
Chin-Cheng Hsu
Monisankha Pal
Raghuveer Peri
Wael AbdAlmageed
Shrikanth Narayanan
AAML
27
65
0
18 Aug 2020
Cross attentive pooling for speaker verification
Seong Min Kye
Yoohwan Kwon
Joon Son Chung
25
9
0
13 Aug 2020
Automatic Quality Assessment for Audio-Visual Verification Systems. The LOVe submission to NIST SRE Challenge 2019
G. Antipov
N. Gengembre
Olivier Le Blouch
Gaël Le Lan
CVBM
20
3
0
13 Aug 2020
Compact Speaker Embedding: lrx-vector
Munir Georges
Jonathan Huang
Tobias Bocklet
20
11
0
11 Aug 2020
S-vectors and TESA: Speaker Embeddings and a Speaker Authenticator Based on Transformer Encoder
Narla John Metilda Sagaya Mary
S. Umesh
Sandesh V Katta
8
31
0
11 Aug 2020
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Naoyuki Kanda
Xuankai Chang
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
22
48
0
11 Aug 2020
Neural PLDA Modeling for End-to-End Speaker Verification
Shreyas Ramoji
Prashant Krishnan
Sriram Ganapathy
24
5
0
11 Aug 2020
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Umut Isik
Ritwik Giri
Neerad Phansalkar
J. Valin
Karim Helwani
A. Krishnaswamy
21
83
0
11 Aug 2020
Self-Supervised Learning of Audio-Visual Objects from Video
Triantafyllos Afouras
Andrew Owens
Joon Son Chung
Andrew Zisserman
SSL
19
253
0
10 Aug 2020
Previous
1
2
3
...
16
17
18
...
20
21
22
Next