Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.08612
Cited By
VoxCeleb: a large-scale speaker identification dataset
26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VoxCeleb: a large-scale speaker identification dataset"
50 / 1,100 papers shown
Title
Adaptive Margin Circle Loss for Speaker Verification
Runqiu Xiao
33
11
0
15 Jun 2021
Voting for the right answer: Adversarial defense for speaker verification
Haibin Wu
Yang Zhang
Zhiyong Wu
Dong Wang
Hung-yi Lee
AAML
33
25
0
15 Jun 2021
Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection
Zhenyu Zhang
Yanhao Ge
Renwang Chen
Ying Tai
Yan Yan
Jian Yang
Chengjie Wang
Jilin Li
Feiyue Huang
CVBM
3DH
16
26
0
15 Jun 2021
Learning Audio-Visual Dereverberation
Changan Chen
Wei-Ju Sun
David Harwath
Kristen Grauman
31
32
0
14 Jun 2021
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing
Tomi Kinnunen
A. Nautsch
Md. Sahidullah
Nicholas W. D. Evans
Xin Wang
Massimiliano Todisco
Héctor Delgado
Junichi Yamagishi
Kong Aik Lee
13
1
0
11 Jun 2021
Unsupervised Co-part Segmentation through Assembly
Qingzhe Gao
Bin Wang
Libin Liu
Baoquan Chen
18
13
0
10 Jun 2021
SpeechBrain: A General-Purpose Speech Toolkit
Mirco Ravanelli
Titouan Parcollet
Peter William VanHarn Plantinga
Aku Rouhe
Samuele Cornell
...
William Aris
Hwidong Na
Yan Gao
R. Mori
Yoshua Bengio
24
752
0
08 Jun 2021
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios
E. Tsunoo
Kentarou Shibata
Chaitanya Narisetty
Yosuke Kashiwagi
Shinji Watanabe
27
12
0
07 Jun 2021
An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis
Beáta Lőrincz
Adriana Stan
M. Giurgiu
21
2
0
03 Jun 2021
APES: Audiovisual Person Search in Untrimmed Video
Juan Carlos León Alcázar
Long Mai
Federico Perazzi
Joon-Young Lee
Pablo Arbeláez
Guohao Li
Fabian Caba Heilbron
36
6
0
03 Jun 2021
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
Haibin Wu
Xu Li
Andy T. Liu
Zhiyong Wu
Helen Meng
Hung-yi Lee
AAML
SSL
55
29
0
01 Jun 2021
X-Vectors with Multi-Scale Aggregation for Speaker Diarization
Myung-Jae Kim
V. Apsingekar
Divya Neelagiri
19
0
0
16 May 2021
Move2Hear: Active Audio-Visual Source Separation
Sagnik Majumder
Ziad Al-Halah
Kristen Grauman
21
44
0
15 May 2021
Study on the temporal pooling used in deep neural networks for speaker verification
Mickael Rouvier
Pierre-Michel Bousquet
J. Duret
25
6
0
10 May 2021
Voice activity detection in the wild: A data-driven approach using teacher-student training
Heinrich Dinkel
Shuai Wang
Xuenan Xu
Mengyue Wu
K. Yu
VLM
13
32
0
10 May 2021
SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Yi-Chen Chen
Po-Han Chi
Shu-Wen Yang
Kai-Wei Chang
Jheng-hao Lin
Sung-Feng Huang
Da-Rong Liu
Chi-Liang Liu
Cheng-Kuang Lee
Hung-yi Lee
MoE
29
17
0
07 May 2021
A Good Image Generator Is What You Need for High-Resolution Video Synthesis
Yu Tian
Jian Ren
Menglei Chai
Kyle Olszewski
Xi Peng
Dimitris N. Metaxas
Sergey Tulyakov
VGen
65
183
0
30 Apr 2021
Personalized Keyphrase Detection using Speaker and Environment Information
R. Rikhye
Quan Wang
Qiao Liang
Yanzhang He
Ding Zhao
Yiteng Huang
Huang
A. Narayanan
Ian McGraw
26
11
0
28 Apr 2021
Multimodal Self-Supervised Learning of General Audio Representations
Luyu Wang
Pauline Luc
Adrià Recasens
Jean-Baptiste Alayrac
Aaron van den Oord
SSL
78
41
0
26 Apr 2021
Motion Representations for Articulated Animation
Aliaksandr Siarohin
Oliver J. Woodford
Jian Ren
Menglei Chai
Sergey Tulyakov
OCL
108
261
0
22 Apr 2021
Voice2Mesh: Cross-Modal 3D Face Model Generation from Voices
Cho-Ying Wu
Ke Xu
Chin-Cheng Hsu
Ulrich Neumann
CVBM
3DH
50
4
0
21 Apr 2021
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection
Junke Wang
Zuxuan Wu
Wenhao Ouyang
Xintong Han
Jingjing Chen
Ser-Nam Lim
Yu-Gang Jiang
ViT
110
258
0
20 Apr 2021
Self-supervised Representation Learning With Path Integral Clustering For Speaker Diarization
Prachi Singh
Sriram Ganapathy
SSL
34
9
0
19 Apr 2021
Federated Learning of User Verification Models Without Sharing Embeddings
H. Hosseini
Hyunsin Park
Sungrack Yun
Christos Louizos
Joseph B. Soriaga
Max Welling
FedML
30
23
0
18 Apr 2021
Conditional independence for pretext task selection in Self-supervised speech representation learning
Salah Zaiem
Titouan Parcollet
S. Essid
SSL
6
4
0
15 Apr 2021
Speaker Attentive Speech Emotion Recognition
Clément Le Moine
Nicolas Obin
Axel Roebel
24
12
0
15 Apr 2021
Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System
Ju-ho Kim
Hye-jin Shim
Jee-weon Jung
Ha-Jin Yu
28
1
0
14 Apr 2021
Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation
Fengpeng Yue
Yan Deng
Lei He
Tom Ko
17
8
0
08 Apr 2021
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Yihui Fu
Luyao Cheng
Shubo Lv
Yukai Jv
Yuxiang Kong
...
Jian Wu
Hui Bu
Xin Xu
Jun Du
Jingdong Chen
25
85
0
08 Apr 2021
Single Source One Shot Reenactment using Weighted motion From Paired Feature Points
S. Tripathy
Arno Solin
Esa Rahtu
3DH
DiffM
10
7
0
07 Apr 2021
Adapting Speaker Embeddings for Speaker Diarisation
Youngki Kwon
Jee-weon Jung
Hee-Soo Heo
You Jin Kim
Bong-Jin Lee
Joon Son Chung
21
13
0
07 Apr 2021
Speaker embeddings by modeling channel-wise correlations
Themos Stafylakis
Johan Rohdin
L. Burget
19
9
0
06 Apr 2021
Speaker Diarization using Two-pass Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings
Kiran Karra
A. McCree
6
2
0
06 Apr 2021
Binary Neural Network for Speaker Verification
Tinglong Zhu
Xiaoyi Qin
Ming Li
MQ
21
12
0
06 Apr 2021
End-to-End Speaker-Attributed ASR with Transformer
Naoyuki Kanda
Guoli Ye
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
29
47
0
05 Apr 2021
Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Liang Lu
Naoyuki Kanda
Jinyu Li
Jiawei Liu
29
19
0
05 Apr 2021
Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances
Chang Zeng
Xin Wang
Erica Cooper
Xiaoxiao Miao
Junichi Yamagishi
44
20
0
04 Apr 2021
Diarization of Legal Proceedings. Identifying and Transcribing Judicial Speech from Recorded Court Audio
Jeffrey Tumminia
Amanda Kuznecov
Sophia Tsilerides
Ilana Weinstein
Brian McFee
M. Picheny
A. Kaufman
37
1
0
03 Apr 2021
Configurable Privacy-Preserving Automatic Speech Recognition
Ranya Aloufi
Hamed Haddadi
David E. Boyle
32
10
0
01 Apr 2021
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Adam Polyak
Yossi Adi
Jade Copet
Eugene Kharitonov
Kushal Lakhotia
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
32
307
0
01 Apr 2021
Improved Meta-Learning Training for Speaker Verification
Yafeng Chen
Wu Guo
Bin Gu
26
7
0
29 Mar 2021
Scalable and Efficient Neural Speech Coding: A Hybrid Design
Kai Zhen
Jongmo Sung
Mi Suk Lee
Seung-Wha Beack
Minje Kim
26
13
0
27 Mar 2021
EfficientTDNN: Efficient Architecture Search for Speaker Recognition
Rui Wang
Zhihua Wei
Haoran Duan
S. Ji
Yang Long
Zhenhou Hong
25
17
0
25 Mar 2021
PriorityCut: Occlusion-guided Regularization for Warp-based Image Animation
Wai Ting Cheung
Gyeongsu Chae
VGen
24
1
0
22 Mar 2021
USTC-NELSLIP System Description for DIHARD-III Challenge
Yuxuan Wang
Maokui He
Shutong Niu
Lei Sun
Tian Gao
Xin Fang
Jia Pan
Jun Du
Chin-Hui Lee
16
28
0
19 Mar 2021
KoDF: A Large-scale Korean DeepFake Detection Dataset
Patrick Kwon
J. You
Gyuhyeon Nam
Sungwoo Park
Gyeongsu Chae
29
100
0
18 Mar 2021
Improving Zero-shot Voice Style Transfer via Disentangled Representation Learning
Siyang Yuan
Pengyu Cheng
Ruiyi Zhang
Weituo Hao
Zhe Gan
Lawrence Carin
DRL
22
60
0
17 Mar 2021
Seeking the Shape of Sound: An Adaptive Framework for Learning Voice-Face Association
Peisong Wen
Qianqian Xu
Yangbangyan Jiang
Zhiyong Yang
Yuan He
Qingming Huang
CVBM
23
32
0
12 Mar 2021
Learning spectro-temporal representations of complex sounds with parameterized neural networks
Rachid Riad
Julien Karadayi
Anne-Catherine Bachoud-Lévi
Emmanuel Dupoux
29
7
0
12 Mar 2021
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
SSL
38
175
0
11 Mar 2021
Previous
1
2
3
...
14
15
16
...
20
21
22
Next