Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.08612
Cited By
VoxCeleb: a large-scale speaker identification dataset
26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VoxCeleb: a large-scale speaker identification dataset"
50 / 1,100 papers shown
Title
Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion
Ahmad Aloradi
Wolfgang Mack
Mohamed Elminshawi
Emanuel Habets
40
5
0
28 Jun 2022
Domain Agnostic Few-shot Learning for Speaker Verification
Seunghan Yang
Debasmit Das
Jang Hyun Cho
Hyoungwoo Park
Sungrack Yun
OOD
19
7
0
28 Jun 2022
Extended U-Net for Speaker Verification in Noisy Environments
Ju-ho Kim
Ju-Sung Heo
Hye-jin Shim
Ha-Jin Yu
19
16
0
27 Jun 2022
Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Yusheng Tian
Jingyu Li
Tan Lee
19
1
0
26 Jun 2022
Exact Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech
Florian Lux
Julia Koch
Ngoc Thang Vu
42
19
0
24 Jun 2022
Modeling Continuous Time Sequences with Intermittent Observations using Marked Temporal Point Processes
Vinayak Gupta
Srikanta J. Bedathur
Sourangshu Bhattacharya
A. De
AI4TS
43
13
0
23 Jun 2022
Towards End-to-End Private Automatic Speaker Recognition
Francisco Teixeira
A. Abad
Bhiksha Raj
Isabel Trancoso
40
10
0
23 Jun 2022
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion
Haibin Wu
Jiawen Kang
Lingwei Meng
Yang Zhang
Xixin Wu
Zhiyong Wu
Hung-yi Lee
Helen Meng
41
9
0
18 Jun 2022
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
Danwei Cai
Zexin Cai
Ming Li
25
10
0
18 Jun 2022
Semi-supervised Time Domain Target Speaker Extraction with Attention
Zhepei Wang
Ritwik Giri
Shrikant Venkataramani
Umut Isik
J. Valin
Paris Smaragdis
Mike Goodwin
A. Krishnaswamy
29
7
0
18 Jun 2022
HairFIT: Pose-Invariant Hairstyle Transfer via Flow-based Hair Alignment and Semantic-Region-Aware Inpainting
Chaeyeon Chung
Taewoo Kim
Hyelin Nam
Seunghwan Choi
Gyojung Gu
S. Park
Jaegul Choo
3DH
28
7
0
17 Jun 2022
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection
Joanna Hong
Minsu Kim
Y. Ro
CVBM
DiffM
36
8
0
15 Jun 2022
The Influence of Dataset Partitioning on Dysfluency Detection Systems
Sebastian P. Bayerl
Dominik Wagner
Elmar Nöth
Tobias Bocklet
Korbinian Riedhammer
44
20
0
07 Jun 2022
Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition
Guangke Chen
Zhe Zhao
Fu Song
Sen Chen
Lingling Fan
Feng Wang
Jiashui Wang
AAML
30
37
0
07 Jun 2022
DT-SV: A Transformer-based Time-domain Approach for Speaker Verification
Nan Zhang
Jianzong Wang
Zhenhou Hong
Chendong Zhao
Xiaoyang Qu
Jing Xiao
44
5
0
26 May 2022
Deep Learning for Visual Speech Analysis: A Survey
Changchong Sheng
Gangyao Kuang
L. Bai
Chen Hou
Y. Guo
Xin Xu
M. Pietikäinen
Li Liu
VLM
38
34
0
22 May 2022
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
137
354
0
21 May 2022
Dynamic Recognition of Speakers for Consent Management by Contrastive Embedding Replay
Arash Shahmansoori
U. Roedig
33
1
0
17 May 2022
Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
30
5
0
17 May 2022
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech
Rongjie Huang
Yi Ren
Jinglin Liu
Chenye Cui
Zhou Zhao
OODD
VLM
119
34
0
15 May 2022
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Bowen Shi
Abdel-rahman Mohamed
Wei-Ning Hsu
SSL
34
17
0
15 May 2022
The VoicePrivacy 2020 Challenge Evaluation Plan
N. Tomashenko
B. M. L. Srivastava
Xin Wang
Emmanuel Vincent
A. Nautsch
...
Nicholas W. D. Evans
J. Patino
J. Bonastre
Paul-Gauthier Noé
Massimiliano Todisco
46
43
0
14 May 2022
Collar-aware Training for Streaming Speaker Change Detection in Broadcast Speech
Joonas Kalda
Tanel Alumäe
23
3
0
14 May 2022
Gamified Speaker Comparison by Listening
Sandip Ghimire
Tomi Kinnunen
Rosa González Hautamäki
13
0
0
10 May 2022
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information
Chiyu Feng
Po-Chun Hsu
Hung-yi Lee
SSL
36
8
0
08 May 2022
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution
Liangbin Xie
Honglun Zhang
Chao Dong
Ying Shan
CVBM
11
75
0
06 May 2022
SVTS: Scalable Video-to-Speech Synthesis
Rodrigo Mira
A. Haliassos
Stavros Petridis
Björn W. Schuller
Maja Pantic
22
32
0
04 May 2022
Baselines and Protocols for Household Speaker Recognition
A. Sholokhov
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
33
4
0
30 Apr 2022
Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Boqing Zhu
Kele Xu
Changjian Wang
Zheng Qin
Tao Sun
Huaimin Wang
Yuxing Peng
SSL
42
17
0
28 Apr 2022
ATST: Audio Representation Learning with Teacher-Student Transformer
Xian Li
Xiaofei Li
ViT
23
20
0
26 Apr 2022
Back-ends Selection for Deep Speaker Embeddings
Zhuo Li
Runqiu Xiao
Zi-qiang Zhang
Zhenduo Zhao
Wenchao Wang
Pengyuan Zhang
19
0
0
25 Apr 2022
Unifying Cosine and PLDA Back-ends for Speaker Verification
Zhiyuan Peng
Xuanji He
Ke Ding
Tan Lee
Guanglu Wan
25
4
0
22 Apr 2022
Conditional Injective Flows for Bayesian Imaging
AmirEhsan Khorashadizadeh
K. Kothari
Leonardo Salsi
Ali Aghababaei Harandi
Maarten V. de Hoop
Ivan Dokmanić
MedIm
31
16
0
15 Apr 2022
BYOL for Audio: Exploring Pre-trained General-purpose Audio Representations
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
SSL
41
53
0
15 Apr 2022
The effect of speech pathology on automatic speaker verification -- a large-scale study
Soroosh Tayebi Arasteh
Tobias Weise
Maria Schuster
E. Noeth
Andreas Maier
Seung Hee Yang
32
8
0
13 Apr 2022
Structure-Aware Motion Transfer with Deformable Anchor Model
Jiale Tao
Biao Wang
Borun Xu
T. Ge
Yuning Jiang
Wen Li
Lixin Duan
34
40
0
11 Apr 2022
Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning
Salah Zaiem
Titouan Parcollet
S. Essid
SSL
12
6
0
08 Apr 2022
Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?
Qiongqiong Wang
Kong Aik Lee
Tianchi Liu
30
16
0
08 Apr 2022
Detecting Vocal Fatigue with Neural Embeddings
Sebastian P. Bayerl
Dominik Wagner
Ilja Baumann
Korbinian Riedhammer
Tobias Bocklet
26
11
0
07 Apr 2022
Design Guidelines for Inclusive Speaker Verification Evaluation Datasets
W. Hutiri
Lauriane Gorce
Aaron Yi Ding
24
7
0
05 Apr 2022
Frequency and Multi-Scale Selective Kernel Attention for Speaker Verification
Sung Hwan Mun
Jee-weon Jung
Min Hyun Han
N. Kim
50
21
0
03 Apr 2022
Improved Relation Networks for End-to-End Speaker Verification and Identification
Ashutosh Chaubey
Sparsh Sinha
Susmita Ghose
27
3
0
31 Mar 2022
Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset
Zehui Yang
Yifan Chen
Lei Luo
Runyan Yang
Lingxuan Ye
...
Yaohui Jin
Qingqing Zhang
Pengyuan Zhang
Lei Xie
Yonghong Yan
25
47
0
31 Mar 2022
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Fan Yu
Zhihao Du
Shiliang Zhang
Yuxiao Lin
Linfu Xie
29
13
0
31 Mar 2022
Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation
Kuan Po Huang
Yuanbin Fu
Yu Zhang
Hung-yi Lee
26
28
0
30 Mar 2022
Decomposed Temporal Dynamic CNN: Efficient Time-Adaptive Network for Text-Independent Speaker Verification Explained with Speaker Activation Map
Seong-Hu Kim
Hyeonuk Nam
Yong-Hwa Park
25
9
0
29 Mar 2022
NeuraGen-A Low-Resource Neural Network based approach for Gender Classification
Shankhanil Ghosh
Chhanda Saha
Nagamani Molakathaala
6
2
0
29 Mar 2022
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
Yang Zhang
Zhiqiang Lv
Haibin Wu
Shanshan Zhang
Pengfei Hu
Zhiyong Wu
Hung-yi Lee
Helen Meng
ViT
39
131
0
29 Mar 2022
Robust Speaker Recognition with Transformers Using wav2vec 2.0
Sergey Novoselov
G. Lavrentyeva
Anastasia Avdeeva
V. Volokhov
Aleksei Gusev
ViT
21
18
0
28 Mar 2022
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
N. Tomashenko
34
10
0
28 Mar 2022
Previous
1
2
3
...
10
11
12
...
20
21
22
Next