ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.08612
  4. Cited By
VoxCeleb: a large-scale speaker identification dataset

VoxCeleb: a large-scale speaker identification dataset

26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
ArXivPDFHTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,100 papers shown
Title
Speaker Verification in Multi-Speaker Environments Using Temporal
  Feature Fusion
Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion
Ahmad Aloradi
Wolfgang Mack
Mohamed Elminshawi
Emanuel Habets
40
5
0
28 Jun 2022
Domain Agnostic Few-shot Learning for Speaker Verification
Domain Agnostic Few-shot Learning for Speaker Verification
Seunghan Yang
Debasmit Das
Jang Hyun Cho
Hyoungwoo Park
Sungrack Yun
OOD
19
7
0
28 Jun 2022
Extended U-Net for Speaker Verification in Noisy Environments
Extended U-Net for Speaker Verification in Noisy Environments
Ju-ho Kim
Ju-Sung Heo
Hye-jin Shim
Ha-Jin Yu
19
16
0
27 Jun 2022
Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Yusheng Tian
Jingyu Li
Tan Lee
19
1
0
26 Jun 2022
Exact Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech
Exact Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech
Florian Lux
Julia Koch
Ngoc Thang Vu
42
19
0
24 Jun 2022
Modeling Continuous Time Sequences with Intermittent Observations using
  Marked Temporal Point Processes
Modeling Continuous Time Sequences with Intermittent Observations using Marked Temporal Point Processes
Vinayak Gupta
Srikanta J. Bedathur
Sourangshu Bhattacharya
A. De
AI4TS
43
13
0
23 Jun 2022
Towards End-to-End Private Automatic Speaker Recognition
Towards End-to-End Private Automatic Speaker Recognition
Francisco Teixeira
A. Abad
Bhiksha Raj
Isabel Trancoso
40
10
0
23 Jun 2022
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion
Haibin Wu
Jiawen Kang
Lingwei Meng
Yang Zhang
Xixin Wu
Zhiyong Wu
Hung-yi Lee
Helen Meng
41
9
0
18 Jun 2022
Identifying Source Speakers for Voice Conversion based Spoofing Attacks
  on Speaker Verification Systems
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
Danwei Cai
Zexin Cai
Ming Li
25
10
0
18 Jun 2022
Semi-supervised Time Domain Target Speaker Extraction with Attention
Semi-supervised Time Domain Target Speaker Extraction with Attention
Zhepei Wang
Ritwik Giri
Shrikant Venkataramani
Umut Isik
J. Valin
Paris Smaragdis
Mike Goodwin
A. Krishnaswamy
29
7
0
18 Jun 2022
HairFIT: Pose-Invariant Hairstyle Transfer via Flow-based Hair Alignment
  and Semantic-Region-Aware Inpainting
HairFIT: Pose-Invariant Hairstyle Transfer via Flow-based Hair Alignment and Semantic-Region-Aware Inpainting
Chaeyeon Chung
Taewoo Kim
Hyelin Nam
Seunghwan Choi
Gyojung Gu
S. Park
Jaegul Choo
3DH
28
7
0
17 Jun 2022
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via
  Speech-Visage Feature Selection
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection
Joanna Hong
Minsu Kim
Y. Ro
CVBM
DiffM
36
8
0
15 Jun 2022
The Influence of Dataset Partitioning on Dysfluency Detection Systems
The Influence of Dataset Partitioning on Dysfluency Detection Systems
Sebastian P. Bayerl
Dominik Wagner
Elmar Nöth
Tobias Bocklet
Korbinian Riedhammer
44
20
0
07 Jun 2022
Towards Understanding and Mitigating Audio Adversarial Examples for
  Speaker Recognition
Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition
Guangke Chen
Zhe Zhao
Fu Song
Sen Chen
Lingling Fan
Feng Wang
Jiashui Wang
AAML
30
37
0
07 Jun 2022
DT-SV: A Transformer-based Time-domain Approach for Speaker Verification
DT-SV: A Transformer-based Time-domain Approach for Speaker Verification
Nan Zhang
Jianzong Wang
Zhenhou Hong
Chendong Zhao
Xiaoyang Qu
Jing Xiao
44
5
0
26 May 2022
Deep Learning for Visual Speech Analysis: A Survey
Deep Learning for Visual Speech Analysis: A Survey
Changchong Sheng
Gangyao Kuang
L. Bai
Chen Hou
Y. Guo
Xin Xu
M. Pietikäinen
Li Liu
VLM
38
34
0
22 May 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
137
354
0
21 May 2022
Dynamic Recognition of Speakers for Consent Management by Contrastive
  Embedding Replay
Dynamic Recognition of Speakers for Consent Management by Contrastive Embedding Replay
Arash Shahmansoori
U. Roedig
33
1
0
17 May 2022
Composing General Audio Representation by Fusing Multilayer Features of
  a Pre-trained Model
Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
30
5
0
17 May 2022
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain
  Text-to-Speech
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech
Rongjie Huang
Yi Ren
Jinglin Liu
Chenye Cui
Zhou Zhao
OODD
VLM
119
34
0
15 May 2022
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Bowen Shi
Abdel-rahman Mohamed
Wei-Ning Hsu
SSL
34
17
0
15 May 2022
The VoicePrivacy 2020 Challenge Evaluation Plan
The VoicePrivacy 2020 Challenge Evaluation Plan
N. Tomashenko
B. M. L. Srivastava
Xin Wang
Emmanuel Vincent
A. Nautsch
...
Nicholas W. D. Evans
J. Patino
J. Bonastre
Paul-Gauthier Noé
Massimiliano Todisco
46
43
0
14 May 2022
Collar-aware Training for Streaming Speaker Change Detection in
  Broadcast Speech
Collar-aware Training for Streaming Speaker Change Detection in Broadcast Speech
Joonas Kalda
Tanel Alumäe
23
3
0
14 May 2022
Gamified Speaker Comparison by Listening
Gamified Speaker Comparison by Listening
Sandip Ghimire
Tomi Kinnunen
Rosa González Hautamäki
13
0
0
10 May 2022
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to
  Store Speaker Information
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information
Chiyu Feng
Po-Chun Hsu
Hung-yi Lee
SSL
36
8
0
08 May 2022
VFHQ: A High-Quality Dataset and Benchmark for Video Face
  Super-Resolution
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution
Liangbin Xie
Honglun Zhang
Chao Dong
Ying Shan
CVBM
11
75
0
06 May 2022
SVTS: Scalable Video-to-Speech Synthesis
SVTS: Scalable Video-to-Speech Synthesis
Rodrigo Mira
A. Haliassos
Stavros Petridis
Björn W. Schuller
Maja Pantic
22
32
0
04 May 2022
Baselines and Protocols for Household Speaker Recognition
Baselines and Protocols for Household Speaker Recognition
A. Sholokhov
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
33
4
0
30 Apr 2022
Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype
  Contrast
Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Boqing Zhu
Kele Xu
Changjian Wang
Zheng Qin
Tao Sun
Huaimin Wang
Yuxing Peng
SSL
42
17
0
28 Apr 2022
ATST: Audio Representation Learning with Teacher-Student Transformer
ATST: Audio Representation Learning with Teacher-Student Transformer
Xian Li
Xiaofei Li
ViT
23
20
0
26 Apr 2022
Back-ends Selection for Deep Speaker Embeddings
Back-ends Selection for Deep Speaker Embeddings
Zhuo Li
Runqiu Xiao
Zi-qiang Zhang
Zhenduo Zhao
Wenchao Wang
Pengyuan Zhang
19
0
0
25 Apr 2022
Unifying Cosine and PLDA Back-ends for Speaker Verification
Unifying Cosine and PLDA Back-ends for Speaker Verification
Zhiyuan Peng
Xuanji He
Ke Ding
Tan Lee
Guanglu Wan
25
4
0
22 Apr 2022
Conditional Injective Flows for Bayesian Imaging
Conditional Injective Flows for Bayesian Imaging
AmirEhsan Khorashadizadeh
K. Kothari
Leonardo Salsi
Ali Aghababaei Harandi
Maarten V. de Hoop
Ivan Dokmanić
MedIm
31
16
0
15 Apr 2022
BYOL for Audio: Exploring Pre-trained General-purpose Audio
  Representations
BYOL for Audio: Exploring Pre-trained General-purpose Audio Representations
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
SSL
41
53
0
15 Apr 2022
The effect of speech pathology on automatic speaker verification -- a
  large-scale study
The effect of speech pathology on automatic speaker verification -- a large-scale study
Soroosh Tayebi Arasteh
Tobias Weise
Maria Schuster
E. Noeth
Andreas Maier
Seung Hee Yang
32
8
0
13 Apr 2022
Structure-Aware Motion Transfer with Deformable Anchor Model
Structure-Aware Motion Transfer with Deformable Anchor Model
Jiale Tao
Biao Wang
Borun Xu
T. Ge
Yuning Jiang
Wen Li
Lixin Duan
34
40
0
11 Apr 2022
Automatic Data Augmentation Selection and Parametrization in Contrastive
  Self-Supervised Speech Representation Learning
Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning
Salah Zaiem
Titouan Parcollet
S. Essid
SSL
12
6
0
08 Apr 2022
Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or
  PLDA?
Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?
Qiongqiong Wang
Kong Aik Lee
Tianchi Liu
30
16
0
08 Apr 2022
Detecting Vocal Fatigue with Neural Embeddings
Detecting Vocal Fatigue with Neural Embeddings
Sebastian P. Bayerl
Dominik Wagner
Ilja Baumann
Korbinian Riedhammer
Tobias Bocklet
26
11
0
07 Apr 2022
Design Guidelines for Inclusive Speaker Verification Evaluation Datasets
Design Guidelines for Inclusive Speaker Verification Evaluation Datasets
W. Hutiri
Lauriane Gorce
Aaron Yi Ding
24
7
0
05 Apr 2022
Frequency and Multi-Scale Selective Kernel Attention for Speaker
  Verification
Frequency and Multi-Scale Selective Kernel Attention for Speaker Verification
Sung Hwan Mun
Jee-weon Jung
Min Hyun Han
N. Kim
50
21
0
03 Apr 2022
Improved Relation Networks for End-to-End Speaker Verification and
  Identification
Improved Relation Networks for End-to-End Speaker Verification and Identification
Ashutosh Chaubey
Sparsh Sinha
Susmita Ghose
27
3
0
31 Mar 2022
Open Source MagicData-RAMC: A Rich Annotated Mandarin
  Conversational(RAMC) Speech Dataset
Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset
Zehui Yang
Yifan Chen
Lei Luo
Runyan Yang
Lingxuan Ye
...
Yaohui Jin
Qingqing Zhang
Pengyuan Zhang
Lei Xie
Yonghong Yan
25
47
0
31 Mar 2022
A Comparative Study on Speaker-attributed Automatic Speech Recognition
  in Multi-party Meetings
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Fan Yu
Zhihao Du
Shiliang Zhang
Yuxiao Lin
Linfu Xie
29
13
0
31 Mar 2022
Improving Distortion Robustness of Self-supervised Speech Processing
  Tasks with Domain Adaptation
Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation
Kuan Po Huang
Yuanbin Fu
Yu Zhang
Hung-yi Lee
26
28
0
30 Mar 2022
Decomposed Temporal Dynamic CNN: Efficient Time-Adaptive Network for
  Text-Independent Speaker Verification Explained with Speaker Activation Map
Decomposed Temporal Dynamic CNN: Efficient Time-Adaptive Network for Text-Independent Speaker Verification Explained with Speaker Activation Map
Seong-Hu Kim
Hyeonuk Nam
Yong-Hwa Park
25
9
0
29 Mar 2022
NeuraGen-A Low-Resource Neural Network based approach for Gender
  Classification
NeuraGen-A Low-Resource Neural Network based approach for Gender Classification
Shankhanil Ghosh
Chhanda Saha
Nagamani Molakathaala
6
2
0
29 Mar 2022
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic
  Speaker Verification
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
Yang Zhang
Zhiqiang Lv
Haibin Wu
Shanshan Zhang
Pengfei Hu
Zhiyong Wu
Hung-yi Lee
Helen Meng
ViT
39
131
0
29 Mar 2022
Robust Speaker Recognition with Transformers Using wav2vec 2.0
Robust Speaker Recognition with Transformers Using wav2vec 2.0
Sergey Novoselov
G. Lavrentyeva
Anastasia Avdeeva
V. Volokhov
Aleksei Gusev
ViT
21
18
0
28 Mar 2022
Analyzing Language-Independent Speaker Anonymization Framework under
  Unseen Conditions
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
N. Tomashenko
34
10
0
28 Mar 2022
Previous
123...101112...202122
Next