ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.08612
  4. Cited By
VoxCeleb: a large-scale speaker identification dataset
v1v2 (latest)

VoxCeleb: a large-scale speaker identification dataset

26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,111 papers shown
Title
Fine-grained Early Frequency Attention for Deep Speaker Representation
  Learning
Fine-grained Early Frequency Attention for Deep Speaker Representation Learning
Amirhossein Hajavi
Ali Etemad
57
2
0
03 Sep 2020
Speaker Representation Learning using Global Context Guided Channel and
  Time-Frequency Transformations
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations
Wei Xia
John H. L. Hansen
57
9
0
02 Sep 2020
Few Shot Text-Independent speaker verification using 3D-CNN
Few Shot Text-Independent speaker verification using 3D-CNN
Prateek Mishra
65
5
0
25 Aug 2020
asya: Mindful verbal communication using deep learning
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
111
1
0
20 Aug 2020
Mesh Guided One-shot Face Reenactment using Graph Convolutional Networks
Mesh Guided One-shot Face Reenactment using Graph Convolutional Networks
Guangming Yao
Yi Yuan
Tianjia Shao
Kun Zhou
3DHCVBM
72
56
0
18 Aug 2020
Adversarial Attack and Defense Strategies for Deep Speaker Recognition
  Systems
Adversarial Attack and Defense Strategies for Deep Speaker Recognition Systems
Arindam Jati
Chin-Cheng Hsu
Monisankha Pal
Raghuveer Peri
Wael AbdAlmageed
Shrikanth Narayanan
AAML
79
67
0
18 Aug 2020
Cross attentive pooling for speaker verification
Cross attentive pooling for speaker verification
Seong Min Kye
Yoohwan Kwon
Joon Son Chung
88
9
0
13 Aug 2020
Automatic Quality Assessment for Audio-Visual Verification Systems. The
  LOVe submission to NIST SRE Challenge 2019
Automatic Quality Assessment for Audio-Visual Verification Systems. The LOVe submission to NIST SRE Challenge 2019
G. Antipov
N. Gengembre
Olivier Le Blouch
Gaël Le Lan
CVBM
37
3
0
13 Aug 2020
Compact Speaker Embedding: lrx-vector
Compact Speaker Embedding: lrx-vector
Munir Georges
Jonathan Huang
Tobias Bocklet
37
11
0
11 Aug 2020
S-vectors and TESA: Speaker Embeddings and a Speaker Authenticator Based
  on Transformer Encoder
S-vectors and TESA: Speaker Embeddings and a Speaker Authenticator Based on Transformer Encoder
Narla John Metilda Sagaya Mary
S. Umesh
Sandesh V Katta
51
31
0
11 Aug 2020
Investigation of End-To-End Speaker-Attributed ASR for Continuous
  Multi-Talker Recordings
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Naoyuki Kanda
Xuankai Chang
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
74
49
0
11 Aug 2020
Neural PLDA Modeling for End-to-End Speaker Verification
Neural PLDA Modeling for End-to-End Speaker Verification
Shreyas Ramoji
Prashant Krishnan
Sriram Ganapathy
35
6
0
11 Aug 2020
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings,
  Semi-Supervised Conversational Data, and Biased Loss
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Umut Isik
Ritwik Giri
Neerad Phansalkar
J. Valin
Karim Helwani
A. Krishnaswamy
72
84
0
11 Aug 2020
Self-Supervised Learning of Audio-Visual Objects from Video
Self-Supervised Learning of Audio-Visual Objects from Video
Triantafyllos Afouras
Andrew Owens
Joon Son Chung
Andrew Zisserman
SSL
126
256
0
10 Aug 2020
Exploring the Use of an Unsupervised Autoregressive Model as a Shared
  Encoder for Text-Dependent Speaker Verification
Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification
Vijay Ravi
Ruchao Fan
Amber Afshan
Huanhua Lu
Abeer Alwan
55
9
0
08 Aug 2020
Extrapolating false alarm rates in automatic speaker verification
Extrapolating false alarm rates in automatic speaker verification
A. Sholokhov
Tomi Kinnunen
Ville Vestman
Kong Aik Lee
42
1
0
08 Aug 2020
A Machine of Few Words -- Interactive Speaker Recognition with
  Reinforcement Learning
A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning
Mathieu Seurin
Florian Strub
Philippe Preux
Olivier Pietquin
49
5
0
07 Aug 2020
Disentangled speaker and nuisance attribute embedding for robust speaker
  verification
Disentangled speaker and nuisance attribute embedding for robust speaker verification
Woohyun Kang
Sung Hwan Mun
Min Hyun Han
N. Kim
67
17
0
07 Aug 2020
Shouted Speech Compensation for Speaker Verification Robust to Vocal
  Effort Conditions
Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions
Santi Prieto
A. O. Giménez
Iván López-Espejo
EDUARDO LLEIDA SOLANO
20
1
0
06 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with
  Adversarial Learning
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
81
6
0
05 Aug 2020
Intra-class variation reduction of speaker representation in
  disentanglement framework
Intra-class variation reduction of speaker representation in disentanglement framework
Yoohwan Kwon
Soo-Whan Chung
Hong-Goo Kang
DRL
114
21
0
04 Aug 2020
Self-attention encoding and pooling for speaker recognition
Self-attention encoding and pooling for speaker recognition
Pooyan Safari
Miquel India
Javier Hernando
ViT
87
81
0
03 Aug 2020
A Comparative Re-Assessment of Feature Extractors for Deep Speaker
  Embeddings
A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
42
9
0
30 Jul 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
130
58
0
29 Jul 2020
Families In Wild Multimedia: A Multimodal Database for Recognizing
  Kinship
Families In Wild Multimedia: A Multimodal Database for Recognizing Kinship
Joseph P. Robinson
Zaid Khan
Yu Yin
Ming Shao
Y. Fu
56
7
0
28 Jul 2020
Detecting and analysing spontaneous oral cancer speech in the wild
Detecting and analysing spontaneous oral cancer speech in the wild
B. Halpern
Rob van Son
M. V. D. Brekel
O. Scharenborg
38
9
0
28 Jul 2020
Self-Attentive Multi-Layer Aggregation with Feature Recalibration and
  Normalization for End-to-End Speaker Verification System
Self-Attentive Multi-Layer Aggregation with Feature Recalibration and Normalization for End-to-End Speaker Verification System
Soonshin Seo
Ji-Hwan Kim
53
0
0
27 Jul 2020
Double Multi-Head Attention for Speaker Verification
Double Multi-Head Attention for Speaker Verification
Miquel India
Pooyan Safari
Javier Hernando
72
18
0
26 Jul 2020
UIAI System for Short-Duration Speaker Verification Challenge 2020
UIAI System for Short-Duration Speaker Verification Challenge 2020
Md. Sahidullah
A. K. Sarkar
Ville Vestman
Xuechen Liu
Romain Serizel
Tomi Kinnunen
Zheng-Hua Tan
Emmanuel Vincent
51
4
0
26 Jul 2020
Augmentation adversarial training for self-supervised speaker
  recognition
Augmentation adversarial training for self-supervised speaker recognition
Jaesung Huh
Hee-Soo Heo
Jingu Kang
Shinji Watanabe
Joon Son Chung
SSL
126
76
0
23 Jul 2020
Optimization of data-driven filterbank for automatic speaker
  verification
Optimization of data-driven filterbank for automatic speaker verification
S. K. Sarangi
Md. Sahidullah
G. Saha
42
38
0
21 Jul 2020
SkipConvNet: Skip Convolutional Neural Network for Speech
  Dereverberation using Optimally Smoothed Spectral Mapping
SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation using Optimally Smoothed Spectral Mapping
Vinay Kothapally
Wei Xia
Shahram Ghorbani
John H. L. Hansen
Wei Xue
Jing-ling Huang
67
25
0
17 Jul 2020
Deep multi-metric learning for text-independent speaker verification
Deep multi-metric learning for text-independent speaker verification
Jiwei Xu
Xinggang Wang
Bin Feng
Wenyu Liu
96
26
0
17 Jul 2020
Cross-Lingual Speaker Verification with Domain-Balanced Hard Prototype
  Mining and Language-Dependent Score Normalization
Cross-Lingual Speaker Verification with Domain-Balanced Hard Prototype Mining and Language-Dependent Score Normalization
Jenthe Thienpondt
Brecht Desplanques
Kris Demuynck
67
24
0
15 Jul 2020
NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling
NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling
Shareef Babu Kalluri
Deepu Vijayasenan
Sriram Ganapathy
M. RageshRajan
Prashant Krishnan
44
18
0
12 Jul 2020
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker
  Verification: Fundamentals
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals
Tomi Kinnunen
Héctor Delgado
Nicholas W. D. Evans
Kong Aik Lee
Ville Vestman
...
Massimiliano Todisco
Xin Wang
Md. Sahidullah
Junichi Yamagishi
D. Reynolds
55
113
0
12 Jul 2020
Federated Learning of User Authentication Models
Federated Learning of User Authentication Models
H. Hosseini
Sungrack Yun
Hyunsin Park
Christos Louizos
Joseph B. Soriaga
Max Welling
FedML
56
13
0
09 Jul 2020
X-vectors: New Quantitative Biomarkers for Early Parkinson's Disease
  Detection from Speech
X-vectors: New Quantitative Biomarkers for Early Parkinson's Disease Detection from Speech
Laetitia Jeancolas
Dijana Petrovska – Delacretaz
G. Mangone
B. Benkelfat
J. Corvol
M. Vidailhet
Stéphane Lehéricy
Habib Benali
45
46
0
07 Jul 2020
ResNeXt and Res2Net Structures for Speaker Verification
ResNeXt and Res2Net Structures for Speaker Verification
Tianyan Zhou
Yong Zhao
Jian Wu
58
27
0
06 Jul 2020
Spot the conversation: speaker diarisation in the wild
Spot the conversation: speaker diarisation in the wild
Joon Son Chung
Jaesung Huh
Arsha Nagrani
Triantafyllos Afouras
Andrew Zisserman
VGen
103
150
0
02 Jul 2020
LSTM and GPT-2 Synthetic Speech Transfer Learning for Speaker
  Recognition to Overcome Data Scarcity
LSTM and GPT-2 Synthetic Speech Transfer Learning for Speaker Recognition to Overcome Data Scarcity
Jordan J. Bird
Diego Resende Faria
Anikó Ekárt
C. Premebida
Pedro P. S. Ayrosa
34
5
0
01 Jul 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Speaker-Conditional Chain Model for Speech Separation and Extraction
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
70
21
0
25 Jun 2020
Joint Speaker Counting, Speech Recognition, and Speaker Identification
  for Overlapped Speech of Any Number of Speakers
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Tianyan Zhou
Takuya Yoshioka
76
78
0
19 Jun 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6
  Challenge
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge
Ashish Arora
Desh Raj
Aswin Shanmugam Subramanian
Ke Li
Bar Ben Yair
Matthew Maciejewski
Piotr Żelasko
Leibny Paola García-Perera
Shinji Watanabe
Sanjeev Khudanpur
147
9
0
14 Jun 2020
Investigating Robustness of Adversarial Samples Detection for Automatic
  Speaker Verification
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification
Xu Li
Na Li
Jinghua Zhong
Xixin Wu
Xunying Liu
Jane Polak Scowcroft
Dong Yu
Helen Meng
AAML
93
37
0
11 Jun 2020
Speech Fusion to Face: Bridging the Gap Between Human's Vocal
  Characteristics and Facial Imaging
Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging
Yeqi Bai
Tao Ma
Lipo Wang
Zhenjie Zhang
CVBM
27
9
0
10 Jun 2020
Speaker Diarization as a Fully Online Learning Problem in MiniVox
Speaker Diarization as a Fully Online Learning Problem in MiniVox
Baihan Lin
Xinxin Zhang
98
16
0
08 Jun 2020
Semi-Supervised Contrastive Learning with Generalized Contrastive Loss
  and Its Application to Speaker Recognition
Semi-Supervised Contrastive Learning with Generalized Contrastive Loss and Its Application to Speaker Recognition
Nakamasa Inoue
Keita Goto
SSL
144
54
0
08 Jun 2020
Graph2Speak: Improving Speaker Identification using Network Knowledge in
  Criminal Conversational Data
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data
Mael Fabien
Seyyed Saeed Sarfjoo
P. Motlícek
S. Madikeri
19
3
0
03 Jun 2020
Crossed-Time Delay Neural Network for Speaker Recognition
Crossed-Time Delay Neural Network for Speaker Recognition
Liang Chen
Yanchun Liang
Xiaohu Shi
You Zhou
Chunguo Wu
27
3
0
31 May 2020
Previous
123...171819...212223
Next