ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.05622
  4. Cited By
VoxCeleb2: Deep Speaker Recognition

VoxCeleb2: Deep Speaker Recognition

14 June 2018
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
ArXivPDFHTML

Papers citing "VoxCeleb2: Deep Speaker Recognition"

50 / 759 papers shown
Title
Learning Facial Representations from the Cycle-consistency of Face
Learning Facial Representations from the Cycle-consistency of Face
Jia-Ren Chang
Yonghao Chen
W. Chiu
CVBM
22
29
0
07 Aug 2021
UniCon: Unified Context Network for Robust Active Speaker Detection
UniCon: Unified Context Network for Robust Active Speaker Detection
Yuanhang Zhang
Susan Liang
Shuang Yang
Xiao-Chang Liu
Zhongqin Wu
Shiguang Shan
Xilin Chen
CVBM
29
36
0
05 Aug 2021
Proposal-based Few-shot Sound Event Detection for Speech and
  Environmental Sounds with Perceivers
Proposal-based Few-shot Sound Event Detection for Speech and Environmental Sounds with Perceivers
Piper Wolters
Logan Sizemore
Chris Daw
Brian Hutchinson
Lauren A. Phillips
29
11
0
28 Jul 2021
Use of speaker recognition approaches for learning and evaluating
  embedding representations of musical instrument sounds
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds
Xuan Shi
Erica Cooper
Junichi Yamagishi
24
7
0
24 Jul 2021
Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding
Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding
Hongning Zhu
Kong Aik Lee
Haizhou Li
33
15
0
14 Jul 2021
Speech2Video: Cross-Modal Distillation for Speech to Video Generation
Speech2Video: Cross-Modal Distillation for Speech to Video Generation
Shijing Si
Jianzong Wang
Xiaoyang Qu
Ning Cheng
Wenqi Wei
Xinghua Zhu
Jing Xiao
VGen
16
15
0
10 Jul 2021
A Comparative Study of Modular and Joint Approaches for
  Speaker-Attributed ASR on Monaural Long-Form Audio
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Naoyuki Kanda
Xiong Xiao
Jian Wu
Tianyan Zhou
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
19
14
0
06 Jul 2021
Multi-modality Deep Restoration of Extremely Compressed Face Videos
Multi-modality Deep Restoration of Extremely Compressed Face Videos
Xi Zhang
Xiaolin Wu
CVBM
19
13
0
05 Jul 2021
What do End-to-End Speech Models Learn about Speaker, Language and
  Channel Information? A Layer-wise and Neuron-level Analysis
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis
Shammur A. Chowdhury
Nadir Durrani
Ahmed M. Ali
36
12
0
01 Jul 2021
Adversarial Sample Detection for Speaker Verification by Neural Vocoders
Adversarial Sample Detection for Speaker Verification by Neural Vocoders
Haibin Wu
Po-Chun Hsu
Ji Gao
Shanshan Zhang
Shen Huang
Jian Kang
Zhiyong Wu
Helen Meng
Hung-yi Lee
AAML
25
20
0
01 Jul 2021
Adaptive Margin Circle Loss for Speaker Verification
Adaptive Margin Circle Loss for Speaker Verification
Runqiu Xiao
17
11
0
15 Jun 2021
Voting for the right answer: Adversarial defense for speaker
  verification
Voting for the right answer: Adversarial defense for speaker verification
Haibin Wu
Yang Zhang
Zhiyong Wu
Dong Wang
Hung-yi Lee
AAML
25
25
0
15 Jun 2021
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker
  Detection in the Wild
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild
Okan Kopuklu
Maja Taseska
Gerhard Rigoll
3DV
19
45
0
07 Jun 2021
VidFace: A Full-Transformer Solver for Video FaceHallucination with
  Unaligned Tiny Snapshots
VidFace: A Full-Transformer Solver for Video FaceHallucination with Unaligned Tiny Snapshots
Y. Gan
Yawei Luo
Xin Yu
Bang Zhang
Yi Yang
ViT
CVBM
25
3
0
31 May 2021
Voice activity detection in the wild: A data-driven approach using
  teacher-student training
Voice activity detection in the wild: A data-driven approach using teacher-student training
Heinrich Dinkel
Shuai Wang
Xuenan Xu
Mengyue Wu
K. Yu
VLM
11
32
0
10 May 2021
Learned Spatial Representations for Few-shot Talking-Head Synthesis
Learned Spatial Representations for Few-shot Talking-Head Synthesis
Moustafa Meshry
Saksham Suri
Larry S. Davis
Abhinav Shrivastava
21
42
0
29 Apr 2021
Pose-Controllable Talking Face Generation by Implicitly Modularized
  Audio-Visual Representation
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation
Hang Zhou
Yasheng Sun
Wayne Wu
Chen Change Loy
Xiaogang Wang
Ziwei Liu
CVBM
28
360
0
22 Apr 2021
Everything's Talkin': Pareidolia Face Reenactment
Everything's Talkin': Pareidolia Face Reenactment
Linsen Song
Wayne Wu
Chaoyou Fu
Chao Qian
Chen Change Loy
Ran He
CVBM
32
12
0
07 Apr 2021
EasyCall corpus: a dysarthric speech dataset
EasyCall corpus: a dysarthric speech dataset
Rosanna Turrisi
Arianna Braccia
M. Emanuele
Simone Giulietti
M. Pugliatti
M. Sensi
Luciano Fadiga
Leonardo Badino
17
22
0
06 Apr 2021
Attention Back-end for Automatic Speaker Verification with Multiple
  Enrollment Utterances
Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances
Chang Zeng
Xin Wang
Erica Cooper
Xiaoxiao Miao
Junichi Yamagishi
38
20
0
04 Apr 2021
Improved Meta-Learning Training for Speaker Verification
Improved Meta-Learning Training for Speaker Verification
Yafeng Chen
Wu Guo
Bin Gu
18
7
0
29 Mar 2021
ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis
ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis
Yinan He
Bei Gan
Siyu Chen
Yichun Zhou
Guojun Yin
Luchuan Song
Lu Sheng
Jing Shao
Ziwei Liu
AAML
26
130
0
09 Mar 2021
Am I a Real or Fake Celebrity? Measuring Commercial Face Recognition Web
  APIs under Deepfake Impersonation Attack
Am I a Real or Fake Celebrity? Measuring Commercial Face Recognition Web APIs under Deepfake Impersonation Attack
Shahroz Tariq
Sowon Jeon
Simon S. Woo
32
25
0
01 Mar 2021
Adversarial defense for automatic speaker verification by cascaded
  self-supervised learning models
Adversarial defense for automatic speaker verification by cascaded self-supervised learning models
Haibin Wu
Xu Li
Andy T. Liu
Zhiyong Wu
Helen Meng
Hung-yi Lee
AAML
29
40
0
14 Feb 2021
Hand-Based Person Identification using Global and Part-Aware Deep
  Feature Representation Learning
Hand-Based Person Identification using Global and Part-Aware Deep Feature Representation Learning
N. L. Baisa
Bryan M. Williams
Hossein Rahmani
Plamen Angelov
Sue Black
42
15
0
13 Jan 2021
MAAS: Multi-modal Assignation for Active Speaker Detection
MAAS: Multi-modal Assignation for Active Speaker Detection
Juan Carlos León Alcázar
Fabian Caba Heilbron
Ali K. Thabet
Guohao Li
62
51
0
11 Jan 2021
Bayesian HMM clustering of x-vector sequences (VBx) in speaker
  diarization: theory, implementation and analysis on standard tasks
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasks
Federico Landini
Jan Profant
Mireia Díez
L. Burget
216
199
0
29 Dec 2020
Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis
Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis
Neeraj Kumar
Srishti Goel
Ankur Narang
Brejesh Lall
22
5
0
14 Dec 2020
Self-supervised Text-independent Speaker Verification using Prototypical
  Momentum Contrastive Learning
Self-supervised Text-independent Speaker Verification using Prototypical Momentum Contrastive Learning
Wei Xia
Chunlei Zhang
Chao Weng
Meng Yu
Dong Yu
SSL
20
77
0
13 Dec 2020
DEAAN: Disentangled Embedding and Adversarial Adaptation Network for
  Robust Speaker Representation Learning
DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning
Mufan Sang
Wei Xia
John H. L. Hansen
OOD
DRL
8
23
0
12 Dec 2020
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge
Arsha Nagrani
Joon Son Chung
Jaesung Huh
Andrew Brown
Ernesto Coto
Weidi Xie
Mitchell McLaren
D. Reynolds
Andrew Zisserman
15
74
0
12 Dec 2020
Monocular Real-time Full Body Capture with Inter-part Correlations
Monocular Real-time Full Body Capture with Inter-part Correlations
Yuxiao Zhou
Marc Habermann
I. Habibie
A. Tewari
Christian Theobalt
F. Xu
CVBM
3DH
46
62
0
11 Dec 2020
DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech
  Synthesis
DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis
Anurag Chowdhury
Arun Ross
Prabu David
8
5
0
09 Dec 2020
Adversarial Disentanglement of Speaker Representation for
  Attribute-Driven Privacy Preservation
Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation
Paul-Gauthier Noé
Mohammad MohammadAmini
D. Matrouf
Titouan Parcollet
Andreas Nautsch
J. Bonastre
24
27
0
08 Dec 2020
Learning an Animatable Detailed 3D Face Model from In-The-Wild Images
Learning an Animatable Detailed 3D Face Model from In-The-Wild Images
Yao Feng
Haiwen Feng
Michael J. Black
Timo Bolkart
CVBM
3DH
41
568
0
07 Dec 2020
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
Ting-Chun Wang
Arun Mallya
Xuan Li
3DH
37
469
0
30 Nov 2020
Empowering Things with Intelligence: A Survey of the Progress,
  Challenges, and Opportunities in Artificial Intelligence of Things
Empowering Things with Intelligence: A Survey of the Progress, Challenges, and Opportunities in Artificial Intelligence of Things
Jing Zhang
Dacheng Tao
31
462
0
17 Nov 2020
Facial Keypoint Sequence Generation from Audio
Facial Keypoint Sequence Generation from Audio
Prateek Manocha
Prithwijit Guha
3DH
VGen
23
0
0
02 Nov 2020
Leveraging speaker attribute information using multi task learning for
  speaker verification and diarization
Leveraging speaker attribute information using multi task learning for speaker verification and diarization
Chau Luu
P. Bell
Steve Renals
22
8
0
27 Oct 2020
Contrastive Unsupervised Learning for Audio Fingerprinting
Contrastive Unsupervised Learning for Audio Fingerprinting
Zhesong Yu
Xingjian Du
Bilei Zhu
Zejun Ma
28
9
0
26 Oct 2020
Combination of Deep Speaker Embeddings for Diarisation
Combination of Deep Speaker Embeddings for Diarisation
Guangzhi Sun
Chao Zhang
P. Woodland
17
20
0
22 Oct 2020
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker
  Diarisation Challenge
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge
Renyu Wang
Ruilin Tong
Y. Yeung
Xiao Chen
6
1
0
22 Oct 2020
Graph Attention Networks for Speaker Verification
Graph Attention Networks for Speaker Verification
Jee-weon Jung
Hee-Soo Heo
Ha-Jin Yu
Joon Son Chung
17
26
0
22 Oct 2020
Unsupervised Representation Learning for Speaker Recognition via
  Contrastive Equilibrium Learning
Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium Learning
Sung Hwan Mun
Woohyun Kang
Min Hyun Han
N. Kim
SSL
41
20
0
22 Oct 2020
Multi-task Metric Learning for Text-independent Speaker Verification
Yafeng Chen
Wu Guo
Jing Shi
Jiajun Qi
Tan Liu
110
0
0
21 Oct 2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge
  2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020
Hee-Soo Heo
Bong-Jin Lee
Jaesung Huh
Joon Son Chung
11
132
0
29 Sep 2020
When Automatic Voice Disguise Meets Automatic Speaker Verification
When Automatic Voice Disguise Meets Automatic Speaker Verification
Linlin Zheng
Jiakang Li
Meng Sun
Xiongwei Zhang
T. Zheng
16
17
0
15 Sep 2020
Self-Supervised Learning of Audio-Visual Objects from Video
Self-Supervised Learning of Audio-Visual Objects from Video
Triantafyllos Afouras
Andrew Owens
Joon Son Chung
Andrew Zisserman
SSL
19
252
0
10 Aug 2020
Double Multi-Head Attention for Speaker Verification
Double Multi-Head Attention for Speaker Verification
Miquel India
Pooyan Safari
Javier Hernando
28
18
0
26 Jul 2020
Deep multi-metric learning for text-independent speaker verification
Deep multi-metric learning for text-independent speaker verification
Jiwei Xu
Xinggang Wang
Bin Feng
Wenyu Liu
41
25
0
17 Jul 2020
Previous
123...13141516
Next