ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.05622
  4. Cited By
VoxCeleb2: Deep Speaker Recognition

VoxCeleb2: Deep Speaker Recognition

14 June 2018
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
ArXivPDFHTML

Papers citing "VoxCeleb2: Deep Speaker Recognition"

50 / 762 papers shown
Title
Disentangling Style and Speaker Attributes for TTS Style Transfer
Disentangling Style and Speaker Attributes for TTS Style Transfer
Xiaochun An
Frank Soong
Lei Xie
56
18
0
24 Jan 2022
Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker
  Classifier Joint Training
Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training
J. Yang
Lei He
28
11
0
20 Jan 2022
Leveraging Real Talking Faces via Self-Supervision for Robust Forgery
  Detection
Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection
A. Haliassos
Rodrigo Mira
Stavros Petridis
M. Pantic
CVBM
40
126
0
18 Jan 2022
VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
A. Brown
Jaesung Huh
Joon Son Chung
Arsha Nagrani
Daniel Garcia-Romero
Andrew Zisserman
31
40
0
12 Jan 2022
Winning solutions and post-challenge analyses of the ChaLearn AutoDL
  challenge 2019
Winning solutions and post-challenge analyses of the ChaLearn AutoDL challenge 2019
Zhengying Liu
Adrien Pavao
Zhen Xu
Sergio Escalera
Fabio Ferreira
...
Peng Wang
Chenglin Wu
Youcheng Xiong
Arber Zela
Yang Zhang
AAML
37
26
0
11 Jan 2022
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization
Hao Jiang
Calvin Murdock
V. Ithapu
EgoV
27
40
0
06 Jan 2022
Robust Self-Supervised Audio-Visual Speech Recognition
Robust Self-Supervised Audio-Visual Speech Recognition
Bowen Shi
Wei-Ning Hsu
Abdel-rahman Mohamed
36
90
0
05 Jan 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
Multimodal Image Synthesis and Editing: The Generative AI Era
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
29
48
0
27 Dec 2021
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Mohan Zhou
Yalong Bai
Wei Zhang
Ting Yao
T. Zhao
Tao Mei
EGVM
27
44
0
27 Dec 2021
Graph attentive feature aggregation for text-independent speaker
  verification
Graph attentive feature aggregation for text-independent speaker verification
Hye-jin Shim
Ju-Sung Heo
Jae-han Park
Gareth Lee
Ha-Jin Yu
35
16
0
23 Dec 2021
Bootstrap Equilibrium and Probabilistic Speaker Representation Learning
  for Self-supervised Speaker Verification
Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Sung Hwan Mun
Min Hyun Han
Dongjune Lee
Jihwan Kim
N. Kim
SSL
41
3
0
16 Dec 2021
Textless Speech-to-Speech Translation on Real Data
Textless Speech-to-Speech Translation on Real Data
Ann Lee
Hongyu Gong
Paul-Ambroise Duquenne
Holger Schwenk
Peng-Jen Chen
...
Sravya Popuri
Yossi Adi
J. Pino
Jiatao Gu
Wei-Ning Hsu
28
142
0
15 Dec 2021
Explore Long-Range Context feature for Speaker Verification
Explore Long-Range Context feature for Speaker Verification
Zhuo Li
33
6
0
14 Dec 2021
Learning-based personal speech enhancement for teleconferencing by
  exploiting spatial-spectral features
Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Yicheng Hsu
Yonghan Lee
M. Bai
19
10
0
10 Dec 2021
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction
  and Lip Reading
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading
Leyuan Qu
C. Weber
S. Wermter
38
23
0
09 Dec 2021
Self-Supervised Speaker Verification with Simple Siamese Network and
  Self-Supervised Regularization
Self-Supervised Speaker Verification with Simple Siamese Network and Self-Supervised Regularization
Mufan Sang
Haoqi Li
F. Liu
Andrew O. Arnold
Li Wan
SSL
16
38
0
08 Dec 2021
How Deep Are the Fakes? Focusing on Audio Deepfake: A Survey
How Deep Are the Fakes? Focusing on Audio Deepfake: A Survey
Zahra Khanjani
Gabrielle Watson
V. P Janeja
25
25
0
28 Nov 2021
AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head
  Reenactment
AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment
Kangyeol Kim
S. Park
Jaeseong Lee
Sunghyo Chung
Junsoo Lee
Jaegul Choo
3DH
CVBM
27
13
0
15 Nov 2021
MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification
MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification
Ladislav Mošner
Oldrich Plchot
L. Burget
J. Černocký
32
7
0
11 Nov 2021
Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech
Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech
Sung-Feng Huang
Chyi-Jiunn Lin
Da-Rong Liu
Yi-Chen Chen
Hung-yi Lee
16
56
0
07 Nov 2021
Class Token and Knowledge Distillation for Multi-head Self-Attention
  Speaker Verification Systems
Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Victoria Mingote
A. Miguel
A. O. Giménez
EDUARDO LLEIDA SOLANO
39
10
0
06 Nov 2021
Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face
  Synthesis
Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face Synthesis
Haozhe Wu
Jia Jia
Haoyu Wang
Yishun Dou
Chao Duan
Qingshan Deng
CVBM
11
73
0
30 Oct 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech
  Processing
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
113
1,704
0
26 Oct 2021
Optimizing Multi-Taper Features for Deep Speaker Verification
Optimizing Multi-Taper Features for Deep Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
18
1
0
21 Oct 2021
Talking Head Generation with Audio and Speech Related Facial Action
  Units
Talking Head Generation with Audio and Speech Related Facial Action Units
Sen Chen
Zhilei Liu
Jiaxing Liu
Zhengxiang Yan
Longbiao Wang
CVBM
21
14
0
19 Oct 2021
Rep Works in Speaker Verification
Rep Works in Speaker Verification
Yufeng Ma
Miao Zhao
Yiwei Ding
Yu Zheng
Min Liu
Minqiang Xu
32
8
0
19 Oct 2021
AE-StyleGAN: Improved Training of Style-Based Auto-Encoders
AE-StyleGAN: Improved Training of Style-Based Auto-Encoders
Ligong Han
S. Musunuri
Martin Renqiang Min
Ruijiang Gao
Yu Tian
Dimitris N. Metaxas
DRL
36
14
0
17 Oct 2021
Sub-word Level Lip Reading With Visual Attention
Sub-word Level Lip Reading With Visual Attention
Prajwal K R
Triantafyllos Afouras
Andrew Zisserman
12
92
0
14 Oct 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
250
1,024
0
13 Oct 2021
Duality Temporal-channel-frequency Attention Enhanced Speaker
  Representation Learning
Duality Temporal-channel-frequency Attention Enhanced Speaker Representation Learning
Li Zhang
Qing Wang
Lei Xie
42
17
0
13 Oct 2021
Simple Attention Module based Speaker Verification with Iterative noisy
  label detection
Simple Attention Module based Speaker Verification with Iterative noisy label detection
Xiaoyi Qin
Na Li
Chao Weng
Dan Su
Ming Li
NoLa
62
49
0
13 Oct 2021
A bridge between features and evidence for binary attribute-driven
  perfect privacy
A bridge between features and evidence for binary attribute-driven perfect privacy
Paul-Gauthier Noé
A. Nautsch
D. Matrouf
Pierre-Michel Bousquet
J. Bonastre
44
6
0
12 Oct 2021
Large-scale Self-Supervised Speech Representation Learning for Automatic
  Speaker Verification
Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification
Zhengyang Chen
Sanyuan Chen
Yu-Huan Wu
Yao Qian
Chengyi Wang
Shujie Liu
Y. Qian
Michael Zeng
SSL
26
124
0
12 Oct 2021
Poformer: A simple pooling transformer for speaker verification
Poformer: A simple pooling transformer for speaker verification
Yufeng Ma
Yiwei Ding
Miao Zhao
Yu Zheng
Min Liu
Minqiang Xu
ViT
21
2
0
10 Oct 2021
Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for
  Speaker Verification
Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification
Qingjian Lin
Lin Yang
Xuyang Wang
Xiaoyi Qin
Junjie Wang
Ming Li
27
21
0
09 Oct 2021
A study of the robustness of raw waveform based speaker embeddings under
  mismatched conditions
A study of the robustness of raw waveform based speaker embeddings under mismatched conditions
Ge Zhu
Frank Cwitkowitz
Z. Duan
22
2
0
08 Oct 2021
Advancing the dimensionality reduction of speaker embeddings for speaker
  diarisation: disentangling noise and informing speech activity
Advancing the dimensionality reduction of speaker embeddings for speaker diarisation: disentangling noise and informing speech activity
You Jin Kim
Hee-Soo Heo
Jee-weon Jung
Youngki Kwon
Bong-Jin Lee
Joon Son Chung
29
3
0
07 Oct 2021
Style Equalization: Unsupervised Learning of Controllable Generative
  Sequence Models
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models
Jen-Hao Rick Chang
A. Shrivastava
H. Koppula
Xiaoshuai Zhang
Oncel Tuzel
DiffM
51
16
0
06 Oct 2021
Fine-tuning wav2vec2 for speaker recognition
Fine-tuning wav2vec2 for speaker recognition
Nik Vaessen
David A. van Leeuwen
39
107
0
30 Sep 2021
USEV: Universal Speaker Extraction with Visual Cue
USEV: Universal Speaker Extraction with Visual Cue
Zexu Pan
Meng Ge
Haizhou Li
34
41
0
30 Sep 2021
NimbRo Avatar: Interactive Immersive Telepresence with Force-Feedback
  Telemanipulation
NimbRo Avatar: Interactive Immersive Telepresence with Force-Feedback Telemanipulation
Max Schwarz
C. Lenz
Andre Rochow
M. Schreiber
Sven Behnke
22
55
0
28 Sep 2021
Optimized Power Normalized Cepstral Coefficients towards Robust Deep
  Speaker Verification
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
32
6
0
24 Sep 2021
Self-Supervised Metric Learning With Graph Clustering For Speaker
  Diarization
Self-Supervised Metric Learning With Graph Clustering For Speaker Diarization
Prachi Singh
Sriram Ganapathy
SSL
29
7
0
14 Sep 2021
TEASEL: A Transformer-Based Speech-Prefixed Language Model
TEASEL: A Transformer-Based Speech-Prefixed Language Model
Mehdi Arjmand
M. Dousti
H. Moradi
33
18
0
12 Sep 2021
Evaluation of an Audio-Video Multimodal Deepfake Dataset using Unimodal
  and Multimodal Detectors
Evaluation of an Audio-Video Multimodal Deepfake Dataset using Unimodal and Multimodal Detectors
Hasam Khalid
Minhan Kim
Shahroz Tariq
Simon S. Woo
23
82
0
07 Sep 2021
The SpeakIn System for VoxCeleb Speaker Recognition Challange 2021
The SpeakIn System for VoxCeleb Speaker Recognition Challange 2021
Miao Zhao
Yufeng Ma
Min Liu
Minqiang Xu
33
59
0
05 Sep 2021
The VoicePrivacy 2020 Challenge: Results and findings
The VoicePrivacy 2020 Challenge: Results and findings
N. Tomashenko
Xin Wang
Emmanuel Vincent
J. Patino
B. M. L. Srivastava
...
Benjamin O’Brien
Anais Chanclu
J. Bonastre
Massimiliano Todisco
Mohamed Maouche
25
105
0
01 Sep 2021
Sparse to Dense Motion Transfer for Face Image Animation
Sparse to Dense Motion Transfer for Face Image Animation
Ruiqi Zhao
Tianyi Wu
Guodong Guo
3DH
CVBM
27
27
0
01 Sep 2021
Look Who's Talking: Active Speaker Detection in the Wild
Look Who's Talking: Active Speaker Detection in the Wild
You Jin Kim
Hee-Soo Heo
Soyeon Choe
Soo-Whan Chung
Yoohwan Kwon
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
44
20
0
17 Aug 2021
Xi-Vector Embedding for Speaker Recognition
Xi-Vector Embedding for Speaker Recognition
Kong Aik Lee
Qiongqiong Wang
Takafumi Koshinaka
BDL
11
27
0
12 Aug 2021
Previous
123...1213141516
Next