Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.05622
Cited By
VoxCeleb2: Deep Speaker Recognition
14 June 2018
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VoxCeleb2: Deep Speaker Recognition"
50 / 759 papers shown
Title
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech
Jun Yu Li
Ruijie Tao
Zexu Pan
Meng Ge
Shuai Wang
Haizhou Li
35
5
0
15 Sep 2023
AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement
Ju-Chieh Chou
Chung-Ming Chien
Karen Livescu
DiffM
21
4
0
14 Sep 2023
DiariST: Streaming Speech Translation with Speaker Diarization
Muqiao Yang
Naoyuki Kanda
Xiaofei Wang
Junkun Chen
Peidong Wang
Jian Xue
Jinyu Li
Takuya Yoshioka
32
6
0
14 Sep 2023
SLMIA-SR: Speaker-Level Membership Inference Attacks against Speaker Recognition Systems
Guangke Chen
Yedi Zhang
Fu Song
36
3
0
14 Sep 2023
Getting More for Less: Using Weak Labels and AV-Mixup for Robust Audio-Visual Speaker Verification
Anith Selvakumar
H. Fashandi
VLM
29
0
0
13 Sep 2023
MASTERKEY: Practical Backdoor Attack Against Speaker Verification Systems
Hanqing Guo
Xun Chen
Junfeng Guo
Li Xiao
Qiben Yan
18
11
0
13 Sep 2023
Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments
Philippe Gonzalez
T. S. Alstrøm
Tobias May
20
13
0
12 Sep 2023
SynVox2: Towards a privacy-friendly VoxCeleb2 dataset
Xiaoxiao Miao
Xin Eric Wang
Erica Cooper
Junichi Yamagishi
Nicholas W. D. Evans
Massimiliano Todisco
J. Bonastre
Mickael Rouvier
17
5
0
12 Sep 2023
Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end?
Xin Wang
Junichi Yamagishi
SyDa
58
23
0
12 Sep 2023
SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus
Haoxu Wang
Fan Yu
Xian Shi
Yuezhang Wang
Shiliang Zhang
Ming Li
29
11
0
11 Sep 2023
Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation
Yuan Gan
Zongxin Yang
Xihang Yue
Lingyun Sun
Yezhou Yang
25
57
0
10 Sep 2023
Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
Xiuzhe Wu
Pengfei Hu
Yang Wu
Xiaoyang Lyu
Yan-Pei Cao
Ying Shan
Wenming Yang
Zhongqian Sun
Xiaojuan Qi
23
14
0
09 Sep 2023
Voice Morphing: Two Identities in One Voice
Sushant Pani
Anurag Chowdhury
Morgan Sandler
Arun Ross
21
1
0
05 Sep 2023
RADIO: Reference-Agnostic Dubbing Video Synthesis
Dongyeun Lee
Chaewon Kim
Sangjoon Yu
Jaejun Yoo
Gyeong-Moon Park
VGen
DiffM
42
1
0
05 Sep 2023
From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications
Shreyank N. Gowda
Dheeraj Pandey
Shashank Narayana Gowda
52
3
0
30 Aug 2023
EEG-Derived Voice Signature for Attended Speaker Detection
Hongxu Zhu
Siqi Cai
Yidi Jiang
Qiquan Zhang
Haizhou Li
24
0
0
28 Aug 2023
Unified and Dynamic Graph for Temporal Character Grouping in Long Videos
Xiujun Shu
Wei Wen
Liangsheng Xu
Ruizhi Qiao
Taian Guo
Hanjun Li
Bei Gan
Tianlin Li
Xing Sun
42
0
0
27 Aug 2023
Fairness and Privacy in Voice Biometrics:A Study of Gender Influences Using wav2vec 2.0
Oubaïda Chouchane
Michele Panariello
Chiara Galdi
Massimiliano Todisco
Nicholas W. D. Evans
27
2
0
27 Aug 2023
UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023
Yu Zheng
Yajun Zhang
Chuanying Niu
Yibin Zhan
Yanhua Long
Dongxing Xu
42
4
0
24 Aug 2023
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion
Se Jin Park
Joanna Hong
Minsu Kim
Y. Ro
37
4
0
23 Aug 2023
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
Soumik Mukhopadhyay
Saksham Suri
R. Gadde
Abhinav Shrivastava
DiffM
46
20
0
18 Aug 2023
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge
Minsu Kim
Jeong Hun Yeo
J. Choi
Y. Ro
34
16
0
18 Aug 2023
A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Li Liu
Lufei Gao
Wen-Ling Lei
Fengji Ma
Xiaotian Lin
Jin-Tao Wang
CVBM
27
5
0
17 Aug 2023
Graph Neural Network Backend for Speaker Recognition
Liang He
Rui Li
Mengqi Niu
16
0
0
17 Aug 2023
The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023
Ze Li
Yuke Lin
Xiaoyi Qin
Ning Jiang
Guoqing Zhao
Ming Li
52
6
0
17 Aug 2023
ChinaTelecom System Description to VoxCeleb Speaker Recognition Challenge 2023
Mengjie Du
Xiang Fang
Jie Li
31
0
0
16 Aug 2023
DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding
J. Choi
Joanna Hong
Y. Ro
DiffM
29
19
0
15 Aug 2023
VoxBlink: A Large Scale Speaker Verification Dataset on Camera
Yuke Lin
Xiaoyi Qin
Guoqing Zhao
Ming Cheng
Ning Jiang
Haiying Wu
Ming Li
41
13
0
14 Aug 2023
Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping
Y. A. D. Djilali
Sanath Narayan
Haithem Boussaid
Ebtesam Almazrouei
Merouane Debbah
34
10
0
11 Aug 2023
Versatile Face Animator: Driving Arbitrary 3D Facial Avatar in RGBD Space
Haoyu Wang
Haozhe Wu
Junliang Xing
Jia Jia
3DH
25
4
0
11 Aug 2023
Speaker Recognition Using Isomorphic Graph Attention Network Based Pooling on Self-Supervised Representation
Zirui Ge
Xinzhou Xu
Haiyan Guo
Tingting Wang
Zhen Yang
SSL
19
1
0
09 Aug 2023
Relation-Aware Distribution Representation Network for Person Clustering with Multiple Modalities
Kaijian Liu
Shixiang Tang
Ziyue Li
Zhishuai Li
Lei Bai
Feng Zhu
Rui Zhao
3DH
16
3
0
01 Aug 2023
Audio-visual video-to-speech synthesis with synthesized input audio
Triantafyllos Kefalas
Yannis Panagakis
M. Pantic
VGen
DiffM
38
1
0
31 Jul 2023
On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer
Md. Asif Jalal
Pablo Peso Parada
Jisi Zhang
Karthikeyan P. Saravanan
Mete Ozay
Myoungji Han
Jung In Lee
Seokyeong Jung
28
1
0
25 Jul 2023
HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces
Stella Bounareli
Christos Tzelepis
Vasileios Argyriou
Ioannis Patras
Georgios Tzimiropoulos
CVBM
31
35
0
20 Jul 2023
Leveraging Visemes for Better Visual Speech Representation and Lip Reading
J. Peymanfard
Vahid Saeedi
Mohammad Reza Mohammadi
Hossein Zeinali
N. Mozayani
39
2
0
19 Jul 2023
Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
Fa-Ting Hong
Dan Xu
CVBM
25
31
0
19 Jul 2023
Exploring Binary Classification Loss For Speaker Verification
Bing Han
Zhengyang Chen
Y. Qian
CVBM
24
10
0
17 Jul 2023
Facial Reenactment Through a Personalized Generator
Ariel Elazary
Yotam Nitzan
Daniel Cohen-Or
35
0
0
12 Jul 2023
SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Adriana Fernandez-Lopez
Honglie Chen
Pingchuan Ma
A. Haliassos
Stavros Petridis
M. Pantic
VLM
33
7
0
10 Jul 2023
NOFA: NeRF-based One-shot Facial Avatar Reconstruction
Wang-Wang Yu
Yanbo Fan
Yong Zhang
Xuanxia Wang
Fei Yin
...
Yan-Pei Cao
Ying Shan
Yang Wu
Zhongqian Sun
Baoyuan Wu
3DH
37
33
0
07 Jul 2023
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition
Guoying Zhao
Zheng Lian
B. Liu
Jianhua Tao
37
17
0
05 Jul 2023
Interactive Conversational Head Generation
Mohan Zhou
Yalong Bai
Wei Zhang
Tingjun Yao
Tiejun Zhao
27
3
0
05 Jul 2023
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
Louis Airale
Dominique Vaufreydaz
Xavier Alameda-Pineda
23
1
0
04 Jul 2023
Beyond Neural-on-Neural Approaches to Speaker Gender Protection
L. V. Bemmel
Zhuoran Liu
Nik Vaessen
Martha Larson
AAML
24
2
0
30 Jun 2023
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units
Junchen Lu
Berrak Sisman
Mingyang Zhang
Haizhou Li
24
4
0
29 Jun 2023
UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data
Heeseung Kim
Sungwon Kim
Ji-Ran Yeom
Sung-Wan Yoon
DiffM
21
21
0
28 Jun 2023
Long-term Conversation Analysis: Exploring Utility and Privacy
F. Nespoli
Jule Pohlhausen
Patrick A. Naylor
Joerg Bitzer
21
0
0
28 Jun 2023
Two-Stage Voice Anonymization for Enhanced Privacy
F. Nespoli
Daniel Barreda
Joerg Bitzer
Patrick A. Naylor
24
3
0
28 Jun 2023
Text-driven Talking Face Synthesis by Reprogramming Audio-driven Models
J. Choi
Minsu Kim
Se Jin Park
Y. Ro
CVBM
16
3
0
28 Jun 2023
Previous
1
2
3
...
6
7
8
...
14
15
16
Next