Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.08612
Cited By
VoxCeleb: a large-scale speaker identification dataset
26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VoxCeleb: a large-scale speaker identification dataset"
50 / 1,100 papers shown
Title
Experimenting with Additive Margins for Contrastive Self-Supervised Speaker Verification
Theo Lepage
Reda Dehak
SSL
21
3
0
06 Jun 2023
Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks
Jianrong Wang
Yaxin Zhao
Li Liu
Tian-Shun Xu
Qi Li
Sen Li
24
9
0
06 Jun 2023
BeyondPixels: A Comprehensive Review of the Evolution of Neural Radiance Fields
AKM SHAHARIAR AZAD RABBY
Chengcui Zhang
36
27
0
05 Jun 2023
MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth Information
Jianrong Wang
Yuchen Huo
Li Liu
Tianyi Xu
Qi Li
Sen Li
25
3
0
04 Jun 2023
ALO-VC: Any-to-any Low-latency One-shot Voice Conversion
Bo Wang
Damien Ronssin
Milos Cernak
BDL
38
3
0
01 Jun 2023
Exploration on HuBERT with Multiple Resolutions
Jiatong Shi
Yun Tang
Hirofumi Inaguma
Hongyu Gong
J. Pino
Shinji Watanabe
43
9
0
01 Jun 2023
Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition
Ashutosh Chaubey
Sparsh Sinha
Susmita Ghose
19
0
0
01 Jun 2023
Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?
Salah Zaiem
Youcef Kemiche
Titouan Parcollet
S. Essid
Mirco Ravanelli
SSL
19
23
0
01 Jun 2023
MiniSUPERB: Lightweight Benchmark for Self-supervised Speech Models
Yu-Hsiang Wang
Huan Chen
Kai-Wei Chang
Winston H. Hsu
Hung-yi Lee
27
6
0
30 May 2023
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
Alessio Brutti
S. Squartini
52
9
0
29 May 2023
MT-SLVR: Multi-Task Self-Supervised Learning for Transformation In(Variant) Representations
Calum Heggan
Timothy M. Hospedales
S. Budgett
Mehrdad Yaghoobi
SSL
35
5
0
29 May 2023
One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification
Ju-Sung Heo
Chan-yeong Lim
Ju-ho Kim
Hyun-Seo Shin
Ha-Jin Yu
29
2
0
27 May 2023
Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition
Wangyou Zhang
Y. Qian
38
10
0
25 May 2023
Visualizing data augmentation in deep speaker recognition
Pengqi Li
Lantian Li
A. Hamdulla
D. Wang
28
3
0
25 May 2023
CN-Celeb-AV: A Multi-Genre Audio-Visual Dataset for Person Recognition
Lantian Li
Xiaolou Li
Haoyu Jiang
Cheng Chen
Ruihai Hou
Dong Wang
SLR
19
5
0
25 May 2023
Towards Solving Cocktail-Party: The First Method to Build a Realistic Dataset with Ground Truths for Speech Separation
Rawad Melhem
Assef Jafar
Oumayma Al Dakkak
24
0
0
25 May 2023
P-vectors: A Parallel-Coupled TDNN/Transformer Network for Speaker Verification
Xiyuan Wang
Fangyuan Wang
Bo Xu
Liang Xu
Jing Xiao
21
6
0
24 May 2023
On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications
Vamsikrishna Chemudupati
Marzieh S. Tahaei
Heitor R. Guimarães
Arthur Pimentel
Anderson R. Avila
Mehdi Rezagholizadeh
Boxing Chen
Tiago H. Falk
SSL
69
7
0
23 May 2023
QFA2SR: Query-Free Adversarial Transfer Attacks to Speaker Recognition Systems
Guangke Chen
Yedi Zhang
Zhe Zhao
Fu Song
AAML
46
11
0
23 May 2023
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
Zhibin Qiu
Mengfan Fu
Gang Hua
G. Altenbek
Hao Huang
DiffM
54
4
0
23 May 2023
An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification
Yafeng Chen
Siqi Zheng
Haibo Wang
Luyao Cheng
Qian Chen
Jiajun Qi
29
38
0
22 May 2023
Progressive Sub-Graph Clustering Algorithm for Semi-Supervised Domain Adaptation Speaker Verification
Zhuo Li
Jingze Lu
Z. Zhao
Wenchao Wang
Pengyuan Zhang
32
1
0
22 May 2023
The HCCL system for VoxCeleb Speaker Recognition Challenge 2022
Zhenduo Zhao
Zhuo Li
Wenchao Wang
Pengyuan Zhang
25
4
0
22 May 2023
LPMM: Intuitive Pose Control for Neural Talking-Head Model via Landmark-Parameter Morphable Model
K. Lee
Patrick Kwon
Myung Ki Lee
Namhyuk Ahn
Junsoo Lee
9
1
0
17 May 2023
Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Wei-wei Lin
Chenhang He
Man-Wai Mak
Youzhi Tu
28
5
0
14 May 2023
WEIRD FAccTs: How Western, Educated, Industrialized, Rich, and Democratic is FAccT?
Ali Akbar Septiandri
Marios Constantinides
Mohammad Tahaei
Daniele Quercia
27
34
0
10 May 2023
DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation
Fa-Ting Hong
Li Shen
Dan Xu
3DH
CVBM
26
15
0
10 May 2023
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Jiazhi Guan
Zhanwang Zhang
Hang Zhou
Tianshu Hu
Kaisiyuan Wang
...
Haocheng Feng
Jingtuo Liu
Errui Ding
Ziwei Liu
Jingdong Wang
44
57
0
09 May 2023
Multi-object Video Generation from Single Frame Layouts
Yang Wu
Zhi-Bin Liu
Hefeng Wu
Liang Lin
27
3
0
06 May 2023
Single-Shot Implicit Morphable Faces with Consistent Texture Parameterization
Connor Z. Lin
Koki Nagano
Jan Kautz
E. R. Chan
Umar Iqbal
Leonidas J. Guibas
Gordon Wetzstein
S. Khamis
3DH
18
14
0
04 May 2023
Multimodal-driven Talking Face Generation via a Unified Diffusion-based Generator
Chao Xu
Shaoting Zhu
Junwei Zhu
Alexander I. Rudnicky
Jiangning Zhang
Ying Tai
Yong Liu
DiffM
62
14
0
04 May 2023
Controllable One-Shot Face Video Synthesis With Semantic Aware Prior
Kangning Liu
Yu-Chuan Su
Wei
Weiheng Hong
Ruijin Cang
Xuhui Jia
CVBM
46
2
0
27 Apr 2023
Self-Supervised Learning with Cluster-Aware-DINO for High-Performance Robust Speaker Verification
Bing Han
Zhengyang Chen
Y. Qian
22
20
0
12 Apr 2023
One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field
Weichuang Li
Longhao Zhang
Dong Wang
Bingyan Zhao
Zhigang Wang
Mulin. Chen
Bangze Zhang
Zhongjian Wang
Liefeng Bo
Xuelong Li
3DH
CVBM
32
53
0
11 Apr 2023
Certifiable Black-Box Attacks with Randomized Adversarial Examples: Breaking Defenses with Provable Confidence
Hanbin Hong
Xinyu Zhang
Binghui Wang
Zhongjie Ba
Yuan Hong
AAML
30
2
0
10 Apr 2023
Unsupervised Speech Representation Pooling Using Vector Quantization
J. Park
Kwanghee Choi
Hyunjun Heo
Hyung-Min Park
SSL
33
0
0
08 Apr 2023
Benchmark Dataset Dynamics, Bias and Privacy Challenges in Voice Biometrics Research
Casandra Rusti
Anna Leschanowsky
Carolyn Quinlan
Michaela Pnacekova
Lauriane Gorce
W. Hutiri
30
2
0
07 Apr 2023
Margin-Mixup: A Method for Robust Speaker Verification in Multi-Speaker Audio
Jenthe Thienpondt
N. Madhu
Kris Demuynck
32
4
0
07 Apr 2023
Face Animation with an Attribute-Guided Diffusion Model
Bo-Wen Zeng
Xuhui Liu
Sicheng Gao
Boyu Liu
Hong Li
Jianzhuang Liu
Baochang Zhang
47
31
0
06 Apr 2023
StyleGAN Salon: Multi-View Latent Optimization for Pose-Invariant Hairstyle Transfer
Sasikarn Khwanmuang
Pakkapon Phongthawee
Patsorn Sangkloy
Supasorn Suwajanakorn
3DH
29
7
0
05 Apr 2023
AutoAD: Movie Description in Context
Tengda Han
Max Bain
Arsha Nagrani
Gül Varol
Weidi Xie
Andrew Zisserman
VGen
29
34
0
29 Mar 2023
VIVE3D: Viewpoint-Independent Video Editing using 3D-Aware GANs
Anna Frühstück
N. Sarafianos
Yuanlu Xu
Peter Wonka
Tony Tung
58
20
0
28 Mar 2023
RobustSwap: A Simple yet Robust Face Swapping Model against Attribute Leakage
Jaeseong Lee
Taewoo Kim
S. Park
Younggun Lee
Jaegul Choo
CVBM
53
2
0
28 Mar 2023
A Universal Identity Backdoor Attack against Speaker Verification based on Siamese Network
Haodong Zhao
Wei Du
Junjie Guo
Gongshen Liu
AAML
18
0
0
28 Mar 2023
CelebV-Text: A Large-Scale Facial Text-Video Dataset
Jianhui Yu
Hao Zhu
Liming Jiang
Chen Change Loy
Weidong (Tom) Cai
Wayne Wu
30
58
0
26 Mar 2023
DS-TDNN: Dual-stream Time-delay Neural Network with Global-aware Filter for Speaker Verification
Yangfu Li
Jiapan Gan
Xiaodan Lin
24
6
0
20 Mar 2023
Right the docs: Characterising voice dataset documentation practices used in machine learning
Kathy Reid
Elizabeth T. Williams
27
2
0
19 Mar 2023
The Graph feature fusion technique for speaker recognition based on wav2vec2.0 framework
Zirui Ge
Haiyan Guo
Zhen Yang
34
1
0
19 Mar 2023
MMFace4D: A Large-Scale Multi-Modal 4D Face Dataset for Audio-Driven 3D Face Animation
Haozhe Wu
Jia Jia
Junliang Xing
Hongwei Xu
Xiangyuan Wang
Jelo Wang
CVBM
32
7
0
17 Mar 2023
Enhancing Unsupervised Audio Representation Learning via Adversarial Sample Generation
Yulin Pan
Xiangteng He
Biao Gong
Yuxin Peng
Yiliang Lv
SSL
24
0
0
15 Mar 2023
Previous
1
2
3
...
6
7
8
...
20
21
22
Next