Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.08612
Cited By
VoxCeleb: a large-scale speaker identification dataset
26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VoxCeleb: a large-scale speaker identification dataset"
50 / 1,100 papers shown
Title
Free Fine-tuning: A Plug-and-Play Watermarking Scheme for Deep Neural Networks
Run Wang
Jixing Ren
Boheng Li
Tianyi She
Wenhui Zhang
Liming Fang
Jing Chen
Chao Shen
Lina Wang
WIGM
34
16
0
14 Oct 2022
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Sarina Meyer
Pascal Tilli
Pavel Denisov
Florian Lux
Julia Koch
Ngoc Thang Vu
30
31
0
13 Oct 2022
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar
Aolan Sun
Xulong Zhang
Tiandong Ling
Jianzong Wang
Ning Cheng
Jing Xiao
35
4
0
13 Oct 2022
Revisiting Self-Supervised Contrastive Learning for Facial Expression Recognition
Yuxuan Shu
Xiao Gu
Guangyao Yang
Benny Lo
SSL
54
17
0
08 Oct 2022
Compressing Video Calls using Synthetic Talking Heads
Madhav Agarwal
Anchit Gupta
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
17
10
0
07 Oct 2022
A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis
Yichen Han
Ya Li
Yingming Gao
Jinlong Xue
Songpo Wang
Lei Yang
21
2
0
07 Oct 2022
Audio-Visual Face Reenactment
Madhav Agarwal
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
DiffM
VGen
27
22
0
06 Oct 2022
PSVRF: Learning to restore Pitch-Shifted Voice without reference
Yangfu Li
Xiaodan Lin
Jiaxin Yang
19
0
0
06 Oct 2022
Geometry Driven Progressive Warping for One-Shot Face Animation
Yatao Zhong
F. Amjadi
Ilya Zharkov
3DH
CVBM
21
1
0
05 Oct 2022
Voice Spoofing Countermeasures: Taxonomy, State-of-the-art, experimental analysis of generalizability, open challenges, and the way forward
Awais Khan
K. Malik
James Ryan
Mikul Saravanan
AAML
53
11
0
02 Oct 2022
An empirical study of weakly supervised audio tagging embeddings for general audio representations
Heinrich Dinkel
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Yujun Wang
43
1
0
30 Sep 2022
Motion and Appearance Adaptation for Cross-Domain Motion Transfer
Borun Xu
Biao Wang
Jinhong Deng
Jiale Tao
T. Ge
Yuning Jiang
Wen Li
Lixin Duan
54
9
0
29 Sep 2022
MeWEHV: Mel and Wave Embeddings for Human Voice Tasks
Andrés Vasco-Carofilis
Laura Fernández-Robles
Enrique Alegre
Eduardo FIDALGO
47
2
0
28 Sep 2022
Motion Transformer for Unsupervised Image Animation
Jiale Tao
Biao Wang
T. Ge
Yuning Jiang
Wen Li
Lixin Duan
ViT
24
9
0
28 Sep 2022
StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment
Stella Bounareli
Christos Tzelepis
Vasileios Argyriou
Ioannis Patras
Georgios Tzimiropoulos
CVBM
29
17
0
27 Sep 2022
NWPU-ASLP System for the VoicePrivacy 2022 Challenge
Jixun Yao
Qing Wang
Li Zhang
Pengcheng Guo
Yuhao Liang
Linfu Xie
PICV
31
17
0
24 Sep 2022
ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Speed
Mei-Shuo Chen
Z. Duan
30
10
0
23 Sep 2022
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022
Qutang Cai
Guoqiang Hong
Zhijian Ye
Ximin Li
Haizhou Li
46
7
0
23 Sep 2022
Gemino: Practical and Robust Neural Compression for Video Conferencing
Vibhaalakshmi Sivaraman
Pantea Karimi
Vedantha Venkatapathy
Mehrdad Khani Shirkoohi
Sadjad Fouladi
M. Alizadeh
F. Durand
Vivienne Sze
3DH
49
18
0
21 Sep 2022
FNeVR: Neural Volume Rendering for Face Animation
Bo-Wen Zeng
Bo-Ye Liu
Hong Li
Xuhui Liu
Jianzhuang Liu
Dapeng Chen
Wei Peng
Baochang Zhang
CVBM
3DH
55
26
0
21 Sep 2022
Pay Attention to Hard Trials
Lantian Li
Di Wang
Dong Wang
56
1
0
10 Sep 2022
Defend Data Poisoning Attacks on Voice Authentication
Ke Li
Cameron Baird
D. Lin
AAML
49
9
0
09 Sep 2022
Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances
Chang Zeng
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
34
6
0
01 Sep 2022
Computing with Hypervectors for Efficient Speaker Identification
Ping-Chen Huang
Denis Kleyko
J. Rabaey
Bruno A. Olshausen
P. Kanerva
40
2
0
28 Aug 2022
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization
Dongmei Wang
Xiong Xiao
Naoyuki Kanda
Takuya Yoshioka
Jian Wu
36
26
0
27 Aug 2022
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages
Tahir Javed
Kaushal Bhogale
A. Raman
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
ELM
30
20
0
24 Aug 2022
Learning Branched Fusion and Orthogonal Projection for Face-Voice Association
M. S. Saeed
Shah Nawaz
M. H. Khan
S. Javed
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
27
4
0
22 Aug 2022
Learning in Audio-visual Context: A Review, Analysis, and New Perspective
Yake Wei
Di Hu
Yapeng Tian
Xuelong Li
46
55
0
20 Aug 2022
Disentangled Speaker Representation Learning via Mutual Information Minimization
Sung Hwan Mun
Mingrui Han
Minchan Kim
Dongjune Lee
N. Kim
DRL
41
9
0
17 Aug 2022
Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle Transfer via Local-Style-Aware Hair Alignment
Taewoo Kim
Chaeyeon Chung
Yoonseong Kim
S. Park
Kangyeol Kim
Jaegul Choo
3DH
39
20
0
16 Aug 2022
FDNeRF: Few-shot Dynamic Neural Radiance Fields for Face Reconstruction and Expression Editing
Jingbo Zhang
Xiaoyu Li
Bo Liu
Can Wang
Jing Liao
3DH
CVBM
39
41
0
11 Aug 2022
Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech
Jaejin Cho
Jesús Villalba
Laureano Moro-Velazquez
Najim Dehak
SSL
41
18
0
10 Aug 2022
Robust Acoustic Domain Identification with its Application to Speaker Diarization
Kishore Kumar A
Shefali Waldekar
Md. Sahidullah
G. Saha
26
0
0
05 Aug 2022
Attention and DCT based Global Context Modeling for Text-independent Speaker Recognition
Wei Xia
John H. L. Hansen
32
4
0
04 Aug 2022
Free-HeadGAN: Neural Talking Head Synthesis with Explicit Gaze Control
M. Doukas
Evangelos Ververas
V. Sharmanska
S. Zafeiriou
CVBM
25
15
0
03 Aug 2022
The SJTU System for Short-duration Speaker Verification Challenge 2021
Bing Han
Zhengyang Chen
Zhikai Zhou
Y. Qian
12
6
0
03 Aug 2022
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label Correction
Bing Han
Zhengyang Chen
Y. Qian
22
32
0
03 Aug 2022
End-To-End Audiovisual Feature Fusion for Active Speaker Detection
Fiseha B. Tesema
Zheyuan Lin
Shiqiang Zhu
Wei Song
J. Gu
Hong-Chuan Wu
17
4
0
27 Jul 2022
CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
Haoning Zhu
Wayne Wu
Wentao Zhu
Liming Jiang
Siwei Tang
Li Zhang
Ziwei Liu
Chen Change Loy
65
155
0
25 Jul 2022
Fine-grained Early Frequency Attention for Deep Speaker Recognition
Amirhossein Hajavi
Ali Etemad
30
4
0
20 Jul 2022
Adversarial Reweighting for Speaker Verification Fairness
Minho Jin
Chelsea J.-T. Ju
Zeya Chen
Yi-Chieh Liu
J. Droppo
A. Stolcke
24
4
0
15 Jul 2022
The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification Challenge
Xingming Wang
Xiaoyi Qin
Yikang Wang
Yunfei Xu
Ming Li
68
14
0
15 Jul 2022
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality
Wei-Ning Hsu
Bowen Shi
SSL
VLM
29
42
0
14 Jul 2022
Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings
Xiaoyi Qin
Na Li
Chao Weng
Dan Su
Ming Li
66
16
0
13 Jul 2022
Label-Efficient Self-Supervised Speaker Verification With Information Maximization and Contrastive Learning
Théo Lepage
Réda Dehak
SSL
29
12
0
12 Jul 2022
PoeticTTS -- Controllable Poetry Reading for Literary Studies
Julia Koch
Florian Lux
Nadja Schauffler
T. Bernhart
Felix Dieterle
Jonas Kuhn
Sandra Richter
Gabriel Viehhauser
Ngoc Thang Vu
24
5
0
11 Jul 2022
Speaker Anonymization with Phonetic Intermediate Representations
Sarina Meyer
Florian Lux
Pavel Denisov
Julia Koch
Pascal Tilli
Ngoc Thang Vu
34
27
0
11 Jul 2022
The HCCL System for the NIST SRE21
Zhuo Li
Runqiu Xiao
Hangting Chen
Zhenduo Zhao
Zi-qiang Zhang
Wenchao Wang
27
0
0
11 Jul 2022
Multi-Frequency Information Enhanced Channel Attention Module for Speaker Representation Learning
Mufan Sang
John H. L. Hansen
22
13
0
10 Jul 2022
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification
Long Chen
Yi Meng
Venkatesh Ravichandran
A. Stolcke
19
1
0
08 Jul 2022
Previous
1
2
3
...
9
10
11
...
20
21
22
Next