ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.08612
  4. Cited By
VoxCeleb: a large-scale speaker identification dataset

VoxCeleb: a large-scale speaker identification dataset

26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
ArXivPDFHTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,100 papers shown
Title
Free Fine-tuning: A Plug-and-Play Watermarking Scheme for Deep Neural
  Networks
Free Fine-tuning: A Plug-and-Play Watermarking Scheme for Deep Neural Networks
Run Wang
Jixing Ren
Boheng Li
Tianyi She
Wenhui Zhang
Liming Fang
Jing Chen
Chao Shen
Lina Wang
WIGM
34
16
0
14 Oct 2022
Anonymizing Speech with Generative Adversarial Networks to Preserve
  Speaker Privacy
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Sarina Meyer
Pascal Tilli
Pavel Denisov
Florian Lux
Julia Koch
Ngoc Thang Vu
30
31
0
13 Oct 2022
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging
  Talking Avatar
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar
Aolan Sun
Xulong Zhang
Tiandong Ling
Jianzong Wang
Ning Cheng
Jing Xiao
35
4
0
13 Oct 2022
Revisiting Self-Supervised Contrastive Learning for Facial Expression
  Recognition
Revisiting Self-Supervised Contrastive Learning for Facial Expression Recognition
Yuxuan Shu
Xiao Gu
Guangyao Yang
Benny Lo
SSL
54
17
0
08 Oct 2022
Compressing Video Calls using Synthetic Talking Heads
Compressing Video Calls using Synthetic Talking Heads
Madhav Agarwal
Anchit Gupta
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
17
10
0
07 Oct 2022
A Keypoint Based Enhancement Method for Audio Driven Free View Talking
  Head Synthesis
A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis
Yichen Han
Ya Li
Yingming Gao
Jinlong Xue
Songpo Wang
Lei Yang
21
2
0
07 Oct 2022
Audio-Visual Face Reenactment
Audio-Visual Face Reenactment
Madhav Agarwal
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
DiffM
VGen
27
22
0
06 Oct 2022
PSVRF: Learning to restore Pitch-Shifted Voice without reference
Yangfu Li
Xiaodan Lin
Jiaxin Yang
19
0
0
06 Oct 2022
Geometry Driven Progressive Warping for One-Shot Face Animation
Geometry Driven Progressive Warping for One-Shot Face Animation
Yatao Zhong
F. Amjadi
Ilya Zharkov
3DH
CVBM
21
1
0
05 Oct 2022
Voice Spoofing Countermeasures: Taxonomy, State-of-the-art, experimental
  analysis of generalizability, open challenges, and the way forward
Voice Spoofing Countermeasures: Taxonomy, State-of-the-art, experimental analysis of generalizability, open challenges, and the way forward
Awais Khan
K. Malik
James Ryan
Mikul Saravanan
AAML
53
11
0
02 Oct 2022
An empirical study of weakly supervised audio tagging embeddings for
  general audio representations
An empirical study of weakly supervised audio tagging embeddings for general audio representations
Heinrich Dinkel
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Yujun Wang
43
1
0
30 Sep 2022
Motion and Appearance Adaptation for Cross-Domain Motion Transfer
Motion and Appearance Adaptation for Cross-Domain Motion Transfer
Borun Xu
Biao Wang
Jinhong Deng
Jiale Tao
T. Ge
Yuning Jiang
Wen Li
Lixin Duan
54
9
0
29 Sep 2022
MeWEHV: Mel and Wave Embeddings for Human Voice Tasks
MeWEHV: Mel and Wave Embeddings for Human Voice Tasks
Andrés Vasco-Carofilis
Laura Fernández-Robles
Enrique Alegre
Eduardo FIDALGO
47
2
0
28 Sep 2022
Motion Transformer for Unsupervised Image Animation
Motion Transformer for Unsupervised Image Animation
Jiale Tao
Biao Wang
T. Ge
Yuning Jiang
Wen Li
Lixin Duan
ViT
24
9
0
28 Sep 2022
StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face
  Reenactment
StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment
Stella Bounareli
Christos Tzelepis
Vasileios Argyriou
Ioannis Patras
Georgios Tzimiropoulos
CVBM
29
17
0
27 Sep 2022
NWPU-ASLP System for the VoicePrivacy 2022 Challenge
NWPU-ASLP System for the VoicePrivacy 2022 Challenge
Jixun Yao
Qing Wang
Li Zhang
Pengcheng Guo
Yuhao Liang
Linfu Xie
PICV
31
17
0
24 Sep 2022
ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on
  Pitch and Speed
ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Speed
Mei-Shuo Chen
Z. Duan
30
10
0
23 Sep 2022
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge
  2022
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022
Qutang Cai
Guoqiang Hong
Zhijian Ye
Ximin Li
Haizhou Li
46
7
0
23 Sep 2022
Gemino: Practical and Robust Neural Compression for Video Conferencing
Gemino: Practical and Robust Neural Compression for Video Conferencing
Vibhaalakshmi Sivaraman
Pantea Karimi
Vedantha Venkatapathy
Mehrdad Khani Shirkoohi
Sadjad Fouladi
M. Alizadeh
F. Durand
Vivienne Sze
3DH
49
18
0
21 Sep 2022
FNeVR: Neural Volume Rendering for Face Animation
FNeVR: Neural Volume Rendering for Face Animation
Bo-Wen Zeng
Bo-Ye Liu
Hong Li
Xuhui Liu
Jianzhuang Liu
Dapeng Chen
Wei Peng
Baochang Zhang
CVBM
3DH
55
26
0
21 Sep 2022
Pay Attention to Hard Trials
Pay Attention to Hard Trials
Lantian Li
Di Wang
Dong Wang
56
1
0
10 Sep 2022
Defend Data Poisoning Attacks on Voice Authentication
Defend Data Poisoning Attacks on Voice Authentication
Ke Li
Cameron Baird
D. Lin
AAML
49
9
0
09 Sep 2022
Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End
  Automatic Speaker Verification with Multiple Enrollment Utterances
Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances
Chang Zeng
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
34
6
0
01 Sep 2022
Computing with Hypervectors for Efficient Speaker Identification
Computing with Hypervectors for Efficient Speaker Identification
Ping-Chen Huang
Denis Kleyko
J. Rabaey
Bruno A. Olshausen
P. Kanerva
40
2
0
28 Aug 2022
Target Speaker Voice Activity Detection with Transformers and Its
  Integration with End-to-End Neural Diarization
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization
Dongmei Wang
Xiong Xiao
Naoyuki Kanda
Takuya Yoshioka
Jian Wu
36
26
0
27 Aug 2022
IndicSUPERB: A Speech Processing Universal Performance Benchmark for
  Indian languages
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages
Tahir Javed
Kaushal Bhogale
A. Raman
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
ELM
30
20
0
24 Aug 2022
Learning Branched Fusion and Orthogonal Projection for Face-Voice
  Association
Learning Branched Fusion and Orthogonal Projection for Face-Voice Association
M. S. Saeed
Shah Nawaz
M. H. Khan
S. Javed
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
27
4
0
22 Aug 2022
Learning in Audio-visual Context: A Review, Analysis, and New
  Perspective
Learning in Audio-visual Context: A Review, Analysis, and New Perspective
Yake Wei
Di Hu
Yapeng Tian
Xuelong Li
46
55
0
20 Aug 2022
Disentangled Speaker Representation Learning via Mutual Information
  Minimization
Disentangled Speaker Representation Learning via Mutual Information Minimization
Sung Hwan Mun
Mingrui Han
Minchan Kim
Dongjune Lee
N. Kim
DRL
41
9
0
17 Aug 2022
Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle
  Transfer via Local-Style-Aware Hair Alignment
Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle Transfer via Local-Style-Aware Hair Alignment
Taewoo Kim
Chaeyeon Chung
Yoonseong Kim
S. Park
Kangyeol Kim
Jaegul Choo
3DH
39
20
0
16 Aug 2022
FDNeRF: Few-shot Dynamic Neural Radiance Fields for Face Reconstruction
  and Expression Editing
FDNeRF: Few-shot Dynamic Neural Radiance Fields for Face Reconstruction and Expression Editing
Jingbo Zhang
Xiaoyu Li
Bo Liu
Can Wang
Jing Liao
3DH
CVBM
39
41
0
11 Aug 2022
Non-Contrastive Self-supervised Learning for Utterance-Level Information
  Extraction from Speech
Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech
Jaejin Cho
Jesús Villalba
Laureano Moro-Velazquez
Najim Dehak
SSL
41
18
0
10 Aug 2022
Robust Acoustic Domain Identification with its Application to Speaker
  Diarization
Robust Acoustic Domain Identification with its Application to Speaker Diarization
Kishore Kumar A
Shefali Waldekar
Md. Sahidullah
G. Saha
26
0
0
05 Aug 2022
Attention and DCT based Global Context Modeling for Text-independent
  Speaker Recognition
Attention and DCT based Global Context Modeling for Text-independent Speaker Recognition
Wei Xia
John H. L. Hansen
32
4
0
04 Aug 2022
Free-HeadGAN: Neural Talking Head Synthesis with Explicit Gaze Control
Free-HeadGAN: Neural Talking Head Synthesis with Explicit Gaze Control
M. Doukas
Evangelos Ververas
V. Sharmanska
S. Zafeiriou
CVBM
25
15
0
03 Aug 2022
The SJTU System for Short-duration Speaker Verification Challenge 2021
The SJTU System for Short-duration Speaker Verification Challenge 2021
Bing Han
Zhengyang Chen
Zhikai Zhou
Y. Qian
12
6
0
03 Aug 2022
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label
  Correction
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label Correction
Bing Han
Zhengyang Chen
Y. Qian
22
32
0
03 Aug 2022
End-To-End Audiovisual Feature Fusion for Active Speaker Detection
End-To-End Audiovisual Feature Fusion for Active Speaker Detection
Fiseha B. Tesema
Zheyuan Lin
Shiqiang Zhu
Wei Song
J. Gu
Hong-Chuan Wu
17
4
0
27 Jul 2022
CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
Haoning Zhu
Wayne Wu
Wentao Zhu
Liming Jiang
Siwei Tang
Li Zhang
Ziwei Liu
Chen Change Loy
65
155
0
25 Jul 2022
Fine-grained Early Frequency Attention for Deep Speaker Recognition
Fine-grained Early Frequency Attention for Deep Speaker Recognition
Amirhossein Hajavi
Ali Etemad
30
4
0
20 Jul 2022
Adversarial Reweighting for Speaker Verification Fairness
Adversarial Reweighting for Speaker Verification Fairness
Minho Jin
Chelsea J.-T. Ju
Zeya Chen
Yi-Chieh Liu
J. Droppo
A. Stolcke
24
4
0
15 Jul 2022
The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification
  Challenge
The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification Challenge
Xingming Wang
Xiaoyi Qin
Yikang Wang
Yunfei Xu
Ming Li
68
14
0
15 Jul 2022
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer
  to Unlabeled Modality
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality
Wei-Ning Hsu
Bowen Shi
SSL
VLM
29
42
0
14 Jul 2022
Cross-Age Speaker Verification: Learning Age-Invariant Speaker
  Embeddings
Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings
Xiaoyi Qin
Na Li
Chao Weng
Dan Su
Ming Li
66
16
0
13 Jul 2022
Label-Efficient Self-Supervised Speaker Verification With Information
  Maximization and Contrastive Learning
Label-Efficient Self-Supervised Speaker Verification With Information Maximization and Contrastive Learning
Théo Lepage
Réda Dehak
SSL
29
12
0
12 Jul 2022
PoeticTTS -- Controllable Poetry Reading for Literary Studies
PoeticTTS -- Controllable Poetry Reading for Literary Studies
Julia Koch
Florian Lux
Nadja Schauffler
T. Bernhart
Felix Dieterle
Jonas Kuhn
Sandra Richter
Gabriel Viehhauser
Ngoc Thang Vu
24
5
0
11 Jul 2022
Speaker Anonymization with Phonetic Intermediate Representations
Speaker Anonymization with Phonetic Intermediate Representations
Sarina Meyer
Florian Lux
Pavel Denisov
Julia Koch
Pascal Tilli
Ngoc Thang Vu
34
27
0
11 Jul 2022
The HCCL System for the NIST SRE21
The HCCL System for the NIST SRE21
Zhuo Li
Runqiu Xiao
Hangting Chen
Zhenduo Zhao
Zi-qiang Zhang
Wenchao Wang
27
0
0
11 Jul 2022
Multi-Frequency Information Enhanced Channel Attention Module for
  Speaker Representation Learning
Multi-Frequency Information Enhanced Channel Attention Module for Speaker Representation Learning
Mufan Sang
John H. L. Hansen
22
13
0
10 Jul 2022
Graph-based Multi-View Fusion and Local Adaptation: Mitigating
  Within-Household Confusability for Speaker Identification
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification
Long Chen
Yi Meng
Venkatesh Ravichandran
A. Stolcke
19
1
0
08 Jul 2022
Previous
123...91011...202122
Next