ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.08612
  4. Cited By
VoxCeleb: a large-scale speaker identification dataset
v1v2 (latest)

VoxCeleb: a large-scale speaker identification dataset

26 June 2017
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "VoxCeleb: a large-scale speaker identification dataset"

50 / 1,111 papers shown
Title
Active Speakers in Context
Active Speakers in Context
Juan Carlos León Alcázar
Fabian Caba Heilbron
Long Mai
Federico Perazzi
Joon-Young Lee
Pablo Arbelaez
Guohao Li
72
62
0
20 May 2020
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Tingle Li
Qingjian Lin
Yuanyuan Bao
Ming Li
39
38
0
19 May 2020
Defending Your Voice: Adversarial Attack on Voice Conversion
Defending Your Voice: Adversarial Attack on Voice Conversion
Chien-yu Huang
Yist Y. Lin
Hung-yi Lee
Lin-Shan Lee
AAML
87
52
0
18 May 2020
Metric Learning for Keyword Spotting
Metric Learning for Keyword Spotting
Jaesung Huh
Minjae Lee
Hee-Soo Heo
Seongkyu Mun
Joon Son Chung
64
23
0
18 May 2020
End-to-End Lip Synchronisation Based on Pattern Classification
End-to-End Lip Synchronisation Based on Pattern Classification
You Jin Kim
Hee-Soo Heo
Soo-Whan Chung
Bong-Jin Lee
CVBM
40
0
0
18 May 2020
Design Choices for X-vector Based Speaker Anonymization
Design Choices for X-vector Based Speaker Anonymization
B. M. L. Srivastava
N. Tomashenko
Xin Wang
Emmanuel Vincent
Junichi Yamagishi
Mohamed Maouche
A. Bellet
Marc Tommasi
60
63
0
18 May 2020
Single Channel Far Field Feature Enhancement For Speaker Verification In
  The Wild
Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild
P. S. Nidadavolu
Saurabh Kataria
Leibny Paola García-Perera
Jesús Villalba
Najim Dehak
28
3
0
17 May 2020
AccentDB: A Database of Non-Native English Accents to Assist Neural
  Speech Recognition
AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition
Afroz Ahamad
Ankit Anand
Pranesh Bhargava
36
23
0
16 May 2020
Speaker Re-identification with Speaker Dependent Speech Enhancement
Speaker Re-identification with Speaker Dependent Speech Enhancement
Yanpei Shi
Qiang Huang
Thomas Hain
43
4
0
15 May 2020
Weakly Supervised Training of Hierarchical Attention Networks for
  Speaker Identification
Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification
Yanpei Shi
Qiang Huang
Thomas Hain
51
2
0
15 May 2020
ConVoice: Real-Time Zero-Shot Voice Style Transfer with Convolutional
  Network
ConVoice: Real-Time Zero-Shot Voice Style Transfer with Convolutional Network
Yurii Rebryk
Stanislav Beliaev
62
8
0
15 May 2020
ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in
  TDNN Based Speaker Verification
ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification
Brecht Desplanques
Jenthe Thienpondt
Kris Demuynck
90
1,349
0
14 May 2020
FaR-GAN for One-Shot Face Reenactment
FaR-GAN for One-Shot Face Reenactment
Hanxiang Hao
Sriram Baireddy
A. Reibman
Edward J. Delp
3DHCVBM
62
10
0
13 May 2020
From Speaker Verification to Multispeaker Speech Synthesis, Deep
  Transfer with Feedback Constraint
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint
Zexin Cai
Chuxiong Zhang
Ming Li
73
42
0
10 May 2020
Segment Aggregation for short utterances speaker verification using raw
  waveforms
Segment Aggregation for short utterances speaker verification using raw waveforms
Seung-bin Kim
Jee-weon Jung
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
34
5
0
07 May 2020
AutoSpeech: Neural Architecture Search for Speaker Recognition
AutoSpeech: Neural Architecture Search for Speaker Recognition
Shaojin Ding
Tianlong Chen
Xinyu Gong
Weiwei Zha
Zhangyang Wang
72
57
0
07 May 2020
What comprises a good talking-head video generation?: A Survey and
  Benchmark
What comprises a good talking-head video generation?: A Survey and Benchmark
Lele Chen
Guofeng Cui
Ziyi Kou
Haitian Zheng
Chenliang Xu
EGVM
54
59
0
07 May 2020
Introducing the VoicePrivacy Initiative
Introducing the VoicePrivacy Initiative
N. Tomashenko
B. M. L. Srivastava
Xin Wang
Emmanuel Vincent
A. Nautsch
...
Nicholas W. D. Evans
J. Patino
J. Bonastre
Paul-Gauthier Noé
Massimiliano Todisco
123
132
0
04 May 2020
VGGSound: A Large-scale Audio-Visual Dataset
VGGSound: A Large-scale Audio-Visual Dataset
Honglie Chen
Weidi Xie
Andrea Vedaldi
Andrew Zisserman
110
583
0
29 Apr 2020
Seeing voices and hearing voices: learning discriminative embeddings
  using cross-modal self-supervision
Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervision
Soo-Whan Chung
Hong-Goo Kang
Joon Son Chung
SSL
48
39
0
29 Apr 2020
Cross-modal Speaker Verification and Recognition: A Multilingual
  Perspective
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
84
26
0
28 Apr 2020
Neural Head Reenactment with Latent Pose Descriptors
Neural Head Reenactment with Latent Pose Descriptors
Egor Burkov
I. Pasechnik
Artur Grigorev
Victor Lempitsky
3DH
122
131
0
24 Apr 2020
Voice-Indistinguishability: Protecting Voiceprint in Privacy-Preserving
  Speech Data Release
Voice-Indistinguishability: Protecting Voiceprint in Privacy-Preserving Speech Data Release
Yaowei Han
Sheng Li
Yang Cao
Qiang Ma
Masatoshi Yoshikawa
58
45
0
16 Apr 2020
From Inference to Generation: End-to-end Fully Self-supervised
  Generation of Human Face from Speech
From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech
Hyeong-Seok Choi
Changdae Park
Kyogu Lee
CVBM
48
29
0
13 Apr 2020
Bayesian x-vector: Bayesian Neural Network based x-vector System for
  Speaker Verification
Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification
Xu Li
Jinghua Zhong
Jianwei Yu
Shoukang Hu
Xixin Wu
Xunying Liu
Helen Meng
BDL
67
12
0
08 Apr 2020
Semi-supervised acoustic modelling for five-lingual code-switched ASR
  using automatically-segmented soap opera speech
Semi-supervised acoustic modelling for five-lingual code-switched ASR using automatically-segmented soap opera speech
N. Wilkinson
A. Biswas
Emre Yilmaz
Febe de Wet
Ewald van der Westhuizen
T. Niesler
75
11
0
08 Apr 2020
Motion-supervised Co-Part Segmentation
Motion-supervised Co-Part Segmentation
Aliaksandr Siarohin
Subhankar Roy
Stéphane Lathuilière
Sergey Tulyakov
Elisa Ricci
N. Sebe
SSL
48
35
0
07 Apr 2020
Deep Normalization for Speaker Vectors
Deep Normalization for Speaker Vectors
Yunqi Cai
Lantian Li
Dong Wang
Andrew Abel
87
25
0
07 Apr 2020
Improving Multi-Scale Aggregation Using Feature Pyramid Module for
  Robust Speaker Verification of Variable-Duration Utterances
Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances
Youngmoon Jung
Seong Min Kye
Yeunju Choi
Myunghun Jung
Hoirin Kim
77
37
0
07 Apr 2020
Meta-Learning for Short Utterance Speaker Recognition with Imbalance
  Length Pairs
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs
Seong Min Kye
Youngmoon Jung
Haebeom Lee
Sung Ju Hwang
Hoirin Kim
124
51
0
06 Apr 2020
Speaker Recognition using SincNet and X-Vector Fusion
Speaker Recognition using SincNet and X-Vector Fusion
Mayank Tripathi
Divyanshu Singh
Seba Susan
28
8
0
05 Apr 2020
Neural i-vectors
Neural i-vectors
Ville Vestman
Kong Aik Lee
Tomi Kinnunen
DRL
59
4
0
03 Apr 2020
Temporarily-Aware Context Modelling using Generative Adversarial
  Networks for Speech Activity Detection
Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection
Tharindu Fernando
Sridha Sridharan
Mitchell McLaren
Darshana Priyasad
Simon Denman
Clinton Fookes
41
5
0
02 Apr 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker
  Verification using Raw Waveforms
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms
Jee-weon Jung
Seung-bin Kim
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
77
60
0
01 Apr 2020
AM-MobileNet1D: A Portable Model for Speaker Recognition
AM-MobileNet1D: A Portable Model for Speaker Recognition
João Antônio Chagas Nunes
David Macêdo
Cleber Zanchettin
64
23
0
31 Mar 2020
A Comparison of Metric Learning Loss Functions for End-To-End Speaker
  Verification
A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification
Juan Manuel Coria
H. Bredin
Sahar Ghannay
S. Rosset
50
15
0
31 Mar 2020
Realistic Face Reenactment via Self-Supervised Disentangling of Identity
  and Pose
Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose
Xianfang Zeng
Yusu Pan
Mengmeng Wang
Jiangning Zhang
Yong Liu
CVBM
147
42
0
29 Mar 2020
Learning Inverse Rendering of Faces from Real-world Videos
Learning Inverse Rendering of Faces from Real-world Videos
Yuda Qiu
Zhangyang Xiong
Kai Han
Zhongyuan Wang
Zixiang Xiong
Xiaoguang Han
CVBM3DH
32
2
0
26 Mar 2020
In defence of metric learning for speaker recognition
In defence of metric learning for speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
Minjae Lee
Hee-Soo Heo
Soyeon Choe
Chiheon Ham
Sung-Ye Jung
Bong-Jin Lee
Icksang Han
77
437
0
26 Mar 2020
Improving Embedding Extraction for Speaker Verification with Ladder
  Network
Improving Embedding Extraction for Speaker Verification with Ladder Network
Fei Tao
Gokhan Tur
24
3
0
20 Mar 2020
Deep Neural Networks for Automatic Speech Processing: A Survey from
  Large Corpora to Limited Data
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data
Vincent Roger
Jérôme Farinas
J. Pinquier
57
24
0
09 Mar 2020
Lightweight Speaker Verification for Online Identification of New
  Speakers with Short Segments
Lightweight Speaker Verification for Online Identification of New Speakers with Short Segments
I. Vélez
C. Rascón
Gibran Fuentes Pineda
84
10
0
06 Mar 2020
First Order Motion Model for Image Animation
First Order Motion Model for Image Animation
Aliaksandr Siarohin
Stéphane Lathuilière
Sergey Tulyakov
Elisa Ricci
N. Sebe
VGenDiffM
184
942
0
29 Feb 2020
Bio-Inspired Modality Fusion for Active Speaker Detection
Bio-Inspired Modality Fusion for Active Speaker Detection
Gustavo Assunção
Nuno Gonccalves
Paulo Menezes
19
3
0
28 Feb 2020
Speech2Phone: A Novel and Efficient Method for Training Speaker
  Recognition Models
Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models
Edresson Casanova
Arnaldo Cândido Júnior
C. Shulby
F. S. Oliveira
L. Gris
Hamilton Pereira da Silva
S. Aluísio
M. Ponti
16
2
0
25 Feb 2020
Towards Learning a Universal Non-Semantic Representation of Speech
Towards Learning a Universal Non-Semantic Representation of Speech
Joel Shor
A. Jansen
Ronnie Maor
Oran Lang
Omry Tuval
Félix de Chaumont Quitry
Marco Tagliasacchi
Ira Shavitt
Dotan Emanuel
Yinnon A. Haviv
SSL
158
160
0
25 Feb 2020
Audio-driven Talking Face Video Generation with Learning-based
  Personalized Head Pose
Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose
Ran Yi
Zipeng Ye
Juyong Zhang
Hujun Bao
Yong Liu
CVBM
113
123
0
24 Feb 2020
DIHARD II is Still Hard: Experimental Results and Discussions from the
  DKU-LENOVO Team
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team
Qingjian Lin
Weicheng Cai
Lin Yang
Junjie Wang
J. Zhang
Ming Li
VLM
59
18
0
23 Feb 2020
An end-to-end approach for the verification problem: learning the right
  distance
An end-to-end approach for the verification problem: learning the right distance
João Monteiro
Isabela Albuquerque
Md. Jahangir Alam
R. Devon Hjelm
T. Falk
45
6
0
21 Feb 2020
Disentangled Speech Embeddings using Cross-modal Self-supervision
Disentangled Speech Embeddings using Cross-modal Self-supervision
Arsha Nagrani
Joon Son Chung
Samuel Albanie
Andrew Zisserman
SSL
92
88
0
20 Feb 2020
Previous
123...181920212223
Next