ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.12233
  4. Cited By
Automatic speaker verification spoofing and deepfake detection using
  wav2vec 2.0 and data augmentation
v1v2 (latest)

Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation

24 February 2022
Hemlata Tak
Massimiliano Todisco
Xin Wang
Jee-weon Jung
Junichi Yamagishi
Nicholas W. D. Evans
ArXiv (abs)PDFHTML

Papers citing "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation"

49 / 49 papers shown
Title
Weakly-supervised Audio Temporal Forgery Localization via Progressive Audio-language Co-learning Network
Weakly-supervised Audio Temporal Forgery Localization via Progressive Audio-language Co-learning Network
Junyan Wu
Wenbo Xu
Wei Lu
Xiangyang Luo
Rui Yang
Shize Guo
104
0
0
03 May 2025
A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection
A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection
Lam Pham
Phat Lam
Dat Tran
Hieu Tang
Tin Nguyen
Alexander Schindler
Canh Vu
Alexander Polonsky
Canh Vu
99
5
0
23 Sep 2024
SONICS: Synthetic Or Not -- Identifying Counterfeit Songs
SONICS: Synthetic Or Not -- Identifying Counterfeit Songs
Md Awsafur Rahman
Zaber Ibn Abdul Hakim
Najibul Haque Sarker
Bishmoy Paul
S. Fattah
130
9
0
26 Aug 2024
Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
Anssi Kanervisto
Ville Hautamaki
Tomi Kinnunen
Junichi Yamagishi
48
16
0
24 Jan 2022
Graph attentive feature aggregation for text-independent speaker
  verification
Graph attentive feature aggregation for text-independent speaker verification
Hye-jin Shim
Ju-Sung Heo
Jae-han Park
Gareth Lee
Ha-Jin Yu
96
16
0
23 Dec 2021
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at
  Scale
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Arun Babu
Changhan Wang
Andros Tjandra
Kushal Lakhotia
Qiantong Xu
...
Yatharth Saraf
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
SSL
110
708
0
17 Nov 2021
Investigating self-supervised front ends for speech spoofing
  countermeasures
Investigating self-supervised front ends for speech spoofing countermeasures
Xin Wang
Junichi Yamagishi
AAML
66
125
0
15 Nov 2021
RawBoost: A Raw Data Boosting and Augmentation Method applied to
  Automatic Speaker Verification Anti-Spoofing
RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing
Hemlata Tak
Madhu R. Kamble
J. Patino
Massimiliano Todisco
Nicholas W. D. Evans
113
112
0
08 Nov 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech
  Processing
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
265
1,898
0
26 Oct 2021
AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph
  Attention Networks
AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks
Jee-weon Jung
Hee-Soo Heo
Hemlata Tak
Hye-jin Shim
Joon Son Chung
Bong-Jin Lee
Ha-Jin Yu
Nicholas W. D. Evans
207
308
0
04 Oct 2021
Fine-tuning wav2vec2 for speaker recognition
Fine-tuning wav2vec2 for speaker recognition
Nik Vaessen
David A. van Leeuwen
101
108
0
30 Sep 2021
ASVspoof 2021: accelerating progress in spoofed and deepfake speech
  detection
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Junichi Yamagishi
Xin Wang
Massimiliano Todisco
Md. Sahidullah
J. Patino
...
Xuechen Liu
Kong Aik Lee
Tomi Kinnunen
Nicholas W. D. Evans
Héctor Delgado
65
351
0
01 Sep 2021
End-to-End Spectro-Temporal Graph Attention Networks for Speaker
  Verification Anti-Spoofing and Speech Deepfake Detection
End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection
Hemlata Tak
Jee-weon Jung
J. Patino
Madhu R. Kamble
Massimiliano Todisco
Nicholas W. D. Evans
82
171
0
27 Jul 2021
UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021
UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021
Xinhui Chen
You Zhang
Ge Zhu
Z. Duan
72
49
0
26 Jul 2021
Improved Language Identification Through Cross-Lingual Self-Supervised
  Learning
Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Andros Tjandra
Diptanu Gon Choudhury
Frank Zhang
Kritika Singh
Alexis Conneau
Alexei Baevski
Assaf Sela
Yatharth Saraf
Michael Auli
VLMSSL
62
36
0
08 Jul 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked
  Prediction of Hidden Units
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
184
2,993
0
14 Jun 2021
SUPERB: Speech processing Universal PERformance Benchmark
SUPERB: Speech processing Universal PERformance Benchmark
Shu-Wen Yang
Po-Han Chi
Yung-Sung Chuang
Cheng-I Jeff Lai
Kushal Lakhotia
...
Shuyan Dong
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
SSL
108
941
0
03 May 2021
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised
  Representation Learning from Speech
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
Solène Evain
H. Nguyen
Hang Le
Marcely Zanon Boito
Salima Mdhaffar
...
François Portet
Solange Rossato
Fabien Ringeval
D. Schwab
Laurent Besacier
SSL
76
70
0
23 Apr 2021
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings
L. Pepino
Pablo Riera
Luciana Ferrer
73
365
0
08 Apr 2021
An Empirical Study on Channel Effects for Synthetic Voice Spoofing
  Countermeasure Systems
An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems
You Zhang
Ge Zhu
Fei Jiang
Z. Duan
77
29
0
03 Apr 2021
ASVspoof 2019: spoofing countermeasures for the detection of
  synthesized, converted and replayed speech
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech
A. Nautsch
Xin Wang
Nicholas W. D. Evans
Tomi Kinnunen
Ville Vestman
Massimiliano Todisco
Héctor Delgado
Md. Sahidullah
Junichi Yamagishi
Kong Aik Lee
168
152
0
11 Feb 2021
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation
  Learning, Semi-Supervised Learning and Interpretation
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Changhan Wang
M. Rivière
Ann Lee
Anne Wu
Chaitanya Talnikar
Daniel Haziza
Mary Williamson
J. Pino
Emmanuel Dupoux
SSL
102
492
0
02 Jan 2021
Exploring wav2vec 2.0 on speaker verification and language
  identification
Exploring wav2vec 2.0 on speaker verification and language identification
Zhiyun Fan
Meng Li
Shiyu Zhou
Bo Xu
141
203
0
11 Dec 2020
MLS: A Large-Scale Multilingual Dataset for Speech Research
MLS: A Large-Scale Multilingual Dataset for Speech Research
Vineel Pratap
Qiantong Xu
Anuroop Sriram
Gabriel Synnaeve
R. Collobert
AuLLM
99
509
0
07 Dec 2020
One-class Learning Towards Synthetic Voice Spoofing Detection
One-class Learning Towards Synthetic Voice Spoofing Detection
You Zhang
Fei Jiang
Z. Duan
68
219
0
27 Oct 2020
An iterative framework for self-supervised deep speaker representation
  learning
An iterative framework for self-supervised deep speaker representation learning
Danwei Cai
Weiqing Wang
Ming Li
SSL
47
37
0
25 Oct 2020
Graph Attention Networks for Speaker Verification
Graph Attention Networks for Speaker Verification
Jee-weon Jung
Hee-Soo Heo
Ha-Jin Yu
Joon Son Chung
65
26
0
22 Oct 2020
Similarity Analysis of Self-Supervised Speech Representations
Similarity Analysis of Self-Supervised Speech Representations
Yu-An Chung
Yonatan Belinkov
James R. Glass
SSL
97
37
0
22 Oct 2020
Self-training and Pre-training are Complementary for Speech Recognition
Self-training and Pre-training are Complementary for Speech Recognition
Qiantong Xu
Alexei Baevski
Tatiana Likhomanenko
Paden Tomasello
Alexis Conneau
R. Collobert
Gabriel Synnaeve
Michael Auli
SSLVLM
135
173
0
22 Oct 2020
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker
  Verification: Fundamentals
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals
Tomi Kinnunen
Héctor Delgado
Nicholas W. D. Evans
Kong Aik Lee
Ville Vestman
...
Massimiliano Todisco
Xin Wang
Md. Sahidullah
Junichi Yamagishi
D. Reynolds
36
113
0
12 Jul 2020
Unsupervised Cross-lingual Representation Learning for Speech
  Recognition
Unsupervised Cross-lingual Representation Learning for Speech Recognition
Alexis Conneau
Alexei Baevski
R. Collobert
Abdel-rahman Mohamed
Michael Auli
SSL
154
782
0
24 Jun 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
297
5,837
0
20 Jun 2020
Improving Multi-Scale Aggregation Using Feature Pyramid Module for
  Robust Speaker Verification of Variable-Duration Utterances
Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances
Youngmoon Jung
Seong Min Kye
Yeunju Choi
Myunghun Jung
Hoirin Kim
58
37
0
07 Apr 2020
Learning Robust and Multilingual Speech Representations
Learning Robust and Multilingual Speech Representations
Kazuya Kawakami
Luyu Wang
Chris Dyer
Phil Blunsom
Aaron van den Oord
SSL
81
100
0
29 Jan 2020
Common Voice: A Massively-Multilingual Speech Corpus
Common Voice: A Massively-Multilingual Speech Corpus
Rosana Ardila
Megan Branson
Kelly Davis
Michael Henretty
M. Kohler
Josh Meyer
Reuben Morais
Lindsay Saunders
Francis M. Tyers
Gregor Weber
VLM
91
1,614
0
13 Dec 2019
Graph U-Nets
Graph U-Nets
Hongyang Gao
Shuiwang Ji
AI4CESSLSSegGNN
132
1,092
0
11 May 2019
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
Massimiliano Todisco
Xin Wang
Ville Vestman
Md. Sahidullah
Héctor Delgado
A. Nautsch
Junichi Yamagishi
Nicholas W. D. Evans
Tomi Kinnunen
Kong Aik Lee
74
616
0
09 Apr 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLMFaML
116
3,156
0
01 Apr 2019
Utterance-level Aggregation For Speaker Recognition In The Wild
Utterance-level Aggregation For Speaker Recognition In The Wild
Weidi Xie
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
62
344
0
26 Feb 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,175
0
11 Oct 2018
Speaker Recognition from Raw Waveform with SincNet
Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
175
717
0
29 Jul 2018
Representation Learning with Contrastive Predictive Coding
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRLSSL
351
10,356
0
10 Jul 2018
Attentive Statistics Pooling for Deep Speaker Embedding
Attentive Statistics Pooling for Deep Speaker Embedding
K. Okabe
Takafumi Koshinaka
Koichi Shinoda
98
530
0
29 Mar 2018
Graph Attention Networks
Graph Attention Networks
Petar Velickovic
Guillem Cucurull
Arantxa Casanova
Adriana Romero
Pietro Lio
Yoshua Bengio
GNN
481
20,233
0
30 Oct 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
786
132,363
0
12 Jun 2017
Self-Normalizing Neural Networks
Self-Normalizing Neural Networks
Günter Klambauer
Thomas Unterthiner
Andreas Mayr
Sepp Hochreiter
470
2,519
0
08 Jun 2017
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
465
43,341
0
11 Feb 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.0K
150,312
0
22 Dec 2014
The BOSARIS Toolkit: Theory, Algorithms and Code for Surviving the New
  DCF
The BOSARIS Toolkit: Theory, Algorithms and Code for Surviving the New DCF
Niko Brummer
E. D. Villiers
77
204
0
10 Apr 2013
1