Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.17222
Cited By
Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection
31 October 2022
L. Attorresi
Davide Salvi
Clara Borrelli
Paolo Bestagini
Stefano Tubaro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection"
26 / 26 papers shown
Title
What You Read Isn't What You Hear: Linguistic Sensitivity in Deepfake Speech Detection
Binh Nguyen
Shuji Shi
Ryan Ofman
Thai Le
AAML
154
0
0
23 May 2025
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Junichi Yamagishi
Xin Wang
Massimiliano Todisco
Md. Sahidullah
J. Patino
...
Xuechen Liu
Kong Aik Lee
Tomi Kinnunen
Nicholas W. D. Evans
Héctor Delgado
65
345
0
01 Sep 2021
SpeechBrain: A General-Purpose Speech Toolkit
Mirco Ravanelli
Titouan Parcollet
Peter William VanHarn Plantinga
Aku Rouhe
Samuele Cornell
...
William Aris
Hwidong Na
Yan Gao
R. Mori
Yoshua Bengio
75
764
0
08 Jun 2021
Generalized Spoofing Detection Inspired from Audio Generation Artifacts
Yang Gao
Tyler Vuong
Mahsa Elyasi
Gaurav Bharaj
Rita Singh
43
20
0
08 Apr 2021
Deepfakes Generation and Detection: State-of-the-art, open challenges, countermeasures, and way forward
Momina Masood
M. Nawaz
K. Malik
A. Javed
Aun Irtaza
AAML
149
311
0
25 Feb 2021
ID-Reveal: Identity-aware DeepFake Video Detection
D. Cozzolino
Andreas Rossler
Justus Thies
Matthias Nießner
L. Verdoliva
AAML
72
166
0
04 Dec 2020
Not made for each other- Audio-Visual Dissonance-based Deepfake Detection and Localization
Komal Chugh
Parul Gupta
Abhinav Dhall
Ramanathan Subramanian
63
170
0
29 May 2020
ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification
Brecht Desplanques
Jenthe Thienpondt
Kris Demuynck
74
1,331
0
14 May 2020
Detecting Deep-Fake Videos from Appearance and Behavior
S. Agarwal
Tarek El-Gaaly
Hany Farid
Ser-Nam Lim
PICV
40
167
0
29 Apr 2020
Video Face Manipulation Detection Through Ensemble of CNNs
Nicolo Bonettini
E. D. Cannas
S. Mandelli
Luca Bondi
Paolo Bestagini
Stefano Tubaro
PICV
CVBM
AAML
47
218
0
16 Apr 2020
Media Forensics and DeepFakes: an overview
L. Verdoliva
91
548
0
18 Jan 2020
BUT System Description to VoxCeleb Speaker Recognition Challenge 2019
Hossein Zeinali
Shuai Wang
Anna Silnova
P. Matejka
Oldrich Plchot
DRL
67
247
0
16 Oct 2019
Detecting and Simulating Artifacts in GAN Fake Images
Xu-Yao Zhang
Svebor Karaman
Shih-Fu Chang
90
486
0
15 Jul 2019
Deep Residual Neural Networks for Audio Spoofing Detection
M. Alzantot
Ziqi Wang
Mani B. Srivastava
55
167
0
30 Jun 2019
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
Massimiliano Todisco
Xin Wang
Ville Vestman
Md. Sahidullah
Héctor Delgado
A. Nautsch
Junichi Yamagishi
Nicholas W. D. Evans
Tomi Kinnunen
Kong Aik Lee
61
608
0
09 Apr 2019
Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks
Hafiz Malik
39
26
0
18 Feb 2019
Exposing Deep Fakes Using Inconsistent Head Poses
Xin Yang
Yuezun Li
Siwei Lyu
CVBM
70
882
0
01 Nov 2018
Exposing DeepFake Videos By Detecting Face Warping Artifacts
Yuezun Li
Siwei Lyu
AAML
CVBM
60
911
0
01 Nov 2018
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
350
2,276
0
14 Jun 2018
Attentive Statistics Pooling for Deep Speaker Embedding
K. Okabe
Takafumi Koshinaka
Koichi Shinoda
92
530
0
29 Mar 2018
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
RJ Skerry-Ryan
Eric Battenberg
Y. Xiao
Yuxuan Wang
Daisy Stanton
Joel Shor
Ron J. Weiss
R. Clark
Rif A. Saurous
54
554
0
24 Mar 2018
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Yuxuan Wang
Daisy Stanton
Yu Zhang
RJ Skerry-Ryan
Eric Battenberg
Joel Shor
Y. Xiao
Fei Ren
Ye Jia
Rif A. Saurous
64
826
0
23 Mar 2018
Squeeze-and-Excitation Networks
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
424
26,481
0
05 Sep 2017
VoxCeleb: a large-scale speaker identification dataset
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
122
2,273
0
26 Jun 2017
Tacotron: Towards End-to-End Speech Synthesis
Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
...
Samy Bengio
Quoc V. Le
Yannis Agiomyrgiannakis
R. Clark
Rif A. Saurous
155
1,823
0
29 Mar 2017
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
193,878
0
10 Dec 2015
1