ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.02813
  4. Cited By
WaveFake: A Data Set to Facilitate Audio Deepfake Detection

WaveFake: A Data Set to Facilitate Audio Deepfake Detection

4 November 2021
Joel Frank
Lea Schonherr
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveFake: A Data Set to Facilitate Audio Deepfake Detection"

50 / 67 papers shown
Title
GLCF: A Global-Local Multimodal Coherence Analysis Framework for Talking Face Generation Detection
GLCF: A Global-Local Multimodal Coherence Analysis Framework for Talking Face Generation Detection
Xiaocan Chen
Qilin Yin
Jiarui Liu
Wei Lu
Xiangyang Luo
Jiantao Zhou
CVBM
141
1
0
18 Dec 2024
Passive Deepfake Detection Across Multi-modalities: A Comprehensive Survey
Passive Deepfake Detection Across Multi-modalities: A Comprehensive Survey
Hong-Hanh Nguyen-Le
Van-Tuan Tran
Dinh-Thuc Nguyen
Nhien-An Le-Khac
AAML
166
2
0
26 Nov 2024
A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection
A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection
Lam Pham
Phat Lam
Dat Tran
Hieu Tang
Tin Nguyen
Alexander Schindler
Canh Vu
Alexander Polonsky
Canh Vu
99
5
0
23 Sep 2024
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild
Jee-weon Jung
Yihan Wu
Xin Wang
Ji-Hoon Kim
Soumi Maiti
...
Joon Son Chung
Wangyou Zhang
Seyun Um
Shinnosuke Takamichi
Shinji Watanabe
128
4
0
18 Sep 2024
Generative Adversarial Networks
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
298
30,149
0
01 Mar 2022
ASVspoof 2019: spoofing countermeasures for the detection of
  synthesized, converted and replayed speech
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech
A. Nautsch
Xin Wang
Nicholas W. D. Evans
Tomi Kinnunen
Ville Vestman
Massimiliano Todisco
Héctor Delgado
Md. Sahidullah
Junichi Yamagishi
Kong Aik Lee
168
152
0
11 Feb 2021
ESPnet-se: end-to-end speech enhancement and separation toolkit designed
  for asr integration
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration
Chenda Li
Jing Shi
Wangyou Zhang
Aswin Shanmugam Subramanian
Xuankai Chang
...
Moto Hira
Tomoki Hayashi
Christoph Boeddeker
Zhuo Chen
Shinji Watanabe
VLM
86
82
0
07 Nov 2020
HiFi-GAN: Generative Adversarial Networks for Efficient and High
  Fidelity Speech Synthesis
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Jungil Kong
Jaehyeon Kim
Jaekyoung Bae
179
1,947
0
12 Oct 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffMBDL
158
1,468
0
21 Sep 2020
WaveGrad: Estimating Gradients for Waveform Generation
WaveGrad: Estimating Gradients for Waveform Generation
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
William Chan
DiffMBDL
109
793
0
02 Sep 2020
Thinking in Frequency: Face Forgery Detection by Mining Frequency-aware
  Clues
Thinking in Frequency: Face Forgery Detection by Mining Frequency-aware Clues
Yuyang Qian
Guojun Yin
Lu Sheng
Zixuan Chen
Jing Shao
CVBM
131
695
0
18 Jul 2020
SoK: The Faults in our ASRs: An Overview of Attacks against Automatic
  Speech Recognition and Speaker Identification Systems
SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
H. Abdullah
Kevin Warren
Vincent Bindschaedler
Nicolas Papernot
Patrick Traynor
AAML
69
129
0
13 Jul 2020
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Yi Ren
Chenxu Hu
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
105
1,406
0
08 Jun 2020
End-to-End Adversarial Text-to-Speech
End-to-End Adversarial Text-to-Speech
Jeff Donahue
Sander Dieleman
Mikolaj Binkowski
Erich Elsen
Karen Simonyan
72
187
0
05 Jun 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
229
3,155
0
16 May 2020
Multi-band MelGAN: Faster Waveform Generation for High-Quality
  Text-to-Speech
Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech
Geng Yang
Shan Yang
Kai-Chun Liu
Peng Fang
Wei Chen
Lei Xie
129
199
0
11 May 2020
ESPnet-ST: All-in-One Speech Translation Toolkit
ESPnet-ST: All-in-One Speech Translation Toolkit
Hirofumi Inaguma
Shun Kiyono
Kevin Duh
Shigeki Karita
Nelson Yalta
Tomoki Hayashi
Shinji Watanabe
95
165
0
21 Apr 2020
Attribution in Scale and Space
Attribution in Scale and Space
Shawn Xu
Subhashini Venugopalan
Mukund Sundararajan
FAttBDL
50
72
0
03 Apr 2020
Evading Deepfake-Image Detectors with White- and Black-Box Attacks
Evading Deepfake-Image Detectors with White- and Black-Box Attacks
Nicholas Carlini
Hany Farid
AAML
63
149
0
01 Apr 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker
  Verification using Raw Waveforms
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms
Jee-weon Jung
Seung-bin Kim
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
58
60
0
01 Apr 2020
Leveraging Frequency Analysis for Deep Fake Image Recognition
Leveraging Frequency Analysis for Deep Fake Image Recognition
Joel Frank
Thorsten Eisenhofer
Lea Schonherr
Asja Fischer
D. Kolossa
Thorsten Holz
76
559
0
19 Mar 2020
Watch your Up-Convolution: CNN Based Generative Deep Neural Networks are
  Failing to Reproduce Spectral Distributions
Watch your Up-Convolution: CNN Based Generative Deep Neural Networks are Failing to Reproduce Spectral Distributions
Ricard Durall
Margret Keuper
J. Keuper
71
337
0
03 Mar 2020
CNN-generated images are surprisingly easy to spot... for now
CNN-generated images are surprisingly easy to spot... for now
Sheng-Yu Wang
Oliver Wang
Richard Y. Zhang
Andrew Owens
Alexei A. Efros
OOD
154
987
0
23 Dec 2019
Common Voice: A Massively-Multilingual Speech Corpus
Common Voice: A Massively-Multilingual Speech Corpus
Rosana Ardila
Megan Branson
Kelly Davis
Michael Henretty
M. Kohler
Josh Meyer
Reuben Morais
Lindsay Saunders
Francis M. Tyers
Gregor Weber
VLM
91
1,614
0
13 Dec 2019
PyTorch: An Imperative Style, High-Performance Deep Learning Library
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
541
42,591
0
03 Dec 2019
WaveFlow: A Compact Flow-based Model for Raw Audio
WaveFlow: A Compact Flow-based Model for Raw Audio
Ming-Yu Liu
Kainan Peng
Kexin Zhao
Z. Song
75
117
0
03 Dec 2019
Parallel WaveGAN: A fast waveform generation model based on generative
  adversarial networks with multi-resolution spectrogram
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
Ryuichi Yamamoto
Eunwoo Song
Jae-Min Kim
60
818
0
25 Oct 2019
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source
  End-to-End Text-to-Speech Toolkit
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Tomoki Hayashi
Ryuichi Yamamoto
Katsuki Inoue
Takenori Yoshimura
Shinji Watanabe
Tomoki Toda
K. Takeda
Yu Zhang
Xu Tan
VLM
90
205
0
24 Oct 2019
MelGAN: Generative Adversarial Networks for Conditional Waveform
  Synthesis
MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Kundan Kumar
Rithesh Kumar
T. Boissière
L. Gestin
Wei Zhen Teoh
Jose M. R. Sotelo
A. D. Brébisson
Yoshua Bengio
Aaron Courville
GAN
165
956
0
08 Oct 2019
High Fidelity Speech Synthesis with Adversarial Networks
High Fidelity Speech Synthesis with Adversarial Networks
Mikolaj Binkowski
Jeff Donahue
Sander Dieleman
Aidan Clark
Erich Elsen
Norman Casagrande
Luis C. Cobo
Karen Simonyan
286
240
0
25 Sep 2019
Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech
  Recognition Systems
Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition Systems
Lea Schonherr
Thorsten Eisenhofer
Steffen Zeiler
Thorsten Holz
D. Kolossa
AAML
77
65
0
05 Aug 2019
Detecting and Simulating Artifacts in GAN Fake Images
Detecting and Simulating Artifacts in GAN Fake Images
Xu-Yao Zhang
Svebor Karaman
Shih-Fu Chang
96
488
0
15 Jul 2019
A Neural Vocoder with Hierarchical Generation of Amplitude and Phase
  Spectra for Statistical Parametric Speech Synthesis
A Neural Vocoder with Hierarchical Generation of Amplitude and Phase Spectra for Statistical Parametric Speech Synthesis
Yang Ai
Zhenhua Ling
112
29
0
23 Jun 2019
Neural source-filter waveform models for statistical parametric speech
  synthesis
Neural source-filter waveform models for statistical parametric speech synthesis
Xin Wang
Shinji Takaki
Junichi Yamagishi
79
118
0
27 Apr 2019
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
Massimiliano Todisco
Xin Wang
Ville Vestman
Md. Sahidullah
Héctor Delgado
A. Nautsch
Junichi Yamagishi
Nicholas W. D. Evans
Tomi Kinnunen
Kong Aik Lee
74
616
0
09 Apr 2019
Detecting GAN generated Fake Images using Co-occurrence Matrices
Detecting GAN generated Fake Images using Co-occurrence Matrices
L. Nataraj
Tajuddin Manhar Mohammed
S. Chandrasekaran
A. Flenner
Jawadul H. Bappy
Amit K. Roy-Chowdhury
B. S. Manjunath
GAN
81
270
0
15 Mar 2019
On Evaluating Adversarial Robustness
On Evaluating Adversarial Robustness
Nicholas Carlini
Anish Athalye
Nicolas Papernot
Wieland Brendel
Jonas Rauber
Dimitris Tsipras
Ian Goodfellow
Aleksander Madry
Alexey Kurakin
ELMAAML
98
905
0
18 Feb 2019
FaceForensics++: Learning to Detect Manipulated Facial Images
FaceForensics++: Learning to Detect Manipulated Facial Images
Andreas Rossler
D. Cozzolino
L. Verdoliva
Christian Riess
Justus Thies
Matthias Nießner
CVBM
113
2,095
0
25 Jan 2019
Do GANs leave artificial fingerprints?
Do GANs leave artificial fingerprints?
Francesco Marra
Diego Gragnaniello
L. Verdoliva
Giovanni Poggi
GAN
62
324
0
31 Dec 2018
Detecting GAN-generated Imagery using Color Cues
Detecting GAN-generated Imagery using Color Cues
Scott McCloskey
Michael Albright
GAN
44
157
0
19 Dec 2018
FloWaveNet : A Generative Flow for Raw Audio
FloWaveNet : A Generative Flow for Raw Audio
Sungwon Kim
Sang-gil Lee
Jongyoon Song
Jaehyeon Kim
Sungroh Yoon
72
169
0
06 Nov 2018
Exposing DeepFake Videos By Detecting Face Warping Artifacts
Exposing DeepFake Videos By Detecting Face Warping Artifacts
Yuezun Li
Siwei Lyu
AAMLCVBM
62
915
0
01 Nov 2018
WaveGlow: A Flow-based Generative Network for Speech Synthesis
WaveGlow: A Flow-based Generative Network for Speech Synthesis
R. Prenger
Rafael Valle
Bryan Catanzaro
155
1,036
0
31 Oct 2018
Attentive Filtering Networks for Audio Replay Attack Detection
Attentive Filtering Networks for Audio Replay Attack Detection
Cheng-I Jeff Lai
A. Abad
Korin Richmond
Junichi Yamagishi
Najim Dehak
Simon King
AAML
83
80
0
31 Oct 2018
Adversarial Attacks Against Automatic Speech Recognition Systems via
  Psychoacoustic Hiding
Adversarial Attacks Against Automatic Speech Recognition Systems via Psychoacoustic Hiding
Lea Schonherr
Katharina Kohls
Steffen Zeiler
Thorsten Holz
D. Kolossa
AAML
77
291
0
16 Aug 2018
ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech
ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech
Ming-Yu Liu
Kainan Peng
Jitong Chen
58
347
0
19 Jul 2018
TequilaGAN: How to easily identify GAN samples
TequilaGAN: How to easily identify GAN samples
Rafael Valle
Wilson Cai
Anish Doshi
GAN
45
12
0
13 Jul 2018
Glow: Generative Flow with Invertible 1x1 Convolutions
Glow: Generative Flow with Invertible 1x1 Convolutions
Diederik P. Kingma
Prafulla Dhariwal
BDLDRL
300
3,141
0
09 Jul 2018
ESPnet: End-to-End Speech Processing Toolkit
ESPnet: End-to-End Speech Processing Toolkit
Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
...
Jahn Heymann
Sanjeev Khudanpur
Nanxin Chen
Adithya Renduchintala
Tsubasa Ochiai
VLM
114
1,513
0
30 Mar 2018
GAN-based Synthetic Medical Image Augmentation for increased CNN
  Performance in Liver Lesion Classification
GAN-based Synthetic Medical Image Augmentation for increased CNN Performance in Liver Lesion Classification
Maayan Frid-Adar
I. Diamant
Eyal Klang
Michal Amitai
Jacob Goldberger
H. Greenspan
GANMedIm
91
1,563
0
03 Mar 2018
12
Next