ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.10819
  4. Cited By
Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy

Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy

15 April 2025
Botao Zhao
Zuheng Kang
Yayun He
Xiaoyang Qu
Junqing Peng
Jing Xiao
Jianzong Wang
ArXiv (abs)PDFHTML

Papers citing "Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy"

17 / 17 papers shown
Title
DIRE for Diffusion-Generated Image Detection
DIRE for Diffusion-Generated Image Detection
Zhendong Wang
Jianmin Bao
Wen-gang Zhou
Weilun Wang
Hezhen Hu
Hong Chen
Houqiang Li
74
217
0
16 Mar 2023
Does Audio Deepfake Detection Generalize?
Does Audio Deepfake Detection Generalize?
Nicolas Müller
Pavel Czempin
Franziska Dieckmann
Adam Froghyar
Konstantin Böttinger
79
152
0
30 Mar 2022
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for
  Zero-shot Multi-speaker Text-to-Speech
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech
Bo Zhao
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
DiffM
80
22
0
22 Feb 2022
Pseudo Numerical Methods for Diffusion Models on Manifolds
Pseudo Numerical Methods for Diffusion Models on Manifolds
Luping Liu
Yi Ren
Zhijie Lin
Zhou Zhao
DiffM
103
652
0
20 Feb 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
474
15,734
0
20 Dec 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech
  Processing
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
254
1,896
0
26 Oct 2021
AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph
  Attention Networks
AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks
Jee-weon Jung
Hee-Soo Heo
Hemlata Tak
Hye-jin Shim
Joon Son Chung
Bong-Jin Lee
Ha-Jin Yu
Nicholas W. D. Evans
202
308
0
04 Oct 2021
Conditional Variational Autoencoder with Adversarial Learning for
  End-to-End Text-to-Speech
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Jaehyeon Kim
Jungil Kong
Juhee Son
DRL
128
894
0
11 Jun 2021
Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
Vadim Popov
Ivan Vovk
Vladimir Gogoryan
Tasnima Sadekova
Mikhail Kudinov
DiffM
104
537
0
13 May 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
250
7,933
0
11 May 2021
Graph Attention Networks for Anti-Spoofing
Graph Attention Networks for Anti-Spoofing
Hemlata Tak
Jee-weon Jung
J. Patino
Massimiliano Todisco
Nicholas W. D. Evans
95
68
0
08 Apr 2021
Improved Denoising Diffusion Probabilistic Models
Improved Denoising Diffusion Probabilistic Models
Alex Nichol
Prafulla Dhariwal
DiffM
352
3,715
0
18 Feb 2021
HiFi-GAN: Generative Adversarial Networks for Efficient and High
  Fidelity Speech Synthesis
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Jungil Kong
Jaehyeon Kim
Jaekyoung Bae
179
1,944
0
12 Oct 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
288
5,837
0
20 Jun 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
672
18,276
0
19 Jun 2020
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Yi Ren
Chenxu Hu
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
105
1,406
0
08 Jun 2020
Attentive Filtering Networks for Audio Replay Attack Detection
Attentive Filtering Networks for Audio Replay Attack Detection
Cheng-I Jeff Lai
A. Abad
Korin Richmond
Junichi Yamagishi
Najim Dehak
Simon King
AAML
83
80
0
31 Oct 2018
1