ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.14176
  4. Cited By
A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual
  Deepfake Detection

A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual Deepfake Detection

20 June 2024
Kyungbok Lee
You Zhang
Zhiyao Duan
ArXivPDFHTML

Papers citing "A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual Deepfake Detection"

13 / 13 papers shown
Title
AVTENet: Audio-Visual Transformer-based Ensemble Network Exploiting
  Multiple Experts for Video Deepfake Detection
AVTENet: Audio-Visual Transformer-based Ensemble Network Exploiting Multiple Experts for Video Deepfake Detection
Ammarah Hashmi
Sahibzada Adil Shahzad
Chia-Wen Lin
Yu Tsao
Hsin-Min Wang
ViT
86
6
0
19 Oct 2023
Integrating Audio-Visual Features for Multimodal Deepfake Detection
Integrating Audio-Visual Features for Multimodal Deepfake Detection
Sneha Muppalla
Shan Jia
Siwei Lyu
37
20
0
05 Oct 2023
SAMO: Speaker Attractor Multi-Center One-Class Learning for Voice
  Anti-Spoofing
SAMO: Speaker Attractor Multi-Center One-Class Learning for Voice Anti-Spoofing
Sivan Ding
You Zhang
Z. Duan
66
28
0
04 Nov 2022
Rethinking Audio-visual Synchronization for Active Speaker Detection
Rethinking Audio-visual Synchronization for Active Speaker Detection
Abudukelimu Wuerkaixi
You Zhang
Z. Duan
Changshui Zhang
36
10
0
21 Jun 2022
ADD 2022: the First Audio Deep Synthesis Detection Challenge
ADD 2022: the First Audio Deep Synthesis Detection Challenge
Jiangyan Yi
Ruibo Fu
J. Tao
Shuai Nie
Haoxin Ma
...
Le Xu
Zhengqi Wen
Haizhou Li
Zheng Lian
Bin Liu
43
182
0
17 Feb 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
347
15,373
0
20 Dec 2021
Spatiotemporal Inconsistency Learning for DeepFake Video Detection
Spatiotemporal Inconsistency Learning for DeepFake Video Detection
Zhihao Gu
Yang Chen
Taiping Yao
Shouhong Ding
Jilin Li
Feiyue Huang
Lizhuang Ma
55
152
0
04 Sep 2021
A Survey on Neural Speech Synthesis
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
67
358
0
29 Jun 2021
One-class Learning Towards Synthetic Voice Spoofing Detection
One-class Learning Towards Synthetic Voice Spoofing Detection
You Zhang
Fei Jiang
Z. Duan
54
215
0
27 Oct 2020
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The
  Wild
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
EGVM
96
777
0
23 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
102
321
0
09 Aug 2020
Not made for each other- Audio-Visual Dissonance-based Deepfake
  Detection and Localization
Not made for each other- Audio-Visual Dissonance-based Deepfake Detection and Localization
Komal Chugh
Parul Gupta
Abhinav Dhall
Ramanathan Subramanian
61
170
0
29 May 2020
Perfect match: Improved cross-modal embeddings for audio-visual
  synchronisation
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation
Soo-Whan Chung
Joon Son Chung
Hong-Goo Kang
41
117
0
21 Sep 2018
1