ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.14395
  4. Cited By
UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for
  Temporal Forgery Localization

UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for Temporal Forgery Localization

28 August 2023
Rui Zhang
Hongxia Wang
Ming-han Du
Hanqing Liu
Yangqiaoyu Zhou
Q. Zeng
ArXivPDFHTML

Papers citing "UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for Temporal Forgery Localization"

35 / 35 papers shown
Title
Weakly-supervised Audio Temporal Forgery Localization via Progressive Audio-language Co-learning Network
Weakly-supervised Audio Temporal Forgery Localization via Progressive Audio-language Co-learning Network
Junyan Wu
Wenbo Xu
Wei Lu
Xiangyang Luo
Rui Yang
Shize Guo
81
0
0
03 May 2025
DiMoDif: Discourse Modality-information Differentiation for Audio-visual Deepfake Detection and Localization
DiMoDif: Discourse Modality-information Differentiation for Audio-visual Deepfake Detection and Localization
C. Koutlis
Symeon Papadopoulos
95
4
0
15 Nov 2024
DeViT: Deformed Vision Transformers in Video Inpainting
DeViT: Deformed Vision Transformers in Video Inpainting
Jiayin Cai
Changlin Li
Xin Tao
Chun Yuan
Yu-Wing Tai
ViT
51
13
0
28 Sep 2022
Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Sindhu B. Hegde
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
78
11
0
01 Sep 2022
Flow-Guided Transformer for Video Inpainting
Flow-Guided Transformer for Video Inpainting
Kaiwen Zhang
Jingjing Fu
Dong Liu
ViT
64
72
0
14 Aug 2022
Video Manipulations Beyond Faces: A Dataset with Human-Machine Analysis
Video Manipulations Beyond Faces: A Dataset with Human-Machine Analysis
Trisha Mittal
Ritwik Sinha
Viswanathan Swaminathan
John Collomosse
Tianyi Zhou
63
9
0
26 Jul 2022
Deepfake Video Detection with Spatiotemporal Dropout Transformer
Deepfake Video Detection with Spatiotemporal Dropout Transformer
Daichi Zhang
Fanzhao Lin
Yingying Hua
Pengju Wang
Dan Zeng
Shiming Ge
ViT
85
38
0
14 Jul 2022
FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech
  Synthesis
FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis
Yongqiang Wang
Zhou Zhao
57
10
0
08 Jul 2022
Towards An End-to-End Framework for Flow-Guided Video Inpainting
Towards An End-to-End Framework for Flow-Guided Video Inpainting
Zerui Li
Cheng Lu
Jia Qin
Chunle Guo
Mingg-Ming Cheng
89
153
0
06 Apr 2022
ActionFormer: Localizing Moments of Actions with Transformers
ActionFormer: Localizing Moments of Actions with Transformers
Chen-Da Liu-Zhang
Jianxin Wu
Yin Li
ViT
63
342
0
16 Feb 2022
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
Guo Chen
Yin-Dong Zheng
Limin Wang
Tong Lu
AI4TS
114
73
0
07 Dec 2021
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
Joel Frank
Lea Schonherr
DiffM
175
128
0
04 Nov 2021
FuseFormer: Fusing Fine-Grained Information in Transformers for Video
  Inpainting
FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
R. Liu
Hanming Deng
Yangyi Huang
Xiaoyu Shi
Lewei Lu
Wenxiu Sun
Xiaogang Wang
Jifeng Dai
Hongsheng Li
ViT
65
127
0
07 Sep 2021
Unsupervised Deep Anomaly Detection for Multi-Sensor Time-Series Signals
Unsupervised Deep Anomaly Detection for Multi-Sensor Time-Series Signals
Yu-xin Zhang
Yiqiang Chen
Jindong Wang
Zhiwen Pan
AI4TS
45
192
0
27 Jul 2021
Combining EfficientNet and Vision Transformers for Video Deepfake
  Detection
Combining EfficientNet and Vision Transformers for Video Deepfake Detection
D. Coccomini
Nicola Messina
Claudio Gennaro
Fabrizio Falchi
ViT
76
172
0
06 Jul 2021
Learning Salient Boundary Feature for Anchor-free Temporal Action
  Localization
Learning Salient Boundary Feature for Anchor-free Temporal Action Localization
Chuming Lin
C. Xu
Donghao Luo
Yabiao Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Yanwei Fu
74
255
0
24 Mar 2021
ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis
ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis
Yinan He
Bei Gan
Siyu Chen
Yichun Zhou
Guojun Yin
Luchuan Song
Lu Sheng
Jing Shao
Ziwei Liu
AAML
69
137
0
09 Mar 2021
UniT: Multimodal Multitask Learning with a Unified Transformer
UniT: Multimodal Multitask Learning with a Unified Transformer
Ronghang Hu
Amanpreet Singh
ViT
76
300
0
22 Feb 2021
Relaxed Transformer Decoders for Direct Action Proposal Generation
Relaxed Transformer Decoders for Direct Action Proposal Generation
Jing Tan
Jiaqi Tang
Limin Wang
Gangshan Wu
ViT
106
182
0
03 Feb 2021
Activity Graph Transformer for Temporal Action Localization
Activity Graph Transformer for Temporal Action Localization
Megha Nawhal
Greg Mori
83
71
0
21 Jan 2021
WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection
WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection
Bojia Zi
Minghao Chang
Jingjing Chen
Xingjun Ma
Yu-Gang Jiang
CVBM
127
386
0
05 Jan 2021
Thinking in Frequency: Face Forgery Detection by Mining Frequency-aware
  Clues
Thinking in Frequency: Face Forgery Detection by Mining Frequency-aware Clues
Yuyang Qian
Guojun Yin
Lu Sheng
Zixuan Chen
Jing Shao
CVBM
131
688
0
18 Jul 2020
Not made for each other- Audio-Visual Dissonance-based Deepfake
  Detection and Localization
Not made for each other- Audio-Visual Dissonance-based Deepfake Detection and Localization
Komal Chugh
Parul Gupta
Abhinav Dhall
Ramanathan Subramanian
63
170
0
29 May 2020
DeepFaceLab: Integrated, flexible and extensible face-swapping framework
DeepFaceLab: Integrated, flexible and extensible face-swapping framework
Ivan Perov
Daiheng Gao
Nikolay Chervoniy
Kunlin Liu
Sugasa Marangonda
...
Jian Jiang
Sheng Zhang
Pingyu Wu
Wenbo Zhou
Weiming Zhang
CVBM
48
225
0
12 May 2020
Face X-ray for More General Face Forgery Detection
Face X-ray for More General Face Forgery Detection
Lingzhi Li
Jianmin Bao
Ting Zhang
Hao Yang
Dong Chen
Fang Wen
B. Guo
PICV
CVBM
61
838
0
31 Dec 2019
The Deepfake Detection Challenge (DFDC) Preview Dataset
The Deepfake Detection Challenge (DFDC) Preview Dataset
Brian Dolhansky
Russ Howes
Ben Pflaum
Nicole Baram
Cristian Canton Ferrer
59
498
0
19 Oct 2019
Deep High-Resolution Representation Learning for Visual Recognition
Deep High-Resolution Representation Learning for Visual Recognition
Jingdong Wang
Ke Sun
Tianheng Cheng
Borui Jiang
Chaorui Deng
...
Yadong Mu
Mingkui Tan
Xinggang Wang
Wenyu Liu
Bin Xiao
390
3,613
0
20 Aug 2019
FaceForensics++: Learning to Detect Manipulated Facial Images
FaceForensics++: Learning to Detect Manipulated Facial Images
Andreas Rossler
D. Cozzolino
L. Verdoliva
Christian Riess
Justus Thies
Matthias Nießner
CVBM
111
2,076
0
25 Jan 2019
SlowFast Networks for Video Recognition
SlowFast Networks for Video Recognition
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
164
3,273
0
10 Dec 2018
Two-Stream Neural Networks for Tampered Face Detection
Two-Stream Neural Networks for Tampered Face Detection
Peng Zhou
Xintong Han
Vlad I. Morariu
L. Davis
PICV
CVBM
54
543
0
29 Mar 2018
Soft-NMS -- Improving Object Detection With One Line of Code
Soft-NMS -- Improving Object Detection With One Line of Code
Navaneeth Bodla
Bharat Singh
Rama Chellappa
L. Davis
ObjD
85
1,788
0
14 Apr 2017
Feature Pyramid Networks for Object Detection
Feature Pyramid Networks for Object Detection
Nayeon Lee
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
468
22,102
0
09 Dec 2016
CNN Architectures for Large-Scale Audio Classification
CNN Architectures for Large-Scale Audio Classification
Shawn Hershey
Sourish Chaudhuri
D. Ellis
J. Gemmeke
A. Jansen
...
Rif A. Saurous
Bryan Seybold
M. Slaney
Ron J. Weiss
K. Wilson
120
2,498
0
29 Sep 2016
Temporal Segment Networks: Towards Good Practices for Deep Action
  Recognition
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
102
3,833
0
02 Aug 2016
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
410
10,482
0
21 Jul 2016
1