Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.09710
Cited By
SimDA: Simple Diffusion Adapter for Efficient Video Generation
18 August 2023
Zhen Xing
Qi Dai
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SimDA: Simple Diffusion Adapter for Efficient Video Generation"
50 / 84 papers shown
Title
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models
Junhao Xia
Chaoyang Zhang
Yecheng Zhang
Chengyang Zhou
Zhichang Wang
Bochun Liu
Dongshuo Yin
DiffM
VGen
31
0
0
11 May 2025
Visual Prompting for One-shot Controllable Video Editing without Inversion
Zhengbo Zhang
Yuxi Zhou
Duo Peng
Joo-Hwee Lim
Zhigang Tu
De Wen Soh
Lin Geng Foo
DiffM
45
1
0
19 Apr 2025
Exploring the Evolution of Physics Cognition in Video Generation: A Survey
Minghui Lin
Xiang Wang
Y. Wang
Shu Wang
Fengqi Dai
...
Cunxiang Wang
Zhengrong Zuo
Nong Sang
Siteng Huang
Donglin Wang
EGVM
VGen
85
3
0
27 Mar 2025
EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation
Qiang Qu
Ming Li
Xiaoming Chen
Tongliang Liu
DiffM
VGen
63
0
0
24 Mar 2025
RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation
Zhiqiang Yuan
Ting Zhang
Ying Deng
Jiapei Zhang
Yeshuang Zhu
Zexi Jia
Jie Zhou
Jinchao Zhang
VGen
39
0
0
22 Mar 2025
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
Hui Zhang
Tingwei Gao
Jie Shao
Zuxuan Wu
69
0
0
20 Mar 2025
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
Quanhao Li
Zhen Xing
Rui Wang
Hui Zhang
Qi Dai
Zuxuan Wu
VGen
66
0
0
20 Mar 2025
TPDiff: Temporal Pyramid Video Diffusion Model
L. Ran
Mike Zheng Shou
77
0
0
12 Mar 2025
TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision
Shaobin Zhuang
Yiwei Guo
Yanbo Ding
Kunchang Li
Xinyuan Chen
Yaohui Wang
Fangyikang Wang
Ying Zhang
Chen Li
Y. Wang
45
0
0
10 Mar 2025
Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Sicheng Xie
Haidong Cao
Zejia Weng
Zhen Xing
Shiwei Shen
Jiaqi Leng
Xipeng Qiu
Yanwei Fu
Zuxuan Wu
Yu Jiang
53
0
0
23 Feb 2025
SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
Junxian Ma
Shiwen Wang
Jian Yang
Junyi Hu
Jian Liang
Guosheng Lin
Jingbo Chen
Kai Li
Yu Meng
DiffM
VGen
61
3
0
17 Feb 2025
IPO: Iterative Preference Optimization for Text-to-Video Generation
Xiaomeng Yang
Zhiyu Tan
Xuecheng Nie
VGen
103
1
0
04 Feb 2025
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
Lifan Jiang
Boxi Wu
Jiahui Zhang
Xiaotong Guan
Shuang Chen
VGen
61
1
0
02 Feb 2025
Parameter-Efficient Fine-Tuning for Foundation Models
Dan Zhang
Tao Feng
Lilong Xue
Yuandong Wang
Yuxiao Dong
J. Tang
44
7
0
23 Jan 2025
Grid Diffusion Models for Text-to-Video Generation
Taegyeong Lee
Soyeong Kwon
Taehwan Kim
54
5
0
31 Dec 2024
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang
Zuxuan Wu
Zhen Xing
Jie Shao
Yu-Gang Jiang
51
9
0
31 Dec 2024
StableAnimator: High-Quality Identity-Preserving Human Image Animation
Shuyuan Tu
Zhen Xing
Xintong Han
Zhi-Qi Cheng
Qi Dai
Chong Luo
Zuxuan Wu
VGen
107
14
0
26 Nov 2024
Safety Without Semantic Disruptions: Editing-free Safe Image Generation via Context-preserving Dual Latent Reconstruction
J. Vice
Naveed Akhtar
Richard I. Hartley
Ajmal Saeed Mian
Ajmal Mian
DiffM
86
0
0
21 Nov 2024
Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge
Qinglong Cao
Ding Wang
Xirui Li
Yuntian Chen
Chao Ma
Xiaokang Yang
DiffM
VGen
113
2
0
18 Nov 2024
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation
Zhenbin Wang
Lei Zhang
Lituan Wang
Minjuan Zhu
Zhenwei Zhang
VGen
MedIm
57
1
0
03 Nov 2024
Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
Zheng Zhan
Yushu Wu
Yifan Gong
Zichong Meng
Zhenglun Kong
Changdi Yang
Geng Yuan
Pu Zhao
Wei Niu
Yanzhi Wang
VGen
42
4
0
02 Nov 2024
Temporal and Spatial Super Resolution with Latent Diffusion Model in Medical MRI images
Vishal Dubey
DiffM
MedIm
33
0
0
29 Oct 2024
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content
Qiuheng Wang
Yukai Shi
Jiarong Ou
R. J. Chen
Ke Lin
...
Mingwu Zheng
Xin Tao
Fei Yang
Pengfei Wan
Di Zhang
VGen
86
18
0
10 Oct 2024
MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion
Onkar Susladkar
Jishu Sen Gupta
Chirag Sehgal
Sparsh Mittal
Rekha Singhal
DiffM
VGen
40
0
0
10 Oct 2024
Denoising Reuse: Exploiting Inter-frame Motion Consistency for Efficient Video Latent Generation
Chenyu Wang
Shuo Yan
Yixuan Chen
Yujiang Wang
Mingzhi Dong
...
Qin Lv
Fan Yang
Tun Lu
Ning Gu
Li Shang
DiffM
VGen
33
0
0
19 Sep 2024
GenRec: Unifying Video Generation and Recognition with Diffusion Models
Zejia Weng
Xitong Yang
Zhen Xing
Zuxuan Wu
Yu-Gang Jiang
VGen
DiffM
42
5
0
27 Aug 2024
Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing
Yitong Yang
Yinglin Wang
Jing Wang
Tian Zhang
DiffM
34
1
0
24 Aug 2024
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
60
22
0
26 Jun 2024
VIA: Unified Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Jing Gu
Yuwei Fang
Ivan Skorokhodov
Peter Wonka
Xinya Du
Sergey Tulyakov
Xin Eric Wang
39
1
0
18 Jun 2024
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Junke Wang
Yi-Xin Jiang
Zehuan Yuan
Binyue Peng
Zuxuan Wu
Yu-Gang Jiang
ViT
VGen
78
36
0
13 Jun 2024
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Miaosen Zhang
Yixuan Wei
Zhen Xing
Yifei Ma
Zuxuan Wu
...
Zheng-Wei Zhang
Qi Dai
Chong Luo
Xin Geng
Baining Guo
VLM
46
1
0
13 Jun 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Tianle Zhang
Langtian Ma
Yuchen Yan
Yuchen Zhang
Kai Wang
...
Wenqi Shao
Yang You
Yu Qiao
Ping Luo
Kaipeng Zhang
VGen
69
2
0
13 Jun 2024
AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation
Kai Wang
Shijian Deng
Jing Shi
Dimitrios Hatzinakos
Yapeng Tian
VGen
72
8
0
11 Jun 2024
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
Zhen Xing
Qi Dai
Zejia Weng
Zuxuan Wu
Yu-Gang Jiang
VGen
43
14
0
10 Jun 2024
CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion
Xingrui Wang
Xin Li
Zhibo Chen
DiffM
47
1
0
07 Jun 2024
CLoG: Benchmarking Continual Learning of Image Generation Models
Haotian Zhang
Junting Zhou
Haowei Lin
Hang Ye
Jianhua Zhu
Zihao Wang
Liangcai Gao
Yizhou Wang
Yitao Liang
DiffM
VLM
29
0
0
07 Jun 2024
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Xiang Wang
Shiwei Zhang
Changxin Gao
Jiayu Wang
Xiaoqiang Zhou
Yingya Zhang
Luxin Yan
Nong Sang
VGen
62
30
0
03 Jun 2024
Enhanced Creativity and Ideation through Stable Video Synthesis
Elijah Miller
Thomas Dupont
Mingming Wang
VGen
30
0
0
22 May 2024
Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices
Nathaniel Cohen
Vladimir Kulikov
Matan Kleiner
Inbar Huberman-Spiegelglas
T. Michaeli
VGen
DiffM
30
15
0
20 May 2024
From Sora What We Can See: A Survey of Text-to-Video Generation
Rui Sun
Yumin Zhang
Tejal Shah
Jiahao Sun
Shuoying Zhang
Wenqi Li
Haoran Duan
Bo Wei
R. Ranjan
EGVM
79
20
0
17 May 2024
TAVGBench: Benchmarking Text to Audible-Video Generation
Yuxin Mao
Xuyang Shen
Jing Zhang
Zhen Qin
Jinxing Zhou
Mochu Xiang
Yiran Zhong
Yuchao Dai
40
11
0
22 Apr 2024
Zero-shot High-fidelity and Pose-controllable Character Animation
Bingwen Zhu
Fanyi Wang
Tianyi Lu
Peng Liu
Jingwen Su
Jinxiu Liu
Yanhao Zhang
Zuxuan Wu
Guo-Jun Qi
Yu-Gang Jiang
DiffM
VGen
52
6
0
21 Apr 2024
On the Content Bias in Fréchet Video Distance
Jason S. Hoffman
Aniruddha Mahapatra
Gaurav Parmar
Jun-Yan Zhu
Jia-Bin Huang
EGVM
50
15
0
18 Apr 2024
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Zhongwei Zhang
Fuchen Long
Yingwei Pan
Zhaofan Qiu
Ting Yao
Yang Cao
Tao Mei
VGen
43
22
0
25 Mar 2024
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Zeyu Han
Chao Gao
Jinyang Liu
Jeff Zhang
Sai Qian Zhang
144
308
0
21 Mar 2024
S2DM: Sector-Shaped Diffusion Models for Video Generation
Haoran Lang
Yuxuan Ge
Zheng Tian
DiffM
VGen
35
0
0
20 Mar 2024
EffiVED:Efficient Video Editing via Text-instruction Diffusion Models
Zhenghao Zhang
Zuozhuo Dai
Long Qin
Weizhi Wang
DiffM
VGen
36
2
0
18 Mar 2024
FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model
Qijun Feng
Zhen Xing
Zuxuan Wu
Yu-Gang Jiang
3DGS
37
4
0
15 Mar 2024
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
EGVM
VGen
36
40
0
08 Mar 2024
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Chao Xu
Yang Liu
Jiazheng Xing
Weida Wang
Mingze Sun
...
Tianxin Huang
Siyuan Li
Zhi-Qi Cheng
Ying Tai
Baigui Sun
CVBM
46
11
0
04 Mar 2024
1
2
Next