Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.17592
Cited By
v1
v2
v3 (latest)
VideoDirector: Precise Video Editing via Text-to-Video Models
26 November 2024
Yukun Wang
Longguang Wang
Zhiyuan Ma
Qibin Hu
Kai Xu
Yulan Guo
VGen
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VideoDirector: Precise Video Editing via Text-to-Video Models"
30 / 30 papers shown
Title
TurboEdit: Instant text-based image editing
Zongze Wu
Nicholas I. Kolkin
Jonathan Brandt
Richard Zhang
Eli Shechtman
DiffM
78
13
0
14 Aug 2024
SAM 2: Segment Anything in Images and Videos
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
...
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
VLM
MLLM
162
933
0
01 Aug 2024
MotionClone: Training-Free Motion Cloning for Controllable Video Generation
Pengyang Ling
Jiazi Bu
Pan Zhang
Xiaoyi Dong
Yuhang Zang
Tong Wu
H. Chen
Jiaqi Wang
Yi Jin
VGen
DiffM
81
41
0
08 Jun 2024
Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices
Nathaniel Cohen
Vladimir Kulikov
Matan Kleiner
Inbar Huberman-Spiegelglas
T. Michaeli
VGen
DiffM
49
17
0
20 May 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
246
322
0
17 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffM
VGen
274
278
0
05 Jan 2024
VidToMe: Video Token Merging for Zero-Shot Video Editing
Xirui Li
Chao Ma
Xiaokang Yang
Ming-Hsuan Yang
DiffM
VGen
77
48
0
17 Dec 2023
AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing
Zhiyuan Ma
Guoli Jia
Bowen Zhou
DiffM
85
11
0
13 Dec 2023
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models
Ozgur Kara
Barışcan Kurtkaya
Hidir Yesiltepe
James M. Rehg
Pinar Yanardag
VGen
DiffM
90
54
0
07 Dec 2023
VBench: Comprehensive Benchmark Suite for Video Generative Models
Ziqi Huang
Yinan He
Jiashuo Yu
Fan Zhang
Chenyang Si
...
Xinyuan Chen
Limin Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
198
451
0
29 Nov 2023
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Yuren Cong
Mengmeng Xu
Christian Simon
Shoufa Chen
Jiawei Ren
Yanping Xie
Juan-Manuel Perez-Rua
Bodo Rosenhahn
Tao Xiang
Sen He
DiffM
VGen
106
87
0
09 Oct 2023
StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Wenhao Chai
Xun Guo
Gaoang Wang
Yang Lu
VGen
DiffM
75
157
0
18 Aug 2023
TokenFlow: Consistent Diffusion Features for Consistent Video Editing
Michal Geyer
Omer Bar-Tal
Shai Bagon
Tali Dekel
VGen
DiffM
122
271
0
19 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
136
877
0
10 Jul 2023
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing
Yujun Shi
Chuhui Xue
Jun Hao Liew
Jiachun Pan
Hanshu Yan
Wenqing Zhang
Vincent Y. F. Tan
Song Bai
115
220
0
26 Jun 2023
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation
Yuval Kirstain
Adam Polyak
Uriel Singer
Shahbuland Matiana
Joe Penna
Omer Levy
EGVM
231
417
0
02 May 2023
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGS
VGen
224
1,104
0
18 Apr 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
136
286
0
20 Mar 2023
Video-P2P: Video Editing with Cross-attention Control
Shaoteng Liu
Yuechen Zhang
Wenbo Li
Zhe Lin
Jiaya Jia
DiffM
VGen
196
218
0
08 Mar 2023
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Weixian Lei
Yuchao Gu
Yufei Shi
Wynne Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
VGen
131
744
0
22 Dec 2022
Null-text Inversion for Editing Real Images using Guided Diffusion Models
Ron Mokady
Amir Hertz
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
114
537
0
17 Nov 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
282
2,895
0
25 Aug 2022
Understanding Diffusion Models: A Unified Perspective
Calvin Luo
DiffM
102
347
0
25 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
209
1,790
0
02 Aug 2022
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
196
3,971
0
26 Jul 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
469
6,083
0
23 May 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
514
15,788
0
20 Dec 2021
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
310
7,971
0
11 May 2021
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
304
7,500
0
06 Oct 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
776
18,408
0
19 Jun 2020
1