Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.03458
Cited By
v1
v2 (latest)
Video Diffusion Models
7 April 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Video Diffusion Models"
50 / 1,256 papers shown
Title
Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts
Yue Ma
Yin-Yin He
Hongfa Wang
Andong Wang
Chenyang Qi
...
Xiu Li
Zhifeng Li
H. Shum
Wei Liu
Qifeng Chen
VGen
DiffM
161
43
0
13 Mar 2024
DragAnything: Motion Control for Anything using Entity Representation
Wejia Wu
Zhuang Li
Yuchao Gu
Rui Zhao
Yefei He
David Junhao Zhang
Mike Zheng Shou
Yan Li
Yan Li
Di Zhang
VGen
145
62
0
12 Mar 2024
Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions
Lan Wang
Vishnu Boddeti
Sernam Lim
VGen
DiffM
56
0
0
11 Mar 2024
An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data
Yudong Yang
Rongfeng Su
Xiaokang Liu
Nan Yan
Lan Wang
MedIm
DiffM
44
1
0
09 Mar 2024
Large Generative Model Assisted 3D Semantic Communication
Feibo Jiang
Yubo Peng
Li Dong
Kezhi Wang
Kun Yang
Cunhua Pan
Xiaohu You
61
9
0
09 Mar 2024
Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation
Weijie Li
Litong Gong
Yiran Zhu
Fanda Fan
Biao Wang
Tiezheng Ge
Bo Zheng
VGen
DiffM
60
3
0
05 Mar 2024
Time Weaver: A Conditional Time Series Generation Model
Sai Shankar Narasimhan
Shubhankar Agarwal
Oguzhan Akcin
Sujay Sanghavi
Sandeep Chinchali
DiffM
MedIm
118
21
0
05 Mar 2024
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
Xuweiyi Chen
Tian Xia
Sihan Xu
VGen
DiffM
105
8
0
04 Mar 2024
Context-aware Talking Face Video Generation
Meidai Xuanyuan
Yuwang Wang
Honglei Guo
Qionghai Dai
DiffM
75
0
0
28 Feb 2024
PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis
Jason J. Yu
Tristan Aumentado-Armstrong
Fereshteh Forghani
Konstantinos G. Derpanis
Marcus A. Brubaker
77
5
0
28 Feb 2024
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
290
22
0
28 Feb 2024
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
Yazhou Xing
Yin-Yin He
Zeyue Tian
Xintao Wang
Qifeng Chen
116
57
0
27 Feb 2024
Label-Noise Robust Diffusion Models
Byeonghu Na
Yeongmin Kim
Heesun Bae
Jung Hyun Lee
Seho Kwon
Wanmo Kang
Il-Chul Moon
NoLa
DiffM
112
8
0
27 Feb 2024
Sora Generates Videos with Stunning Geometrical Consistency
Xuanyi Li
Daquan Zhou
Chenxu Zhang
Shaodong Wei
Qibin Hou
Ming-Ming Cheng
EGVM
63
16
0
27 Feb 2024
Accelerating Diffusion Sampling with Optimized Time Steps
Shuchen Xue
Zhaoqiang Liu
Fei Chen
Shifeng Zhang
Tianyang Hu
Enze Xie
Zhenguo Li
DiffM
145
29
0
27 Feb 2024
Video as the New Language for Real-World Decision Making
Sherry Yang
Jacob Walker
Jack Parker-Holder
Yilun Du
Jake Bruce
Andre Barreto
Pieter Abbeel
Dale Schuurmans
VGen
121
56
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
263
103
0
27 Feb 2024
Cinematographic Camera Diffusion Model
Hongda Jiang
Xi Wang
Marc Christie
Libin Liu
Baoquan Chen
DiffM
VGen
85
9
0
25 Feb 2024
Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT
Sixiao Zheng
Jingyang Huo
Yu Wang
Yanwei Fu
VGen
DiffM
69
1
0
24 Feb 2024
Genie: Generative Interactive Environments
Jake Bruce
Michael Dennis
Ashley D. Edwards
Jack Parker-Holder
Yuge Shi
...
Konrad Zolna
Jeff Clune
Nando de Freitas
Satinder Singh
Tim Rocktaschel
VGen
VLM
156
188
0
23 Feb 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
134
62
0
22 Feb 2024
Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models
Yixuan Ren
Yang Zhou
Jimei Yang
Jing Shi
Difan Liu
Feng Liu
Mingi Kwon
Abhinav Shrivastava
DiffM
VGen
140
37
0
22 Feb 2024
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
Jianhong Bai
Tianyu He
Yuchi Wang
Junliang Guo
Haoji Hu
Zuozhu Liu
Jiang Bian
VGen
100
30
0
20 Feb 2024
Human Video Translation via Query Warping
Haiming Zhu
Yangyang Xu
Shengfeng He
DiffM
92
0
0
19 Feb 2024
Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same
Sungjun Ahn
Hyun-Jeong Yim
Youngwan Lee
Sung-Ik Park
VGen
92
4
0
19 Feb 2024
Magic-Me: Identity-Specific Video Customized Diffusion
Ze Ma
Daquan Zhou
Chun-Hsiao Yeh
Xue-She Wang
Xiuyu Li
Huanrui Yang
Zhen Dong
Kurt Keutzer
Jiashi Feng
VGen
DiffM
86
32
0
14 Feb 2024
World Model on Million-Length Video And Language With Blockwise RingAttention
Hao Liu
Wilson Yan
Matei A. Zaharia
Pieter Abbeel
VGen
140
85
0
13 Feb 2024
Rolling Diffusion Models
David Ruhe
Jonathan Heek
Tim Salimans
Emiel Hoogeboom
DiffM
100
41
0
12 Feb 2024
Towards Fast Stochastic Sampling in Diffusion Generative Models
Kushagra Pandey
Maja R. Rudolph
Stephan Mandt
DiffM
66
0
0
11 Feb 2024
Sequential Flow Straightening for Generative Modeling
Jongmin Yoon
Juho Lee
60
0
0
09 Feb 2024
Controllable seismic velocity synthesis using generative diffusion models
Fu Wang
Xinquan Huang
T. Alkhalifah
DiffM
59
7
0
09 Feb 2024
Stable Autonomous Flow Matching
Christopher Iliffe Sprague
Arne Elofsson
Hossein Azizpour
94
1
0
08 Feb 2024
Blue noise for diffusion models
Xingchang Huang
Corentin Salaün
C. N. Vasconcelos
Christian Theobalt
Cengiz Öztireli
Gurprit Singh
DiffM
89
11
0
07 Feb 2024
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Weiming Ren
Harry Yang
Ge Zhang
Cong Wei
Xinrun Du
Stephen W. Huang
Wenhu Chen
DiffM
VGen
126
66
0
06 Feb 2024
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies
Xixi Hu
Bo Liu
Xingchao Liu
Qiang Liu
90
15
0
06 Feb 2024
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Shiyuan Yang
Liang Hou
Haibin Huang
Chongyang Ma
Pengfei Wan
Di Zhang
Xiaodong Chen
Jing Liao
VGen
DiffM
155
86
0
05 Feb 2024
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
Yang Jin
Zhicheng Sun
Kun Xu
Kun Xu
Liwei Chen
...
Yuliang Liu
Di Zhang
Yang Song
Kun Gai
Yadong Mu
VGen
113
51
0
05 Feb 2024
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
Yiyuan Zhang
Yuhao Kang
Zhixin Zhang
Xiaohan Ding
Sanyuan Zhao
Xiangyu Yue
VGen
93
4
0
05 Feb 2024
Robust Inverse Graphics via Probabilistic Inference
Tuan Anh Le
Pavel Sountsov
Matthew D. Hoffman
Ben Lee
Brian Patton
Rif A. Saurous
73
0
0
02 Feb 2024
Boximator: Generating Rich and Controllable Motions for Video Synthesis
Jiawei Wang
Yuchen Zhang
Jiaxin Zou
Yan Zeng
Guoqiang Wei
Liping Yuan
Hang Li
DiffM
VGen
104
51
0
02 Feb 2024
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning
Fu-Yun Wang
Zhaoyang Huang
Xiaoyu Shi
Weikang Bian
Guanglu Song
Yu Liu
Hongsheng Li
62
24
0
01 Feb 2024
Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators
Daniel Geng
Andrew Owens
DiffM
83
34
0
31 Jan 2024
Advances in 3D Generation: A Survey
Xiaoyu Li
Qi Zhang
Di Kang
Weihao Cheng
Yiming Gao
Jingbo Zhang
Zhihao Liang
Jing Liao
Yan-Pei Cao
Ying Shan
160
43
0
31 Jan 2024
A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming
Pengyuan Zhou
Lin Wang
Zhi Liu
Yanbin Hao
Pan Hui
Sasu Tarkoma
J. Kangasharju
VGen
116
30
0
30 Jan 2024
DITTO: Diffusion Inference-Time T-Optimization for Music Generation
Cheng-i Wang
Julian McAuley
Taylor Berg-Kirkpatrick
Nicholas J. Bryan
DiffM
119
41
0
22 Jan 2024
Vlogger: Make Your Dream A Vlog
Shaobin Zhuang
Kunchang Li
Xinyuan Chen
Yaohui Wang
Ziwei Liu
Yu Qiao
Yali Wang
VGen
DiffM
83
39
0
17 Jan 2024
UniVG: Towards UNIfied-modal Video Generation
Ludan Ruan
Lei Tian
Chuanwei Huang
Xu Zhang
Xinyan Xiao
VGen
DiffM
81
3
0
17 Jan 2024
Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior
Zike Wu
Pan Zhou
Xuanyu Yi
Xiaoding Yuan
Hanwang Zhang
DiffM
89
41
0
17 Jan 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
259
323
0
17 Jan 2024
Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation
Mathis Petrovich
Or Litany
Umar Iqbal
Michael J. Black
Gül Varol
Xue Bin Peng
Davis Rempe
DiffM
VGen
106
44
0
16 Jan 2024
Previous
1
2
3
...
14
15
16
...
24
25
26
Next