Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.02303
Cited By
Imagen Video: High Definition Video Generation with Diffusion Models
5 October 2022
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
A. Gritsenko
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Imagen Video: High Definition Video Generation with Diffusion Models"
50 / 1,164 papers shown
Title
Video Diffusion Models with Local-Global Context Guidance
Si-hang Yang
Lu Zhang
Yu Liu
Zhizhuo Jiang
You He
VGen
DiffM
17
13
0
05 Jun 2023
Detector Guidance for Multi-Object Text-to-Image Generation
Luping Liu
Zijian Zhang
Yi Ren
Rongjie Huang
Xiang Yin
Zhou Zhao
DiffM
31
9
0
04 Jun 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGen
DiffM
33
316
0
03 Jun 2023
DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting
Salva Rühling Cachay
Bo Zhao
Hailey James
Rose Yu
DiffM
AI4TS
26
57
0
03 Jun 2023
Probabilistic Adaptation of Text-to-Video Models
Mengjiao Yang
Yilun Du
Bo Dai
Dale Schuurmans
J. Tenenbaum
Pieter Abbeel
VGen
DiffM
43
24
0
02 Jun 2023
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds
Yanyu Li
Huan Wang
Qing Jin
Ju Hu
Pavlo Chemerys
Yun Fu
Yanzhi Wang
Sergey Tulyakov
Jian Ren
VLM
24
152
0
01 Jun 2023
Extracting Reward Functions from Diffusion Models
Felipe Nuti
Tim Franzmeyer
João F. Henriques
19
14
0
01 Jun 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffM
VLM
28
39
0
01 Jun 2023
Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Jinbo Xing
Menghan Xia
Yuxin Liu
Yuechen Zhang
Yong Zhang
...
Haoxin Chen
Xiaodong Cun
Xintao Wang
Ying Shan
T. Wong
VGen
DiffM
39
84
0
01 Jun 2023
DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation
Xiaoliang Ju
Zhaoyang Huang
Yijin Li
Guofeng Zhang
Yu Qiao
Hongsheng Li
19
7
0
01 Jun 2023
Addressing Negative Transfer in Diffusion Models
Hyojun Go
Jinyoung Kim
Yunsung Lee
Seunghyun Lee
Shinhyeok Oh
Hyeongdon Moon
Seungtaek Choi
DiffM
VLM
32
24
0
01 Jun 2023
Control4D: Efficient 4D Portrait Editing with Text
Ruizhi Shao
Jingxiang Sun
Cheng Peng
Zerong Zheng
Boyao Zhou
Hongwen Zhang
Yebin Liu
DiffM
24
23
0
31 May 2023
Ambient Diffusion: Learning Clean Distributions from Corrupted Data
Giannis Daras
Kulin Shah
Y. Dagan
Aravind Gollakota
A. Dimakis
Adam R. Klivans
DiffM
45
67
0
30 May 2023
SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing
Nazmul Karim
Umar Khalid
M. Joneidi
Chen Chen
Nazanin Rahnavard
DiffM
VGen
19
5
0
30 May 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGen
DiffM
48
88
0
29 May 2023
TaleCrafter: Interactive Story Visualization with Multiple Characters
Yuan Gong
Youxin Pang
Xiaodong Cun
Menghan Xia
Yingqing He
...
Longyue Wang
Yong Zhang
Xintao Wang
Ying Shan
Yujiu Yang
DiffM
30
45
0
29 May 2023
InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
30
31
0
29 May 2023
Conditional Score Guidance for Text-Driven Image-to-Image Translation
Hyunsoo Lee
Minsoo Kang
Bohyung Han
DiffM
18
14
0
29 May 2023
Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models
Weijian Luo
Tianyang Hu
Shifeng Zhang
Jiacheng Sun
Zhenguo Li
Zhihua Zhang
35
106
0
29 May 2023
Alteration-free and Model-agnostic Origin Attribution of Generated Images
Zhenting Wang
Chen Chen
Yi Zeng
Lingjuan Lyu
Shiqing Ma
25
5
0
29 May 2023
Towards Consistent Video Editing with Text-to-Image Diffusion Models
Zicheng Zhang
Bonan Li
Xuecheng Nie
Congying Han
Tiande Guo
Luoqi Liu
DiffM
20
24
0
27 May 2023
Functional Flow Matching
Gavin Kerrigan
Giosue Migliorini
Padhraic Smyth
39
13
0
26 May 2023
High-Fidelity Image Compression with Score-based Generative Models
Emiel Hoogeboom
E. Agustsson
Fabian Mentzer
Luca Versari
G. Toderici
Lucas Theis
DiffM
21
38
0
26 May 2023
ControlVideo: Conditional Control for One-shot Text-driven Video Editing and Beyond
Min Zhao
Rongzheng Wang
Fan Bao
Chongxuan Li
Jun Zhu
VGen
DiffM
21
4
0
26 May 2023
Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability
Haotian Xue
Alexandre Araujo
Bin Hu
Yongxin Chen
DiffM
35
41
0
25 May 2023
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Omri Avrahami
Kfir Aberman
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
VLM
DiffM
30
165
0
25 May 2023
Trans-Dimensional Generative Modeling via Jump Diffusion Models
Andrew Campbell
William Harvey
Christian Weilbach
Valentin De Bortoli
Tom Rainforth
Arnaud Doucet
DiffM
33
10
0
25 May 2023
Unifying GANs and Score-Based Diffusion as Generative Particle Models
Jean-Yves Franceschi
Mike Gartrell
Ludovic Dos Santos
Thibaut Issenhuth
Emmanuel de Bezenac
Mickaël Chen
A. Rakotomamonjy
DiffM
18
21
0
25 May 2023
GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes
Ibrahim Ethem Hamamci
Sezgin Er
Anjany Sekuboyina
Enis Simsar
A. Tezcan
...
Hadrien Reynaud
Sarthak Pati
Christian Bluethgen
M. K. Özdemir
Bjoern H. Menze
DiffM
MedIm
42
16
0
25 May 2023
Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3)
Tsu-Ching Hsiao
Haoming Chen
Hsuan-Kung Yang
Chun-Yi Lee
DiffM
23
7
0
25 May 2023
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution
Y. Ma
Huan Yang
Wenhan Yang
Jianlong Fu
Jiaying Liu
DiffM
10
7
0
24 May 2023
T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified Visual Modalities
Kangfu Mei
Mo Zhou
Vishal M. Patel
DiffM
26
1
0
24 May 2023
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
30
6
0
24 May 2023
Video Prediction Models as Rewards for Reinforcement Learning
Alejandro Escontrela
Ademi Adeniji
Wilson Yan
Ajay Jain
Xue Bin Peng
Ken Goldberg
Youngwoon Lee
Danijar Hafner
Pieter Abbeel
42
52
0
23 May 2023
DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot Text-to-Video Generation
Susung Hong
Junyoung Seo
Heeseong Shin
Sung‐Jin Hong
Seung Wook Kim
DiffM
VGen
28
34
0
23 May 2023
Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Weifeng Chen
Yatai Ji
Jie Wu
Hefeng Wu
Pan Xie
Jiashi Li
Xin Xia
Xuefeng Xiao
Liang Lin
VGen
121
6
0
23 May 2023
Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach
Yufan Zhou
Ruiyi Zhang
Tongfei Sun
Jinhui Xu
DiffM
109
37
0
23 May 2023
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
Haoyu Lu
Guoxing Yang
Nanyi Fei
Yuqi Huo
Zhiwu Lu
Ping Luo
Mingyu Ding
DiffM
VGen
28
56
0
22 May 2023
Training Diffusion Models with Reinforcement Learning
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
EGVM
44
316
0
22 May 2023
GSURE-Based Diffusion Model Training with Corrupted Data
Bahjat Kawar
Noam Elata
T. Michaeli
Michael Elad
DiffM
37
30
0
22 May 2023
ControlVideo: Training-free Controllable Text-to-Video Generation
Yabo Zhang
Yuxiang Wei
Dongsheng Jiang
Xiaopeng Zhang
W. Zuo
Qi Tian
VGen
DiffM
36
236
0
22 May 2023
DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment
Shentong Mo
Jing Shi
Yapeng Tian
20
17
0
22 May 2023
Guided Motion Diffusion for Controllable Human Motion Synthesis
Korrawe Karunratanakul
Konpat Preechakul
Supasorn Suwajanakorn
Siyu Tang
DiffM
34
122
0
21 May 2023
InstructVid2Vid: Controllable Video Editing with Natural Language Instructions
Bosheng Qin
Juncheng Li
Siliang Tang
Tat-Seng Chua
Yueting Zhuang
VGen
DiffM
31
16
0
21 May 2023
Any-to-Any Generation via Composable Diffusion
Zineng Tang
Ziyi Yang
Chenguang Zhu
Michael Zeng
Joey Tianyi Zhou
VGen
DiffM
33
171
0
19 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
Ziyi Wu
Jingyu Hu
Wuyue Lu
Igor Gilitschenski
Animesh Garg
DiffM
OCL
36
44
0
18 May 2023
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Junchen Zhu
Jianlong Fu
Jiaying Liu
DiffM
VGen
45
114
0
18 May 2023
LDM3D: Latent Diffusion Model for 3D
Gabriela Ben-Melech Stan
Diana Wofk
Scottie Fox
Alex Redden
Will Saxton
...
Estelle Aflalo
Shao-Yen Tseng
Fabio Nonato
Matthias Muller
Vasudev Lal
24
44
0
18 May 2023
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Songwei Ge
Seungjun Nah
Guilin Liu
Tyler Poon
Andrew Tao
Bryan Catanzaro
David Jacobs
Jia-Bin Huang
Ming-Yu Liu
Yogesh Balaji
DiffM
VGen
45
252
0
17 May 2023
Controllable Mind Visual Diffusion Model
Bo-Wen Zeng
Shanglin Li
Xuhui Liu
Sicheng Gao
Xiaolong Jiang
Xu Tang
Yao Hu
Jianzhuang Liu
Baochang Zhang
DiffM
25
24
0
17 May 2023
Previous
1
2
3
...
19
20
21
22
23
24
Next