Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.02303
Cited By
Imagen Video: High Definition Video Generation with Diffusion Models
5 October 2022
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
A. Gritsenko
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Imagen Video: High Definition Video Generation with Diffusion Models"
50 / 1,162 papers shown
Title
SceneScape: Text-Driven Consistent Scene Generation
Rafail Fridman
Amit Abecasis
Yoni Kasten
Tali Dekel
VGen
38
110
0
02 Feb 2023
Learning Universal Policies via Text-Guided Video Generation
Yilun Du
Mengjiao Yang
Bo Dai
H. Dai
Ofir Nachum
J. Tenenbaum
Dale Schuurmans
Pieter Abbeel
PINN
LM&Ro
19
231
0
31 Jan 2023
Shape-aware Text-driven Layered Video Editing
Yao-Chih Lee
Ji-Ze Jang
Yi-Ting Chen
Elizabeth Qiu
Jia-Bin Huang
VGen
DiffM
39
53
0
30 Jan 2023
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
Haohe Liu
Zehua Chen
Yiitan Yuan
Xinhao Mei
Xubo Liu
Danilo P. Mandic
Wenwu Wang
Mark D. Plumbley
DiffM
35
467
0
29 Jan 2023
Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion
Flavio Schneider
Ojasv Kamal
Zhijing Jin
Bernhard Schölkopf
MGen
27
83
0
27 Jan 2023
PLay: Parametrically Conditioned Layout Generation using Latent Diffusion
Ching-Yi Cheng
Forrest Huang
Gang Li
Yang Li
DiffM
19
29
0
27 Jan 2023
Text-To-4D Dynamic Scene Generation
Uriel Singer
Shelly Sheynin
Adam Polyak
Oron Ashual
Iurii Makarov
...
Naman Goyal
Andrea Vedaldi
Devi Parikh
Justin Johnson
Yaniv Taigman
DiffM
30
147
0
26 Jan 2023
Imitating Human Behaviour with Diffusion Models
Tim Pearce
Tabish Rashid
Anssi Kanervisto
David Bignell
Mingfei Sun
...
Sergio Valcarcel Macua
Shan Zheng Tan
Ida Momennejad
Katja Hofmann
Sam Devlin
DiffM
32
203
0
25 Jan 2023
Bipartite Graph Diffusion Model for Human Interaction Generation
Baptiste Chopin
Hao Tang
Mohamed Daoudi
DiffM
21
12
0
24 Jan 2023
Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Siyuan Huang
Zan Wang
Puhao Li
Baoxiong Jia
Tengyu Liu
Yixin Zhu
Wei Liang
Song-Chun Zhu
DiffM
64
201
0
15 Jan 2023
Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Michal Stypulkowski
Konstantinos Vougioukas
Sen He
Maciej Ziȩba
Stavros Petridis
M. Pantic
DiffM
18
116
0
06 Jan 2023
TeViS:Translating Text Synopses to Video Storyboards
Xu Gu
Yuchong Sun
Feiyue Ni
Shizhe Chen
Xihua Wang
Ruihua Song
Yangqiu Song
Xiang Cao
DiffM
23
4
0
31 Dec 2022
Exploring Vision Transformers as Diffusion Learners
He Cao
Jianan Wang
Tianhe Ren
Xianbiao Qi
Yihao Chen
Yuan Yao
L. Zhang
44
10
0
28 Dec 2022
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Weixian Lei
Yuchao Gu
Yufei Shi
W. Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
VGen
29
691
0
22 Dec 2022
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Ludan Ruan
Y. Ma
Huan Yang
Huiguo He
Bei Liu
Jianlong Fu
Nicholas Jing Yuan
Qin Jin
B. Guo
DiffM
VGen
33
174
0
19 Dec 2022
Latent Diffusion for Language Generation
Justin Lovelace
Varsha Kishore
Chao-gang Wan
Eliot Shekhtman
Kilian Q. Weinberger
DiffM
24
71
0
19 Dec 2022
Point-E: A System for Generating 3D Point Clouds from Complex Prompts
Alex Nichol
Heewoo Jun
Prafulla Dhariwal
Pamela Mishkin
Mark Chen
DiffM
20
584
0
16 Dec 2022
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
Qiucheng Wu
Yujian Liu
Handong Zhao
Ajinkya Kale
T. Bui
Tong Yu
Zhe-nan Lin
Yang Zhang
Shiyu Chang
DiffM
CoGe
24
97
0
16 Dec 2022
SADM: Sequence-Aware Diffusion Model for Longitudinal Medical Image Generation
Jee Seok Yoon
Chenghao Zhang
Heung-Il Suk
Jia Guo
Xiaoxia Li
DiffM
MedIm
22
38
0
16 Dec 2022
CLIPPO: Image-and-Language Understanding from Pixels Only
Michael Tschannen
Basil Mustafa
N. Houlsby
CLIP
VLM
32
47
0
15 Dec 2022
The Infinite Index: Information Retrieval on Generative Text-To-Image Models
Niklas Deckers
Maik Frobe
Johannes Kiesel
G. Pandolfo
Christopher Schröder
Benno Stein
Martin Potthast
DiffM
42
16
0
14 Dec 2022
Reproducible scaling laws for contrastive language-image learning
Mehdi Cherti
Romain Beaumont
Ross Wightman
Mitchell Wortsman
Gabriel Ilharco
Cade Gordon
Christoph Schuhmann
Ludwig Schmidt
J. Jitsev
VLM
CLIP
59
739
0
14 Dec 2022
Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Su Wang
Chitwan Saharia
Ceslee Montgomery
Jordi Pont-Tuset
Shai Noy
...
Radu Soricut
Jason Baldridge
Mohammad Norouzi
Peter Anderson
William Chan
35
173
0
13 Dec 2022
Rodin: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion
Tengfei Wang
Bo Zhang
Ting Zhang
Shuyang Gu
Jianmin Bao
...
Jingjing Shen
Dong Chen
Fang Wen
Qifeng Chen
B. Guo
35
279
0
12 Dec 2022
Towards Practical Plug-and-Play Diffusion Models
Hyojun Go
Yunsung Lee
Jin-Young Kim
Seunghyun Lee
Myeongho Jeong
Hyun Seung Lee
Seungtaek Choi
DiffM
32
16
0
12 Dec 2022
Economic Systems in Metaverse: Basics, State of the Art, and Challenges
Huawei Huang
Qinnan Zhang
Taotao Li
Qinglin Yang
Zhaokang Yin
Junhao Wu
Zehui Xiong
Jianming Zhu
Jiajing Wu
Zibin Zheng
AILaw
34
27
0
12 Dec 2022
MAGVIT: Masked Generative Video Transformer
Lijun Yu
Yong Cheng
Kihyuk Sohn
José Lezama
Han Zhang
...
Alexander G. Hauptmann
Ming-Hsuan Yang
Yuan Hao
Irfan Essa
Lu Jiang
DiffM
VGen
32
224
0
10 Dec 2022
SmartBrush: Text and Shape Guided Object Inpainting with Diffusion Model
Shaoan Xie
Zhifei Zhang
Zhe-nan Lin
Tobias Hinz
Anton van den Hengel
DiffM
16
231
0
09 Dec 2022
Executing your Commands via Motion Diffusion in Latent Space
Xin Chen
Biao Jiang
Wen Liu
Zilong Huang
Bin-Bin Fu
Tao Chen
Jingyi Yu
Gang Yu
VGen
DiffM
25
337
0
08 Dec 2022
3D-LDM: Neural Implicit 3D Shape Generation with Latent Diffusion Models
Gimin Nam
Mariem Khlifi
Andrew Rodriguez
Alberto Tono
Linqi Zhou
Paul Guerrero
DiffM
29
68
0
01 Dec 2022
SinDDM: A Single Image Denoising Diffusion Model
Vladimir Kulikov
Shahar Yadin
Matan Kleiner
T. Michaeli
DiffM
18
77
0
29 Nov 2022
Continuous diffusion for categorical data
Sander Dieleman
Laurent Sartran
Arman Roshannai
Nikolay Savinov
Yaroslav Ganin
...
Conor Durkan
Curtis Hawthorne
Rémi Leblond
Will Grathwohl
J. Adler
DiffM
26
98
0
28 Nov 2022
Diffusion Probabilistic Model Made Slim
Xingyi Yang
Daquan Zhou
Jiashi Feng
Xinchao Wang
DiffM
27
103
0
27 Nov 2022
Efficient Video Prediction via Sparsely Conditioned Flow Matching
A. Davtyan
Sepehr Sameni
Paolo Favaro
VGen
DiffM
35
27
0
26 Nov 2022
3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models
Gang Li
Heliang Zheng
Chaoyue Wang
Chang Li
C. Zheng
Dacheng Tao
DiffM
26
59
0
25 Nov 2022
TPA-Net: Generate A Dataset for Text to Physics-based Animation
Yuxing Qiu
Feng Gao
Minchen Li
Govind Thattai
Yin Yang
Chenfanfu Jiang
PINN
DiffM
VGen
49
0
0
25 Nov 2022
Latent Video Diffusion Models for High-Fidelity Long Video Generation
Yin-Yin He
Tianyu Yang
Yong Zhang
Ying Shan
Qifeng Chen
DiffM
VGen
16
202
0
23 Nov 2022
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
Tsu-jui Fu
Licheng Yu
Ning Zhang
Cheng-Yang Fu
Jong-Chyi Su
William Yang Wang
Sean Bell
VGen
56
37
0
23 Nov 2022
EDICT: Exact Diffusion Inversion via Coupled Transformations
Bram Wallace
Akash Gokul
Nikhil Naik
DiffM
22
173
0
22 Nov 2022
SinFusion: Training Diffusion Models on a Single Image or Video
Yaniv Nikankin
Niv Haim
Michal Irani
VGen
24
69
0
21 Nov 2022
Beyond the Field-of-View: Enhancing Scene Visibility and Perception with Clip-Recurrent Transformer
Haowen Shi
Qi Jiang
Kailun Yang
Xiaoyue Yin
Ze Wang
Kaiwei Wang
ViT
43
5
0
21 Nov 2022
MagicVideo: Efficient Video Generation With Latent Diffusion Models
Daquan Zhou
Weimin Wang
Hanshu Yan
Weiwei Lv
Yizhe Zhu
Jiashi Feng
DiffM
VGen
39
372
0
20 Nov 2022
EDGE: Editable Dance Generation From Music
Jo-Han Tseng
Rodrigo Castellon
C. Karen Liu
28
222
0
19 Nov 2022
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models
Simon Alexanderson
Rajmund Nagy
Jonas Beskow
G. Henter
DiffM
VGen
24
165
0
17 Nov 2022
CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Natural Language
Aditya Sanghi
Rao Fu
Vivian Liu
Karl Willis
Hooman Shayani
Amir Hosein Khasahmadi
Srinath Sridhar
Daniel E. Ritchie
19
52
0
02 Nov 2022
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji
Seungjun Nah
Xun Huang
Arash Vahdat
Jiaming Song
...
Timo Aila
S. Laine
Bryan Catanzaro
Tero Karras
Xuan Li
VLM
MoE
26
803
0
02 Nov 2022
Being Comes from Not-being: Open-vocabulary Text-to-Motion Generation with Wordless Training
Junfan Lin
Jianlong Chang
Lingbo Liu
Guanbin Li
Liang Lin
Qi Tian
Changan Chen
VGen
48
40
0
28 Oct 2022
Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks
Edwin Zhang
Yujie Lu
William Wang
Amy Zhang
DiffM
LM&Ro
29
16
0
27 Oct 2022
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models
Zijie J. Wang
Evan Montoya
David Munechika
Haoyang Yang
Benjamin Hoover
Duen Horng Chau
21
288
0
26 Oct 2022
Categorical SDEs with Simplex Diffusion
Pierre Harvey Richemond
Sander Dieleman
Arnaud Doucet
DiffM
17
24
0
26 Oct 2022
Previous
1
2
3
...
22
23
24
Next