Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.08818
Cited By
v1
v2 (latest)
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
18 April 2023
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGS
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models"
50 / 273 papers shown
Title
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Dinesh Manocha
MoE
159
19
0
14 Oct 2024
VideoAgent: Self-Improving Video Generation
Achint Soni
Sreyas Venkataraman
Abhranil Chandra
Sebastian Fischmeister
Percy Liang
Bo Dai
Sherry Yang
LM&Ro
VGen
168
11
0
14 Oct 2024
MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion
Onkar Susladkar
Jishu Sen Gupta
Chirag Sehgal
Sparsh Mittal
Rekha Singhal
DiffM
VGen
109
0
0
10 Oct 2024
Progressive Autoregressive Video Diffusion Models
Desai Xie
Zhan Xu
Yicong Hong
Hao Tan
Difan Liu
Feng Liu
Arie E. Kaufman
Yang Zhou
DiffM
VGen
127
15
0
10 Oct 2024
Manifolds, Random Matrices and Spectral Gaps: The geometric phases of generative diffusion
Enrico Ventura
Beatrice Achilli
Gianluigi Silvestri
Carlo Lucibello
L. Ambrogioni
DiffM
135
10
0
08 Oct 2024
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design
Jiachen Li
Qian Long
Jian Zheng
Xiaofeng Gao
Robinson Piramuthu
Wenhu Chen
William Yang Wang
VGen
128
26
0
08 Oct 2024
Pyramidal Flow Matching for Efficient Video Generative Modeling
Yang Jin
Zhicheng Sun
Ningyuan Li
Kun Xu
K. Xu
...
Nan Zhuang
Quzhe Huang
Yang Song
Yadong Mu
Zhouchen Lin
VGen
168
87
0
08 Oct 2024
ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler
Serin Yang
Taesung Kwon
Jong Chul Ye
VGen
DiffM
111
7
0
08 Oct 2024
Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration
Zhiyu Zhu
Jinhui Hou
Hui Liu
H. Zeng
Junhui Hou
81
0
0
07 Oct 2024
MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation
T. Pham
Tri Ton
Chang D. Yoo
105
3
0
03 Oct 2024
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Yuqing Wang
Tianwei Xiong
Daquan Zhou
Zhijie Lin
Yang Zhao
Bingyi Kang
Jiashi Feng
Xihui Liu
VGen
163
35
0
03 Oct 2024
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models
Seyedmorteza Sadat
Otmar Hilliges
Romann M. Weber
DiffM
64
13
0
03 Oct 2024
Text2PDE: Latent Diffusion Models for Accessible Physics Simulation
Anthony Zhou
Zijie Li
Michael Schneier
John R Buchanan Jr
Amir Barati Farimani
AI4CE
DiffM
178
8
0
02 Oct 2024
Replace Anyone in Videos
Xiang Wang
Shiwei Zhang
Haonan Qiu
Ruihang Chu
Zekun Li
Yuanxing Zhang
Changxin Gao
Yuehuan Wang
Chunhua Shen
Nong Sang
VGen
DiffM
123
1
0
30 Sep 2024
A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation
Masato Ishii
Akio Hayakawa
Takashi Shibuya
Yuki Mitsufuji
VGen
DiffM
165
4
0
26 Sep 2024
Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models
A. Popov
Alperen Degirmenci
David Wehr
Shashank Hegde
Ryan Oldja
...
David Nistér
Urs Muller
Ruchi Bhargava
Stan Birchfield
Nikolai Smolyanskiy
161
11
0
25 Sep 2024
Single Image, Any Face: Generalisable 3D Face Generation
Wenqing Wang
Haosen Yang
Josef Kittler
Xiatian Zhu
3DH
150
0
0
25 Sep 2024
Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification
Xinrui Zhou
Yuhao Huang
Haoran Dou
Shijing Chen
Ao Chang
...
Jie Jessie Ren
Ruobing Huang
Jun Cheng
Wufeng Xue
Dong Ni
MedIm
389
0
0
25 Sep 2024
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
Yifang Men
Yuan Yao
Miaomiao Cui
Liefeng Bo
DiffM
138
30
0
24 Sep 2024
Multi-modal Generative AI: Multi-modal LLMs, Diffusions and the Unification
X. Wang
Yuwei Zhou
Bin Huang
Hong Chen
Wenwu Zhu
DiffM
156
9
0
23 Sep 2024
HSIGene: A Foundation Model For Hyperspectral Image Generation
Li Pang
Datao Tang
Shuang Xu
Deyu Meng
Xiangyong Cao
DiffM
84
16
0
19 Sep 2024
OSV: One Step is Enough for High-Quality Image to Video Generation
Xiaofeng Mao
Zhengkai Jiang
Fu-Yun Wang
Wenbing Zhu
Hao Chen
Mingmin Chi
Yabiao Wang
Wenhan Luo
DiffM
VGen
129
13
0
17 Sep 2024
LT3SD: Latent Trees for 3D Scene Diffusion
Quan Meng
Lei Li
Matthias Nießner
Angela Dai
174
14
0
12 Sep 2024
Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models
Rohit Jena
Ali Taghibakhshi
Sahil Jain
Gerald Shen
Nima Tajbakhsh
Arash Vahdat
97
5
0
09 Sep 2024
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Jianbiao Mei
T. Hu
Xuemeng Yang
Licheng Wen
Yu Yang
Tiantian Wei
Yukai Ma
Min Dou
Botian Shi
Yong Liu
VGen
DiffM
170
6
0
06 Sep 2024
DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
Qianlong Xiang
Miao Zhang
Yuzhang Shang
Jianlong Wu
Yan Yan
Liqiang Nie
DiffM
127
10
0
05 Sep 2024
CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention
Gaojie Lin
Jianwen Jiang
Chao Liang
Tianyun Zhong
Jiaqi Yang
Yanbo Zheng
VGen
DiffM
144
19
0
03 Sep 2024
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model
Fan Liu
Wenqiang Sun
Hanyang Wang
Yikai Wang
Haowen Sun
Junliang Ye
Jun Zhang
Yueqi Duan
VGen
123
41
0
29 Aug 2024
Constrained Diffusion Models via Dual Training
Shervin Khalafi
Dongsheng Ding
Alejandro Ribeiro
113
4
0
27 Aug 2024
Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation
Xiaojuan Wang
Boyang Zhou
Brian L. Curless
Ira Kemelmacher-Shlizerman
Aleksander Holynski
Steven M. Seitz
DiffM
122
17
0
27 Aug 2024
Diffusion Models Are Real-Time Game Engines
Dani Valevski
Yaniv Leviathan
Moab Arar
Shlomi Fruchter
DiffM
VGen
AI4CE
139
91
0
27 Aug 2024
Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention
Mengkang Hu
DiffM
119
10
0
01 Aug 2024
SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
Yiming Xie
Chun-Han Yao
Vikram S. Voleti
Huaizu Jiang
Varun Jampani
VGen
156
47
0
24 Jul 2024
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Sherwin Bahmani
Ivan Skorokhodov
Aliaksandr Siarohin
Willi Menapace
Guocheng Qian
...
Chaoyang Wang
Jiaxu Zou
Andrea Tagliasacchi
David B. Lindell
Sergey Tulyakov
VGen
DiffM
207
50
0
17 Jul 2024
Kinetic Typography Diffusion Model
Seonmi Park
Inhwan Bae
Seunghyun Shin
Hae-Gon Jeon
DiffM
116
2
0
15 Jul 2024
T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative Models
Yibo Miao
Yifan Zhu
Yinpeng Dong
Lijia Yu
Jun Zhu
Xiao-Shan Gao
EGVM
131
20
0
08 Jul 2024
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
Fei Shen
Hu Ye
Sibo Liu
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
140
40
0
02 Jul 2024
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models
Seyedmorteza Sadat
Manuel Kansy
Otmar Hilliges
Romann M. Weber
98
14
0
02 Jul 2024
A3D: Does Diffusion Dream about 3D Alignment?
Savva Ignatyev
Nina Konovalova
Daniil Selikhanovych
Nikolay Patakin
Nikolay Patakin
...
Anton Konushin
Peter Wonka
Alexander Filippov
Peter Wonka
Evgeny Burnaev
DiffM
184
1
0
21 Jun 2024
Training-free Camera Control for Video Generation
Chen Hou
Guoqiang Wei
VGen
DiffM
203
40
0
14 Jun 2024
SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
Rishit Dagli
Shivesh Prakash
Robert Wu
H. Khosravani
143
6
0
06 Jun 2024
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
Tianchen Zhao
Tongcheng Fang
Haofeng Huang
Enshu Liu
Widyadewi Soedarmadji
...
Shengen Yan
Huazhong Yang
Xuefei Ning
Xuefei Ning
Yu Wang
MQ
VGen
195
35
0
04 Jun 2024
On the Hardness of Sampling from Mixture Distributions via Langevin Dynamics
Xiwei Cheng
Kexin Fu
Farzan Farnia
124
0
0
04 Jun 2024
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Jiahao Shao
Yuanbo Yang
Hongyu Zhou
Youmin Zhang
Yujun Shen
Vitor Campagnolo Guizilini
Yue Wang
Matteo Poggi
Yiyi Liao
VGen
DiffM
MDE
123
43
0
03 Jun 2024
EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing
Hadrien Reynaud
Qingjie Meng
Mischa Dombrowski
Arijit Ghosh
Thomas Day
Alberto Gomez
Paul Leeson
Bernhard Kainz
MedIm
82
7
0
02 Jun 2024
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
169
103
0
27 May 2024
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models
Seyedmorteza Sadat
Jakob Buhmann
Derek Bradley
Otmar Hilliges
Romann M. Weber
152
9
0
23 May 2024
AdjointDEIS: Efficient Gradients for Diffusion Models
Zander W. Blasingame
Chen Liu
DiffM
160
5
0
23 May 2024
Enhanced Creativity and Ideation through Stable Video Synthesis
Elijah Miller
Thomas Dupont
Mingming Wang
VGen
61
1
0
22 May 2024
MagicPose4D: Crafting Articulated Models with Appearance and Motion Control
Hao Zhang
Di Chang
Fang Li
Mohammad Soleymani
Narendra Ahuja
110
8
0
22 May 2024
Previous
1
2
3
4
5
6
Next