Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.10704
Cited By
Latent Video Transformer
18 June 2020
Ruslan Rakhimov
Denis Volkhonskiy
Alexey Artemov
Denis Zorin
Evgeny Burnaev
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Latent Video Transformer"
47 / 47 papers shown
Title
Object-Centric Image to Video Generation with Language Guidance
Angel Villar-Corrales
Gjergj Plepi
Sven Behnke
DiffM
VGen
OCL
153
1
0
17 Feb 2025
Real-Time Video Generation with Pyramid Attention Broadcast
Xuanlei Zhao
Xiaolong Jin
Kai Wang
Yang You
VGen
DiffM
86
37
0
22 Aug 2024
Self-supervised Multi-future Occupancy Forecasting for Autonomous Driving
Bernard Lange
Masha Itkina
Jiachen Li
Mykel J. Kochenderfer
55
4
0
30 Jul 2024
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffM
VGen
144
252
0
05 Jan 2024
Jukebox: A Generative Model for Music
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
VLM
82
731
0
30 Apr 2020
Feature Quantization Improves GAN Training
Yang Zhao
Chunyuan Li
Ping Yu
Jianfeng Gao
Changyou Chen
MQ
37
47
0
05 Apr 2020
Transformation-based Adversarial Video Prediction on Large-Scale Data
Pauline Luc
Aidan Clark
Sander Dieleman
Diego de Las Casas
Yotam Doron
Albin Cassirer
Karen Simonyan
VGen
253
86
0
09 Mar 2020
Axial Attention in Multidimensional Transformers
Jonathan Ho
Nal Kalchbrenner
Dirk Weissenborn
Tim Salimans
58
525
0
20 Dec 2019
Few-shot Video-to-Video Synthesis
Ting-Chun Wang
Ming-Yuan Liu
Andrew Tao
Guilin Liu
Jan Kautz
Bryan Catanzaro
DiffM
VGen
113
367
0
28 Oct 2019
Scaling Autoregressive Video Models
Dirk Weissenborn
Oscar Täckström
Jakob Uszkoreit
DiffM
VGen
62
200
0
06 Jun 2019
Generating Diverse High-Fidelity Images with VQ-VAE-2
Ali Razavi
Aaron van den Oord
Oriol Vinyals
DRL
BDL
92
1,788
0
02 Jun 2019
SinGAN: Learning a Generative Model from a Single Natural Image
Tamar Rott Shaham
Tali Dekel
T. Michaeli
GAN
VLM
83
839
0
02 May 2019
An Empirical Study of Spatial Attention Mechanisms in Deep Networks
Xizhou Zhu
Dazhi Cheng
Zheng Zhang
Stephen Lin
Jifeng Dai
65
409
0
11 Apr 2019
End-to-End Time-Lapse Video Synthesis from a Single Outdoor Image
Seonghyeon Nam
Chongyang Ma
Menglei Chai
William Brendel
N. Xu
Seon Joo Kim
GAN
26
32
0
01 Apr 2019
Video Generation from Single Semantic Label Map
Junting Pan
Chengyu Wang
Xu Jia
Jing Shao
Lu Sheng
Junjie Yan
Xiaogang Wang
VGen
24
104
0
11 Mar 2019
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation
Manoj Kumar
Mohammad Babaeizadeh
D. Erhan
Chelsea Finn
Sergey Levine
Laurent Dinh
Durk Kingma
VGen
38
131
0
04 Mar 2019
Generating High Fidelity Images with Subscale Pixel Networks and Multidimensional Upscaling
Jacob Menick
Nal Kalchbrenner
43
150
0
04 Dec 2018
Towards High Resolution Video Generation with Progressive Growing of Sliced Wasserstein GANs
Dinesh Acharya
Zhiwu Huang
D. Paudel
Luc Van Gool
GAN
24
68
0
04 Oct 2018
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Andrew Brock
Jeff Donahue
Karen Simonyan
196
5,363
0
28 Sep 2018
Adaptive Input Representations for Neural Language Modeling
Alexei Baevski
Michael Auli
79
389
0
28 Sep 2018
Video-to-Video Synthesis
Ting-Chun Wang
Ming-Yuan Liu
Jun-Yan Zhu
Guilin Liu
Andrew Tao
Jan Kautz
Bryan Catanzaro
GAN
VGen
57
987
0
20 Aug 2018
A Short Note about Kinetics-600
João Carreira
Eric Noland
Andras Banki-Horvath
Chloe Hillier
Andrew Zisserman
56
520
0
03 Aug 2018
Learning to Decompose and Disentangle Representations for Video Prediction
Jun-Ting Hsieh
Bingbin Liu
De-An Huang
Li Fei-Fei
Juan Carlos Niebles
DRL
161
305
0
11 Jun 2018
PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning
Yunbo Wang
Zhifeng Gao
Mingsheng Long
Jianmin Wang
Philip S. Yu
102
472
0
17 Apr 2018
Stochastic Adversarial Video Prediction
Alex X. Lee
Richard Y. Zhang
F. Ebert
Pieter Abbeel
Chelsea Finn
Sergey Levine
DRL
VGen
GAN
41
450
0
04 Apr 2018
Fast Decoding in Sequence Models using Discrete Latent Variables
Łukasz Kaiser
Aurko Roy
Ashish Vaswani
Niki Parmar
Samy Bengio
Jakob Uszkoreit
Noam M. Shazeer
35
231
0
09 Mar 2018
Stochastic Video Generation with a Learned Prior
Emily L. Denton
Rob Fergus
VGen
63
525
0
21 Feb 2018
PixelSNAIL: An Improved Autoregressive Generative Model
Xi Chen
Nikhil Mishra
Mostafa Rohaninejad
Pieter Abbeel
DRL
DiffM
BDL
GAN
48
270
0
28 Dec 2017
Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture
Katsunori Ohnishi
Shohei Yamamoto
Yoshitaka Ushiku
Tatsuya Harada
VGen
GAN
47
60
0
27 Nov 2017
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
139
4,928
0
02 Nov 2017
Self-Supervised Visual Planning with Temporal Skip Connections
F. Ebert
Chelsea Finn
Alex X. Lee
Sergey Levine
SSL
57
318
0
15 Oct 2017
MoCoGAN: Decomposing Motion and Content for Video Generation
Sergey Tulyakov
Ming-Yuan Liu
Xiaodong Yang
Jan Kautz
GAN
104
1,138
0
17 Jul 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
321
129,831
0
12 Jun 2017
Unsupervised Learning of Disentangled Representations from Video
Emily L. Denton
Vighnesh Birodkar
DRL
CoGe
OOD
54
552
0
31 May 2017
The Kinetics Human Action Video Dataset
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
...
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
186
3,771
0
19 May 2017
PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications
Tim Salimans
A. Karpathy
Xi Chen
Diederik P. Kingma
41
933
0
19 Jan 2017
Temporal Generative Adversarial Nets with Singular Value Clipping
Masaki Saito
Eiichi Matsumoto
Shunta Saito
GAN
48
445
0
21 Nov 2016
Video Pixel Networks
Nal Kalchbrenner
Aaron van den Oord
Karen Simonyan
Ivo Danihelka
Oriol Vinyals
Alex Graves
Koray Kavukcuoglu
38
423
0
03 Oct 2016
Efficient softmax approximation for GPUs
Edouard Grave
Armand Joulin
Moustapha Cissé
David Grangier
Hervé Jégou
55
271
0
14 Sep 2016
Generating Videos with Scene Dynamics
Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
GAN
VGen
135
1,464
0
08 Sep 2016
Conditional Image Generation with PixelCNN Decoders
Aaron van den Oord
Nal Kalchbrenner
Oriol Vinyals
L. Espeholt
Alex Graves
Koray Kavukcuoglu
VLM
110
2,490
0
16 Jun 2016
Unsupervised Learning for Physical Interaction through Video Prediction
Chelsea Finn
Ian Goodfellow
Sergey Levine
43
1,042
0
23 May 2016
Spatio-temporal video autoencoder with differentiable memory
Viorica Patraucean
Ankur Handa
R. Cipolla
56
307
0
19 Nov 2015
Deep multi-scale video prediction beyond mean square error
Michaël Mathieu
Camille Couprie
Yann LeCun
GAN
92
1,881
0
17 Nov 2015
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
437
7,952
0
13 Jun 2015
Unsupervised Learning of Video Representations using LSTMs
Nitish Srivastava
Elman Mansimov
Ruslan Salakhutdinov
SSL
107
2,586
0
16 Feb 2015
Video (language) modeling: a baseline for generative models of natural videos
MarcÁurelio Ranzato
Arthur Szlam
Joan Bruna
Michaël Mathieu
R. Collobert
S. Chopra
VGen
62
471
0
20 Dec 2014
1