Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1412.6604
Cited By
Video (language) modeling: a baseline for generative models of natural videos
20 December 2014
MarcÁurelio Ranzato
Arthur Szlam
Joan Bruna
Michaël Mathieu
R. Collobert
S. Chopra
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Video (language) modeling: a baseline for generative models of natural videos"
50 / 127 papers shown
Title
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Junbang Liang
Ruoshi Liu
Ege Ozguroglu
Sruthi Sudhakar
Achal Dave
P. Tokmakov
Shuran Song
Carl Vondrick
VGen
42
24
0
24 Jun 2024
SFTformer: A Spatial-Frequency-Temporal Correlation-Decoupling Transformer for Radar Echo Extrapolation
Liangyu Xu
Wanxuan Lu
Hongfeng Yu
Fanglong Yao
Xian Sun
Kun Fu
50
5
0
28 Feb 2024
USTEP: Spatio-Temporal Predictive Learning under A Unified View
Cheng Tan
Jue Wang
Zhangyang Gao
Siyuan Li
Stan Z. Li
40
1
0
09 Oct 2023
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Xiaofeng Wang
Zheng Hua Zhu
Guan Huang
Xinze Chen
Jiagang Zhu
Jiwen Lu
VGen
27
150
0
18 Sep 2023
TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis
Zihan Zhang
Richard Liu
Kfir Aberman
Rana Hanocka
DiffM
43
26
0
27 Jul 2023
Length of Stay prediction for Hospital Management using Domain Adaptation
Lyse Naomi Wamba Momo
Nyalleng Moorosi
E. Nsoesie
F. Rademakers
B. De Moor
OOD
12
1
0
29 Jun 2023
MagicVideo: Efficient Video Generation With Latent Diffusion Models
Daquan Zhou
Weimin Wang
Hanshu Yan
Weiwei Lv
Yizhe Zhu
Jiashi Feng
DiffM
VGen
41
373
0
20 Nov 2022
See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction
Maria Attarian
Advaya Gupta
Ziyi Zhou
Wei Yu
Igor Gilitschenski
Animesh Garg
LM&Ro
29
7
0
07 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffM
VGen
68
375
0
05 Oct 2022
Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
...
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
64
1,480
0
05 Oct 2022
Image Classification using Sequence of Pixels
Gajraj Kuldeep
21
0
0
23 Sep 2022
Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud
Geraldo F. Oliveira
Juan Gómez Luna
Saugata Ghose
Amirali Boroumand
O. Mutlu
29
24
0
19 Sep 2022
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator
Younggyo Seo
Kimin Lee
Fangchen Liu
Stephen James
Pieter Abbeel
VGen
29
28
0
15 Sep 2022
Intelligent 3D Network Protocol for Multimedia Data Classification using Deep Learning
A. Syed
Eman A. Aldhahri
M. Iqbal
Abid Ali
Ammar Muthanna
Harun Jamil
F. Jamil
3DH
20
2
0
23 Jul 2022
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning
Cheng Tan
Zhangyang Gao
Lirong Wu
Yongjie Xu
Jun Xia
Siyuan Li
Stan Z. Li
48
107
0
24 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
112
112
0
23 Jun 2022
SimVP: Simpler yet Better Video Prediction
Zhangyang Gao
Cheng Tan
Lirong Wu
Stan Z. Li
54
212
0
09 Jun 2022
Cascaded Video Generation for Videos In-the-Wild
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
37
0
0
01 Jun 2022
STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond
Zheng Chang
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
30
1
0
20 Apr 2022
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Songwei Ge
Thomas Hayes
Harry Yang
Xiaoyue Yin
Guan Pang
David Jacobs
Jia-Bin Huang
Devi Parikh
ViT
59
215
0
07 Apr 2022
STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction
Zheng Chang
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
29
50
0
30 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
18
119
0
25 Mar 2022
Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Tomávs Souvcek
Jean-Baptiste Alayrac
Antoine Miech
Ivan Laptev
Josef Sivic
21
32
0
22 Mar 2022
StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
Ivan Skorokhodov
Sergey Tulyakov
Mohamed Elhoseiny
VGen
40
279
0
29 Dec 2021
Wide and Narrow: Video Prediction from Context and Motion
Jaehoon Cho
Jiyoung Lee
Changjae Oh
Wonil Song
Kwanghoon Sohn
22
1
0
22 Oct 2021
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning
Zhiyu Yao
Yunbo Wang
Haixu Wu
Jianmin Wang
Mingsheng Long
AI4TS
29
8
0
08 Oct 2021
A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction
Moitreya Chatterjee
Narendra Ahuja
A. Cherian
UQCV
VGen
BDL
44
17
0
06 Oct 2021
Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks
Amirali Boroumand
Saugata Ghose
Berkin Akin
Ravi Narayanaswami
Geraldo F. Oliveira
Xiaoyu Ma
Eric Shiu
O. Mutlu
25
82
0
29 Sep 2021
A Framework for Multisensory Foresight for Embodied Agents
Xiaohui Chen
Ramtin Hosseini
K. Panetta
Jivko Sinapov
26
3
0
15 Sep 2021
Hierarchical Video Generation for Complex Data
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
22
4
0
04 Jun 2021
FDNet: A Deep Learning Approach with Two Parallel Cross Encoding Pathways for Precipitation Nowcasting
Bi Yan
Chao Yang
F. Chen
Kohei Takeda
Changjun Wang
29
13
0
06 May 2021
DriveGAN: Towards a Controllable High-Quality Neural Simulation
S. Kim
Jonah Philion
Antonio Torralba
Sanja Fidler
29
109
0
30 Apr 2021
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning
Yunbo Wang
Haixu Wu
Jianjin Zhang
Zhifeng Gao
Jianmin Wang
Philip S. Yu
Mingsheng Long
28
380
0
17 Mar 2021
Self-Supervision by Prediction for Object Discovery in Videos
Beril Besbinar
P. Frossard
SSL
31
7
0
09 Mar 2021
MotionRNN: A Flexible Model for Video Prediction with Spacetime-Varying Motions
Haixu Wu
Zhiyu Yao
J. Wan
Mingsheng Long
33
126
0
03 Mar 2021
Predicting Video with VQVAE
Jacob Walker
Ali Razavi
Aaron van den Oord
DRL
24
67
0
02 Mar 2021
Deep Video Prediction for Time Series Forecasting
Zhen Zeng
T. Balch
Manuela Veloso
AI4TS
16
13
0
24 Feb 2021
Learning Temporal Dynamics from Cycles in Narrated Video
Dave Epstein
Jiajun Wu
Cordelia Schmid
Chen Sun
AI4TS
38
14
0
07 Jan 2021
Learning the Predictability of the Future
Dídac Surís
Ruoshi Liu
Carl Vondrick
24
71
0
01 Jan 2021
Mutual Information Based Method for Unsupervised Disentanglement of Video Representation
Aditya Sreekar
Ujjwal Tiwari
A. Namboodiri
DRL
26
4
0
17 Nov 2020
Enriching Video Captions With Contextual Text
Philipp Rimle
Pelin Dogan
Markus Gross
30
3
0
29 Jul 2020
Latent Video Transformer
Ruslan Rakhimov
Denis Volkhonskiy
Alexey Artemov
Denis Zorin
Evgeny Burnaev
VGen
33
119
0
18 Jun 2020
Going in circles is the way forward: the role of recurrence in visual inference
R. S. V. Bergen
N. Kriegeskorte
17
82
0
26 Mar 2020
Photo-Realistic Video Prediction on Natural Videos of Largely Changing Frames
O. Shouno
GAN
43
21
0
19 Mar 2020
Stochastic Latent Residual Video Prediction
Jean-Yves Franceschi
E. Delasalles
Mickaël Chen
Sylvain Lamprier
Patrick Gallinari
VGen
28
159
0
21 Feb 2020
Learning Predictive Models From Observation and Interaction
Karl Schmeckpeper
Annie Xie
Oleh Rybkin
Stephen Tian
Kostas Daniilidis
Sergey Levine
Chelsea Finn
DRL
33
60
0
30 Dec 2019
Action Anticipation with RBF Kernelized Feature Mapping RNN
Yuge Shi
Basura Fernando
Richard I. Hartley
34
82
0
18 Nov 2019
High Fidelity Video Prediction with Large Stochastic Recurrent Neural Networks
Ruben Villegas
Arkanath Pathak
Harini Kannan
D. Erhan
Quoc V. Le
Honglak Lee
VGen
22
136
0
05 Nov 2019
Markov Decision Process for Video Generation
V. Yushchenko
Nikita Araslanov
Stefan Roth
VGen
23
20
0
26 Sep 2019
Adversarial Video Generation on Complex Datasets
Aidan Clark
Jeff Donahue
Karen Simonyan
VGen
GAN
27
74
0
15 Jul 2019
1
2
3
Next