Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.04174
Cited By
Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction
6 March 2021
Bohan Wu
Suraj Nair
Roberto Martin-Martin
Li Fei-Fei
Chelsea Finn
DRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction"
24 / 24 papers shown
Title
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Jianye Hao
Mingsheng Long
VGen
40
23
0
24 May 2024
Block-local learning with probabilistic latent representations
David Kappel
Khaleelulla Khan Nazeer
Cabrel Teguemne Fokam
Christian Mayr
Anand Subramoney
24
4
0
24 May 2023
Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes
Aran Nayebi
R. Rajalingham
M. Jazayeri
G. R. Yang
36
17
0
19 May 2023
3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes
Haotian Xue
Antonio Torralba
J. Tenenbaum
Daniel L. K. Yamins
Yunzhu Li
H. Tung
PINN
VGen
AI4CE
55
8
0
22 Apr 2023
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers
Jaehoon Yoo
Semin Kim
Doyup Lee
Chiheon Kim
Seunghoon Hong
29
3
0
20 Mar 2023
Recent Advances of Deep Robotic Affordance Learning: A Reinforcement Learning Perspective
Xintong Yang
Ze Ji
Jing Wu
Yunyu Lai
33
12
0
09 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
26
504
0
07 Mar 2023
Open-World Object Manipulation using Pre-trained Vision-Language Models
Austin Stone
Ted Xiao
Yao Lu
K. Gopalakrishnan
Kuang-Huei Lee
...
Sean Kirmani
Brianna Zitkovich
F. Xia
Chelsea Finn
Karol Hausman
LM&Ro
142
144
0
02 Mar 2023
Long-horizon video prediction using a dynamic latent hierarchy
Alexey Zakharov
Qinghai Guo
Z. Fountas
19
4
0
29 Dec 2022
Exploiting Spatial-temporal Correlations for Video Anomaly Detection
Mengyang Zhao
Yang Liu
Jing Li
Xinhua Zeng
22
15
0
02 Nov 2022
Rethinking Learning Approaches for Long-Term Action Anticipation
Megha Nawhal
Akash Abdu Jyothi
Greg Mori
AI4TS
34
26
0
20 Oct 2022
A unified model for continuous conditional video prediction
Xi Ye
Guillaume-Alexandre Bilodeau
AI4TS
34
7
0
11 Oct 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
100
110
0
23 Jun 2022
MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation
Vikram S. Voleti
Alexia Jolicoeur-Martineau
Christopher Pal
DiffM
VGen
13
290
0
19 May 2022
STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond
Zheng Chang
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
30
1
0
20 Apr 2022
Transframer: Arbitrary Frame Prediction with Generative Models
C. Nash
João Carreira
Jacob Walker
Iain Barr
Andrew Jaegle
Mateusz Malinowski
Peter W. Battaglia
ViT
19
37
0
17 Mar 2022
MSPred: Video Prediction at Multiple Spatio-Temporal Scales with Hierarchical Recurrent Networks
Angel Villar-Corrales
Ani J. Karapetyan
Andreas Boltres
Sven Behnke
19
11
0
17 Mar 2022
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffM
VGen
36
255
0
16 Mar 2022
Variational Predictive Routing with Nested Subjective Timescales
Alexey Zakharov
Qinghai Guo
Z. Fountas
BDL
AI4TS
35
9
0
21 Oct 2021
Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks
Bohan Wu
Suraj Nair
Li Fei-Fei
Chelsea Finn
OffRL
LM&Ro
38
24
0
21 Sep 2021
Learning Language-Conditioned Robot Behavior from Offline Data and Crowd-Sourced Annotation
Suraj Nair
E. Mitchell
Kevin Chen
Brian Ichter
Silvio Savarese
Chelsea Finn
LM&Ro
OffRL
23
154
0
02 Sep 2021
Sawtooth Factorial Topic Embeddings Guided Gamma Belief Network
Zhibin Duan
Dongsheng Wang
Bo Chen
Chaojie Wang
Wenchao Chen
Yewen Li
J. Ren
Mingyuan Zhou
BDL
28
38
0
30 Jun 2021
Physion: Evaluating Physical Prediction from Vision in Humans and Machines
Daniel M. Bear
E. Wang
Damian Mrowca
Felix Binder
Hsiau-Yu Fish Tung
...
Li Fei-Fei
Nancy Kanwisher
J. Tenenbaum
Daniel L. K. Yamins
Judith E. Fan
OOD
55
86
0
15 Jun 2021
GENESIS-V2: Inferring Unordered Object Representations without Iterative Refinement
Martin Engelcke
Oiwi Parker Jones
Ingmar Posner
OCL
29
115
0
20 Apr 2021
1