Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction

6 March 2021

Bohan Wu

Suraj Nair

Roberto Martin-Martin

Li Fei-Fei

Chelsea Finn

DRL

ArXiv PDF HTML

Papers citing "Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction"

24 / 24 papers shown

Title
iVideoGPT: Interactive VideoGPTs are Scalable World Models Jialong Wu Shaofeng Yin Ningya Feng Xu He Dong Li Jianye Hao Mingsheng Long VGen 40 23 0 24 May 2024
Block-local learning with probabilistic latent representations David Kappel Khaleelulla Khan Nazeer Cabrel Teguemne Fokam Christian Mayr Anand Subramoney 24 4 0 24 May 2023
Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes Aran Nayebi R. Rajalingham M. Jazayeri G. R. Yang 36 17 0 19 May 2023
3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes Haotian Xue Antonio Torralba J. Tenenbaum Daniel L. K. Yamins Yunzhu Li H. Tung PINN VGen AI4CE 55 8 0 22 Apr 2023
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers Jaehoon Yoo Semin Kim Doyup Lee Chiheon Kim Seunghoon Hong 29 3 0 20 Mar 2023
Recent Advances of Deep Robotic Affordance Learning: A Reinforcement Learning Perspective Xintong Yang Ze Ji Jing Wu Yunyu Lai 33 12 0 09 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT Yihan Cao Siyu Li Yixin Liu Zhiling Yan Yutong Dai Philip S. Yu Lichao Sun 26 504 0 07 Mar 2023
Open-World Object Manipulation using Pre-trained Vision-Language Models Austin Stone Ted Xiao Yao Lu K. Gopalakrishnan Kuang-Huei Lee ... Sean Kirmani Brianna Zitkovich F. Xia Chelsea Finn Karol Hausman LM&Ro 142 144 0 02 Mar 2023
Long-horizon video prediction using a dynamic latent hierarchy Alexey Zakharov Qinghai Guo Z. Fountas 19 4 0 29 Dec 2022
Exploiting Spatial-temporal Correlations for Video Anomaly Detection Mengyang Zhao Yang Liu Jing Li Xinhua Zeng 22 15 0 02 Nov 2022
Rethinking Learning Approaches for Long-Term Action Anticipation Megha Nawhal Akash Abdu Jyothi Greg Mori AI4TS 34 26 0 20 Oct 2022
A unified model for continuous conditional video prediction Xi Ye Guillaume-Alexandre Bilodeau AI4TS 34 7 0 11 Oct 2022
MaskViT: Masked Visual Pre-Training for Video Prediction Agrim Gupta Stephen Tian Yunzhi Zhang Jiajun Wu Roberto Martín-Martín Li Fei-Fei 100 110 0 23 Jun 2022
MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation Vikram S. Voleti Alexia Jolicoeur-Martineau Christopher Pal DiffM VGen 13 290 0 19 May 2022
STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond Zheng Chang Xinfeng Zhang Shanshe Wang Siwei Ma Wen Gao 30 1 0 20 Apr 2022
Transframer: Arbitrary Frame Prediction with Generative Models C. Nash João Carreira Jacob Walker Iain Barr Andrew Jaegle Mateusz Malinowski Peter W. Battaglia ViT 19 37 0 17 Mar 2022
MSPred: Video Prediction at Multiple Spatio-Temporal Scales with Hierarchical Recurrent Networks Angel Villar-Corrales Ani J. Karapetyan Andreas Boltres Sven Behnke 19 11 0 17 Mar 2022
Diffusion Probabilistic Modeling for Video Generation Ruihan Yang Prakhar Srivastava Stephan Mandt DiffM VGen 36 255 0 16 Mar 2022
Variational Predictive Routing with Nested Subjective Timescales Alexey Zakharov Qinghai Guo Z. Fountas BDL AI4TS 35 9 0 21 Oct 2021
Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks Bohan Wu Suraj Nair Li Fei-Fei Chelsea Finn OffRL LM&Ro 38 24 0 21 Sep 2021
Learning Language-Conditioned Robot Behavior from Offline Data and Crowd-Sourced Annotation Suraj Nair E. Mitchell Kevin Chen Brian Ichter Silvio Savarese Chelsea Finn LM&Ro OffRL 23 154 0 02 Sep 2021
Sawtooth Factorial Topic Embeddings Guided Gamma Belief Network Zhibin Duan Dongsheng Wang Bo Chen Chaojie Wang Wenchao Chen Yewen Li J. Ren Mingyuan Zhou BDL 28 38 0 30 Jun 2021
Physion: Evaluating Physical Prediction from Vision in Humans and Machines Daniel M. Bear E. Wang Damian Mrowca Felix Binder Hsiau-Yu Fish Tung ... Li Fei-Fei Nancy Kanwisher J. Tenenbaum Daniel L. K. Yamins Judith E. Fan OOD 55 86 0 15 Jun 2021
GENESIS-V2: Inferring Unordered Object Representations without Iterative Refinement Martin Engelcke Oiwi Parker Jones Ingmar Posner OCL 29 115 0 20 Apr 2021