v1v2 (latest)

Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos

14 December 2024

Haoran Li

Papers citing "Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos"

50 / 53 papers shown

Title
Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning Xin Liu Yaran Chen Dong Zhao 77 2 0 20 May 2024
Diffusion Reward: Learning Rewards via Conditional Video Diffusion Tao Huang Guangqi Jiang Yanjie Ze Huazhe Xu VGen 115 26 0 21 Dec 2023
Learning to Act without Actions Dominik Schmidt Minqi Jiang OffRL 137 38 0 17 Dec 2023
Adversarial Imitation Learning from Visual Observations using Latent Information Vittorio Giammarino Tomas Landelius I. Paschalidis 97 7 0 29 Sep 2023
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges Maryam Zare P. Kebria Abbas Khosravi Saeid Nahavandi 95 102 0 05 Sep 2023
HIQL: Offline Goal-Conditioned RL with Latent States as Actions Seohong Park Dibya Ghosh Benjamin Eysenbach Sergey Levine OffRL 128 61 0 22 Jul 2023
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization Jinxin Liu Hongyin Zhang Zifeng Zhuang Yachen Kang Donglin Wang Bin Wang OffRL 115 8 0 26 Jun 2023
TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning Ruijie Zheng Xiyao Wang Yanchao Sun Shuang Ma Jieyu Zhao Huazhe Xu Hal Daumé Furong Huang 90 40 0 22 Jun 2023
Learning from Visual Observation via Offline Pretrained State-to-Go Transformer Bo-fan Zhou Ke Li Jiechuan Jiang Zongqing Lu ViT OffRL 53 10 0 22 Jun 2023
Video Prediction Models as Rewards for Reinforcement Learning Alejandro Escontrela Ademi Adeniji Wilson Yan Ajay Jain Xue Bin Peng Ken Goldberg Youngwoon Lee Danijar Hafner Pieter Abbeel 108 59 0 23 May 2023
Reinforcement Learning from Passive Data via Latent Intentions Dibya Ghosh Chethan Bhateja Sergey Levine OffRL 97 48 0 10 Apr 2023
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning Xin Liu Yaran Chen Haoran Li Boyu Li Dong Zhao SSL 129 10 0 11 Feb 2023
Multi-View Masked World Models for Visual Robotic Manipulation Younggyo Seo Junsup Kim Stephen James Kimin Lee Jinwoo Shin Pieter Abbeel VGen 105 60 0 05 Feb 2023
STEPS: Joint Self-supervised Nighttime Image Enhancement and Depth Estimation Yupeng Zheng Chengliang Zhong Pengfei Li Huan-ang Gao Yuhang Zheng ... Ling Wang Hao Zhao Guyue Zhou Qichao Zhang Dong Zhao 85 38 0 02 Feb 2023
Visual Imitation Learning with Patch Rewards Minghuan Liu Tairan He Weinan Zhang Shuicheng Yan Zhongwen Xu SSL 99 14 0 02 Feb 2023
Mastering Diverse Domains through World Models Danijar Hafner J. Pašukonis Jimmy Ba Timothy Lillicrap 94 617 0 10 Jan 2023
Masked World Models for Visual Control Younggyo Seo Danijar Hafner Hao Liu Fangchen Liu Stephen James Kimin Lee Pieter Abbeel OffRL 177 149 0 28 Jun 2022
Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning Yang Yue Bingyi Kang Zhongwen Xu Gao Huang Shuicheng Yan OffRL 99 13 0 25 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos Bowen Baker Ilge Akkaya Peter Zhokhov Joost Huizinga Jie Tang Adrien Ecoffet Brandon Houghton Raul Sampedro Jeff Clune OffRL 159 304 0 23 Jun 2022
A Survey on Model-based Reinforcement Learning Fan Luo Tian Xu Hang Lai Xiong-Hui Chen Weinan Zhang Yang Yu OffRL LRM 121 110 0 19 Jun 2022
Reinforcement Learning with Action-Free Pre-Training from Videos Younggyo Seo Kimin Lee Stephen James Pieter Abbeel SSL OnRL 107 123 0 25 Mar 2022
Masked Visual Pre-training for Motor Control Tete Xiao Ilija Radosavovic Trevor Darrell Jitendra Malik SSL 119 250 0 11 Mar 2022
Human-Level Control through Directly-Trained Deep Spiking Q-Networks Guisong Liu Wenjie Deng Xiurui Xie Li Huang Huajin Tang OffRL 63 46 0 13 Dec 2021
URLB: Unsupervised Reinforcement Learning Benchmark Michael Laskin Denis Yarats Hao Liu Kimin Lee Albert Zhan Kevin Lu Catherine Cang Lerrel Pinto Pieter Abbeel SSL OffRL 86 140 0 28 Oct 2021
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations Fei Deng Ingook Jang Sungjin Ahn VLM 79 62 0 27 Oct 2021
Intrinsically Motivated Self-supervised Learning in Reinforcement Learning Yue Zhao Chenzhuang Du Hang Zhao Tiejun Li SSL 60 5 0 26 Jun 2021
Reinforcement Learning with Prototypical Representations Denis Yarats Rob Fergus A. Lazaric Lerrel Pinto SSL 83 226 0 22 Feb 2021
Mastering Atari with Discrete World Models Danijar Hafner Timothy Lillicrap Mohammad Norouzi Jimmy Ba DRL 192 875 0 05 Oct 2020
Decoupling Representation Learning from Reinforcement Learning Adam Stooke Kimin Lee Pieter Abbeel Michael Laskin SSL DRL 380 346 0 14 Sep 2020
Deep Reinforcement Learning based Automatic Exploration for Navigation in Unknown Environment Haoran Li Qichao Zhang Dongbin Zhao 60 198 0 23 Jul 2020
Data-Efficient Reinforcement Learning with Self-Predictive Representations Max Schwarzer Ankesh Anand Rishab Goel R. Devon Hjelm Aaron Courville Philip Bachman 118 321 0 12 Jul 2020
CURL: Contrastive Unsupervised Representations for Reinforcement Learning A. Srinivas Michael Laskin Pieter Abbeel SSL DRL OffRL 145 1,097 0 08 Apr 2020
A Simple Framework for Contrastive Learning of Visual Representations Ting-Li Chen Simon Kornblith Mohammad Norouzi Geoffrey E. Hinton SSL 434 18,988 0 13 Feb 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey B. R. Kiran Ibrahim Sobh V. Talpaert Patrick Mannion A. A. Sallab S. Yogamani P. Pérez 367 1,710 0 02 Feb 2020
Dream to Control: Learning Behaviors by Latent Imagination Danijar Hafner Timothy Lillicrap Jimmy Ba Mohammad Norouzi VLM 190 1,378 0 03 Dec 2019
Leveraging Procedural Generation to Benchmark Reinforcement Learning K. Cobbe Christopher Hesse Jacob Hilton John Schulman 129 557 0 03 Dec 2019
Momentum Contrast for Unsupervised Visual Representation Learning Kaiming He Haoqi Fan Yuxin Wu Saining Xie Ross B. Girshick SSL 275 12,174 0 13 Nov 2019
On Mutual Information Maximization for Representation Learning Michael Tschannen Josip Djolonga Paul Kishan Rubenstein Sylvain Gelly Mario Lucic SSL 196 502 0 31 Jul 2019
Model-Based Reinforcement Learning for Atari Lukasz Kaiser Mohammad Babaeizadeh Piotr Milos B. Osinski R. Campbell ... Sergey Levine Afroz Mohiuddin Ryan Sepassi George Tucker Henryk Michalewski OffRL 207 870 0 01 Mar 2019
Generative Adversarial Imitation from Observation F. Torabi Garrett A. Warnell Peter Stone GAN 98 245 0 17 Jul 2018
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation Dmitry Kalashnikov A. Irpan P. Pastor Julian Ibarz Alexander Herzog ... Deirdre Quillen E. Holly Mrinal Kalakrishnan Vincent Vanhoucke Sergey Levine 184 1,473 0 27 Jun 2018
Imitating Latent Policies from Observation Ashley D. Edwards Himanshu Sahni Yannick Schroecker Charles Isbell 108 139 0 21 May 2018
Behavioral Cloning from Observation F. Torabi Garrett A. Warnell Peter Stone OffRL 145 732 0 04 May 2018
Zero-Shot Visual Imitation Deepak Pathak Parsa Mahmoudieh Guanghao Luo Pulkit Agrawal Dian Chen Yide Shentu Evan Shelhamer Jitendra Malik Alexei A. Efros Trevor Darrell LM&Ro 126 301 0 23 Apr 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures L. Espeholt Hubert Soyer Rémi Munos Karen Simonyan Volodymyr Mnih ... Vlad Firoiu Tim Harley Iain Dunning Shane Legg Koray Kavukcuoglu 276 1,609 0 05 Feb 2018
Neural Discrete Representation Learning Aaron van den Oord Oriol Vinyals Koray Kavukcuoglu BDL SSL OCL 259 5,093 0 02 Nov 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 698 19,363 0 20 Jul 2017
Reinforcement Learning with Unsupervised Auxiliary Tasks Max Jaderberg Volodymyr Mnih Wojciech M. Czarnecki Tom Schaul Joel Z Leibo David Silver Koray Kavukcuoglu SSL 121 1,229 0 16 Nov 2016
Generative Adversarial Imitation Learning Jonathan Ho Stefano Ermon GAN 203 3,132 0 10 Jun 2016
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 415 13,333 0 09 Sep 2015