PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning

23 May 2024

Hang Su

Jun Zhu

Papers citing "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning"

32 / 32 papers shown

Title
Task Aware Dreamer for Task Generalization in Reinforcement Learning Chengyang Ying Zhongkai Hao Xinning Zhou Hang Su Songming Liu Dong Yan Jun Zhu 187 3 0 17 Feb 2025
Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies Michael Beukman Devon Jarvis Richard Klein Steven D. James Benjamin Rosman 67 12 0 25 Oct 2023
ManyQuadrupeds: Learning a Single Locomotion Policy for Diverse Quadruped Robots M. Shafiee Guillaume Bellegarda A. Ijspeert 54 29 0 16 Oct 2023
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL Fei Ni Jianye Hao Yao Mu Yifu Yuan Yan Zheng Bin Wang Zhixuan Liang DiffM OffRL 84 46 0 31 May 2023
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning Xin Liu Yaran Chen Haoran Li Boyu Li Dong Zhao SSL 97 10 0 11 Feb 2023
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem Chenyi Yu Weinan Zhang H. Lai Zheng Tian L. Kneip Jun Wang 90 15 0 18 Dec 2022
Choreographer: Learning and Adapting Skills in Imagination Pietro Mazzaglia Tim Verbelen Bart Dhoedt Alexandre Lacoste Sai Rajeswar 86 24 0 23 Nov 2022
A Mixture of Surprises for Unsupervised Reinforcement Learning Andrew Zhao Matthieu Lin Yangguang Li Yang Liu Gao Huang 38 13 0 13 Oct 2022
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels Sai Rajeswar Pietro Mazzaglia Tim Verbelen Alexandre Piché Bart Dhoedt Rameswar Panda Alexandre Lacoste SSL 62 21 0 24 Sep 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos Bowen Baker Ilge Akkaya Peter Zhokhov Joost Huizinga Jie Tang Adrien Ecoffet Brandon Houghton Raul Sampedro Jeff Clune OffRL 106 298 0 23 Jun 2022
Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk Chengyang Ying Xinning Zhou Hang Su Dong Yan Ning Chen Jun Zhu 47 41 0 09 Jun 2022
A Generalist Agent Scott E. Reed Konrad Zolna Emilio Parisotto Sergio Gomez Colmenarejo Alexander Novikov ... Yutian Chen R. Hadsell Oriol Vinyals Mahyar Bordbar Nando de Freitas LM&Ro LLMAG AI4CE 186 810 0 12 May 2022
One After Another: Learning Incremental Skills for a Changing World Nur Muhammad (Mahi) Shafiullah Lerrel Pinto CLL 38 13 0 21 Mar 2022
Masked Autoencoders Are Scalable Vision Learners Kaiming He Xinlei Chen Saining Xie Yanghao Li Piotr Dollár Ross B. Girshick ViT TPM 427 7,705 0 11 Nov 2021
URLB: Unsupervised Reinforcement Learning Benchmark Michael Laskin Denis Yarats Hao Liu Kimin Lee Albert Zhan Kevin Lu Catherine Cang Lerrel Pinto Pieter Abbeel SSL OffRL 67 137 0 28 Oct 2021
The Information Geometry of Unsupervised Reinforcement Learning Benjamin Eysenbach Ruslan Salakhutdinov Sergey Levine SSL OffRL 90 35 0 06 Oct 2021
APS: Active Pretraining with Successor Features Hao Liu Pieter Abbeel 84 120 0 31 Aug 2021
Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning Viktor Makoviychuk Lukasz Wawrzyniak Yunrong Guo Michelle Lu Kier Storey ... David Hoeller Nikita Rudin Arthur Allshire Ankur Handa Gavriel State 148 1,065 0 24 Aug 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability Dibya Ghosh Jad Rahme Aviral Kumar Amy Zhang Ryan P. Adams Sergey Levine OffRL 344 116 0 13 Jul 2021
Unsupervised Skill Discovery with Bottleneck Option Learning Jaekyeom Kim Seohong Park Gunhee Kim 50 33 0 27 Jun 2021
Behavior From the Void: Unsupervised Active Pre-Training Hao Liu Pieter Abbeel VLM SSL 75 200 0 08 Mar 2021
Embodied Intelligence via Learning and Evolution Agrim Gupta Silvio Savarese Surya Ganguli Li Fei-Fei AI4CE 79 247 0 03 Feb 2021
Mastering Atari with Discrete World Models Danijar Hafner Timothy Lillicrap Mohammad Norouzi Jimmy Ba DRL 93 849 0 05 Oct 2020
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning Yuke Zhu J. Wong Ajay Mandlekar Roberto Martín-Martín Abhishek Joshi Soroush Nasiriany Yifeng Zhu Soroush Nasiriany Yifeng Zhu 160 442 0 25 Sep 2020
Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels Ilya Kostrikov Denis Yarats Rob Fergus OffRL 92 789 0 28 Apr 2020
Dream to Control: Learning Behaviors by Latent Imagination Danijar Hafner Timothy Lillicrap Jimmy Ba Mohammad Norouzi VLM 108 1,349 0 03 Dec 2019
Self-Supervised Exploration via Disagreement Deepak Pathak Dhiraj Gandhi Abhinav Gupta SSL 73 380 0 10 Jun 2019
Skew-Fit: State-Covering Self-Supervised Reinforcement Learning Vitchyr H. Pong Murtaza Dalal Steven Lin Ashvin Nair Shikhar Bahl Sergey Levine OffRL SSL 69 276 0 08 Mar 2019
DeepMind Control Suite Yuval Tassa Yotam Doron Alistair Muldal Tom Erez Yazhe Li ... A. Abdolmaleki J. Merel Andrew Lefrancq Timothy Lillicrap Martin Riedmiller ELM LM&Ro BDL 127 1,133 0 02 Jan 2018
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 444 18,931 0 20 Jul 2017
Curiosity-driven Exploration by Self-supervised Prediction Deepak Pathak Pulkit Agrawal Alexei A. Efros Trevor Darrell LRM SSL 106 2,433 0 15 May 2017
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 300 13,214 0 09 Sep 2015