Title
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL Charles Packer Pieter Abbeel Joseph E. Gonzalez OffRL 71 17 0 02 Dec 2021
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation Todor Davchev Oleg O. Sushkov Jean-Baptiste Regli S. Schaal Y. Aytar Markus Wulfmeier Jonathan Scholz 44 18 0 01 Dec 2021
Learning Long-Term Reward Redistribution via Randomized Return Decomposition Zhizhou Ren Ruihan Guo Yuanshuo Zhou Jian-wei Peng 127 38 0 26 Nov 2021
Adaptive Multi-Goal Exploration Jean Tarbouriech O. D. Domingues Pierre Ménard Matteo Pirotta Michal Valko A. Lazaric 130 3 0 23 Nov 2021
Generalized Decision Transformer for Offline Hindsight Information Matching Hiroki Furuta Y. Matsuo S. Gu OffRL 116 104 0 19 Nov 2021
Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning Christopher Hoang Sungryull Sohn Jongwook Choi Wilka Carvalho Honglak Lee 74 32 0 18 Nov 2021
Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics Ingmar Schubert Danny Driess Ozgur S. Oguz Marc Toussaint OffRL 40 1 0 15 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions' Sets Daniel Eugênio Neves João Pedro Oliveira Batisteli Eduardo Felipe Lopes Lucila Ishitani Zenilton K. G. Patrocínio OffRL 25 1 0 12 Nov 2021
One model Packs Thousands of Items with Recurrent Conditional Query Learning Dongda Li Zhaoquan Gu Yuexuan Wang Changwei Ren F. Lau 81 17 0 12 Nov 2021
Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation Isabella Liu Shagun Uppal Gaurav Sukhatme Joseph J. Lim Péter Englert Youngwoon Lee 53 13 0 11 Nov 2021
Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments Eivind Bøhn E. M. Coates D. Reinhardt T. Johansen 43 29 0 07 Nov 2021
Automatic Goal Generation using Dynamical Distance Learning Bharat Prakash Nicholas R. Waytowich T. Mohsenin Tim Oates 41 2 0 07 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning Dhruv Shah Peng Xu Yao Lu Ted Xiao Alexander Toshev Sergey Levine Brian Ichter OffRL 81 43 0 04 Nov 2021
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning Wenlong Huang Igor Mordatch Pieter Abbeel Deepak Pathak 136 64 0 04 Nov 2021
Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning Sindre Benjamin Remman Inga Strümke A. Lekkas CML 48 7 0 04 Nov 2021
Autonomous Attack Mitigation for Industrial Control Systems John Mern Kyle Hatch Ryan Silva Cameron Hickert Tamim I. Sookoor Mykel J. Kochenderfer AAML 68 7 0 03 Nov 2021
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space Giuseppe Paolo Alexandre Coninx Alban Laflaquière Stéphane Doncieux 34 3 0 02 Nov 2021
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay Dogan C. Cicek Enes Duran Baturay Saglam Furkan B. Mutlu Suleyman S. Kozat OffRL 35 11 0 02 Nov 2021
Robot Learning from Randomized Simulations: A Review Fabio Muratore Fabio Ramos Greg Turk Wenhao Yu Michael Gienger Jan Peters AI4CE 119 83 0 01 Nov 2021
Adjacency constraint for efficient hierarchical reinforcement learning Tianren Zhang Shangqi Guo Tian Tan Xiao M Hu Feng Chen 87 17 0 30 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment Tung M. Luu Chang D. Yoo 80 8 0 28 Oct 2021
Similarity-Aware Skill Reproduction based on Multi-Representational Learning from Demonstration Brendan Hertel S. Ahmadzadeh 53 8 0 28 Oct 2021
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling Jesús Bujalance Martín Raphael Chekroun Fabien Moutarde OffRL 62 5 0 27 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching Pierre-Alexandre Kamienny Jean Tarbouriech Sylvain Lamprier A. Lazaric Ludovic Denoyer SSL 109 18 0 27 Oct 2021
Learning Domain Invariant Representations in Goal-conditioned Block MDPs Beining Han Chongyi Zheng Harris Chan Keiran Paster Michael Ruogu Zhang Jimmy Ba OOD AI4CE 106 14 0 27 Oct 2021
Learning Diverse Policies in MOBA Games via Macro-Goals Yiming Gao Bei Shi Xueying Du Liang Wang Guangwei Chen ... Weixuan Wang Deheng Ye Qiang Fu Wei Yang Lanxiao Huang 76 11 0 27 Oct 2021
Multitask Adaptation by Retrospective Exploration with Learned World Models Artem Zholus Aleksandr I. Panov CLL 29 0 0 25 Oct 2021
Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning Kibeom Kim Min Whoo Lee Yoonsung Kim Je-hwan Ryu Minsu Lee Byoung-Tak Zhang 71 8 0 25 Oct 2021
Learning Stochastic Shortest Path with Linear Function Approximation Steffen Czolbe Jiafan He Adrian Dalca Quanquan Gu 86 30 0 25 Oct 2021
Mixture-of-Variational-Experts for Continual Learning Y. Yin Yu Wang CLL FedML 59 6 0 25 Oct 2021
Contrastive Active Inference Pietro Mazzaglia Tim Verbelen Bart Dhoedt 80 26 0 19 Oct 2021
Discovering and Achieving Goals via World Models Russell Mendonca Oleh Rybkin Kostas Daniilidis Danijar Hafner Deepak Pathak 99 127 0 18 Oct 2021
Learn Proportional Derivative Controllable Latent Space from Pixels Weiyao Wang Marin Kobilarov Gregory Hager 75 1 0 15 Oct 2021
Wasserstein Unsupervised Reinforcement Learning Shuncheng He Yuhang Jiang Hongchang Zhang Jianzhun Shao Xiangyang Ji OffRL 93 23 0 15 Oct 2021
Improving the sample-efficiency of neural architecture search with reinforcement learning A. Nagy Ábel Boros 118 3 0 13 Oct 2021
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning Jinghuan Shang Kumara Kahatapitiya Xiang Li Michael S. Ryoo OffRL 100 36 0 12 Oct 2021
Planning from Pixels in Environments with Combinatorially Hard Search Spaces Marco Bagatella Miroslav Olsák Michal Rolínek Georg Martius OffRL 63 7 0 12 Oct 2021
Auditing Robot Learning for Safety and Compliance during Deployment Homanga Bharadhwaj 37 4 0 12 Oct 2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization S. Gu Manfred Diaz Daniel Freeman Hiroki Furuta Seyed Kamyar Seyed Ghasemipour Anton Raichuk Byron David Erik Frey Erwin Coumans Olivier Bachem 80 14 0 10 Oct 2021
Learning Visual Shape Control of Novel 3D Deformable Objects from Partial-View Point Clouds Bao Thach Brian Y. Cho Alan Kuntz Tucker Hermans 3DPC 84 30 0 10 Oct 2021
Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning Keyu Li Ye Lu Max Meng 24 9 0 09 Oct 2021
Improving Kinodynamic Planners for Vehicular Navigation with Learned Goal-Reaching Controllers Aravind Sivaramakrishnan Edgar Granados Seth Karten T. McMahon Kostas E. Bekris 49 7 0 08 Oct 2021
Learning to Centralize Dual-Arm Assembly Marvin Alles Elie Aljalbout 67 18 0 08 Oct 2021
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations Sindre Benjamin Remman A. Lekkas 55 14 0 07 Oct 2021
Designing Composites with Target Effective Young's Modulus using Reinforcement Learning Aldair E. Gongora Siddharth Mysore Beichen Li Wan Shou Wojciech Matusik E. Morgan Keith A. Brown Emily Whiting AI4CE 62 9 0 07 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks Robert McCarthy Qiang Wang S. Redmond OffRL 72 15 0 05 Oct 2021
Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning Jing Bi Jiebo Luo Chenliang Xu 124 49 0 05 Oct 2021
Large Batch Experience Replay Thibault Lahire Matthieu Geist Emmanuel Rachelson OffRL 100 13 0 04 Oct 2021
Sim and Real: Better Together Shirli Di-Castro Shashua Dotan DiCastro Shie Mannor 130 11 0 01 Oct 2021
Decentralized Graph-Based Multi-Agent Reinforcement Learning Using Reward Machines Jueming Hu Zhe Xu Weichang Wang Guannan Qu Yutian Pang Yongming Liu 100 12 0 30 Sep 2021