Handling Sparse Rewards in Reinforcement Learning Using Model Predictive
Control

v1v2 (latest)

Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control

4 October 2022

Jorge de Heuvel

Maren Bennewitz

ArXiv (abs)PDF HTML

Papers citing "Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control"

17 / 17 papers shown

Title
Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control Jielong Yang Daoyuan Huang 90 0 0 21 Feb 2025
Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration Desik Rengarajan G. Vaidya Akshay Sarvesh D. Kalathil S. Shakkottai OffRL 60 57 0 09 Feb 2022
Infusing model predictive control into meta-reinforcement learning for mobile robots in dynamic environments Jaeuk Shin A. Hakobyan Mingyu Park Yeoneung Kim Gihun Kim Insoon Yang 67 11 0 15 Sep 2021
End-Effector Stabilization of a 10-DOF Mobile Manipulator using Nonlinear Model Predictive Control Mostafa Osman M. Mehrez Shiyi Yang Soo Jeon W. Melek AILaw 23 9 0 24 Mar 2021
Goal-constrained Sparse Reinforcement Learning for End-to-End Driving Pranav Agarwal Pierre de Beaucorps Raoul de Charette 49 3 0 16 Mar 2021
Where to go next: Learning a Subgoal Recommendation Policy for Navigation Among Pedestrians B. Brito Michael Everett Jonathan P. How Javier Alonso-Mora 116 62 0 25 Feb 2021
Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving Haochen Liu Zhiyu Huang Jingda Wu Chen Lv 74 73 0 18 Feb 2021
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments Vinicius G. Goecks Gregory M. Gremillion Vernon J. Lawhern J. Valasek Nicholas R. Waytowich OffRL 55 31 0 09 Oct 2019
Soft Actor-Critic Algorithms and Applications Tuomas Haarnoja Aurick Zhou Kristian Hartikainen George Tucker Sehoon Ha ... Vikash Kumar Henry Zhu Abhishek Gupta Pieter Abbeel Sergey Levine 141 2,445 0 13 Dec 2018
Learning with Training Wheels: Speeding up Training with a Simple Controller for Deep Reinforcement Learning Linhai Xie Sen Wang Stefano Rosa Andrew Markham A. Trigoni OffRL 72 80 0 12 Dec 2018
Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for Map-less Navigation by Leveraging Prior Demonstrations Mark Pfeiffer Samarth Shukla M. Turchetta Cesar Cadena Andreas Krause Roland Siegwart Juan I. Nieto 61 158 0 18 May 2018
Safe end-to-end imitation learning for model predictive control Keuntaek Lee Kamil Saigol Evangelos A. Theodorou BDL 89 24 0 27 Mar 2018
Addressing Function Approximation Error in Actor-Critic Methods Scott Fujimoto H. V. Hoof David Meger OffRL 180 5,204 0 26 Feb 2018
Overcoming Exploration in Reinforcement Learning with Demonstrations Ashvin Nair Bob McGrew Marcin Andrychowicz Wojciech Zaremba Pieter Abbeel OffRL 94 788 0 28 Sep 2017
Hindsight Experience Replay Marcin Andrychowicz Dwight Crow Alex Ray Jonas Schneider Rachel Fong Peter Welinder Bob McGrew Joshua Tobin Pieter Abbeel Wojciech Zaremba OffRL 268 2,337 0 05 Jul 2017
Deep Q-learning from Demonstrations Todd Hester Matej Vecerík Olivier Pietquin Marc Lanctot Tom Schaul ... Gabriel Dulac-Arnold Ian Osband J. Agapiou Joel Z Leibo A. Gruslys OffRL 54 155 0 12 Apr 2017
Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search Tianhao Zhang G. Kahn Sergey Levine Pieter Abbeel 76 427 0 22 Sep 2015