ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.01525
  4. Cited By
Handling Sparse Rewards in Reinforcement Learning Using Model Predictive
  Control
v1v2 (latest)

Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control

4 October 2022
Murad Dawood
Nils Dengler
Jorge de Heuvel
Maren Bennewitz
ArXiv (abs)PDFHTML

Papers citing "Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control"

17 / 17 papers shown
Title
Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control
Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control
Jielong Yang
Daoyuan Huang
90
0
0
21 Feb 2025
Reinforcement Learning with Sparse Rewards using Guidance from Offline
  Demonstration
Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration
Desik Rengarajan
G. Vaidya
Akshay Sarvesh
D. Kalathil
S. Shakkottai
OffRL
60
57
0
09 Feb 2022
Infusing model predictive control into meta-reinforcement learning for
  mobile robots in dynamic environments
Infusing model predictive control into meta-reinforcement learning for mobile robots in dynamic environments
Jaeuk Shin
A. Hakobyan
Mingyu Park
Yeoneung Kim
Gihun Kim
Insoon Yang
67
11
0
15 Sep 2021
End-Effector Stabilization of a 10-DOF Mobile Manipulator using
  Nonlinear Model Predictive Control
End-Effector Stabilization of a 10-DOF Mobile Manipulator using Nonlinear Model Predictive Control
Mostafa Osman
M. Mehrez
Shiyi Yang
Soo Jeon
W. Melek
AILaw
23
9
0
24 Mar 2021
Goal-constrained Sparse Reinforcement Learning for End-to-End Driving
Goal-constrained Sparse Reinforcement Learning for End-to-End Driving
Pranav Agarwal
Pierre de Beaucorps
Raoul de Charette
49
3
0
16 Mar 2021
Where to go next: Learning a Subgoal Recommendation Policy for
  Navigation Among Pedestrians
Where to go next: Learning a Subgoal Recommendation Policy for Navigation Among Pedestrians
B. Brito
Michael Everett
Jonathan P. How
Javier Alonso-Mora
116
62
0
25 Feb 2021
Improved Deep Reinforcement Learning with Expert Demonstrations for
  Urban Autonomous Driving
Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving
Haochen Liu
Zhiyu Huang
Jingda Wu
Chen Lv
74
73
0
18 Feb 2021
Integrating Behavior Cloning and Reinforcement Learning for Improved
  Performance in Dense and Sparse Reward Environments
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments
Vinicius G. Goecks
Gregory M. Gremillion
Vernon J. Lawhern
J. Valasek
Nicholas R. Waytowich
OffRL
55
31
0
09 Oct 2019
Soft Actor-Critic Algorithms and Applications
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
141
2,445
0
13 Dec 2018
Learning with Training Wheels: Speeding up Training with a Simple
  Controller for Deep Reinforcement Learning
Learning with Training Wheels: Speeding up Training with a Simple Controller for Deep Reinforcement Learning
Linhai Xie
Sen Wang
Stefano Rosa
Andrew Markham
A. Trigoni
OffRL
72
80
0
12 Dec 2018
Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for
  Map-less Navigation by Leveraging Prior Demonstrations
Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for Map-less Navigation by Leveraging Prior Demonstrations
Mark Pfeiffer
Samarth Shukla
M. Turchetta
Cesar Cadena
Andreas Krause
Roland Siegwart
Juan I. Nieto
61
158
0
18 May 2018
Safe end-to-end imitation learning for model predictive control
Safe end-to-end imitation learning for model predictive control
Keuntaek Lee
Kamil Saigol
Evangelos A. Theodorou
BDL
89
24
0
27 Mar 2018
Addressing Function Approximation Error in Actor-Critic Methods
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
180
5,204
0
26 Feb 2018
Overcoming Exploration in Reinforcement Learning with Demonstrations
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
94
788
0
28 Sep 2017
Hindsight Experience Replay
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
268
2,337
0
05 Jul 2017
Deep Q-learning from Demonstrations
Deep Q-learning from Demonstrations
Todd Hester
Matej Vecerík
Olivier Pietquin
Marc Lanctot
Tom Schaul
...
Gabriel Dulac-Arnold
Ian Osband
J. Agapiou
Joel Z Leibo
A. Gruslys
OffRL
54
155
0
12 Apr 2017
Learning Deep Control Policies for Autonomous Aerial Vehicles with
  MPC-Guided Policy Search
Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search
Tianhao Zhang
G. Kahn
Sergey Levine
Pieter Abbeel
76
427
0
22 Sep 2015
1