Rating-based Reinforcement Learning

Rating-based Reinforcement Learning

30 July 2023

Devin White

Ellen R. Novoseller

Vernon J. Lawhern

Nicholas R. Waytowich

Papers citing "Rating-based Reinforcement Learning"

12 / 12 papers shown

Title
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning Calarina Muslimani Matthew E. Taylor OffRL 103 2 0 30 Apr 2024
B-Pref: Benchmarking Preference-Based Reinforcement Learning Kimin Lee Laura M. Smith Anca Dragan Pieter Abbeel OffRL 94 97 0 04 Nov 2021
APReL: A Library for Active Preference-based Reward Learning Algorithms Erdem Biyik Aditi Talati Dorsa Sadigh 52 36 0 16 Aug 2021
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training Kimin Lee Laura M. Smith Pieter Abbeel OffRL 63 284 0 09 Jun 2021
Learning Reward Functions from Diverse Sources of Human Feedback: Optimally Integrating Demonstrations and Preferences Erdem Biyik Dylan P. Losey Malayandi Palan Nicholas C. Landolfi Gleb Shevchuk Dorsa Sadigh 57 118 0 24 Jun 2020
Asking Easy Questions: A User-Friendly Approach to Active Reward Learning Erdem Biyik Malayandi Palan Nicholas C. Landolfi Dylan P. Losey Dorsa Sadigh 41 116 0 10 Oct 2019
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations Daniel S. Brown Wonjoon Goo P. Nagarajan S. Niekum 73 357 0 12 Apr 2019
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Tuomas Haarnoja Aurick Zhou Pieter Abbeel Sergey Levine 309 8,352 0 04 Jan 2018
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces Garrett A. Warnell Nicholas R. Waytowich Vernon J. Lawhern Peter Stone 44 271 0 28 Sep 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 499 19,065 0 20 Jul 2017
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 320 13,248 0 09 Sep 2015
Maximum Entropy Deep Inverse Reinforcement Learning Markus Wulfmeier Peter Ondruska Ingmar Posner OOD 72 406 0 17 Jul 2015