PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via
Relabeling Experience and Unsupervised Pre-training

PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training

9 June 2021

Pieter Abbeel

Papers citing "PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training"

17 / 67 papers shown

Title
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning Mudit Verma Siddhant Bhambri Subbarao Kambhampati 37 4 0 17 Feb 2023
A State Augmentation based approach to Reinforcement Learning from Human Preferences Mudit Verma Subbarao Kambhampati 33 2 0 17 Feb 2023
On The Fragility of Learned Reward Functions Lev McKinney Yawen Duan David M. Krueger Adam Gleave 33 20 0 09 Jan 2023
Benchmarks and Algorithms for Offline Preference-Based Reward Learning Daniel Shin Anca Dragan Daniel S. Brown OffRL 17 53 0 03 Jan 2023
Few-Shot Preference Learning for Human-in-the-Loop RL Joey Hejna Dorsa Sadigh OffRL 32 92 0 06 Dec 2022
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning Katherine Metcalf Miguel Sarabia B. Theobald OffRL 38 4 0 12 Nov 2022
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision Ashvin Nair Brian Zhu Gokul Narayanan Eugen Solowjow Sergey Levine OffRL OnRL 28 15 0 27 Oct 2022
Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion Utkarsh Soni Nupur Thakur S. Sreedharan L. Guan Mudit Verma Matthew Marquez Subbarao Kambhampati 39 6 0 27 Oct 2022
Symbol Guided Hindsight Priors for Reward Learning from Human Preferences Mudit Verma Katherine Metcalf 35 8 0 17 Oct 2022
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning Xinran Liang Katherine Shu Kimin Lee Pieter Abbeel 21 58 0 24 May 2022
Learning Dense Reward with Temporal Variant Self-Supervision Yuning Wu Jieliang Luo Hui Li SSL 16 0 0 20 May 2022
Reinforcement Learning in Practice: Opportunities and Challenges Yuxi Li OffRL 36 9 0 23 Feb 2022
B-Pref: Benchmarking Preference-Based Reinforcement Learning Kimin Lee Laura M. Smith Anca Dragan Pieter Abbeel OffRL 40 93 0 04 Nov 2021
A Framework for Learning to Request Rich and Contextually Useful Information from Humans Khanh Nguyen Yonatan Bisk Hal Daumé 47 16 0 14 Oct 2021
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback Xiaofei Wang Kimin Lee Kourosh Hakhamaneshi Pieter Abbeel Michael Laskin 34 42 0 11 Aug 2021
Human Engagement Providing Evaluative and Informative Advice for Interactive Reinforcement Learning Adam Bignold Francisco Cruz Richard Dazeley Peter Vamplew Cameron Foale 22 18 0 21 Sep 2020
Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation L. Guan Mudit Verma Sihang Guo Ruohan Zhang Subbarao Kambhampati 43 42 0 26 Jun 2020