ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.05091
  4. Cited By
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via
  Relabeling Experience and Unsupervised Pre-training

PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training

9 June 2021
Kimin Lee
Laura M. Smith
Pieter Abbeel
    OffRL
ArXivPDFHTML

Papers citing "PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training"

17 / 67 papers shown
Title
Exploiting Unlabeled Data for Feedback Efficient Human Preference based
  Reinforcement Learning
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning
Mudit Verma
Siddhant Bhambri
Subbarao Kambhampati
37
4
0
17 Feb 2023
A State Augmentation based approach to Reinforcement Learning from Human
  Preferences
A State Augmentation based approach to Reinforcement Learning from Human Preferences
Mudit Verma
Subbarao Kambhampati
33
2
0
17 Feb 2023
On The Fragility of Learned Reward Functions
On The Fragility of Learned Reward Functions
Lev McKinney
Yawen Duan
David M. Krueger
Adam Gleave
33
20
0
09 Jan 2023
Benchmarks and Algorithms for Offline Preference-Based Reward Learning
Benchmarks and Algorithms for Offline Preference-Based Reward Learning
Daniel Shin
Anca Dragan
Daniel S. Brown
OffRL
17
53
0
03 Jan 2023
Few-Shot Preference Learning for Human-in-the-Loop RL
Few-Shot Preference Learning for Human-in-the-Loop RL
Joey Hejna
Dorsa Sadigh
OffRL
32
92
0
06 Dec 2022
Rewards Encoding Environment Dynamics Improves Preference-based
  Reinforcement Learning
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning
Katherine Metcalf
Miguel Sarabia
B. Theobald
OffRL
38
4
0
12 Nov 2022
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for
  Industrial Insertion of Novel Connectors from Vision
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision
Ashvin Nair
Brian Zhu
Gokul Narayanan
Eugen Solowjow
Sergey Levine
OffRL
OnRL
28
15
0
27 Oct 2022
Towards customizable reinforcement learning agents: Enabling preference
  specification through online vocabulary expansion
Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion
Utkarsh Soni
Nupur Thakur
S. Sreedharan
L. Guan
Mudit Verma
Matthew Marquez
Subbarao Kambhampati
39
6
0
27 Oct 2022
Symbol Guided Hindsight Priors for Reward Learning from Human
  Preferences
Symbol Guided Hindsight Priors for Reward Learning from Human Preferences
Mudit Verma
Katherine Metcalf
35
8
0
17 Oct 2022
Reward Uncertainty for Exploration in Preference-based Reinforcement
  Learning
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
Xinran Liang
Katherine Shu
Kimin Lee
Pieter Abbeel
21
58
0
24 May 2022
Learning Dense Reward with Temporal Variant Self-Supervision
Learning Dense Reward with Temporal Variant Self-Supervision
Yuning Wu
Jieliang Luo
Hui Li
SSL
16
0
0
20 May 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
B-Pref: Benchmarking Preference-Based Reinforcement Learning
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
40
93
0
04 Nov 2021
A Framework for Learning to Request Rich and Contextually Useful
  Information from Humans
A Framework for Learning to Request Rich and Contextually Useful Information from Humans
Khanh Nguyen
Yonatan Bisk
Hal Daumé
47
16
0
14 Oct 2021
Skill Preferences: Learning to Extract and Execute Robotic Skills from
  Human Feedback
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Xiaofei Wang
Kimin Lee
Kourosh Hakhamaneshi
Pieter Abbeel
Michael Laskin
34
42
0
11 Aug 2021
Human Engagement Providing Evaluative and Informative Advice for
  Interactive Reinforcement Learning
Human Engagement Providing Evaluative and Informative Advice for Interactive Reinforcement Learning
Adam Bignold
Francisco Cruz
Richard Dazeley
Peter Vamplew
Cameron Foale
22
18
0
21 Sep 2020
Widening the Pipeline in Human-Guided Reinforcement Learning with
  Explanation and Context-Aware Data Augmentation
Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation
L. Guan
Mudit Verma
Sihang Guo
Ruohan Zhang
Subbarao Kambhampati
43
42
0
26 Jun 2020
Previous
12