ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.12656
  4. Cited By
Replacing Rewards with Examples: Example-Based Policy Search via
  Recursive Classification

Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification

23 March 2021
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
    OffRL
ArXivPDFHTML

Papers citing "Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification"

16 / 16 papers shown
Title
Learning control strategy in soft robotics through a set of
  configuration spaces
Learning control strategy in soft robotics through a set of configuration spaces
Etienne Ménager
Christian Duriez
40
0
0
21 Feb 2024
RLIF: Interactive Imitation Learning as Reinforcement Learning
RLIF: Interactive Imitation Learning as Reinforcement Learning
Jianlan Luo
Perry Dong
Yuexiang Zhai
Yi-An Ma
Sergey Levine
OffRL
25
14
0
21 Nov 2023
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
Xuzhe Dang
Stefan Edelkamp
35
4
0
06 Nov 2023
Demonstration-free Autonomous Reinforcement Learning via Implicit and
  Bidirectional Curriculum
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum
Jigang Kim
Daesol Cho
H. J. Kim
22
3
0
17 May 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning
  from Observations
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Anqi Li
Byron Boots
Ching-An Cheng
OffRL
28
16
0
30 Mar 2023
Benchmarks and Algorithms for Offline Preference-Based Reward Learning
Benchmarks and Algorithms for Offline Preference-Based Reward Learning
Daniel Shin
Anca Dragan
Daniel S. Brown
OffRL
14
53
0
03 Jan 2023
Training Robots to Evaluate Robots: Example-Based Interactive Reward
  Functions for Policy Learning
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun-Yen Huang
E. Hu
Dinesh Jayaraman
OffRL
28
5
0
17 Dec 2022
TarGF: Learning Target Gradient Field to Rearrange Objects without
  Explicit Goal Specification
TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification
Min-Yu Wu
Fangwei Zhong
Yulong Xia
Hao Dong
OOD
32
17
0
02 Sep 2022
Discriminator-Weighted Offline Imitation Learning from Suboptimal
  Demonstrations
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations
Haoran Xu
Xianyuan Zhan
Honglei Yin
Huiling Qin
OffRL
26
66
0
20 Jul 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
25
138
0
15 Jun 2022
Versatile Offline Imitation from Observations and Examples via
  Regularized State-Occupancy Matching
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching
Yecheng Jason Ma
Andrew Shen
Dinesh Jayaraman
Osbert Bastani
OffRL
23
32
0
04 Feb 2022
Combining Learning from Human Feedback and Knowledge Engineering to
  Solve Hierarchical Tasks in Minecraft
Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft
Vinicius G. Goecks
Nicholas R. Waytowich
David Watkins
Bharat Prakash
13
7
0
07 Dec 2021
Example-Driven Model-Based Reinforcement Learning for Solving
  Long-Horizon Visuomotor Tasks
Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks
Bohan Wu
Suraj Nair
Li Fei-Fei
Chelsea Finn
OffRL
LM&Ro
38
24
0
21 Sep 2021
Visual Adversarial Imitation Learning using Variational Models
Visual Adversarial Imitation Learning using Variational Models
Rafael Rafailov
Tianhe Yu
Aravind Rajeswaran
Chelsea Finn
SSL
28
49
0
16 Jul 2021
DisCo RL: Distribution-Conditioned Reinforcement Learning for
  General-Purpose Policies
DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies
Soroush Nasiriany
Vitchyr H. Pong
Ashvin Nair
Alexander Khazatsky
Glen Berseth
Sergey Levine
OffRL
58
14
0
23 Apr 2021
Semi-supervised reward learning for offline reinforcement learning
Semi-supervised reward learning for offline reinforcement learning
Ksenia Konyushkova
Konrad Zolna
Y. Aytar
Alexander Novikov
Scott E. Reed
Serkan Cabi
Nando de Freitas
SSL
OffRL
68
23
0
12 Dec 2020
1