ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.11723
  4. Cited By
Learning from Suboptimal Demonstration via Self-Supervised Reward
  Regression

Learning from Suboptimal Demonstration via Self-Supervised Reward Regression

17 October 2020
Letian Chen
Rohan R. Paleja
Matthew C. Gombolay
    SSL
ArXivPDFHTML

Papers citing "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression"

30 / 30 papers shown
Title
Reinforcement Learning from Multi-level and Episodic Human Feedback
Reinforcement Learning from Multi-level and Episodic Human Feedback
Muhammad Qasim Elahi
Somtochukwu Oguchienti
Maheed H. Ahmed
Mahsa Ghasemi
OffRL
55
0
0
20 Apr 2025
Robotic Table Tennis: A Case Study into a High Speed Learning System
Robotic Table Tennis: A Case Study into a High Speed Learning System
David B. DÁmbrosio
Jonathan Abelian
Saminda Abeyruwan
Michael Ahn
Alex Bewley
...
Vikas Sindhwani
Avi Singh
Vincent Vanhoucke
Grace Vesom
Peng Xu
60
13
0
20 Feb 2025
Learning Strategy Representation for Imitation Learning in Multi-Agent Games
Learning Strategy Representation for Imitation Learning in Multi-Agent Games
Shiqi Lei
Kanghon Lee
Linjing Li
Jinkyoo Park
OffRL
42
0
0
17 Feb 2025
Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation
N. Dennler
Stefanos Nikolaidis
Maja J. Matarić
218
0
0
03 Jan 2025
Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance
Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance
Kin Man Lee
Sean Ye
Qingyu Xiao
Zixuan Wu
Z. Zaidi
David B. DÁmbrosio
Pannag R. Sanketi
Matthew Gombolay
80
2
0
23 Sep 2024
Achieving Human Level Competitive Robot Table Tennis
Achieving Human Level Competitive Robot Table Tennis
David B. DÁmbrosio
Saminda Abeyruwan
L. Graesser
Atil Iscen
H. B. Amor
...
Vikas Sindhwani
Vincent Vanhoucke
Grace Vesom
P. Xu
Pannag R. Sanketi
95
14
0
07 Aug 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
Matthew E. Taylor
OffRL
46
2
0
30 Apr 2024
DIDA: Denoised Imitation Learning based on Domain Adaptation
DIDA: Denoised Imitation Learning based on Domain Adaptation
Kaichen Huang
Hai-Hang Sun
Shenghua Wan
Minghao Shao
Shuai Feng
Le Gan
De-Chuan Zhan
32
1
0
04 Apr 2024
ELA: Exploited Level Augmentation for Offline Learning in Zero-Sum Games
ELA: Exploited Level Augmentation for Offline Learning in Zero-Sum Games
Shiqi Lei
Kanghoon Lee
Linjing Li
Jinkyoo Park
Jiachen Li
OffRL
31
1
0
28 Feb 2024
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning
Huy Hoang
Tien Mai
Pradeep Varakantham
OffRL
47
2
0
20 Feb 2024
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
M. Beliaev
Ramtin Pedarsani
41
2
0
02 Feb 2024
Aligning Human Intent from Imperfect Demonstrations with
  Confidence-based Inverse soft-Q Learning
Aligning Human Intent from Imperfect Demonstrations with Confidence-based Inverse soft-Q Learning
Xizhou Bu
Wenjuan Li
Zhengxiong Liu
Zhiqiang Ma
Panfeng Huang
22
1
0
18 Dec 2023
Signal Temporal Logic-Guided Apprenticeship Learning
Signal Temporal Logic-Guided Apprenticeship Learning
Aniruddh Gopinath Puranic
Jyotirmoy V. Deshmukh
Stefanos Nikolaidis
46
2
0
09 Nov 2023
Learning to Discern: Imitating Heterogeneous Human Demonstrations with
  Preference and Representation Learning
Learning to Discern: Imitating Heterogeneous Human Demonstrations with Preference and Representation Learning
Sachit Kuhar
Shuo Cheng
Shivang Chopra
Matthew Bronars
Danfei Xu
55
9
0
22 Oct 2023
Learning Reward for Physical Skills using Large Language Model
Learning Reward for Physical Skills using Large Language Model
Yuwei Zeng
Yiqing Xu
36
6
0
21 Oct 2023
Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement
  Learning with Sub-optimal Demonstrations
Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations
Lu Li
Yuxin Pan
Ruobing Chen
Jie Liu
Zilin Wang
Yu Liu
Zhiheng Li
50
0
0
13 Oct 2023
The Effect of Robot Skill Level and Communication in Rapid, Proximate
  Human-Robot Collaboration
The Effect of Robot Skill Level and Communication in Rapid, Proximate Human-Robot Collaboration
Kin Man Lee
Arjun Krishna
Z. Zaidi
Rohan R. Paleja
Letian Chen
Erin Hedlund-Botti
Mariah L. Schrum
Matthew C. Gombolay
27
14
0
07 Apr 2023
Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
Yunke Wang
Bo Du
Chang Xu
38
8
0
13 Feb 2023
Few-Shot Preference Learning for Human-in-the-Loop RL
Few-Shot Preference Learning for Human-in-the-Loop RL
Joey Hejna
Dorsa Sadigh
OffRL
32
92
0
06 Dec 2022
Reinforcement learning with Demonstrations from Mismatched Task under
  Sparse Reward
Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Yanjiang Guo
Jingyue Gao
Zheng Wu
Chengming Shi
Jianyu Chen
OffRL
26
4
0
03 Dec 2022
D-Shape: Demonstration-Shaped Reinforcement Learning via Goal
  Conditioning
D-Shape: Demonstration-Shaped Reinforcement Learning via Goal Conditioning
Caroline Wang
Garrett A. Warnell
Peter Stone
43
3
0
26 Oct 2022
Robust Offline Reinforcement Learning with Gradient Penalty and
  Constraint Relaxation
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Chengqian Gao
Kelvin Xu
Liu Liu
Deheng Ye
P. Zhao
Zhiqiang Xu
OffRL
45
2
0
19 Oct 2022
GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot
GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot
Tianli Ding
L. Graesser
Saminda Abeyruwan
David B. DÁmbrosio
Anish Shankar
P. Sermanet
Pannag R. Sanketi
Corey Lynch
59
20
0
07 Oct 2022
Receding Horizon Inverse Reinforcement Learning
Receding Horizon Inverse Reinforcement Learning
Yiqing Xu
Wei Gao
David Hsu
24
14
0
09 Jun 2022
Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous
  Demonstration
Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous Demonstration
Sravan Jayanthi
Letian Chen
Matthew C. Gombolay
30
0
0
14 Feb 2022
A Ranking Game for Imitation Learning
A Ranking Game for Imitation Learning
Harshit S. Sikchi
Akanksha Saran
Wonjoon Goo
S. Niekum
OffRL
27
22
0
07 Feb 2022
Learning from Imperfect Demonstrations via Adversarial Confidence
  Transfer
Learning from Imperfect Demonstrations via Adversarial Confidence Transfer
Zhangjie Cao
Zihan Wang
Dorsa Sadigh
AAML
37
7
0
07 Feb 2022
Towards Sample-efficient Apprenticeship Learning from Suboptimal
  Demonstration
Towards Sample-efficient Apprenticeship Learning from Suboptimal Demonstration
Letian Chen
Rohan R. Paleja
Matthew C. Gombolay
13
2
0
08 Oct 2021
Supervised Bayesian Specification Inference from Demonstrations
Supervised Bayesian Specification Inference from Demonstrations
Ankit J. Shah
Pritish Kamath
Shen Li
Patrick L. Craven
Kevin J. Landers
Kevin B. Oden
J. Shah
27
3
0
06 Jul 2021
Interpretable and Personalized Apprenticeship Scheduling: Learning
  Interpretable Scheduling Policies from Heterogeneous User Demonstrations
Interpretable and Personalized Apprenticeship Scheduling: Learning Interpretable Scheduling Policies from Heterogeneous User Demonstrations
Rohan R. Paleja
Andrew Silva
Letian Chen
Matthew C. Gombolay
22
31
0
14 Jun 2019
1