Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05313
Cited By
Reinforcement Learning from Imperfect Demonstrations
14 February 2018
Yang Gao
Huazhe Xu
Ji Lin
Feng Yu
Sergey Levine
Trevor Darrell
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reinforcement Learning from Imperfect Demonstrations"
50 / 51 papers shown
Title
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
Yuxuan Li
Ning Yang
Ning Yang
Stephen Xia
OffRL
53
0
0
08 Apr 2025
Economic Battery Storage Dispatch with Deep Reinforcement Learning from Rule-Based Demonstrations
Manuel Sage
Martin Staniszewski
Yaoyao Fiona Zhao
34
2
0
06 Apr 2025
RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning
Mingkang Wu
Devin White
Vernon J. Lawhern
Nicholas R. Waytowich
Yongcan Cao
OffRL
39
0
0
13 Jan 2025
Proximal Policy Distillation
Giacomo Spigler
OffRL
28
1
0
21 Jul 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
Matthew E. Taylor
OffRL
46
2
0
30 Apr 2024
A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations
E. C. Ozcan
Vittorio Giammarino
James Queeney
I. Paschalidis
OffRL
44
0
0
29 Feb 2024
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
M. Beliaev
Ramtin Pedarsani
43
2
0
02 Feb 2024
Exploiting Symmetry and Heuristic Demonstrations in Off-policy Reinforcement Learning for Robotic Manipulation
Amir M. Soufi Enayati
Zengjie Zhang
Kashish Gupta
Homayoun Najjaran
OffRL
16
0
0
12 Apr 2023
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
Akhil Agnihotri
R. Jain
Haipeng Luo
29
2
0
02 Feb 2023
On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations
Tim G. J. Rudner
Cong Lu
Michael A. Osborne
Yarin Gal
Yee Whye Teh
OffRL
38
27
0
28 Dec 2022
Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Yanjiang Guo
Jingyue Gao
Zheng Wu
Chengming Shi
Jianyu Chen
OffRL
26
4
0
03 Dec 2022
Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots
Akanksha Saran
K. Desai
M. L. Chang
Rudolf Lioutikov
A. Thomaz
S. Niekum
25
3
0
01 Nov 2022
A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline Regret
Sheelabhadra Dey
Sumedh Pendurkar
Guni Sharon
Josiah P. Hanna
16
10
0
20 Sep 2022
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
OffRL
44
2
0
31 Jul 2022
Informed Learning by Wide Neural Networks: Convergence, Generalization and Sampling Complexity
Jianyi Yang
Shaolei Ren
35
3
0
02 Jul 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
Efficient Reinforcement Learning from Demonstration Using Local Ensemble and Reparameterization with Split and Merge of Expert Policies
Yu Wang
Fang Liu
29
0
0
23 May 2022
Aligning to Social Norms and Values in Interactive Narratives
Prithviraj Ammanabrolu
Liwei Jiang
Maarten Sap
Hannaneh Hajishirzi
Yejin Choi
AI4CE
28
47
0
04 May 2022
Demonstration-Bootstrapped Autonomous Practicing via Multi-Task Reinforcement Learning
Abhishek Gupta
Corey Lynch
Brandon Kinman
Garrett Peake
Sergey Levine
Karol Hausman
OffRL
19
17
0
29 Mar 2022
Reinforcement Learning from Demonstrations by Novel Interactive Expert and Application to Automatic Berthing Control Systems for Unmanned Surface Vessel
Haoran Zhang
Chenkun Yin
Yanxin Zhang
S. Jin
Zhenxuan Li
OffRL
21
3
0
23 Feb 2022
Malleable Agents for Re-Configurable Robotic Manipulators
Athindran Ramesh Kumar
Gurudutt Hosangadi
32
0
0
04 Feb 2022
GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving
Raphael Chekroun
Marin Toromanoff
Sascha Hornauer
Fabien Moutarde
39
60
0
16 Nov 2021
Learning from Ambiguous Demonstrations with Self-Explanation Guided Reinforcement Learning
Yantian Zha
L. Guan
Subbarao Kambhampati
26
5
0
11 Oct 2021
Credit Assignment Safety Learning from Human Demonstrations
A. Prabhakar
A. Billard
25
2
0
09 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery phantom
Jihoon Kweon
Kyunghwan Kim
Chaehyuk Lee
Hwi Kwon
Jinwoo Park
...
Inwook Back
J. Roh
Y. Moon
Jaesoon Choi
Young-Hak Kim
OnRL
24
33
0
05 Oct 2021
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
27
12
0
29 Jun 2021
Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits
Wenshuo Guo
Kumar Krishna Agrawal
Aditya Grover
Vidya Muthukumar
A. Pananjady
16
8
0
28 Jun 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
22
8
0
03 May 2021
POPO: Pessimistic Offline Policy Optimization
Qiang He
Xinwen Hou
OffRL
37
10
0
26 Dec 2020
Deep Reinforcement Learning based Local Planner for UAV Obstacle Avoidance using Demonstration Data
Lei He
Nabil Aouf
J. Whidborne
Bifeng Song
26
26
0
06 Aug 2020
A Conceptual Framework for Externally-influenced Agents: An Assisted Reinforcement Learning Review
Adam Bignold
Francisco Cruz
Matthew E. Taylor
Tim Brys
Richard Dazeley
Peter Vamplew
Cameron Foale
20
28
0
03 Jul 2020
Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations
Alexey Skrynnik
A. Staroverov
Ermek Aitygulov
Kirill Aksenov
Vasilii Davydov
Aleksandr I. Panov
OffRL
23
4
0
17 Jun 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
46
592
0
16 Jun 2020
Reinforcement Learning with Supervision from Noisy Demonstrations
Kun-Peng Ning
Sheng-Jun Huang
14
7
0
14 Jun 2020
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation
Taiyu Zhu
Kezhi Li
P. Herrero
Pantelis Georgiou
24
80
0
18 May 2020
Action Space Shaping in Deep Reinforcement Learning
Anssi Kanervisto
Christian Scheller
Ville Hautamaki
21
80
0
02 Apr 2020
Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft
Christian Scheller
Yanick Schraner
Manfred Vogel
26
27
0
12 Mar 2020
Exploration-efficient Deep Reinforcement Learning with Demonstration Guidance for Robot Control
Ke Lin
Liang Gong
Xudong Li
Te Sun
Binhao Chen
Chengliang Liu
Zhengfeng Zhang
Jian Pu
Junping Zhang
24
8
0
27 Feb 2020
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Reinforcement Learning-based Visual Navigation with Information-Theoretic Regularization
Qiaoyun Wu
Kai Xu
Jun Wang
Mingliang Xu
Xiaoxi Gong
Tianyi Zhou
19
30
0
09 Dec 2019
IRIS: Implicit Reinforcement without Interaction at Scale for Learning Control from Offline Robot Manipulation Data
Ajay Mandlekar
Fabio Ramos
Byron Boots
Silvio Savarese
Li Fei-Fei
Animesh Garg
Dieter Fox
OffRL
34
117
0
13 Nov 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Zhilin Wang
Che Wang
Yanqiu Wu
Keith Ross
OffRL
33
121
0
27 Oct 2019
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments
Vinicius G. Goecks
Gregory M. Gremillion
Vernon J. Lawhern
J. Valasek
Nicholas R. Waytowich
OffRL
19
31
0
09 Oct 2019
Review of Learning-based Longitudinal Motion Planning for Autonomous Vehicles: Research Gaps between Self-driving and Traffic Congestion
Hao Zhou
Jorge A. Laval
Anye Zhou
Yu Wang
W. Wu
Zhuo Qing
S. Peeta
18
37
0
02 Oct 2019
Attentive Multi-Task Deep Reinforcement Learning
Timo Bram
Gino Brunner
Oliver Richter
Roger Wattenhofer
CLL
23
18
0
05 Jul 2019
Goal-conditioned Imitation Learning
Yiming Ding
Carlos Florensa
Mariano Phielipp
Pieter Abbeel
34
219
0
13 Jun 2019
Adversarial Imitation Learning from Incomplete Demonstrations
Mingfei Sun
Xiaojuan Ma
21
28
0
29 May 2019
How You Act Tells a Lot: Privacy-Leakage Attack on Deep Reinforcement Learning
Xinlei Pan
Weiyao Wang
Xiaoshuai Zhang
Bo-wen Li
Jinfeng Yi
D. Song
MIACV
69
26
0
24 Apr 2019
Multi-Preference Actor Critic
Ishan Durugkar
Matthew J. Hausknecht
Adith Swaminathan
Patrick MacAlpine
19
1
0
05 Apr 2019
Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
Shani Gamrian
Yoav Goldberg
27
105
0
31 May 2018
1
2
Next