Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.09070
Cited By
An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions
15 May 2023
Xi Yang
Ge Gao
Min Chi
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions"
18 / 18 papers shown
Title
Strictly Batch Imitation Learning by Energy-based Distribution Matching
Daniel Jarrett
Ioana Bica
M. Schaar
OffRL
76
63
0
25 Jun 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
576
2,046
0
04 May 2020
Imitation Learning via Off-Policy Distribution Matching
Ilya Kostrikov
Ofir Nachum
Jonathan Tompson
OOD
OffRL
158
206
0
10 Dec 2019
Your Classifier is Secretly an Energy Based Model and You Should Treat it Like One
Will Grathwohl
Kuan-Chieh Wang
J. Jacobsen
David Duvenaud
Mohammad Norouzi
Kevin Swersky
VLM
100
547
0
06 Dec 2019
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
190
609
0
01 Jan 2019
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
108
259
0
09 Sep 2018
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
David Hallac
Sagar Vare
Stephen P. Boyd
J. Leskovec
AI4TS
56
276
0
10 Jun 2017
Multi-Modal Imitation Learning from Unstructured Demonstrations using Generative Adversarial Nets
Karol Hausman
Yevgen Chebotar
S. Schaal
Gaurav Sukhatme
Joseph J. Lim
GAN
91
150
0
30 May 2017
Continuous State-Space Models for Optimal Sepsis Treatment - a Deep Reinforcement Learning Approach
Aniruddh Raghu
Matthieu Komorowski
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
42
193
0
23 May 2017
One-Shot Imitation Learning
Yan Duan
Marcin Andrychowicz
Bradly C. Stadie
Jonathan Ho
Jonas Schneider
Ilya Sutskever
Pieter Abbeel
Wojciech Zaremba
OffRL
86
689
0
21 Mar 2017
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
Xi Chen
Yan Duan
Rein Houthooft
John Schulman
Ilya Sutskever
Pieter Abbeel
GAN
163
4,240
0
12 Jun 2016
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
165
3,125
0
10 Jun 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
225
5,087
0
05 Jun 2016
HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks with Delayed Rewards
S. Krishnan
Animesh Garg
Richard Liaw
Lauren Miller
Florian T. Pokorny
Ken Goldberg
69
40
0
21 Apr 2016
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
Chelsea Finn
Sergey Levine
Pieter Abbeel
112
952
0
01 Mar 2016
A Widely Applicable Bayesian Information Criterion
Sumio Watanabe
103
787
0
31 Aug 2012
Imitation Learning with a Value-Based Prior
Umar Syed
Robert Schapire
69
14
0
20 Jun 2012
Bayesian multitask inverse reinforcement learning
Christos Dimitrakakis
Constantin Rothkopf
BDL
87
107
0
18 Jun 2011
1