ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.09070
  4. Cited By
An Offline Time-aware Apprenticeship Learning Framework for Evolving
  Reward Functions

An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions

15 May 2023
Xi Yang
Ge Gao
Min Chi
    OffRL
ArXiv (abs)PDFHTML

Papers citing "An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions"

18 / 18 papers shown
Title
Strictly Batch Imitation Learning by Energy-based Distribution Matching
Strictly Batch Imitation Learning by Energy-based Distribution Matching
Daniel Jarrett
Ioana Bica
M. Schaar
OffRL
76
63
0
25 Jun 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRLGP
576
2,046
0
04 May 2020
Imitation Learning via Off-Policy Distribution Matching
Imitation Learning via Off-Policy Distribution Matching
Ilya Kostrikov
Ofir Nachum
Jonathan Tompson
OODOffRL
158
206
0
10 Dec 2019
Your Classifier is Secretly an Energy Based Model and You Should Treat
  it Like One
Your Classifier is Secretly an Energy Based Model and You Should Treat it Like One
Will Grathwohl
Kuan-Chieh Wang
J. Jacobsen
David Duvenaud
Mohammad Norouzi
Kevin Swersky
VLM
100
547
0
06 Dec 2019
A Theoretical Analysis of Deep Q-Learning
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
190
609
0
01 Jan 2019
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward
  Bias in Adversarial Imitation Learning
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
108
259
0
09 Sep 2018
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series
  Data
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
David Hallac
Sagar Vare
Stephen P. Boyd
J. Leskovec
AI4TS
56
276
0
10 Jun 2017
Multi-Modal Imitation Learning from Unstructured Demonstrations using
  Generative Adversarial Nets
Multi-Modal Imitation Learning from Unstructured Demonstrations using Generative Adversarial Nets
Karol Hausman
Yevgen Chebotar
S. Schaal
Gaurav Sukhatme
Joseph J. Lim
GAN
91
150
0
30 May 2017
Continuous State-Space Models for Optimal Sepsis Treatment - a Deep
  Reinforcement Learning Approach
Continuous State-Space Models for Optimal Sepsis Treatment - a Deep Reinforcement Learning Approach
Aniruddh Raghu
Matthieu Komorowski
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
42
193
0
23 May 2017
One-Shot Imitation Learning
One-Shot Imitation Learning
Yan Duan
Marcin Andrychowicz
Bradly C. Stadie
Jonathan Ho
Jonas Schneider
Ilya Sutskever
Pieter Abbeel
Wojciech Zaremba
OffRL
86
689
0
21 Mar 2017
InfoGAN: Interpretable Representation Learning by Information Maximizing
  Generative Adversarial Nets
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
Xi Chen
Yan Duan
Rein Houthooft
John Schulman
Ilya Sutskever
Pieter Abbeel
GAN
163
4,240
0
12 Jun 2016
Generative Adversarial Imitation Learning
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
165
3,125
0
10 Jun 2016
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRLODL
225
5,087
0
05 Jun 2016
HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks
  with Delayed Rewards
HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks with Delayed Rewards
S. Krishnan
Animesh Garg
Richard Liaw
Lauren Miller
Florian T. Pokorny
Ken Goldberg
69
40
0
21 Apr 2016
Guided Cost Learning: Deep Inverse Optimal Control via Policy
  Optimization
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
Chelsea Finn
Sergey Levine
Pieter Abbeel
112
952
0
01 Mar 2016
A Widely Applicable Bayesian Information Criterion
A Widely Applicable Bayesian Information Criterion
Sumio Watanabe
103
787
0
31 Aug 2012
Imitation Learning with a Value-Based Prior
Imitation Learning with a Value-Based Prior
Umar Syed
Robert Schapire
69
14
0
20 Jun 2012
Bayesian multitask inverse reinforcement learning
Bayesian multitask inverse reinforcement learning
Christos Dimitrakakis
Constantin Rothkopf
BDL
87
107
0
18 Jun 2011
1