ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.00088
  4. Cited By
Improved Policy Optimization for Online Imitation Learning

Improved Policy Optimization for Online Imitation Learning

29 July 2022
J. Lavington
Sharan Vaswani
Mark W. Schmidt
    OffRL
ArXivPDFHTML

Papers citing "Improved Policy Optimization for Online Imitation Learning"

7 / 7 papers shown
Title
Online Episodic Convex Reinforcement Learning
Online Episodic Convex Reinforcement Learning
B. Moreno
Khaled Eldowa
Pierre Gaillard
Margaux Brégère
Nadia Oudjane
OffRL
29
0
0
12 May 2025
MetaCURL: Non-stationary Concave Utility Reinforcement Learning
MetaCURL: Non-stationary Concave Utility Reinforcement Learning
B. Moreno
Margaux Brégère
Pierre Gaillard
Nadia Oudjane
OffRL
37
0
0
30 May 2024
Critic Sequential Monte Carlo
Critic Sequential Monte Carlo
Vasileios Lioutas
J. Lavington
Justice Sefas
Matthew Niedoba
Yunpeng Liu
Berend Zwartsenberg
Setareh Dabiri
Frank D. Wood
Adam Scibior
42
7
0
30 May 2022
Scalable Online Planning via Reinforcement Learning Fine-Tuning
Scalable Online Planning via Reinforcement Learning Fine-Tuning
Arnaud Fickinger
Hengyuan Hu
Brandon Amos
Stuart J. Russell
Noam Brown
49
21
0
30 Sep 2021
Improved Deep Reinforcement Learning with Expert Demonstrations for
  Urban Autonomous Driving
Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving
Haochen Liu
Zhiyu Huang
Jingda Wu
Chen Lv
34
70
0
18 Feb 2021
Linear Convergence of Gradient and Proximal-Gradient Methods Under the
  Polyak-Łojasiewicz Condition
Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition
Hamed Karimi
J. Nutini
Mark W. Schmidt
136
1,198
0
16 Aug 2016
Off-Policy Actor-Critic
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
160
220
0
22 May 2012
1