ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.06924
  4. Cited By
Online Apprenticeship Learning

Online Apprenticeship Learning

13 February 2021
Lior Shani
Tom Zahavy
Shie Mannor
    OffRL
ArXivPDFHTML

Papers citing "Online Apprenticeship Learning"

25 / 25 papers shown
Title
Generative Adversarial Networks
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
255
30,123
0
01 Mar 2022
Provably Efficient Reinforcement Learning with Linear Function
  Approximation Under Adaptivity Constraints
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
239
167
0
06 Jan 2021
Toward the Fundamental Limits of Imitation Learning
Toward the Fundamental Limits of Imitation Learning
Nived Rajaraman
Lin F. Yang
Jiantao Jiao
K. Ramachandran
132
87
0
13 Sep 2020
Mirror Descent Policy Optimization
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
90
85
0
20 May 2020
Generative Adversarial Imitation Learning with Neural Networks: Global
  Optimality and Convergence Rate
Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate
Yufeng Zhang
Qi Cai
Zhuoran Yang
Zhaoran Wang
164
12
0
08 Mar 2020
Optimistic Policy Optimization with Bandit Feedback
Optimistic Policy Optimization with Bandit Feedback
Yonathan Efroni
Lior Shani
Aviv A. Rosenberg
Shie Mannor
48
90
0
19 Feb 2020
On Computation and Generalization of Generative Adversarial Imitation
  Learning
On Computation and Generalization of Generative Adversarial Imitation Learning
Minshuo Chen
Yizhou Wang
Tianyi Liu
Zhuoran Yang
Xingguo Li
Zhaoran Wang
T. Zhao
96
40
0
09 Jan 2020
Provably Efficient Exploration in Policy Optimization
Provably Efficient Exploration in Policy Optimization
Qi Cai
Zhuoran Yang
Chi Jin
Zhaoran Wang
51
281
0
12 Dec 2019
Apprenticeship Learning via Frank-Wolfe
Apprenticeship Learning via Frank-Wolfe
Tom Zahavy
Alon Cohen
Haim Kaplan
Yishay Mansour
68
18
0
05 Nov 2019
Introduction to Online Convex Optimization
Introduction to Online Convex Optimization
Elad Hazan
OffRL
163
1,928
0
07 Sep 2019
Adaptive Trust Region Policy Optimization: Global Convergence and Faster
  Rates for Regularized MDPs
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs
Lior Shani
Yonathan Efroni
Shie Mannor
49
175
0
06 Sep 2019
On the Theory of Policy Gradient Methods: Optimality, Approximation, and
  Distribution Shift
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift
Alekh Agarwal
Sham Kakade
Jason D. Lee
G. Mahajan
61
320
0
01 Aug 2019
Wasserstein Adversarial Imitation Learning
Wasserstein Adversarial Imitation Learning
Huang Xiao
Michael Herman
Joerg Wagner
Sebastian Ziesche
Jalal Etesami
T. H. Linh
41
71
0
19 Jun 2019
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy
  Policies
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies
Yonathan Efroni
Nadav Merlis
Mohammad Ghavamzadeh
Shie Mannor
OffRL
89
68
0
27 May 2019
A Theory of Regularized Markov Decision Processes
A Theory of Regularized Markov Decision Processes
Matthieu Geist
B. Scherrer
Olivier Pietquin
109
325
0
31 Jan 2019
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning
  without Domain Knowledge using Value Function Bounds
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds
Andrea Zanette
Emma Brunskill
OffRL
97
276
0
01 Jan 2019
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward
  Bias in Adversarial Imitation Learning
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
76
260
0
09 Sep 2018
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Lionel Blondé
Alexandros Kalousis
GAN
45
47
0
06 Sep 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
292
8,329
0
04 Jan 2018
Improved Training of Wasserstein GANs
Improved Training of Wasserstein GANs
Ishaan Gulrajani
Faruk Ahmed
Martín Arjovsky
Vincent Dumoulin
Aaron Courville
GAN
195
9,545
0
31 Mar 2017
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement
  Learning
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Christoph Dann
Tor Lattimore
Emma Brunskill
72
309
0
22 Mar 2017
Generative Adversarial Imitation Learning
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
131
3,105
0
10 Jun 2016
Model-Free Imitation Learning with Policy Optimization
Model-Free Imitation Learning with Policy Optimization
Jonathan Ho
Jayesh K. Gupta
Stefano Ermon
47
149
0
26 May 2016
Deep Exploration via Bootstrapped DQN
Deep Exploration via Bootstrapped DQN
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
121
1,307
0
15 Feb 2016
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
277
6,764
0
19 Feb 2015
1