Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.06924
Cited By
Online Apprenticeship Learning
13 February 2021
Lior Shani
Tom Zahavy
Shie Mannor
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Online Apprenticeship Learning"
25 / 25 papers shown
Title
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
255
30,123
0
01 Mar 2022
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
239
167
0
06 Jan 2021
Toward the Fundamental Limits of Imitation Learning
Nived Rajaraman
Lin F. Yang
Jiantao Jiao
K. Ramachandran
132
87
0
13 Sep 2020
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
90
85
0
20 May 2020
Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate
Yufeng Zhang
Qi Cai
Zhuoran Yang
Zhaoran Wang
164
12
0
08 Mar 2020
Optimistic Policy Optimization with Bandit Feedback
Yonathan Efroni
Lior Shani
Aviv A. Rosenberg
Shie Mannor
48
90
0
19 Feb 2020
On Computation and Generalization of Generative Adversarial Imitation Learning
Minshuo Chen
Yizhou Wang
Tianyi Liu
Zhuoran Yang
Xingguo Li
Zhaoran Wang
T. Zhao
96
40
0
09 Jan 2020
Provably Efficient Exploration in Policy Optimization
Qi Cai
Zhuoran Yang
Chi Jin
Zhaoran Wang
51
281
0
12 Dec 2019
Apprenticeship Learning via Frank-Wolfe
Tom Zahavy
Alon Cohen
Haim Kaplan
Yishay Mansour
68
18
0
05 Nov 2019
Introduction to Online Convex Optimization
Elad Hazan
OffRL
163
1,928
0
07 Sep 2019
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs
Lior Shani
Yonathan Efroni
Shie Mannor
49
175
0
06 Sep 2019
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift
Alekh Agarwal
Sham Kakade
Jason D. Lee
G. Mahajan
61
320
0
01 Aug 2019
Wasserstein Adversarial Imitation Learning
Huang Xiao
Michael Herman
Joerg Wagner
Sebastian Ziesche
Jalal Etesami
T. H. Linh
41
71
0
19 Jun 2019
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies
Yonathan Efroni
Nadav Merlis
Mohammad Ghavamzadeh
Shie Mannor
OffRL
89
68
0
27 May 2019
A Theory of Regularized Markov Decision Processes
Matthieu Geist
B. Scherrer
Olivier Pietquin
109
325
0
31 Jan 2019
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds
Andrea Zanette
Emma Brunskill
OffRL
97
276
0
01 Jan 2019
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
76
260
0
09 Sep 2018
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Lionel Blondé
Alexandros Kalousis
GAN
45
47
0
06 Sep 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
292
8,329
0
04 Jan 2018
Improved Training of Wasserstein GANs
Ishaan Gulrajani
Faruk Ahmed
Martín Arjovsky
Vincent Dumoulin
Aaron Courville
GAN
195
9,545
0
31 Mar 2017
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Christoph Dann
Tor Lattimore
Emma Brunskill
72
309
0
22 Mar 2017
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
131
3,105
0
10 Jun 2016
Model-Free Imitation Learning with Policy Optimization
Jonathan Ho
Jayesh K. Gupta
Stefano Ermon
47
149
0
26 May 2016
Deep Exploration via Bootstrapped DQN
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
121
1,307
0
15 Feb 2016
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
277
6,764
0
19 Feb 2015
1