Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1908.10479
Cited By
Exploration-Enhanced POLITEX
27 August 2019
Yasin Abbasi-Yadkori
N. Lazić
Csaba Szepesvári
Gellert Weisz
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploration-Enhanced POLITEX"
16 / 16 papers shown
Title
Regularized Off-Policy TD-Learning
Bo Liu
Sridhar Mahadevan
Ji Liu
OffRL
42
65
0
06 Jun 2020
Finite-Sample Analysis of Proximal Gradient TD Algorithms
Bo Liu
Ji Liu
Mohammad Ghavamzadeh
Sridhar Mahadevan
Marek Petrik
50
158
0
06 Jun 2020
Provably Efficient Maximum Entropy Exploration
Elad Hazan
Sham Kakade
Karan Singh
A. V. Soest
67
297
0
06 Dec 2018
Large-Scale Study of Curiosity-Driven Learning
Yuri Burda
Harrison Edwards
Deepak Pathak
Amos Storkey
Trevor Darrell
Alexei A. Efros
LRM
69
702
0
13 Aug 2018
Is Q-learning Provably Efficient?
Chi Jin
Zeyuan Allen-Zhu
Sébastien Bubeck
Michael I. Jordan
OffRL
63
805
0
10 Jul 2018
Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Yasin Abbasi-Yadkori
N. Lazić
Csaba Szepesvári
OffRL
68
95
0
17 Apr 2018
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
143
740
0
02 Mar 2018
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
127
1,133
0
02 Jan 2018
Minimax Regret Bounds for Reinforcement Learning
M. G. Azar
Ian Osband
Rémi Munos
80
774
0
16 Mar 2017
Multi-step Reinforcement Learning: A Unifying Algorithm
Kristopher De Asis
Fernando Hernandez-Garcia
Zach Holland
R. Sutton
37
121
0
03 Mar 2017
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
91
3,749
0
20 Nov 2015
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
212
3,787
0
18 Nov 2015
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
156
7,623
0
22 Sep 2015
Scale-Free Algorithms for Online Linear Optimization
Francesco Orabona
D. Pál
ODL
63
53
0
19 Feb 2015
Generalization and Exploration via Randomized Value Functions
Ian Osband
Benjamin Van Roy
Zheng Wen
77
314
0
04 Feb 2014
Off-policy Learning with Eligibility Traces: A Survey
Matthieu Geist
B. Scherrer
OffRL
85
94
0
15 Apr 2013
1