ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.10479
  4. Cited By
Exploration-Enhanced POLITEX

Exploration-Enhanced POLITEX

27 August 2019
Yasin Abbasi-Yadkori
N. Lazić
Csaba Szepesvári
Gellert Weisz
ArXivPDFHTML

Papers citing "Exploration-Enhanced POLITEX"

16 / 16 papers shown
Title
Regularized Off-Policy TD-Learning
Regularized Off-Policy TD-Learning
Bo Liu
Sridhar Mahadevan
Ji Liu
OffRL
42
65
0
06 Jun 2020
Finite-Sample Analysis of Proximal Gradient TD Algorithms
Finite-Sample Analysis of Proximal Gradient TD Algorithms
Bo Liu
Ji Liu
Mohammad Ghavamzadeh
Sridhar Mahadevan
Marek Petrik
50
158
0
06 Jun 2020
Provably Efficient Maximum Entropy Exploration
Provably Efficient Maximum Entropy Exploration
Elad Hazan
Sham Kakade
Karan Singh
A. V. Soest
67
297
0
06 Dec 2018
Large-Scale Study of Curiosity-Driven Learning
Large-Scale Study of Curiosity-Driven Learning
Yuri Burda
Harrison Edwards
Deepak Pathak
Amos Storkey
Trevor Darrell
Alexei A. Efros
LRM
69
702
0
13 Aug 2018
Is Q-learning Provably Efficient?
Is Q-learning Provably Efficient?
Chi Jin
Zeyuan Allen-Zhu
Sébastien Bubeck
Michael I. Jordan
OffRL
63
805
0
10 Jul 2018
Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Yasin Abbasi-Yadkori
N. Lazić
Csaba Szepesvári
OffRL
68
95
0
17 Apr 2018
Distributed Prioritized Experience Replay
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
143
740
0
02 Mar 2018
DeepMind Control Suite
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
127
1,133
0
02 Jan 2018
Minimax Regret Bounds for Reinforcement Learning
Minimax Regret Bounds for Reinforcement Learning
M. G. Azar
Ian Osband
Rémi Munos
80
774
0
16 Mar 2017
Multi-step Reinforcement Learning: A Unifying Algorithm
Multi-step Reinforcement Learning: A Unifying Algorithm
Kristopher De Asis
Fernando Hernandez-Garcia
Zach Holland
R. Sutton
37
121
0
03 Mar 2017
Dueling Network Architectures for Deep Reinforcement Learning
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
91
3,749
0
20 Nov 2015
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
212
3,787
0
18 Nov 2015
Deep Reinforcement Learning with Double Q-learning
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
156
7,623
0
22 Sep 2015
Scale-Free Algorithms for Online Linear Optimization
Scale-Free Algorithms for Online Linear Optimization
Francesco Orabona
D. Pál
ODL
63
53
0
19 Feb 2015
Generalization and Exploration via Randomized Value Functions
Generalization and Exploration via Randomized Value Functions
Ian Osband
Benjamin Van Roy
Zheng Wen
77
314
0
04 Feb 2014
Off-policy Learning with Eligibility Traces: A Survey
Off-policy Learning with Eligibility Traces: A Survey
Matthieu Geist
B. Scherrer
OffRL
85
94
0
15 Apr 2013
1