Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.18289
Cited By
Highway Reinforcement Learning
28 May 2024
Yuhui Wang
M. Strupl
Francesco Faccio
Qingyuan Wu
Haozhe Liu
Michal Grudzieñ
Xiaoyang Tan
Jürgen Schmidhuber
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Highway Reinforcement Learning"
15 / 15 papers shown
Title
Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations
Albert Wilcox
Ashwin Balakrishna
Jules Dedieu
Wyame Benslimane
Daniel S. Brown
Ken Goldberg
OffRL
71
20
0
14 Oct 2022
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Qingfeng Lan
Yangchen Pan
Alona Fyshe
Martha White
73
180
0
16 Feb 2020
Adaptive Trade-Offs in Off-Policy Learning
Mark Rowland
Will Dabney
Rémi Munos
OffRL
116
22
0
16 Oct 2019
Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target
J. F. Hernandez-Garcia
R. Sutton
62
63
0
22 Jan 2019
How to Combine Tree-Search Methods in Reinforcement Learning
Yonathan Efroni
Gal Dalal
B. Scherrer
Shie Mannor
58
32
0
06 Sep 2018
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
151
741
0
02 Mar 2018
Beyond the One Step Greedy Approach in Reinforcement Learning
Yonathan Efroni
Gal Dalal
B. Scherrer
Shie Mannor
OffRL
104
51
0
10 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
247
1,609
0
05 Feb 2018
Equivalence Between Policy Gradients and Soft Q-Learning
John Schulman
Xi Chen
Pieter Abbeel
OffRL
115
349
0
21 Apr 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
118
1,350
0
27 Feb 2017
A Greedy Approach to Adapting the Trace Parameter for Temporal Difference Learning
Martha White
Adam White
65
48
0
02 Jul 2016
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
138
617
0
08 Jun 2016
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
179
7,678
0
22 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
135
3,442
0
08 Jun 2015
Potential-Based Shaping and Q-Value Initialization are Equivalent
Eric Wiewiora
OffRL
76
179
0
26 Jun 2011
1