Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07461
Cited By
Shallow Updates for Deep Reinforcement Learning
21 May 2017
Nir Levine
Tom Zahavy
D. Mankowitz
Aviv Tamar
Shie Mannor
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Shallow Updates for Deep Reinforcement Learning"
19 / 19 papers shown
Title
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Jing Hou
Guang Chen
Ruiqi Zhang
Zhijun Li
Shangding Gu
Changjun Jiang
OffRL
36
2
0
11 Dec 2023
Towards a Better Understanding of Representation Dynamics under TD-learning
Yunhao Tang
Rémi Munos
OffRL
34
2
0
29 May 2023
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
OffRL
49
15
0
20 Nov 2022
Meta-Learning-Based Robust Adaptive Flight Control Under Uncertain Wind Conditions
Michael O'Connell
Guanya Shi
Xichen Shi
Soon-Jo Chung
49
22
0
02 Mar 2021
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Ofir Nabati
Tom Zahavy
Shie Mannor
32
18
0
07 Feb 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
64
73
0
01 Jan 2021
Randomized Policy Learning for Continuous State and Action MDPs
Hiteshi Sharma
Rahul Jain
31
1
0
08 Jun 2020
A Comprehensive Overview and Survey of Recent Advances in Meta-Learning
Huimin Peng
VLM
OffRL
39
35
0
17 Apr 2020
Distributional Robustness and Regularization in Reinforcement Learning
E. Derman
Shie Mannor
34
44
0
05 Mar 2020
How Transferable are the Representations Learned by Deep Q Agents?
Jacob Tyo
Zachary Chase Lipton
OffRL
27
6
0
24 Feb 2020
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
48
189
0
23 Dec 2019
A Geometric Perspective on Optimal Representations for Reinforcement Learning
Marc G. Bellemare
Will Dabney
Robert Dadashi
Adrien Ali Taïga
Pablo Samuel Castro
Nicolas Le Roux
Dale Schuurmans
Tor Lattimore
Clare Lyle
24
90
0
31 Jan 2019
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching
Tom Zahavy
Shie Mannor
HAI
41
30
0
24 Jan 2019
Trust Region Value Optimization using Kalman Filtering
Shirli Di-Castro Shashua
Shie Mannor
29
7
0
23 Jan 2019
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
48
598
0
01 Jan 2019
Context-Dependent Upper-Confidence Bounds for Directed Exploration
Raksha Kumaraswamy
M. Schlegel
Adam White
Martha White
OffRL
31
12
0
15 Nov 2018
Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning
David Janz
Jiri Hron
Przemysław Mazur
Katja Hofmann
José Miguel Hernández-Lobato
Sebastian Tschiatschek
65
51
0
15 Oct 2018
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
Tom Zahavy
Matan Haroush
Nadav Merlis
D. Mankowitz
Shie Mannor
53
189
0
06 Sep 2018
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
323
2,908
0
15 Sep 2016
1