ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07461
  4. Cited By
Shallow Updates for Deep Reinforcement Learning

Shallow Updates for Deep Reinforcement Learning

21 May 2017
Nir Levine
Tom Zahavy
D. Mankowitz
Aviv Tamar
Shie Mannor
    OffRL
ArXivPDFHTML

Papers citing "Shallow Updates for Deep Reinforcement Learning"

19 / 19 papers shown
Title
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Jing Hou
Guang Chen
Ruiqi Zhang
Zhijun Li
Shangding Gu
Changjun Jiang
OffRL
36
2
0
11 Dec 2023
Towards a Better Understanding of Representation Dynamics under
  TD-learning
Towards a Better Understanding of Representation Dynamics under TD-learning
Yunhao Tang
Rémi Munos
OffRL
34
2
0
29 May 2023
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch
  Size
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
OffRL
49
15
0
20 Nov 2022
Meta-Learning-Based Robust Adaptive Flight Control Under Uncertain Wind
  Conditions
Meta-Learning-Based Robust Adaptive Flight Control Under Uncertain Wind Conditions
Michael O'Connell
Guanya Shi
Xichen Shi
Soon-Jo Chung
49
22
0
02 Mar 2021
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Ofir Nabati
Tom Zahavy
Shie Mannor
32
18
0
07 Feb 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
64
73
0
01 Jan 2021
Randomized Policy Learning for Continuous State and Action MDPs
Randomized Policy Learning for Continuous State and Action MDPs
Hiteshi Sharma
Rahul Jain
31
1
0
08 Jun 2020
A Comprehensive Overview and Survey of Recent Advances in Meta-Learning
A Comprehensive Overview and Survey of Recent Advances in Meta-Learning
Huimin Peng
VLM
OffRL
39
35
0
17 Apr 2020
Distributional Robustness and Regularization in Reinforcement Learning
Distributional Robustness and Regularization in Reinforcement Learning
E. Derman
Shie Mannor
34
44
0
05 Mar 2020
How Transferable are the Representations Learned by Deep Q Agents?
How Transferable are the Representations Learned by Deep Q Agents?
Jacob Tyo
Zachary Chase Lipton
OffRL
27
6
0
24 Feb 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
48
189
0
23 Dec 2019
A Geometric Perspective on Optimal Representations for Reinforcement
  Learning
A Geometric Perspective on Optimal Representations for Reinforcement Learning
Marc G. Bellemare
Will Dabney
Robert Dadashi
Adrien Ali Taïga
Pablo Samuel Castro
Nicolas Le Roux
Dale Schuurmans
Tor Lattimore
Clare Lyle
24
90
0
31 Jan 2019
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through
  Likelihood Matching
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching
Tom Zahavy
Shie Mannor
HAI
41
30
0
24 Jan 2019
Trust Region Value Optimization using Kalman Filtering
Trust Region Value Optimization using Kalman Filtering
Shirli Di-Castro Shashua
Shie Mannor
29
7
0
23 Jan 2019
A Theoretical Analysis of Deep Q-Learning
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
48
598
0
01 Jan 2019
Context-Dependent Upper-Confidence Bounds for Directed Exploration
Context-Dependent Upper-Confidence Bounds for Directed Exploration
Raksha Kumaraswamy
M. Schlegel
Adam White
Martha White
OffRL
31
12
0
15 Nov 2018
Successor Uncertainties: Exploration and Uncertainty in Temporal
  Difference Learning
Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning
David Janz
Jiri Hron
Przemysław Mazur
Katja Hofmann
José Miguel Hernández-Lobato
Sebastian Tschiatschek
65
51
0
15 Oct 2018
Learn What Not to Learn: Action Elimination with Deep Reinforcement
  Learning
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
Tom Zahavy
Matan Haroush
Nadav Merlis
D. Mankowitz
Shie Mannor
53
189
0
06 Sep 2018
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp
  Minima
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
323
2,908
0
15 Sep 2016
1