Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1603.00748
Cited By
Continuous Deep Q-Learning with Model-based Acceleration
2 March 2016
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Continuous Deep Q-Learning with Model-based Acceleration"
50 / 308 papers shown
Title
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
207
1
0
26 Mar 2025
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
179
4
0
04 Feb 2025
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
190
7
0
23 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
197
0
0
07 Oct 2024
Online Control-Informed Learning
Zihao Liang
Tianyu Zhou
Zehui Lu
Shaoshuai Mou
122
1
0
04 Oct 2024
q-exponential family for policy optimization
Lingwei Zhu
Haseeb Shah
Han Wang
Yukie Nagai
Martha White
OffRL
140
0
0
14 Aug 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
335
54
0
23 May 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
121
8
0
09 Apr 2024
Reinforcement Learning-Based Approaches for Enhancing Security and Resilience in Smart Control: A Survey on Attack and Defense Methods
Zheyu Zhang
AAML
47
0
0
23 Feb 2024
A Q-learning approach to the continuous control problem of robot inverted pendulum balancing
Mohammad Safeea
Pedro Neto
27
10
0
05 Dec 2023
Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models
T. Westenbroek
Jacob Levy
David Fridovich-Keil
77
0
0
16 Jul 2023
TD Convergence: An Optimization Perspective
Kavosh Asadi
Shoham Sabach
Yao Liu
Omer Gottesman
Rasool Fakoor
MU
88
8
0
30 Jun 2023
Deep Deterministic Policy Gradient for End-to-End Communication Systems without Prior Channel Knowledge
Bolun Zhang
Nguyen Van Huynh
72
5
0
12 May 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
Carlos Núnez-Molina
Pablo Mesejo
Juan Fernández-Olivares
126
3
0
20 Apr 2023
Acquisition Conditioned Oracle for Nongreedy Active Feature Acquisition
M. Valancius
M. Lennon
Junier Oliva
74
1
0
27 Feb 2023
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
94
617
0
10 Jan 2023
Holistic Network Virtualization and Pervasive Network Intelligence for 6G
Xuemin Shen
Shen
Jie Gao
Wen Wu
Mushu Li
Conghao Zhou
W. Zhuang
106
238
0
02 Jan 2023
Risk-Sensitive Reinforcement Learning with Exponential Criteria
Erfaun Noorani
Christos N. Mavridis
John S. Baras
99
9
0
18 Dec 2022
Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Zichen Zhang
Johannes Kirschner
Junxi Zhang
Francesco Zanini
Alex Ayoub
Masood Dehghan
Dale Schuurmans
OffRL
82
3
0
17 Dec 2022
CT-DQN: Control-Tutored Deep Reinforcement Learning
F. D. Lellis
M. Coraggio
G. Russo
Mirco Musolesi
M. D. Bernardo
55
4
0
02 Dec 2022
On-device Training: A First Overview on Existing Systems
Shuai Zhu
Thiemo Voigt
Jeonggil Ko
Fatemeh Rahimian
142
17
0
01 Dec 2022
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality
Gianluigi Grandesso
Elisa Alboni
G. P. R. Papini
Patrick M. Wensing
Andrea Del Prete
78
16
0
12 Nov 2022
Job Scheduling in Datacenters using Constraint Controlled RL
V. Venkataswamy
41
1
0
10 Nov 2022
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
130
13
0
08 Nov 2022
A Survey on Reinforcement Learning in Aviation Applications
Pouria Razzaghi
Amin Tabrizian
Wei Guo
Shulu Chen
Abenezer Taye
Ellis E. Thompson
Alexis Bregeon
Ali Baheri
Peng Wei
OffRL
52
56
0
03 Nov 2022
MEET: A Monte Carlo Exploration-Exploitation Trade-off for Buffer Sampling
Julius Ott
Lorenzo Servadei
Jose A. Arjona-Medina
E. Rinaldi
Gianfranco Mauro
Daniela Sanchez Lopera
Michael Stephan
Thomas Stadelmayer
Avik Santra
Robert Wille
66
0
0
24 Oct 2022
Design of experiments for the calibration of history-dependent models via deep reinforcement learning and an enhanced Kalman filter
Ruben Villarreal
Nikolaos N. Vlassis
Nhon N. Phan
Tommie A. Catanach
Reese E. Jones
N. Trask
S. Kramer
WaiChing Sun
OffRL
59
12
0
27 Sep 2022
Model-Free Reinforcement Learning for Asset Allocation
Adebayo Oshingbesan
Eniola Ajiboye
Peruth Kamashazi
Timothy Mbaka
OffRL
59
1
0
21 Sep 2022
A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks
Paulina Stevia Nouwou Mindom
Amin Nikanjam
Foutse Khomh
OffRL
67
11
0
25 Aug 2022
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
Bojun Huang
56
1
0
22 Jul 2022
q-Learning in Continuous Time
Yanwei Jia
X. Zhou
OffRL
158
78
0
02 Jul 2022
Action-modulated midbrain dopamine activity arises from distributed control policies
Jack W Lindsey
Ashok Litwin-Kumar
MLAU
51
12
0
01 Jul 2022
Incorporating Voice Instructions in Model-Based Reinforcement Learning for Self-Driving Cars
Mingze Wang
Ziyang Zhang
Grace Hui Yang
58
1
0
21 Jun 2022
An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation
Shuyu Yin
Yaoyu Zhang
Peilin Liu
Z. Xu
85
2
0
25 May 2022
Neural-Progressive Hedging: Enforcing Constraints in Reinforcement Learning with Stochastic Programming
Supriyo Ghosh
L. Wynter
Shiau Hong Lim
D. Nguyen
63
0
0
27 Feb 2022
Online Decision Transformer
Qinqing Zheng
Amy Zhang
Aditya Grover
OffRL
93
209
0
11 Feb 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
David Meger
Doina Precup
Ofir Nachum
S. Gu
117
32
0
28 Jan 2022
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
242
96
0
28 Jan 2022
Automated Reinforcement Learning: An Overview
Reza Refaei Afshar
Yingqian Zhang
Joaquin Vanschoren
U. Kaymak
OffRL
160
16
0
13 Jan 2022
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
126
180
0
08 Dec 2021
PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay
Xingxing Liang
Yang Ma
Yanghe Feng
Zhong Liu
61
10
0
07 Dec 2021
Generalized Decision Transformer for Offline Hindsight Information Matching
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
116
104
0
19 Nov 2021
Physics-informed neural networks via stochastic Hamiltonian dynamics learning
Minh Nguyen
Chandrajit Bajaj
40
1
0
15 Nov 2021
Distributed Reinforcement Learning for Privacy-Preserving Dynamic Edge Caching
Shengheng Liu
Chong Zheng
Yongming Huang
Tony Q.S. Quek
67
61
0
20 Oct 2021
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
Matthieu Geist
Olivier Pietquin
OffRL
105
23
0
19 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks
Robert McCarthy
Qiang Wang
S. Redmond
OffRL
72
15
0
05 Oct 2021
Continuous-Time Fitted Value Iteration for Robust Policies
M. Lutter
Boris Belousov
Shie Mannor
Dieter Fox
Animesh Garg
Jan Peters
70
9
0
05 Oct 2021
Deep Reinforcement Learning with Adjustments
H. Khorasgani
Haiyan Wang
Chetan Gupta
Susumu Serita
25
2
0
28 Sep 2021
Runtime Safety Assurance for Learning-enabled Control of Autonomous Driving Vehicles
Shengduo Chen
Yao Sun
Dachuan Li
Qiang Wang
Qi Hao
J. Sifakis
82
18
0
28 Sep 2021
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms
Ruizhi Chen
Xiaoyu Wu
Yansong Pan
Kaizhao Yuan
Ling Li
...
Shaohui Peng
Xishan Zhang
Zidong Du
Qi Guo
Yunji Chen
OffRL
61
3
0
04 Sep 2021
1
2
3
4
5
6
7
Next