Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1603.00748
Cited By
Continuous Deep Q-Learning with Model-based Acceleration
2 March 2016
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Continuous Deep Q-Learning with Model-based Acceleration"
50 / 170 papers shown
Title
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
93
1
0
26 Mar 2025
Is Bellman Equation Enough for Learning Control?
Haoxiang You
Lekan Molu
Ian Abraham
68
0
0
04 Mar 2025
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
80
2
0
04 Feb 2025
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
116
2
0
23 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
42
0
0
07 Oct 2024
Online Control-Informed Learning
Zihao Liang
Tianyu Zhou
Zehui Lu
Shaoshuai Mou
33
1
0
04 Oct 2024
q-exponential family for policy optimization
Lingwei Zhu
Haseeb Shah
Han Wang
Yukie Nagai
Martha White
OffRL
78
0
0
14 Aug 2024
Parallel Distributional Deep Reinforcement Learning for Mapless Navigation of Terrestrial Mobile Robots
V. A. Kich
A. H. Kolling
J. C. Jesus
Gabriel V. Heisler
Hiago Jacobs
...
André da Silva Kelbouscas
Akihisa Ohya
Ricardo B. Grando
Paulo Lilles Jorge Drews-Jr
D. T. Gamarra
35
3
0
11 Aug 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
49
5
0
29 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
82
45
0
23 May 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
29
6
0
22 Apr 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
50
6
0
09 Apr 2024
Monotone, Bi-Lipschitz, and Polyak-Lojasiewicz Networks
Ruigang Wang
Krishnamurthy Dvijotham
I. Manchester
36
5
0
02 Feb 2024
Multi-Agent Probabilistic Ensembles with Trajectory Sampling for Connected Autonomous Vehicles
Ruoqi Wen
Jiahao Huang
Rongpeng Li
Guoru Ding
Zhifeng Zhao
37
1
0
21 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
36
8
0
15 Dec 2023
A Q-learning approach to the continuous control problem of robot inverted pendulum balancing
Mohammad Safeea
Pedro Neto
20
7
0
05 Dec 2023
Active Control of Flow over Rotating Cylinder by Multiple Jets using Deep Reinforcement Learning
Kamyar Dobakhti
J. Ghazanfarian
AI4CE
29
0
0
22 Jul 2023
Deep Deterministic Policy Gradient for End-to-End Communication Systems without Prior Channel Knowledge
Bolun Zhang
Nguyen Van Huynh
30
4
0
12 May 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
Carlos Núnez-Molina
Pablo Mesejo
Juan Fernández-Olivares
30
3
0
20 Apr 2023
AdaptSim: Task-Driven Simulation Adaptation for Sim-to-Real Transfer
Allen Z. Ren
Hongkai Dai
Benjamin Burchfiel
Anirudha Majumdar
27
14
0
09 Feb 2023
Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Zichen Zhang
Johannes Kirschner
Junxi Zhang
Francesco Zanini
Alex Ayoub
Masood Dehghan
Dale Schuurmans
OffRL
24
3
0
17 Dec 2022
On-device Training: A First Overview on Existing Systems
Shuai Zhu
Thiemo Voigt
Jeonggil Ko
Fatemeh Rahimian
34
14
0
01 Dec 2022
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality
Gianluigi Grandesso
Elisa Alboni
G. P. R. Papini
Patrick M. Wensing
Andrea Del Prete
30
15
0
12 Nov 2022
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
23
12
0
08 Nov 2022
Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Qi Liu
Xueyuan Li
Zirui Li
Jingda Wu
Guodong Du
Xinlu Gao
Fan Yang
Shihua Yuan
49
8
0
06 Nov 2022
A Survey on Reinforcement Learning in Aviation Applications
Pouria Razzaghi
Amin Tabrizian
Wei Guo
Shulu Chen
Abenezer Taye
Ellis E. Thompson
Alexis Bregeon
Ali Baheri
Peng Wei
OffRL
23
52
0
03 Nov 2022
Improving aircraft performance using machine learning: a review
S. L. Clainche
E. Ferrer
Sam Gibson
Elisabeth Cross
A. Parente
Ricardo Vinuesa
AI4CE
36
93
0
20 Oct 2022
Hierarchical reinforcement learning for in-hand robotic manipulation using Davenport chained rotations
Francisco Roldan Sanchez
Qiang-qiang Wang
David Córdova Bulens
Kevin McGuinness
Stephen J. Redmond
Noel E. O'Connor
18
1
0
03 Oct 2022
Design of experiments for the calibration of history-dependent models via deep reinforcement learning and an enhanced Kalman filter
Ruben Villarreal
Nikolaos N. Vlassis
Nhon N. Phan
Tommie A. Catanach
Reese E. Jones
N. Trask
S. Kramer
WaiChing Sun
OffRL
30
11
0
27 Sep 2022
Model-Free Reinforcement Learning for Asset Allocation
Adebayo Oshingbesan
Eniola Ajiboye
Peruth Kamashazi
Timothy Mbaka
OffRL
19
1
0
21 Sep 2022
q-Learning in Continuous Time
Yanwei Jia
X. Zhou
OffRL
51
69
0
02 Jul 2022
Action-modulated midbrain dopamine activity arises from distributed control policies
Jack W Lindsey
Ashok Litwin-Kumar
MLAU
21
11
0
01 Jul 2022
Incorporating Voice Instructions in Model-Based Reinforcement Learning for Self-Driving Cars
Mingze Wang
Ziyang Zhang
Grace Hui Yang
29
1
0
21 Jun 2022
An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation
Shuyu Yin
Tao Luo
Peilin Liu
Z. Xu
18
2
0
25 May 2022
Gradient-Based Trajectory Optimization With Learned Dynamics
Bhavya Sukhija
Nathanael Kohler
Miguel Zamora
Simon Zimmermann
Sebastian Curi
Andreas Krause
Stelian Coros
30
9
0
09 Apr 2022
Neural-Progressive Hedging: Enforcing Constraints in Reinforcement Learning with Stochastic Programming
Supriyo Ghosh
L. Wynter
Shiau Hong Lim
D. Nguyen
34
0
0
27 Feb 2022
A Survey on Deep Reinforcement Learning-based Approaches for Adaptation and Generalization
Pamul Yadav
Ashutosh Mishra
Junyong Lee
Shiho Kim
OffRL
AI4CE
21
10
0
17 Feb 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
D. Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
140
95
0
28 Jan 2022
Value Activation for Bias Alleviation: Generalized-activated Deep Double Deterministic Policy Gradients
Jiafei Lyu
Yu Yang
Jiangpeng Yan
Xiu Li
OffRL
AI4CE
39
5
0
21 Dec 2021
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
29
167
0
08 Dec 2021
Distributed Reinforcement Learning for Privacy-Preserving Dynamic Edge Caching
Shengheng Liu
Chong Zheng
Yongming Huang
Tony Q.S. Quek
16
60
0
20 Oct 2021
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
M. Geist
Olivier Pietquin
OffRL
33
23
0
19 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks
Robert McCarthy
Qiang Wang
S. Redmond
OffRL
27
15
0
05 Oct 2021
Continuous-Time Fitted Value Iteration for Robust Policies
M. Lutter
Boris Belousov
Shie Mannor
Dieter Fox
Animesh Garg
Jan Peters
10
9
0
05 Oct 2021
Deep Reinforcement Learning with Adjustments
H. Khorasgani
Haiyan Wang
Chetan Gupta
Susumu Serita
20
2
0
28 Sep 2021
Runtime Safety Assurance for Learning-enabled Control of Autonomous Driving Vehicles
Shengduo Chen
Yao Sun
Dachuan Li
Qiang Wang
Qi Hao
J. Sifakis
57
15
0
28 Sep 2021
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
M. Geist
OffRL
24
9
0
16 Aug 2021
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Alan Chan
Hugo Silva
Sungsu Lim
Tadashi Kozuno
A. R. Mahmood
Martha White
25
29
0
17 Jul 2021
Species Distribution Modeling for Machine Learning Practitioners: A Review
Sara Beery
Elijah Cole
Joseph Parker
Pietro Perona
Kevin Winner
21
69
0
03 Jul 2021
1
2
3
4
Next