Continuous Deep Q-Learning with Model-based Acceleration

2 March 2016

Papers citing "Continuous Deep Q-Learning with Model-based Acceleration"

50 / 308 papers shown

Title
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning Yongshuai Liu Xin Liu 207 1 0 26 Mar 2025
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning Patrick Yin Tyler Westenbroek Simran Bagaria Kevin Huang Ching-an Cheng Andrey Kobolov Abhishek Gupta 179 4 0 04 Feb 2025
Prioritized Generative Replay Renhao Wang Kevin Frans Pieter Abbeel Sergey Levine Alexei A. Efros OnRL DiffM 190 7 0 23 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling Jasmine Bayrooti Carl Henrik Ek Amanda Prorok 197 0 0 07 Oct 2024
Online Control-Informed Learning Zihao Liang Tianyu Zhou Zehui Lu Shaoshuai Mou 122 1 0 04 Oct 2024
q-exponential family for policy optimization Lingwei Zhu Haseeb Shah Han Wang Yukie Nagai Martha White OffRL 140 0 0 14 Aug 2024
A Survey on Vision-Language-Action Models for Embodied AI Yueen Ma Zixing Song Yuzheng Zhuang Jianye Hao Irwin King LM&Ro 335 54 0 23 May 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective Victor-Alexandru Darvariu Stephen Hailes Mirco Musolesi AI4CE 121 8 0 09 Apr 2024
Reinforcement Learning-Based Approaches for Enhancing Security and Resilience in Smart Control: A Survey on Attack and Defense Methods Zheyu Zhang AAML 47 0 0 23 Feb 2024
A Q-learning approach to the continuous control problem of robot inverted pendulum balancing Mohammad Safeea Pedro Neto 27 10 0 05 Dec 2023
Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models T. Westenbroek Jacob Levy David Fridovich-Keil 77 0 0 16 Jul 2023
TD Convergence: An Optimization Perspective Kavosh Asadi Shoham Sabach Yao Liu Omer Gottesman Rasool Fakoor MU 88 8 0 30 Jun 2023
Deep Deterministic Policy Gradient for End-to-End Communication Systems without Prior Channel Knowledge Bolun Zhang Nguyen Van Huynh 72 5 0 12 May 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making Carlos Núnez-Molina Pablo Mesejo Juan Fernández-Olivares 126 3 0 20 Apr 2023
Acquisition Conditioned Oracle for Nongreedy Active Feature Acquisition M. Valancius M. Lennon Junier Oliva 74 1 0 27 Feb 2023
Mastering Diverse Domains through World Models Danijar Hafner J. Pašukonis Jimmy Ba Timothy Lillicrap 94 617 0 10 Jan 2023
Holistic Network Virtualization and Pervasive Network Intelligence for 6G Xuemin Shen Shen Jie Gao Wen Wu Mushu Li Conghao Zhou W. Zhuang 106 238 0 02 Jan 2023
Risk-Sensitive Reinforcement Learning with Exponential Criteria Erfaun Noorani Christos N. Mavridis John S. Baras 99 9 0 18 Dec 2022
Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off Zichen Zhang Johannes Kirschner Junxi Zhang Francesco Zanini Alex Ayoub Masood Dehghan Dale Schuurmans OffRL 82 3 0 17 Dec 2022
CT-DQN: Control-Tutored Deep Reinforcement Learning F. D. Lellis M. Coraggio G. Russo Mirco Musolesi M. D. Bernardo 55 4 0 02 Dec 2022
On-device Training: A First Overview on Existing Systems Shuai Zhu Thiemo Voigt Jeonggil Ko Fatemeh Rahimian 142 17 0 01 Dec 2022
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality Gianluigi Grandesso Elisa Alboni G. P. R. Papini Patrick M. Wensing Andrea Del Prete 78 16 0 12 Nov 2022
Job Scheduling in Datacenters using Constraint Controlled RL V. Venkataswamy 41 1 0 10 Nov 2022
Progress and summary of reinforcement learning on energy management of MPS-EV Jincheng Hu Yang Lin Liang Chu Zhuoran Hou Jihan Li Jingjing Jiang Yuanjian Zhang 130 13 0 08 Nov 2022
A Survey on Reinforcement Learning in Aviation Applications Pouria Razzaghi Amin Tabrizian Wei Guo Shulu Chen Abenezer Taye Ellis E. Thompson Alexis Bregeon Ali Baheri Peng Wei OffRL 52 56 0 03 Nov 2022
MEET: A Monte Carlo Exploration-Exploitation Trade-off for Buffer Sampling Julius Ott Lorenzo Servadei Jose A. Arjona-Medina E. Rinaldi Gianfranco Mauro Daniela Sanchez Lopera Michael Stephan Thomas Stadelmayer Avik Santra Robert Wille 66 0 0 24 Oct 2022
Design of experiments for the calibration of history-dependent models via deep reinforcement learning and an enhanced Kalman filter Ruben Villarreal Nikolaos N. Vlassis Nhon N. Phan Tommie A. Catanach Reese E. Jones N. Trask S. Kramer WaiChing Sun OffRL 59 12 0 27 Sep 2022
Model-Free Reinforcement Learning for Asset Allocation Adebayo Oshingbesan Eniola Ajiboye Peruth Kamashazi Timothy Mbaka OffRL 59 1 0 21 Sep 2022
A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks Paulina Stevia Nouwou Mindom Amin Nikanjam Foutse Khomh OffRL 67 11 0 25 Aug 2022
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation) Bojun Huang 56 1 0 22 Jul 2022
q-Learning in Continuous Time Yanwei Jia X. Zhou OffRL 158 78 0 02 Jul 2022
Action-modulated midbrain dopamine activity arises from distributed control policies Jack W Lindsey Ashok Litwin-Kumar MLAU 51 12 0 01 Jul 2022
Incorporating Voice Instructions in Model-Based Reinforcement Learning for Self-Driving Cars Mingze Wang Ziyang Zhang Grace Hui Yang 58 1 0 21 Jun 2022
An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation Shuyu Yin Yaoyu Zhang Peilin Liu Z. Xu 85 2 0 25 May 2022
Neural-Progressive Hedging: Enforcing Constraints in Reinforcement Learning with Stochastic Programming Supriyo Ghosh L. Wynter Shiau Hong Lim D. Nguyen 63 0 0 27 Feb 2022
Online Decision Transformer Qinqing Zheng Amy Zhang Aditya Grover OffRL 93 209 0 11 Feb 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error Scott Fujimoto David Meger Doina Precup Ofir Nachum S. Gu 117 32 0 28 Jan 2022
Can Wikipedia Help Offline Reinforcement Learning? Machel Reid Yutaro Yamada S. Gu 3DV RALM OffRL 242 96 0 28 Jan 2022
Automated Reinforcement Learning: An Overview Reza Refaei Afshar Yingqian Zhang Joaquin Vanschoren U. Kaymak OffRL 160 16 0 13 Jan 2022
Recent Advances in Reinforcement Learning in Finance B. Hambly Renyuan Xu Huining Yang OffRL 126 180 0 08 Dec 2021
PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay Xingxing Liang Yang Ma Yanghe Feng Zhong Liu 61 10 0 07 Dec 2021
Generalized Decision Transformer for Offline Hindsight Information Matching Hiroki Furuta Y. Matsuo S. Gu OffRL 116 104 0 19 Nov 2021
Physics-informed neural networks via stochastic Hamiltonian dynamics learning Minh Nguyen Chandrajit Bajaj 40 1 0 15 Nov 2021
Distributed Reinforcement Learning for Privacy-Preserving Dynamic Edge Caching Shengheng Liu Chong Zheng Yongming Huang Tony Q.S. Quek 67 61 0 20 Oct 2021
Continuous Control with Action Quantization from Demonstrations Robert Dadashi Léonard Hussenot Damien Vincent Sertan Girgin Anton Raichuk Matthieu Geist Olivier Pietquin OffRL 105 23 0 19 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks Robert McCarthy Qiang Wang S. Redmond OffRL 72 15 0 05 Oct 2021
Continuous-Time Fitted Value Iteration for Robust Policies M. Lutter Boris Belousov Shie Mannor Dieter Fox Animesh Garg Jan Peters 70 9 0 05 Oct 2021
Deep Reinforcement Learning with Adjustments H. Khorasgani Haiyan Wang Chetan Gupta Susumu Serita 25 2 0 28 Sep 2021
Runtime Safety Assurance for Learning-enabled Control of Autonomous Driving Vehicles Shengduo Chen Yao Sun Dachuan Li Qiang Wang Qi Hao J. Sifakis 82 18 0 28 Sep 2021
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms Ruizhi Chen Xiaoyu Wu Yansong Pan Kaizhao Yuan Ling Li ... Shaohui Peng Xishan Zhang Zidong Du Qi Guo Yunji Chen OffRL 61 3 0 04 Sep 2021