Reduce Computational Cost In Deep Reinforcement Learning Via Randomized Policy Learning

25 May 2025

Papers citing "Reduce Computational Cost In Deep Reinforcement Learning Via Randomized Policy Learning"

20 / 20 papers shown

Title
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning Shengyi Huang Quentin Gallouedec Florian Felten Antonin Raffin Rousslan Fernand Julien Dossa ... Alexander Nikulin Xiao Hu Tianlin Liu Jongwook Choi Brent Yi OffRL 50 9 0 05 Feb 2024
CPG-RL: Learning Central Pattern Generators for Quadruped Locomotion Guillaume Bellegarda A. Ijspeert 48 80 0 01 Nov 2022
RMA: Rapid Motor Adaptation for Legged Robots Ashish Kumar Zipeng Fu Deepak Pathak Jitendra Malik 96 564 0 08 Jul 2021
Blind Bipedal Stair Traversal via Sim-to-Real Reinforcement Learning J. Siekmann Kevin R. Green John Warila Alan Fern J. Hurst 43 187 0 18 May 2021
Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning Guillaume Bellegarda Yiyu Chen Zhuochen Liu Quan Nguyen 41 45 0 11 Mar 2021
Deep Randomized Neural Networks Claudio Gallicchio Simone Scardapane OOD 60 62 0 27 Feb 2020
Learning to Drive in a Day Alex Kendall Jeffrey Hawke David Janz Przemyslaw Mazur Daniele Reda John M. Allen Vinh-Dieu Lam Alex Bewley Amar Shah 60 649 0 01 Jul 2018
Addressing Function Approximation Error in Actor-Critic Methods Scott Fujimoto H. V. Hoof David Meger OffRL 120 5,121 0 26 Feb 2018
Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data Y. Liu Ning Liu B. Logan Zhiyuan Xu Jian Tang Yanzhi Wang OffRL OOD 45 102 0 28 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Tuomas Haarnoja Aurick Zhou Pieter Abbeel Sergey Levine 152 8,236 0 04 Jan 2018
Deep Reinforcement Learning for Sepsis Treatment Aniruddh Raghu Matthieu Komorowski Imran Ahmed Leo Anthony Celi Peter Szolovits Marzyeh Ghassemi OffRL 39 172 0 27 Nov 2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation Yuhuai Wu Elman Mansimov Shun Liao Roger C. Grosse Jimmy Ba OffRL 33 624 0 17 Aug 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 179 18,685 0 20 Jul 2017
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic S. Gu Timothy Lillicrap Zoubin Ghahramani Richard Turner Sergey Levine OffRL BDL 56 344 0 07 Nov 2016
Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving Shai Shalev-Shwartz Shaked Shammah Amnon Shashua 21 828 0 11 Oct 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 134 8,805 0 04 Feb 2016
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 104 13,174 0 09 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation John Schulman Philipp Moritz Sergey Levine Michael I. Jordan Pieter Abbeel OffRL 20 3,368 0 08 Jun 2015
Trust Region Policy Optimization John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel 206 6,722 0 19 Feb 2015
Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra Martin Riedmiller 41 12,163 0 19 Dec 2013