Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.19054
Cited By
Reduce Computational Cost In Deep Reinforcement Learning Via Randomized Policy Learning
25 May 2025
Zhuochen Liu
Rahul Jain
Quan Nguyen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reduce Computational Cost In Deep Reinforcement Learning Via Randomized Policy Learning"
20 / 20 papers shown
Title
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Shengyi Huang
Quentin Gallouedec
Florian Felten
Antonin Raffin
Rousslan Fernand Julien Dossa
...
Alexander Nikulin
Xiao Hu
Tianlin Liu
Jongwook Choi
Brent Yi
OffRL
50
9
0
05 Feb 2024
CPG-RL: Learning Central Pattern Generators for Quadruped Locomotion
Guillaume Bellegarda
A. Ijspeert
48
80
0
01 Nov 2022
RMA: Rapid Motor Adaptation for Legged Robots
Ashish Kumar
Zipeng Fu
Deepak Pathak
Jitendra Malik
96
564
0
08 Jul 2021
Blind Bipedal Stair Traversal via Sim-to-Real Reinforcement Learning
J. Siekmann
Kevin R. Green
John Warila
Alan Fern
J. Hurst
43
187
0
18 May 2021
Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning
Guillaume Bellegarda
Yiyu Chen
Zhuochen Liu
Quan Nguyen
41
45
0
11 Mar 2021
Deep Randomized Neural Networks
Claudio Gallicchio
Simone Scardapane
OOD
60
62
0
27 Feb 2020
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
60
649
0
01 Jul 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
120
5,121
0
26 Feb 2018
Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data
Y. Liu
Ning Liu
B. Logan
Zhiyuan Xu
Jian Tang
Yanzhi Wang
OffRL
OOD
45
102
0
28 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
152
8,236
0
04 Jan 2018
Deep Reinforcement Learning for Sepsis Treatment
Aniruddh Raghu
Matthieu Komorowski
Imran Ahmed
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
39
172
0
27 Nov 2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
33
624
0
17 Aug 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
179
18,685
0
20 Jul 2017
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRL
BDL
56
344
0
07 Nov 2016
Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving
Shai Shalev-Shwartz
Shaked Shammah
Amnon Shashua
21
828
0
11 Oct 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
134
8,805
0
04 Feb 2016
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
104
13,174
0
09 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
20
3,368
0
08 Jun 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
206
6,722
0
19 Feb 2015
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
41
12,163
0
19 Dec 2013
1