Return-based Scaling: Yet Another Normalisation Trick for Deep RL

Return-based Scaling: Yet Another Normalisation Trick for Deep RL

11 May 2021

Georg Ostrovski

ArXiv (abs)PDF HTML

Papers citing "Return-based Scaling: Yet Another Normalisation Trick for Deep RL"

13 / 13 papers shown

Title
Hyperspherical Normalization for Scalable Deep Reinforcement Learning Hojoon Lee Youngdo Lee Takuma Seno Donghu Kim Peter Stone Jaegul Choo 181 4 0 21 Feb 2025
Streaming Deep Reinforcement Learning Finally Works Mohamed Elsayed Gautham Vasan A. R. Mahmood OffRL 116 6 0 18 Oct 2024
OPTIMA: Optimized Policy for Intelligent Multi-Agent Systems Enables Coordination-Aware Autonomous Vehicles Rui Du Kai Zhao Jinlong Hou Qiang Zhang Peter Zhang 63 0 0 09 Oct 2024
Variational Best-of-N Alignment Afra Amini Tim Vieira Ryan Cotterell Ryan Cotterell BDL 109 23 0 08 Jul 2024
Reward Centering Abhishek Naik Yi Wan Manan Tomar Richard S. Sutton 62 7 0 16 May 2024
BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned Approximations Robert J. Moss Anthony Corso J. Caers Mykel J. Kochenderfer 71 7 0 31 May 2023
Decision-Focused Model-based Reinforcement Learning for Reward Transfer Abhishek Sharma S. Parbhoo Omer Gottesman Finale Doshi-Velez OffRL 47 0 0 06 Apr 2023
Backward Curriculum Reinforcement Learning Kyungmin Ko OnRL 40 0 0 29 Dec 2022
Human-level Atari 200x faster Steven Kapturowski Victor Campos Ray Jiang Nemanja Rakićević Hado van Hasselt Charles Blundell Adria Puigdomenech Badia OffRL 94 30 0 15 Sep 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels Edoardo Cetin Philip J. Ball Steve Roberts Oya Celiktutan 114 38 0 03 Jul 2022
The Phenomenon of Policy Churn Tom Schaul André Barreto John Quan Georg Ostrovski 89 28 0 01 Jun 2022
Deep Reinforcement Learning at the Edge of the Statistical Precipice Rishabh Agarwal Max Schwarzer Pablo Samuel Castro Aaron Courville Marc G. Bellemare OffRL 193 680 0 30 Aug 2021
When should agents explore? Miruna Pislar David Szepesvari Georg Ostrovski Diana Borsa Tom Schaul 81 22 0 26 Aug 2021