ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.05347
  4. Cited By
Return-based Scaling: Yet Another Normalisation Trick for Deep RL

Return-based Scaling: Yet Another Normalisation Trick for Deep RL

11 May 2021
Tom Schaul
Georg Ostrovski
Iurii Kemaev
Diana Borsa
ArXiv (abs)PDFHTML

Papers citing "Return-based Scaling: Yet Another Normalisation Trick for Deep RL"

13 / 13 papers shown
Title
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
181
4
0
21 Feb 2025
Streaming Deep Reinforcement Learning Finally Works
Streaming Deep Reinforcement Learning Finally Works
Mohamed Elsayed
Gautham Vasan
A. R. Mahmood
OffRL
116
6
0
18 Oct 2024
OPTIMA: Optimized Policy for Intelligent Multi-Agent Systems Enables
  Coordination-Aware Autonomous Vehicles
OPTIMA: Optimized Policy for Intelligent Multi-Agent Systems Enables Coordination-Aware Autonomous Vehicles
Rui Du
Kai Zhao
Jinlong Hou
Qiang Zhang
Peter Zhang
63
0
0
09 Oct 2024
Variational Best-of-N Alignment
Variational Best-of-N Alignment
Afra Amini
Tim Vieira
Ryan Cotterell
Ryan Cotterell
BDL
109
23
0
08 Jul 2024
Reward Centering
Reward Centering
Abhishek Naik
Yi Wan
Manan Tomar
Richard S. Sutton
62
7
0
16 May 2024
BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned
  Approximations
BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned Approximations
Robert J. Moss
Anthony Corso
J. Caers
Mykel J. Kochenderfer
71
7
0
31 May 2023
Decision-Focused Model-based Reinforcement Learning for Reward Transfer
Decision-Focused Model-based Reinforcement Learning for Reward Transfer
Abhishek Sharma
S. Parbhoo
Omer Gottesman
Finale Doshi-Velez
OffRL
47
0
0
06 Apr 2023
Backward Curriculum Reinforcement Learning
Backward Curriculum Reinforcement Learning
Kyungmin Ko
OnRL
40
0
0
29 Dec 2022
Human-level Atari 200x faster
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
94
30
0
15 Sep 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Edoardo Cetin
Philip J. Ball
Steve Roberts
Oya Celiktutan
114
38
0
03 Jul 2022
The Phenomenon of Policy Churn
The Phenomenon of Policy Churn
Tom Schaul
André Barreto
John Quan
Georg Ostrovski
89
28
0
01 Jun 2022
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
193
680
0
30 Aug 2021
When should agents explore?
When should agents explore?
Miruna Pislar
David Szepesvari
Georg Ostrovski
Diana Borsa
Tom Schaul
81
22
0
26 Aug 2021
1