Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.14826
Cited By
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning Research
20 November 2020
J. Obando-Ceron
Pablo Samuel Castro
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning Research"
50 / 61 papers shown
Title
Unraveling the Rainbow: can value-based methods schedule?
Arthur Corrêa
Alexandre Jesus
Cristóvão Silva
Samuel Moniz
OffRL
37
0
0
06 May 2025
β
\beta
β
-DQN: Improving Deep Q-Learning By Evolving the Behavior
Hongming Zhang
Fengshuo Bai
Chenjun Xiao
Chao Gao
Bo Xu
Martin Müller
OffRL
35
2
0
03 Jan 2025
Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
38
0
0
06 Nov 2024
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar
J. Obando-Ceron
Rameswar Panda
Hugo Larochelle
Pablo Samuel Castro
MoE
142
2
0
02 Oct 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson
Samuel Neumann
Raksha Kumaraswamy
Martha White
Adam White
23
2
0
26 Jul 2024
Mixture of Experts in a Mixture of RL settings
Timon Willi
J. Obando-Ceron
Jakob Foerster
Karolina Dziugaite
Pablo Samuel Castro
MoE
49
7
0
26 Jun 2024
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
J. Obando-Ceron
J. G. Araújo
Rameswar Panda
Pablo Samuel Castro
48
6
0
25 Jun 2024
Neural-Kernel Conditional Mean Embeddings
Eiki Shimizu
Kenji Fukumizu
Dino Sejdinovic
43
3
0
16 Mar 2024
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Jesse Farebrother
Jordi Orbay
Q. Vuong
Adrien Ali Taïga
Yevgen Chebotar
...
Sergey Levine
Pablo Samuel Castro
Aleksandra Faust
Aviral Kumar
Rishabh Agarwal
OffRL
56
56
0
06 Mar 2024
In value-based deep reinforcement learning, a pruned network is a good network
J. Obando-Ceron
Rameswar Panda
Pablo Samuel Castro
OffRL
38
17
0
19 Feb 2024
Mixtures of Experts Unlock Parameter Scaling for Deep RL
J. Obando-Ceron
Ghada Sokar
Timon Willi
Clare Lyle
Jesse Farebrother
Jakob N. Foerster
Gintare Karolina Dziugaite
Doina Precup
Pablo Samuel Castro
58
29
0
13 Feb 2024
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
Maciej Wolczyk
Bartłomiej Cupiał
M. Ostaszewski
Michal Bortkiewicz
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
48
13
0
05 Feb 2024
PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping
Nai-Chieh Huang
Ping-Chun Hsieh
Kuo-Hao Ho
I-Chen Wu
21
8
0
19 Dec 2023
Meta-Learning Strategies through Value Maximization in Neural Networks
Rodrigo Carrasco-Davis
Javier Masís
Andrew M. Saxe
27
1
0
30 Oct 2023
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
Dexter Neo
Tsuhan Chen
30
1
0
26 Oct 2023
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRL
OnRL
41
15
0
19 Oct 2023
Small batch deep reinforcement learning
J. Obando-Ceron
Marc G. Bellemare
Pablo Samuel Castro
VLM
34
14
0
05 Oct 2023
IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Siyuan Li
Haoyang Li
Jin Zhang
Zhen Wang
Peng Liu
Chongjie Zhang
OffRL
24
1
0
14 Aug 2023
Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges
Giorgio Franceschelli
Mirco Musolesi
AI4CE
40
20
0
31 Jul 2023
Transformers in Reinforcement Learning: A Survey
Pranav Agarwal
A. Rahman
P. St-Charles
Simon J. D. Prince
Samira Ebrahimi Kahou
OffRL
26
19
0
12 Jul 2023
Hyperparameters in Reinforcement Learning and How To Tune Them
Theresa Eimer
Marius Lindauer
Roberta Raileanu
OffRL
29
35
0
02 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Rameswar Panda
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
54
83
0
30 May 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDL
OffRL
28
20
0
29 May 2023
Co-Learning Empirical Games and World Models
Max O. Smith
Michael P. Wellman
16
2
0
23 May 2023
Distill or Annotate? Cost-Efficient Fine-Tuning of Compact Models
Junmo Kang
Wei-ping Xu
Alan Ritter
47
15
0
02 May 2023
Empirical Design in Reinforcement Learning
Andrew Patterson
Samuel Neumann
Martha White
Adam White
17
21
0
03 Apr 2023
Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning
Sotetsu Koyamada
Shinri Okano
Soichiro Nishimori
Y. Murata
Keigo Habara
Haruka Kita
Shin Ishii
21
24
0
29 Mar 2023
Hyperparameters in Contextual RL are Highly Situational
Theresa Eimer
C. Benjamins
Marius Lindauer
26
4
0
21 Dec 2022
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Hsin-En Su
Yen-Ju Chen
Ping-Chun Hsieh
Xi Liu
OffRL
23
0
0
10 Dec 2022
Learning on Graphs for Mineral Asset Valuation Under Supply and Demand Uncertainty
Yassine Yaakoubi
Hager Radi
R. Dimitrakopoulos
25
0
0
07 Dec 2022
CUP: Critic-Guided Policy Reuse
Jin Zhang
Siyuan Li
Chongjie Zhang
29
8
0
15 Oct 2022
Elastic Step DQN: A novel multi-step algorithm to alleviate overestimation in Deep QNetworks
Adrian Ly
Richard Dazeley
Peter Vamplew
Francisco Cruz
Sunil Aryal
15
8
0
07 Oct 2022
Atari-5: Distilling the Arcade Learning Environment down to Five Games
Matthew Aitchison
Penny Sweetser
Marcus Hutter
50
19
0
05 Oct 2022
MSRL: Distributed Reinforcement Learning with Dataflow Fragments
Huanzhou Zhu
Bo Zhao
Gang Chen
Weifeng Chen
Yijie Chen
Liang Shi
Yaodong Yang
Peter R. Pietzuch
Lei Chen
OffRL
MoE
16
6
0
03 Oct 2022
Prioritizing Samples in Reinforcement Learning with Reducible Loss
Shivakanth Sujit
Somjit Nath
Pedro H. M. Braga
Samira Ebrahimi Kahou
50
15
0
22 Aug 2022
Robots Enact Malignant Stereotypes
Andrew Hundt
William Agnew
V. Zeng
Severin Kacianka
Matthew C. Gombolay
LM&Ro
35
41
0
23 Jul 2022
Reinforcement Learning for Economic Policy: A New Frontier?
C. Tilbury
OffRL
8
3
0
16 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Han Wang
Archit Sakhadeo
Adam White
James Bell
Vincent Liu
Xutong Zhao
Puer Liu
Tadashi Kozuno
Alona Fyshe
Martha White
OffRL
OnRL
22
7
0
18 May 2022
Robust Losses for Learning Value Functions
Andrew Patterson
Victor Liao
Martha White
28
12
0
17 May 2022
Continual Learning with Foundation Models: An Empirical Study of Latent Replay
O. Ostapenko
Timothée Lesort
P. Rodríguez
Md Rifat Arefin
Arthur Douillard
Irina Rish
Laurent Charlin
34
51
0
30 Apr 2022
Proper Reuse of Image Classification Features Improves Object Detection
C. N. Vasconcelos
Vighnesh Birodkar
Vincent Dumoulin
VLM
17
32
0
01 Apr 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives
Toshinori Kitamura
Ryo Yonetani
OffRL
32
4
0
08 Dec 2021
Reinforcement Learning-based Switching Controller for a Milliscale Robot in a Constrained Environment
Abbas Tariverdi
Ulysse Côté-Allard
Kim Mathiassen
O. Elle
H. Kalvøy
Ø. Martinsen
J. Tørresen
16
4
0
27 Nov 2021
Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari
Dominik Schmidt
Thomas Schmied
OffRL
20
12
0
19 Nov 2021
The Difficulty of Passive Learning in Deep Reinforcement Learning
Georg Ostrovski
Pablo Samuel Castro
Will Dabney
OffRL
16
57
0
26 Oct 2021
Neural PPO-Clip Attains Global Optimality: A Hinge Loss Perspective
Nai-Chieh Huang
Ping-Chun Hsieh
Kuo-Hao Ho
Hsuan-Yu Yao
Kai-Chun Hu
Liang-Chun Ouyang
I-Chen Wu
32
1
0
26 Oct 2021
On The Transferability of Deep-Q Networks
M. Sabatelli
Pierre Geurts
34
2
0
06 Oct 2021
Large Batch Experience Replay
Thibault Lahire
M. Geist
Emmanuel Rachelson
OffRL
53
13
0
04 Oct 2021
1
2
Next