Rainbow: Combining Improvements in Deep Reinforcement Learning

6 October 2017

Dan Horgan

Bilal Piot

M. G. Azar

David Silver

OffRL

ArXiv PDF HTML

Papers citing "Rainbow: Combining Improvements in Deep Reinforcement Learning"

50 / 303 papers shown

Title
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning Simo Alami C. Rim Kaddah Jesse Read Marie-Paule Cani 48 0 0 07 May 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model Moritz A. Zanger Pascal R. van der Vaart Wendelin Bohmer M. Spaan UQCV BDL 149 0 0 14 Mar 2025
Reinforcement Learning-based Threat Assessment Wuzhou Sun Siyi Li Qingxiang Zou Zixing Liao AAML 76 0 0 04 Mar 2025
Reinforcement learning with combinatorial actions for coupled restless bandits Lily Xu Bryan Wilder Elias B. Khalil Milind Tambe 72 1 0 01 Mar 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks Thomas Schmied Thomas Adler Vihang Patil M. Beck Korbinian Poppel Johannes Brandstetter G. Klambauer Razvan Pascanu Sepp Hochreiter 75 5 0 21 Feb 2025
Evolution and The Knightian Blindspot of Machine Learning Joel Lehman Elliot Meyerson Tarek El-Gaaly Kenneth O. Stanley Tarin Ziyaee 86 1 0 22 Jan 2025
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey Zhihong Liu Xin Xu Peng Qiao Dongsheng Li OffRL 22 2 0 08 Nov 2024
Entity-based Reinforcement Learning for Autonomous Cyber Defence Isaac Symes Thompson Alberto Caron Chris Hicks V. Mavroudis AAML 51 2 0 23 Oct 2024
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL Ömer Veysel Çağatan Barış Akgün OffRL 34 0 0 22 Oct 2024
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning Hikaru Shindo Quentin Delfosse Devendra Singh Dhami Kristian Kersting 43 3 0 15 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control Ehsan Futuhi Shayan Karimi Chao Gao Martin Müller 38 1 0 07 Oct 2024
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization Hanyang Zhao Genta Indra Winata Anirban Das Shi-Xiong Zhang D. Yao Wenpin Tang Sambit Sahu 54 5 0 05 Oct 2024
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL Ghada Sokar J. Obando-Ceron Aaron C. Courville Hugo Larochelle Pablo Samuel Castro MoE 127 2 0 02 Oct 2024
Integrating Reinforcement Learning and Model Predictive Control with Applications to Microgrids Caio Fabio Oliveira da Silva Azita Dabiri B. de Schutter 50 4 0 17 Sep 2024
The Role of Deep Learning Regularizations on Actors in Offline RL Denis Tarasov Anja Surina Çağlar Gülçehre OffRL AI4CE 53 1 0 11 Sep 2024
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning Shreyas S R OffRL OnRL 28 0 0 10 Sep 2024
Reinforcement Learning for Sustainable Energy: A Survey Koen Ponse Felix Kleuker Márton Fejér Álvaro Serra-Gómez Aske Plaat Thomas M. Moerland OffRL AI4CE 40 1 0 26 Jul 2024
Functional Acceleration for Policy Mirror Descent Veronica Chelu Doina Precup 30 0 0 23 Jul 2024
Simplifying Deep Temporal Difference Learning Matteo Gallici Mattie Fellows Benjamin Ellis B. Pou Ivan Masmitja Jakob Foerster Mario Martin OffRL 62 15 0 05 Jul 2024
Normalization and effective learning rates in reinforcement learning Clare Lyle Zeyu Zheng Khimya Khetarpal James Martens H. V. Hasselt Razvan Pascanu Will Dabney 19 7 0 01 Jul 2024
Towards shutdownable agents via stochastic choice Elliott Thornley Alexander Roman Christos Ziakas Leyton Ho Louis Thomson 38 0 0 30 Jun 2024
World Models with Hints of Large Language Models for Goal Achieving Zeyuan Liu Ziyu Huan Xiyao Wang Jiafei Lyu Jian Tao Xiu Li Furong Huang Huazhe Xu LM&Ro LRM AI4CE 46 1 0 11 Jun 2024
Mimicry and the Emergence of Cooperative Communication Dylan R. Cope Peter McBurney 35 0 0 26 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement Learning Zhepeng Cen Yi-Fan Yao Zuxin Liu Ding Zhao OffRL 40 3 0 20 May 2024
Dominion: A New Frontier for AI Research Danny Halawi Aron Sarmasi Siena Saltzen Joshua McCoy OffRL 19 0 0 10 May 2024
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes Kyle Stachowicz Sergey Levine 17 6 0 07 May 2024
RUMOR: Reinforcement learning for Understanding a Model of the Real World for Navigation in Dynamic Environments Diego Martínez Baselga L. Riazuelo Luis Montano 92 1 0 25 Apr 2024
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep Reinforcement Learning Weiwei Gu Senquan Wang 43 5 0 12 Mar 2024
Koopman-Assisted Reinforcement Learning Preston Rozwood Edward Mehrez Ludger Paehler Wen Sun Steven L. Brunton 40 6 0 04 Mar 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation Yu Chen Xiangcheng Zhang Siwei Wang Longbo Huang 42 3 0 28 Feb 2024
Enhancing Reinforcement Learning Agents with Local Guides Paul Daoudi Bogdan Robu Christophe Prieur Ludovic Dos Santos M. Barlier OnRL 31 3 0 21 Feb 2024
Boosting Reinforcement Learning Algorithms in Continuous Robotic Reaching Tasks using Adaptive Potential Functions Yifei Chen Lambert Schomaker Francisco Cruz 38 0 0 07 Feb 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent Yingru Li Jiawei Xu Lei Han Zhi-Quan Luo BDL OffRL 26 6 0 05 Feb 2024
A Strategy for Preparing Quantum Squeezed States Using Reinforcement Learning Xiaolong Zhao Yiming Zhao Ming Li Tingting Li Qian Liu Shuai Guo Xuexi Yi 15 1 0 29 Jan 2024
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation Paul Daoudi Mathias Formoso Othman Gaizi Achraf Azize Evrard Garcelon OffRL 26 0 0 24 Dec 2023
Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning Md Saiful Islam Srijita Das S. Gottipati William Duguay Clodéric Mars Jalal Arabneydi Antoine Fagette Matthew J. Guzdial Matthew E. Taylor 38 1 0 23 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey Dom Huh Prasant Mohapatra AI4CE 36 8 0 15 Dec 2023
An Invitation to Deep Reinforcement Learning Bernhard Jaeger Andreas Geiger OffRL OOD 78 5 0 13 Dec 2023
Unsupervised Social Event Detection via Hybrid Graph Contrastive Learning and Reinforced Incremental Clustering Yuanyuan Guo Zehua Zang Hang Gao Xiao Xu Rui Wang Lixiang Liu Jiangmeng Li 26 5 0 08 Dec 2023
Mission-driven Exploration for Accelerated Deep Reinforcement Learning with Temporal Logic Task Specifications Jun Wang Hosein Hasanbeig Kaiyuan Tan Zihe Sun Y. Kantaros 35 3 0 28 Nov 2023
From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex? Yannik Keller Jannis Blüml Gopika Sudhakaran Kristian Kersting GNN 24 0 0 22 Nov 2023
Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning M. Gerstgrasser Tom Danino Sarah Keren 23 5 0 01 Nov 2023
A Kernel Perspective on Behavioural Metrics for Markov Decision Processes Pablo Samuel Castro Tyler Kastner Prakash Panangaden Mark Rowland 38 4 0 05 Oct 2023
Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning Hao Qin Zhaozhou Wu Xingqi Zhang 16 0 0 31 Aug 2023
Data-Efficient Online Learning of Ball Placement in Robot Table Tennis Philip Tobuschat Hao Ma Le Chen Bernhard Schölkopf Michael Muehlebach 30 1 0 28 Aug 2023
Learning Cyber Defence Tactics from Scratch with Multi-Agent Reinforcement Learning Jacob Wiebe Ranwa Al Mallah Li Li AAML 36 3 0 25 Aug 2023
SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents Amirhossein Zolfagharian Manel Abdellatif Lionel C. Briand S. Ramesh 25 5 0 03 Aug 2023
On-Robot Bayesian Reinforcement Learning for POMDPs Hai V. Nguyen Sammie Katt Yuchen Xiao Chris Amato OffRL 21 1 0 22 Jul 2023
PASTA: Pretrained Action-State Transformer Agents Raphael Boige Yannis Flet-Berliac Arthur Flajolet Guillaume Richard Thomas Pierrot LM&Ro OffRL 37 5 0 20 Jul 2023
Meta-Value Learning: a General Framework for Learning with Learning Awareness Tim Cooijmans Milad Aghajohari Aaron C. Courville 21 6 0 17 Jul 2023