ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.02298
  4. Cited By
Rainbow: Combining Improvements in Deep Reinforcement Learning

Rainbow: Combining Improvements in Deep Reinforcement Learning

6 October 2017
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Rainbow: Combining Improvements in Deep Reinforcement Learning"

50 / 307 papers shown
Title
Obstacle Avoidance for UAS in Continuous Action Space Using Deep
  Reinforcement Learning
Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning
Jueming Hu
Xuxi Yang
Weichang Wang
Peng Wei
Lei Ying
Yongming Liu
40
24
0
13 Nov 2021
Deep Reinforcement Model Selection for Communications Resource
  Allocation in On-Site Medical Care
Deep Reinforcement Model Selection for Communications Resource Allocation in On-Site Medical Care
Steffen Gracla
Edgar Beck
C. Bockelmann
Armin Dekorsy
19
1
0
12 Nov 2021
CubeTR: Learning to Solve The Rubiks Cube Using Transformers
Mustafa Chasmai
ViT
37
1
0
11 Nov 2021
Hybrid BYOL-ViT: Efficient approach to deal with small datasets
Hybrid BYOL-ViT: Efficient approach to deal with small datasets
Safwen Naimi
Rien van Leeuwen
W. Souidène
S. B. Saoud
SSL
ViT
25
2
0
08 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
60
100
0
06 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon
  Reasoning
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
32
41
0
04 Nov 2021
Mastering Atari Games with Limited Data
Mastering Atari Games with Limited Data
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
40
222
0
30 Oct 2021
GrowSpace: Learning How to Shape Plants
GrowSpace: Learning How to Shape Plants
Yasmeen Hitti
Ionelia Buzatu
Manuel Del Verme
M. Lefsrud
Florian Golemo
A. Durand
19
2
0
15 Oct 2021
NeurIPS 2021 Competition IGLU: Interactive Grounded Language
  Understanding in a Collaborative Environment
NeurIPS 2021 Competition IGLU: Interactive Grounded Language Understanding in a Collaborative Environment
Julia Kiseleva
Ziming Li
Mohammad Aliannejadi
Shrestha Mohanty
Maartje ter Hoeve
...
Arthur Szlam
Yuxuan Sun
Katja Hofmann
Michel Galley
Ahmed Hassan Awadallah
LLMAG
70
15
0
13 Oct 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement
  Learning
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
42
16
0
07 Oct 2021
Explaining Deep Reinforcement Learning Agents In The Atari Domain
  through a Surrogate Model
Explaining Deep Reinforcement Learning Agents In The Atari Domain through a Surrogate Model
Alexander Sieusahai
Matthew J. Guzdial
35
13
0
07 Oct 2021
Learning Multi-Objective Curricula for Robotic Policy Learning
Learning Multi-Objective Curricula for Robotic Policy Learning
Jikun Kang
Miao Liu
Abhinav Gupta
C. Pal
Xue Liu
Jie Fu
39
4
0
06 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery
  phantom
Deep reinforcement learning for guidewire navigation in coronary artery phantom
Jihoon Kweon
Kyunghwan Kim
Chaehyuk Lee
Hwi Kwon
Jinwoo Park
...
Inwook Back
J. Roh
Y. Moon
Jaesoon Choi
Young-Hak Kim
OnRL
16
33
0
05 Oct 2021
On the Estimation Bias in Double Q-Learning
On the Estimation Bias in Double Q-Learning
Zhizhou Ren
Guangxiang Zhu
Haotian Hu
Beining Han
Jian-Hai Chen
Chongjie Zhang
21
17
0
29 Sep 2021
The $f$-Divergence Reinforcement Learning Framework
The fff-Divergence Reinforcement Learning Framework
Chen Gong
Qiang He
Yunpeng Bai
Zhouyi Yang
Xiaoyu Chen
Xinwen Hou
Xianjie Zhang
Yu Liu
Guoliang Fan
34
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with
  On-Policy Experience
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
11
30
0
24 Sep 2021
On Bonus-Based Exploration Methods in the Arcade Learning Environment
On Bonus-Based Exploration Methods in the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
16
58
0
22 Sep 2021
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for
  Efficient Deep-Reinforcement Learning
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for Efficient Deep-Reinforcement Learning
Adarsh Kosta
Malik Aqeel Anwar
Priyadarshini Panda
A. Raychowdhury
Kaushik Roy
8
4
0
16 Sep 2021
Benchmarking the Spectrum of Agent Capabilities
Benchmarking the Spectrum of Agent Capabilities
Danijar Hafner
ELM
33
127
0
14 Sep 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic
  Reinforcement Learning and Global Convergence of Policy Gradient Methods
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
25
6
0
13 Sep 2021
An Oracle and Observations for the OpenAI Gym / ALE Freeway Environment
An Oracle and Observations for the OpenAI Gym / ALE Freeway Environment
J. Plank
Catherine D. Schuman
Robert M. Patton
21
0
0
02 Sep 2021
Interactive Machine Comprehension with Dynamic Knowledge Graphs
Interactive Machine Comprehension with Dynamic Knowledge Graphs
Xingdi Yuan
34
3
0
31 Aug 2021
APS: Active Pretraining with Successor Features
APS: Active Pretraining with Successor Features
Hao Liu
Pieter Abbeel
38
119
0
31 Aug 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
59
633
0
30 Aug 2021
Implicitly Regularized RL with Implicit Q-Values
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
M. Geist
OffRL
24
9
0
16 Aug 2021
Graph Attention Network-based Multi-agent Reinforcement Learning for
  Slicing Resource Management in Dense Cellular Network
Graph Attention Network-based Multi-agent Reinforcement Learning for Slicing Resource Management in Dense Cellular Network
Yan Shao
Rongpeng Li
Bing Hu
Yingxiao Wu
Zhifeng Zhao
Honggang Zhang
33
46
0
11 Aug 2021
High Performance Across Two Atari Paddle Games Using the Same Perceptual
  Control Architecture Without Training
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training
T. Gulrez
W. Mansell
16
0
0
04 Aug 2021
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for
  Dynamic Control
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control
Xin-Yang Liu
Jian-Xun Wang
AI4CE
25
38
0
31 Jul 2021
Mastering Visual Continuous Control: Improved Data-Augmented
  Reinforcement Learning
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
OffRL
36
337
0
20 Jul 2021
Conservative Offline Distributional Reinforcement Learning
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
70
78
0
12 Jul 2021
CoBERL: Contrastive BERT for Reinforcement Learning
CoBERL: Contrastive BERT for Reinforcement Learning
Andrea Banino
Adria Puidomenech Badia
Jacob Walker
Tim Scholtes
Jovana Mitrović
Charles Blundell
OffRL
30
36
0
12 Jul 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy
  Correction
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRL
OnRL
35
9
0
04 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under
  Data Augmentation
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
26
134
0
01 Jul 2021
Applications of the Free Energy Principle to Machine Learning and
  Neuroscience
Applications of the Free Energy Principle to Machine Learning and Neuroscience
Beren Millidge
DRL
20
7
0
30 Jun 2021
Learning Task Informed Abstractions
Learning Task Informed Abstractions
Xiang Fu
Ge Yang
Pulkit Agrawal
Tommi Jaakkola
15
65
0
29 Jun 2021
Zoo-Tuning: Adaptive Transfer from a Zoo of Models
Zoo-Tuning: Adaptive Transfer from a Zoo of Models
Yang Shu
Zhi Kou
Zhangjie Cao
Jianmin Wang
Mingsheng Long
29
44
0
29 Jun 2021
Convergent and Efficient Deep Q Network Algorithm
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
24
12
0
29 Jun 2021
Hierarchically Integrated Models: Learning to Navigate from
  Heterogeneous Robots
Hierarchically Integrated Models: Learning to Navigate from Heterogeneous Robots
Katie Kang
G. Kahn
Sergey Levine
37
5
0
24 Jun 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
SSL
28
15
0
10 Jun 2021
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for
  Reinforcement Learning
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning
Tao Yu
Cuiling Lan
Wenjun Zeng
Mingxiao Feng
Zhizheng Zhang
Zhibo Chen
OffRL
20
46
0
08 Jun 2021
Same State, Different Task: Continual Reinforcement Learning without
  Interference
Same State, Different Task: Continual Reinforcement Learning without Interference
Samuel Kessler
Jack Parker-Holder
Philip J. Ball
S. Zohren
Stephen J. Roberts
CLL
OffRL
19
46
0
05 Jun 2021
MICo: Improved representations via sampling-based state similarity for
  Markov decision processes
MICo: Improved representations via sampling-based state similarity for Markov decision processes
Pablo Samuel Castro
Tyler Kastner
Prakash Panangaden
Mark Rowland
43
35
0
03 Jun 2021
OctoPath: An OcTree Based Self-Supervised Learning Approach to Local
  Trajectory Planning for Mobile Robots
OctoPath: An OcTree Based Self-Supervised Learning Approach to Local Trajectory Planning for Mobile Robots
Bogdan Trasnea
Cosmin Ginerica
Mihai V. Zaha
G. Macesanu
C. Pozna
Sorin Grigorescu
24
8
0
02 Jun 2021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement
  Learning
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
18
5
0
01 Jun 2021
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation
  Perspective
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
21
52
0
11 May 2021
A Deep Reinforcement Learning Approach for the Meal Delivery Problem
A Deep Reinforcement Learning Approach for the Meal Delivery Problem
H. Jahanshahi
Aysun Bozanta
Mucahit Cevik
E. M. Kavuk
Ayse Tosun Misirli
Sibel B. Sonuc
Bilgin Kosucu
Ayse Basar
42
28
0
24 Apr 2021
Learning on a Budget via Teacher Imitation
Learning on a Budget via Teacher Imitation
Ercüment Ilhan
Jeremy Gow
Diego Perez-Liebana
OffRL
22
2
0
17 Apr 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
Clément Romac
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
27
21
0
17 Mar 2021
Regularized Behavior Value Estimation
Regularized Behavior Value Estimation
Çağlar Gülçehre
Sergio Gomez Colmenarejo
Ziyun Wang
Jakub Sygnowski
T. Paine
Konrad Zolna
Yutian Chen
Matthew W. Hoffman
Razvan Pascanu
Nando de Freitas
OffRL
28
37
0
17 Mar 2021
An Information-Theoretic Perspective on Credit Assignment in
  Reinforcement Learning
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
24
17
0
10 Mar 2021
Previous
1234567
Next