Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1710.02298
Cited By
Rainbow: Combining Improvements in Deep Reinforcement Learning
6 October 2017
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rainbow: Combining Improvements in Deep Reinforcement Learning"
50 / 303 papers shown
Title
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Jinyi Liu
Y. Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
OffRL
44
2
0
27 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
33
7
0
14 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Aaron C. Courville
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
54
82
0
30 May 2023
VA-learning as a more efficient alternative to Q-learning
Yunhao Tang
Rémi Munos
Mark Rowland
Michal Valko
OffRL
15
6
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
31
9
0
29 May 2023
Accelerating Value Iteration with Anchoring
Jongmin Lee
Ernest K. Ryu
18
7
0
26 May 2023
Testing of Deep Reinforcement Learning Agents with Surrogate Models
Matteo Biagiola
Paolo Tonella
41
19
0
22 May 2023
Augmenting Autotelic Agents with Large Language Models
Cédric Colas
Laetitia Teodorescu
Pierre-Yves Oudeyer
Xingdi Yuan
Marc-Alexandre Côté
LLMAG
LM&Ro
28
22
0
21 May 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
24
31
0
20 Apr 2023
Autonomous Agent for Beyond Visual Range Air Combat: A Deep Reinforcement Learning Approach
Joao P. A. Dantas
Marcos R. O. A. Máximo
Takashi Yoneyama
24
4
0
19 Apr 2023
Deep reinforcement learning applied to an assembly sequence planning problem with user preferences
M. Neves
Pedro Neto
OffRL
18
17
0
13 Apr 2023
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
47
8
0
07 Mar 2023
Graph Decision Transformer
Shengchao Hu
Li Shen
Ya-Qin Zhang
Dacheng Tao
OffRL
33
15
0
07 Mar 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
51
88
0
24 Feb 2023
Selective experience replay compression using coresets for lifelong deep reinforcement learning in medical imaging
Guangyao Zheng
Samson Zhou
Vladimir Braverman
M. Jacobs
V. Parekh
OffRL
CLL
24
3
0
22 Feb 2023
Understanding the effect of varying amounts of replay per step
A. Paul
Videh Raj Nema
8
0
0
20 Feb 2023
Robot path planning using deep reinforcement learning
Miguel Quinones-Ramirez
Jorge Ríos-Martínez
Víctor Uc Cetina
SSL
25
5
0
17 Feb 2023
Improving robot navigation in crowded environments using intrinsic rewards
Diego Martínez Baselga
L. Riazuelo
Luis Montano
45
12
0
13 Feb 2023
Distributional GFlowNets with Quantile Flows
Dinghuai Zhang
L. Pan
Ricky T. Q. Chen
Aaron Courville
Yoshua Bengio
29
25
0
11 Feb 2023
Neural Episodic Control with State Abstraction
Zhuo Li
Derui Zhu
Yujing Hu
Xiaofei Xie
L. Ma
Yan Zheng
Yan Song
Yingfeng Chen
Jianjun Zhao
OffRL
18
14
0
27 Jan 2023
Deep Laplacian-based Options for Temporally-Extended Exploration
Martin Klissarov
Marlos C. Machado
OffRL
16
18
0
26 Jan 2023
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
26
0
0
26 Jan 2023
Multi-agent Reinforcement Learning with Graph Q-Networks for Antenna Tuning
Maxime Bouton
Jaeseong Jeong
José Outes Carnero
Adriano Mendo
Alexandros Nikou
24
1
0
20 Jan 2023
Learning to Control and Coordinate Mixed Traffic Through Robot Vehicles at Complex and Unsignalized Intersections
Dawei Wang
Weizi Li
Lei Zhu
Jia-Yu Pan
40
15
0
12 Jan 2023
Predictive World Models from Real-World Partial Observations
Robin Karlsson
Alexander Carballo
Keisuke Fujii
Kento Ohtani
K. Takeda
29
5
0
12 Jan 2023
Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search
Wenqing Zheng
S. Sharan
Zhiwen Fan
Kevin Wang
Yihan Xi
Zhangyang Wang
58
9
0
30 Dec 2022
Learning Physically Realizable Skills for Online Packing of General 3D Shapes
Hang Zhao
Zherong Pan
Yang Yu
Kai Xu
OffRL
37
13
0
05 Dec 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
35
0
0
23 Nov 2022
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
19
1
0
23 Nov 2022
Active Example Selection for In-Context Learning
Yiming Zhang
Shi Feng
Chenhao Tan
SILM
LRM
32
186
0
08 Nov 2022
Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Qi Liu
Xueyuan Li
Zirui Li
Jingda Wu
Guodong Du
Xinlu Gao
Fan Yang
Shihua Yuan
49
8
0
06 Nov 2022
Achieving mouse-level strategic evasion performance using real-time computational planning
German Espinosa
Gabrielle E. Wink
Alexander T. Lai
D. Dombeck
Malcolm A. MacIver
11
2
0
04 Nov 2022
Multi-Agent Reinforcement Learning for Adaptive Mesh Refinement
Jiachen Yang
K. Mittal
T. Dzanic
S. Petrides
B. Keith
Brenden K. Petersen
Daniel Faissol
R. Anderson
31
8
0
02 Nov 2022
A Bibliometric Analysis and Review on Reinforcement Learning for Transportation Applications
Can Li
Lei Bai
L. Yao
S. Waller
Wei Liu
35
14
0
26 Oct 2022
Climate Change Policy Exploration using Reinforcement Learning
Theodore Wolf
18
0
0
23 Oct 2022
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRL
LRM
35
22
0
22 Oct 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
34
6
0
22 Oct 2022
Bridging the Gap Between Target Networks and Functional Regularization
Alexandre Piché
Valentin Thomas
Joseph Marino
Rafael Pardiñas
Gian Maria Marconi
C. Pal
Mohammad Emtiyaz Khan
14
1
0
21 Oct 2022
Deep Reinforcement Learning for Inverse Inorganic Materials Design
Elton Pan
Christopher Karpovich
E. Olivetti
AI4CE
24
11
0
21 Oct 2022
Towards Trustworthy Automatic Diagnosis Systems by Emulating Doctors' Reasoning with Deep Reinforcement Learning
Arsène Fansi Tchango
Rishab Goel
Julien Martel
Zhi Wen
G. Caron
J. Ghosn
29
11
0
13 Oct 2022
Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems
Junmin Zhong
Ruofan Wu
J. Si
LRM
13
0
0
10 Oct 2022
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
Chen Gong
Zhou Yang
Yunru Bai
Junda He
Jieke Shi
...
Arunesh Sinha
Bowen Xu
Xinwen Hou
David Lo
Guoliang Fan
AAML
OffRL
21
7
0
07 Oct 2022
Hyperbolic Deep Reinforcement Learning
Edoardo Cetin
B. Chamberlain
Michael M. Bronstein
Jonathan J. Hunt
43
20
0
04 Oct 2022
Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders
Per-Arne Andersen
Ole-Christoffer Granmo
Morten Goodwin
OOD
28
0
0
03 Oct 2022
Bayesian Q-learning With Imperfect Expert Demonstrations
Fengdi Che
Xiru Zhu
Doina Precup
D. Meger
Gregory Dudek
19
2
0
01 Oct 2022
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning
Manuel Goulão
Arlindo L. Oliveira
ViT
38
6
0
22 Sep 2022
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
46
12
0
21 Sep 2022
Deep Generalized Schrödinger Bridge
Guan-Horng Liu
T. Chen
Oswin So
Evangelos A. Theodorou
OT
AI4CE
16
35
0
20 Sep 2022
MAN: Multi-Action Networks Learning
Keqin Wang
Alison Bartsch
A. Farimani
21
3
0
19 Sep 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
21
16
0
16 Sep 2022
Previous
1
2
3
4
5
6
7
Next