Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1710.02298
Cited By
Rainbow: Combining Improvements in Deep Reinforcement Learning
6 October 2017
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rainbow: Combining Improvements in Deep Reinforcement Learning"
50 / 303 papers shown
Title
Project proposal: A modular reinforcement learning based automated theorem prover
Boris Shminke
18
1
0
06 Sep 2022
Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Zijian Gao
Yiying Li
Kele Xu
Yuanzhao Zhai
Dawei Feng
Bo Ding
Xinjun Mao
Huaimin Wang
35
0
0
24 Aug 2022
Reproducibility Report: Contrastive Learning of Socially-aware Motion Representations
Roop Sen
Sidharth Sinha
Parv Maheshwari
Animesh Jha
Debashish Chakravarty
16
0
0
18 Aug 2022
Recurrent networks, hidden states and beliefs in partially observable environments
Gaspard Lambrechts
Adrien Bolland
D. Ernst
16
12
0
06 Aug 2022
Learning to Generalize with Object-centric Agents in the Open World Survival Game Crafter
Aleksandar Stanić
Yujin Tang
David R Ha
Jürgen Schmidhuber
ELM
29
13
0
05 Aug 2022
DRL-M4MR: An Intelligent Multicast Routing Approach Based on DQN Deep Reinforcement Learning in SDN
Chenwei Zhao
Miao Ye
Xingsi Xue
Jianhui Lv
Qiuxiang Jiang
Yong Wang
19
17
0
31 Jul 2022
Associative Memory Based Experience Replay for Deep Reinforcement Learning
Mengyuan Li
Arman Kazemi
Ann Franchesca Laguna
Sharon Hu
VLM
16
8
0
16 Jul 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Edoardo Cetin
Philip J. Ball
Steve Roberts
Oya Celiktutan
30
36
0
03 Jul 2022
DayDreamer: World Models for Physical Robot Learning
Philipp Wu
Alejandro Escontrela
Danijar Hafner
Ken Goldberg
Pieter Abbeel
52
277
0
28 Jun 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li
Jinghuan Shang
Srijan Das
Michael S. Ryoo
SSL
27
31
0
10 Jun 2022
Hub-Pathway: Transfer Learning from A Hub of Pre-trained Models
Yang Shu
Zhangjie Cao
Ziyang Zhang
Jianmin Wang
Mingsheng Long
17
4
0
08 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron C. Courville
Marc G. Bellemare
OffRL
OnRL
31
63
0
03 Jun 2022
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger
Robert W. Platt
Chris Amato
OffRL
32
35
0
02 Jun 2022
Critic Sequential Monte Carlo
Vasileios Lioutas
J. Lavington
Justice Sefas
Matthew Niedoba
Yunpeng Liu
Berend Zwartsenberg
Setareh Dabiri
Frank D. Wood
Adam Scibior
47
7
0
30 May 2022
Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories
Christopher W. F. Parsonson
Alexandre Laterre
Thomas D. Barrett
19
19
0
28 May 2022
An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation
Shuyu Yin
Tao Luo
Peilin Liu
Z. Xu
16
2
0
25 May 2022
MetaSlicing: A Novel Resource Allocation Framework for Metaverse
N. Chu
D. Hoang
Diep N. Nguyen
Khoa T. Phan
E. Dutkiewicz
Dusist Niyato
Tao Shu
36
46
0
23 May 2022
Nuclear Norm Maximization Based Curiosity-Driven Learning
Chao Chen
Zijian Gao
Kele Xu
Sen Yang
Yiying Li
Bo Ding
Dawei Feng
Huaimin Wang
143
5
0
21 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
30
8
0
20 May 2022
Robust Losses for Learning Value Functions
Andrew Patterson
Victor Liao
Martha White
25
12
0
17 May 2022
The Primacy Bias in Deep Reinforcement Learning
Evgenii Nikishin
Max Schwarzer
P. DÓro
Pierre-Luc Bacon
Aaron C. Courville
OnRL
96
180
0
16 May 2022
Asking for Knowledge: Training RL Agents to Query External Knowledge Using Language
Iou-Jen Liu
Xingdi Yuan
Marc-Alexandre Côté
Pierre-Yves Oudeyer
A. Schwing
RALM
19
12
0
12 May 2022
Interactive Grounded Language Understanding in a Collaborative Environment: IGLU 2021
Julia Kiseleva
Ziming Li
Mohammad Aliannejadi
Shrestha Mohanty
Maartje ter Hoeve
...
I. Churin
Putra Manggala
Kata Naszádi
Michiel van der Meer
Taewoon Kim
LLMAG
33
30
0
05 May 2022
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning
Chenyu Sun
Hangwei Qian
C. Miao
OffRL
26
12
0
02 May 2022
Understanding and Preventing Capacity Loss in Reinforcement Learning
Clare Lyle
Mark Rowland
Will Dabney
CLL
36
109
0
20 Apr 2022
Automatically Learning Fallback Strategies with Model-Free Reinforcement Learning in Safety-Critical Driving Scenarios
Ugo Lecerf
Christelle Yemdji Tchassi
S. Aubert
Pietro Michiardi
21
0
0
11 Apr 2022
Federated Reinforcement Learning with Environment Heterogeneity
Hao Jin
Yang Peng
Wenhao Yang
Shusen Wang
Zhihua Zhang
57
67
0
06 Apr 2022
Possibility Before Utility: Learning And Using Hierarchical Affordances
Robby Costales
Shariq Iqbal
Fei Sha
26
5
0
23 Mar 2022
How to Learn from Risk: Explicit Risk-Utility Reinforcement Learning for Efficient and Safe Driving Strategies
Lukas M. Schmidt
Sebastian Rietsch
Axel Plinge
Bjoern M. Eskofier
Christopher Mutschler
OffRL
35
5
0
16 Mar 2022
Orchestrated Value Mapping for Reinforcement Learning
Mehdi Fatemi
Arash Tavakoli
19
8
0
14 Mar 2022
Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation
Alex Long
Alan Blair
H. V. Hoof
23
3
0
07 Mar 2022
Coordinate-Aligned Multi-Camera Collaboration for Active Multi-Object Tracking
Zeyu Fang
Jian Zhao
Mingyu Yang
Wen-gang Zhou
Zhenbo Lu
Houqiang Li
28
10
0
22 Feb 2022
TransDreamer: Reinforcement Learning with Transformer World Models
Changgu Chen
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
OffRL
32
90
0
19 Feb 2022
Sequential Bayesian experimental designs via reinforcement learning
Hikaru Asano
OffRL
18
0
0
14 Feb 2022
Regularized Q-learning
Han-Dong Lim
Donghwan Lee
19
10
0
11 Feb 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
D. Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Planning and Learning with Adaptive Lookahead
Aviv A. Rosenberg
Assaf Hallak
Shie Mannor
Gal Chechik
Gal Dalal
21
7
0
28 Jan 2022
Quantile-Based Policy Optimization for Reinforcement Learning
Jinyang Jiang
Jiaqiao Hu
Yijie Peng
33
7
0
27 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning
Tong Mu
Georgios Theocharous
David Arbour
Emma Brunskill
30
6
0
30 Dec 2021
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
19
30
0
16 Dec 2021
Human-Level Control through Directly-Trained Deep Spiking Q-Networks
Guisong Liu
Wenjie Deng
Xiurui Xie
Li Huang
Huajin Tang
OffRL
24
43
0
13 Dec 2021
Godot Reinforcement Learning Agents
E. Beeching
Jilles Debangoye
Olivier Simonin
Christian Wolf
GP
OnRL
21
5
0
07 Dec 2021
JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning
Zichuan Lin
Junyou Li
Jianing Shi
Deheng Ye
Qiang Fu
Wei Yang
BDL
32
34
0
07 Dec 2021
ED2: Environment Dynamics Decomposition World Models for Continuous Control
Jianye Hao
Yifu Yuan
Cong Wang
Zhen Wang
OffRL
16
1
0
06 Dec 2021
Reinforcement Learning-based Switching Controller for a Milliscale Robot in a Constrained Environment
Abbas Tariverdi
Ulysse Côté-Allard
Kim Mathiassen
O. Elle
H. Kalvøy
Ø. Martinsen
J. Tørresen
16
4
0
27 Nov 2021
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning
Nicolai Dorka
Tim Welschehold
Joschka Boedecker
Wolfram Burgard
OffRL
30
9
0
24 Nov 2021
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Lu Zheng
Jiarui Chen
Jianhao Wang
Jiamin He
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
Chongjie Zhang
16
82
0
22 Nov 2021
GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving
Raphael Chekroun
Marin Toromanoff
Sascha Hornauer
Fabien Moutarde
39
60
0
16 Nov 2021
Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning
Jueming Hu
Xuxi Yang
Weichang Wang
Peng Wei
Lei Ying
Yongming Liu
40
24
0
13 Nov 2021
Previous
1
2
3
4
5
6
7
Next