Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.06887
Cited By
A Distributional Perspective on Reinforcement Learning
21 July 2017
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Distributional Perspective on Reinforcement Learning"
50 / 257 papers shown
Title
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for Efficient Deep-Reinforcement Learning
Adarsh Kosta
Malik Aqeel Anwar
Priyadarshini Panda
A. Raychowdhury
Kaushik Roy
13
4
0
16 Sep 2021
Enabling risk-aware Reinforcement Learning for medical interventions through uncertainty decomposition
Paul Festor
Giulia Luise
Matthieu Komorowski
A. Faisal
UD
OffRL
22
10
0
16 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
93
0
14 Sep 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
59
637
0
30 Aug 2021
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
70
78
0
12 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
27
12
0
29 Jun 2021
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
Amrit Singh Bedi
Anjaly Parayil
Junyu Zhang
Mengdi Wang
Alec Koppel
33
15
0
15 Jun 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
SSL
49
15
0
10 Jun 2021
Bayesian Bellman Operators
M. Fellows
Kristian Hartikainen
Shimon Whiteson
OffRL
42
15
0
09 Jun 2021
MICo: Improved representations via sampling-based state similarity for Markov decision processes
Pablo Samuel Castro
Tyler Kastner
Prakash Panangaden
Mark Rowland
43
35
0
03 Jun 2021
Expected Scalarised Returns Dominance: A New Solution Concept for Multi-Objective Decision Making
Conor F. Hayes
T. Verstraeten
D. Roijers
Enda Howley
Patrick Mannion
34
14
0
02 Jun 2021
What Matters for Adversarial Imitation Learning?
Manu Orsini
Anton Raichuk
Léonard Hussenot
Damien Vincent
Robert Dadashi
Sertan Girgin
M. Geist
Olivier Bachem
Olivier Pietquin
Marcin Andrychowicz
42
77
0
01 Jun 2021
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
24
53
0
11 May 2021
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
34
52
0
26 Apr 2021
Off-Policy Risk Assessment in Contextual Bandits
Audrey Huang
Liu Leqi
Zachary Chase Lipton
Kamyar Azizzadenesheli
OffRL
27
36
0
18 Apr 2021
Regularized Behavior Value Estimation
Çağlar Gülçehre
Sergio Gomez Colmenarejo
Ziyun Wang
Jakub Sygnowski
T. Paine
Konrad Zolna
Yutian Chen
Matthew W. Hoffman
Razvan Pascanu
Nando de Freitas
OffRL
31
37
0
17 Mar 2021
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
24
17
0
10 Mar 2021
Return-Based Contrastive Representation Learning for Reinforcement Learning
Guoqing Liu
Chuheng Zhang
Li Zhao
Tao Qin
Jinhua Zhu
Jian Li
Nenghai Yu
Tie-Yan Liu
SSL
OffRL
19
47
0
22 Feb 2021
Sparse Attention Guided Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning
Jaskirat Singh
Liang Zheng
OffRL
21
3
0
14 Feb 2021
Deep Reinforcement Learning for the Control of Robotic Manipulation: A Focussed Mini-Review
Rongrong Liu
F. Nageotte
P. Zanne
M. de Mathelin
Birgitta Dresp
48
143
0
08 Feb 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
47
73
0
01 Jan 2021
POPO: Pessimistic Offline Policy Optimization
Qiang He
Xinwen Hou
OffRL
35
10
0
26 Dec 2020
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Y. Fu
Zhongzhi Yu
Yongan Zhang
Yingyan Lin
22
4
0
24 Dec 2020
A case for new neural network smoothness constraints
Mihaela Rosca
T. Weber
Arthur Gretton
S. Mohamed
AAML
35
48
0
14 Dec 2020
Semi-supervised reward learning for offline reinforcement learning
Ksenia Konyushkova
Konrad Zolna
Y. Aytar
Alexander Novikov
Scott E. Reed
Serkan Cabi
Nando de Freitas
SSL
OffRL
68
23
0
12 Dec 2020
A Riemannian Block Coordinate Descent Method for Computing the Projection Robust Wasserstein Distance
Minhui Huang
Shiqian Ma
Lifeng Lai
26
42
0
09 Dec 2020
Deep Reinforcement Learning for Resource Constrained Multiclass Scheduling in Wireless Networks
Apostolos Avranas
Marios Kountouris
P. Ciblat
24
7
0
27 Nov 2020
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning Research
J. Obando-Ceron
Pablo Samuel Castro
OffRL
20
105
0
20 Nov 2020
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
D. Mankowitz
D. A. Calian
Rae Jeong
Cosmin Paduraru
N. Heess
Sumanth Dathathri
Martin Riedmiller
Timothy A. Mann
24
11
0
20 Oct 2020
What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator
Hongyao Tang
Zhaopeng Meng
Jianye Hao
Chong Chen
D. Graves
...
Hangyu Mao
Wulong Liu
Yaodong Yang
Wenyuan Tao
Li Wang
OffRL
14
7
0
19 Oct 2020
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
48
814
0
05 Oct 2020
QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning
Jian Hu
Seth Austin Harding
Haibin Wu
Siyue Hu
Shih-Wei Liao
29
9
0
09 Sep 2020
DRLE: Decentralized Reinforcement Learning at the Edge for Traffic Light Control in the IoV
Pengyuan Zhou
Xianfu Chen
Zhi Liu
Tristan Braud
Pan Hui
J. Kangasharju
6
54
0
03 Sep 2020
Revisiting Fundamentals of Experience Replay
W. Fedus
Prajit Ramachandran
Rishabh Agarwal
Yoshua Bengio
Hugo Larochelle
Mark Rowland
Will Dabney
KELM
OffRL
30
233
0
13 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
25
199
0
09 Jul 2020
robo-gym -- An Open Source Toolkit for Distributed Deep Reinforcement Learning on Real and Simulated Robots
M. Lucchi
Friedemann Zindler
Stephan Mühlbacher-Karrer
Horst Pichler
OffRL
30
29
0
06 Jul 2020
Critic Regularized Regression
Ziyun Wang
Alexander Novikov
Konrad Zolna
Jost Tobias Springenberg
Scott E. Reed
...
Noah Y. Siegel
J. Merel
Çağlar Gülçehre
N. Heess
Nando de Freitas
OffRL
36
317
0
26 Jun 2020
A Unifying Framework for Reinforcement Learning and Planning
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
33
9
0
26 Jun 2020
Deep Reinforcement and InfoMax Learning
Bogdan Mazoure
Rémi Tachet des Combes
T. Doan
Philip Bachman
R. Devon Hjelm
AI4CE
25
108
0
12 Jun 2020
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
M. Geist
Olivier Pietquin
26
124
0
08 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
65
225
0
01 Jun 2020
CURL: Contrastive Unsupervised Representations for Reinforcement Learning
A. Srinivas
Michael Laskin
Pieter Abbeel
SSL
DRL
OffRL
49
1,061
0
08 Apr 2020
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
120
0
24 Mar 2020
Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics
Amir H. Mosavi
Pedram Ghamisi
Yaser Faghan
Puhong Duan
OffRL
21
152
0
21 Mar 2020
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
67
772
0
19 Mar 2020
Distributional Robustness and Regularization in Reinforcement Learning
E. Derman
Shie Mannor
27
44
0
05 Mar 2020
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
29
60
0
22 Jan 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
22
174
0
09 Jan 2020
Uncertainty-Based Out-of-Distribution Classification in Deep Reinforcement Learning
Andreas Sedlmeier
Thomas Gabor
Thomy Phan
Lenz Belzner
Claudia Linnhoff-Popien
21
25
0
31 Dec 2019
Previous
1
2
3
4
5
6
Next