Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.06923
Cited By
Implicit Quantile Networks for Distributional Reinforcement Learning
14 June 2018
Will Dabney
Georg Ostrovski
David Silver
Rémi Munos
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Implicit Quantile Networks for Distributional Reinforcement Learning"
50 / 112 papers shown
Title
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Adam R. Villaflor
Zheng Huang
Swapnil Pande
John M. Dolan
J. Schneider
OffRL
25
24
0
21 Jul 2022
Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration
Wenhui Huang
Cong Zhang
Jingda Wu
Xiangkun He
Jie Zhang
Chengqi Lv
16
8
0
20 Jun 2022
Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk
Chengyang Ying
Xinning Zhou
Hang Su
Dong Yan
Ning Chen
Jun Zhu
24
41
0
09 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games
Ziyi Liu
Xian Guo
Yongchun Fang
26
0
0
31 May 2022
Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise
Yoshinari Motokawa
T. Sugawara
30
2
0
19 May 2022
Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Bobak Shahriari
A. Abdolmaleki
Arunkumar Byravan
A. Friesen
Siqi Liu
Jost Tobias Springenberg
N. Heess
Matthew W. Hoffman
Martin Riedmiller
OffRL
46
9
0
21 Apr 2022
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement Learning
Cheng Liu
E. Kampen
Guido de Croon
39
16
0
28 Mar 2022
Zipfian environments for Reinforcement Learning
Stephanie C. Y. Chan
Andrew Kyle Lampinen
Pierre Harvey Richemond
Felix Hill
OffRL
15
15
0
15 Mar 2022
Multivariate Quantile Function Forecaster
Kelvin K. Kan
Franccois-Xavier Aubet
Tim Januschowski
Youngsuk Park
Konstantinos Benidis
Lars Ruthotto
Jan Gasthaus
AI4TS
39
22
0
23 Feb 2022
MCMARL: Parameterizing Value Function via Mixture of Categorical Distributions for Multi-Agent Reinforcement Learning
Jian Zhao
Mingyu Yang
Youpeng Zhao
Xu Hu
Wen-gang Zhou
Jiangcheng Zhu
Houqiang Li
25
3
0
21 Feb 2022
TransDreamer: Reinforcement Learning with Transformer World Models
Changgu Chen
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
OffRL
34
91
0
19 Feb 2022
MuZero with Self-competition for Rate Control in VP9 Video Compression
Amol Mandhane
A. Zhernov
Maribeth Rauh
Chenjie Gu
Miaosen Wang
...
Jackson Broshear
Julian Schrittwieser
Thomas Hubert
Oriol Vinyals
Timothy A. Mann
37
44
0
14 Feb 2022
Deep Non-Crossing Quantiles through the Partial Derivative
Axel Brando
J. Gimeno
Jose A. Rodríguez-Serrano
Jordi Vitrià
48
13
0
30 Jan 2022
Towards Autonomous Satellite Communications: An AI-based Framework to Address System-level Challenges
J. Luis
Skylar Eiskowitz
Nils Pachler de la Osa
E. Crawley
B. Cameron
23
5
0
11 Dec 2021
Reinforcement Learning-based Switching Controller for a Milliscale Robot in a Constrained Environment
Abbas Tariverdi
Ulysse Côté-Allard
Kim Mathiassen
O. Elle
H. Kalvøy
Ø. Martinsen
J. Tørresen
16
4
0
27 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Keith Ross
OffRL
17
9
0
17 Nov 2021
GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving
Raphael Chekroun
Marin Toromanoff
Sascha Hornauer
Fabien Moutarde
39
60
0
16 Nov 2021
Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning
Yingjie Fei
Zhuoran Yang
Yudong Chen
Zhaoran Wang
51
46
0
06 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
65
100
0
06 Nov 2021
Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement Learning Approach
Changyin Sun
Lijun Sun
22
6
0
02 Nov 2021
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
852
0
12 Oct 2021
Large Batch Experience Replay
Thibault Lahire
M. Geist
Emmanuel Rachelson
OffRL
56
13
0
04 Oct 2021
ADER:Adapting between Exploration and Robustness for Actor-Critic Methods
Bo Zhou
Kejiao Li
Hongsheng Zeng
Fan Wang
Hao Tian
OffRL
38
1
0
08 Sep 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
61
639
0
30 Aug 2021
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
73
79
0
12 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
27
6
0
07 Jul 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
SSL
49
15
0
10 Jun 2021
MICo: Improved representations via sampling-based state similarity for Markov decision processes
Pablo Samuel Castro
Tyler Kastner
Prakash Panangaden
Mark Rowland
48
35
0
03 Jun 2021
Learning to drive from a world on rails
Di Chen
V. Koltun
Philipp Krahenbuhl
98
116
0
03 May 2021
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
39
52
0
26 Apr 2021
Off-Policy Risk Assessment in Contextual Bandits
Audrey Huang
Liu Leqi
Zachary Chase Lipton
Kamyar Azizzadenesheli
OffRL
29
36
0
18 Apr 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
60
73
0
01 Jan 2021
POPO: Pessimistic Offline Policy Optimization
Qiang He
Xinwen Hou
OffRL
37
10
0
26 Dec 2020
A case for new neural network smoothness constraints
Mihaela Rosca
T. Weber
Arthur Gretton
S. Mohamed
AAML
35
48
0
14 Dec 2020
Deep Reinforcement Learning for Resource Constrained Multiclass Scheduling in Wireless Networks
Apostolos Avranas
Marios Kountouris
P. Ciblat
24
7
0
27 Nov 2020
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning Research
J. Obando-Ceron
Pablo Samuel Castro
OffRL
20
105
0
20 Nov 2020
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
53
823
0
05 Oct 2020
QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning
Jian Hu
Seth Austin Harding
Haibin Wu
Siyue Hu
Shih-Wei Liao
29
9
0
09 Sep 2020
Hyperparameter Selection for Offline Reinforcement Learning
T. Paine
Cosmin Paduraru
Andrea Michi
Çağlar Gülçehre
Konrad Zolna
Alexander Novikov
Ziyun Wang
Nando de Freitas
GP
OffRL
49
146
0
17 Jul 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
65
225
0
01 Jun 2020
Adversarial Attacks on Reinforcement Learning based Energy Management Systems of Extended Range Electric Delivery Vehicles
Pengyue Wang
Yuante Li
Shashi Shekhar
W. Northrop
AAML
21
8
0
01 Jun 2020
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
121
0
24 Mar 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
25
174
0
09 Jan 2020
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
44
205
0
25 Nov 2019
Worst Cases Policy Gradients
Yichuan Tang
Jian Zhang
Ruslan Salakhutdinov
27
75
0
09 Nov 2019
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping
Cristian Bodnar
A. Li
Karol Hausman
P. Pastor
Mrinal Kalakrishnan
OffRL
28
50
0
01 Oct 2019
Harnessing Structures for Value-Based Planning and Reinforcement Learning
Yuzhe Yang
Guo Zhang
Zhi Xu
Dina Katabi
OffRL
27
31
0
26 Sep 2019
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
27
68
0
25 Sep 2019
Previous
1
2
3
Next