Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1710.10044
Cited By
Distributional Reinforcement Learning with Quantile Regression
27 October 2017
Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Distributional Reinforcement Learning with Quantile Regression"
50 / 401 papers shown
Title
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning
Shentao Yang
Yihao Feng
Shujian Zhang
Mi Zhou
OffRL
40
12
0
14 Jun 2022
Robust Reinforcement Learning with Distributional Risk-averse formulation
Pierre Clavier
S. Allassonnière
E. L. Pennec
OOD
39
7
0
14 Jun 2022
Conformal Off-policy Prediction
Yingying Zhang
C. Shi
Shuang Luo
OffRL
38
10
0
14 Jun 2022
Conformal Prediction Intervals for Markov Decision Process Trajectories
Thomas G. Dietterich
Jesse Hostetler
11
17
0
10 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
David Brandfonbrener
Rémi Tachet des Combes
Romain Laroche
OffRL
39
5
0
02 Jun 2022
Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games
Ziyi Liu
Xian Guo
Yongchun Fang
26
0
0
31 May 2022
A Simulation Environment and Reinforcement Learning Method for Waste Reduction
Sami Jullien
Mozhdeh Ariannezhad
Paul T. Groth
Maarten de Rijke
OOD
OffRL
11
4
0
30 May 2022
Censored Quantile Regression Neural Networks for Distribution-Free Survival Analysis
Tim Pearce
Jong-Hyeon Jeong
Yichen Jia
Jun Zhu
29
2
0
26 May 2022
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning
Harley Wiltzer
David Meger
Marc G. Bellemare
19
13
0
24 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
43
8
0
20 May 2022
Robust Losses for Learning Value Functions
Andrew Patterson
Victor Liao
Martha White
30
12
0
17 May 2022
q
q
q
-Munchausen Reinforcement Learning
Lingwei Zhu
Zheng Chen
E. Uchibe
Takamitsu Matsubara
OffRL
14
0
0
16 May 2022
Enforcing KL Regularization in General Tsallis Entropy Reinforcement Learning via Advantage Learning
Lingwei Zhu
Zheng Chen
E. Uchibe
Takamitsu Matsubara
14
1
0
16 May 2022
Interpretable Stochastic Model Predictive Control using Distributional Reinforced Estimation for Quadrotor Tracking Systems
Yanran Wang
James O’Keeffe
Qiuchen Qian
David E. Boyle
16
3
0
14 May 2022
Efficient Risk-Averse Reinforcement Learning
Ido Greenberg
Yinlam Chow
Mohammad Ghavamzadeh
Shie Mannor
41
39
0
10 May 2022
Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces
G. C. Alexandropoulos
Kyriakos Stylianopoulos
Chongwen Huang
Chau Yuen
M. Bennis
Mérouane Debbah
30
88
0
08 May 2022
Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Bobak Shahriari
A. Abdolmaleki
Arunkumar Byravan
A. Friesen
Siqi Liu
Jost Tobias Springenberg
N. Heess
Matthew W. Hoffman
Martin Riedmiller
OffRL
46
9
0
21 Apr 2022
Understanding and Preventing Capacity Loss in Reinforcement Learning
Clare Lyle
Mark Rowland
Will Dabney
CLL
41
110
0
20 Apr 2022
SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics
Yannis Flet-Berliac
D. Basu
AAML
28
8
0
20 Apr 2022
JORLDY: a fully customizable open source framework for reinforcement learning
Kyushik Min
Hyunho Lee
Kwansu Shin
Tae-woo Lee
Hojoon Lee
Jinwon Choi
Sung-Hyun Son
OnRL
19
0
0
11 Apr 2022
Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation
Zehua Wang
Guogang Liao
Xiaowen Shi
Xiaoxu Wu
Chuheng Zhang
Bingqin Zhu
Yongkang Wang
Xingxing Wang
Dong Wang
25
4
0
02 Apr 2022
Investigating the Properties of Neural Network Representations in Reinforcement Learning
Han Wang
Erfan Miahi
Martha White
Marlos C. Machado
Zaheer Abbas
Raksha Kumaraswamy
Vincent Liu
Adam White
25
26
0
30 Mar 2022
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement Learning
Cheng Liu
E. Kampen
Guido de Croon
39
16
0
28 Mar 2022
Action Candidate Driven Clipped Double Q-learning for Discrete and Continuous Action Tasks
Haobo Jiang
Jin Xie
Jian Yang
OffRL
11
10
0
22 Mar 2022
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks
Fan Wu
Linyi Li
Chejian Xu
Huan Zhang
B. Kailkhura
K. Kenthapadi
Ding Zhao
Bo-wen Li
AAML
OffRL
34
34
0
16 Mar 2022
Zipfian environments for Reinforcement Learning
Stephanie C. Y. Chan
Andrew Kyle Lampinen
Pierre Harvey Richemond
Felix Hill
OffRL
15
15
0
15 Mar 2022
Invariance in Policy Optimisation and Partial Identifiability in Reward Learning
Joar Skalse
Matthew Farrugia-Roberts
Stuart J. Russell
Alessandro Abate
Adam Gleave
11
46
0
14 Mar 2022
Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons
C. Shi
Shuang Luo
Yuan Le
Hongtu Zhu
R. Song
OffRL
OnRL
32
10
0
26 Feb 2022
MCMARL: Parameterizing Value Function via Mixture of Categorical Distributions for Multi-Agent Reinforcement Learning
Jian Zhao
Mingyu Yang
Youpeng Zhao
Xu Hu
Wen-gang Zhou
Jiangcheng Zhu
Houqiang Li
25
3
0
21 Feb 2022
MuZero with Self-competition for Rate Control in VP9 Video Compression
Amol Mandhane
A. Zhernov
Maribeth Rauh
Chenjie Gu
Miaosen Wang
...
Jackson Broshear
Julian Schrittwieser
Thomas Hubert
Oriol Vinyals
Timothy A. Mann
37
44
0
14 Feb 2022
Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning
Michael Teng
M. van de Panne
Frank Wood
OOD
OffRL
14
1
0
06 Feb 2022
Distributional Reinforcement Learning by Sinkhorn Divergence
Ke Sun
Yingnan Zhao
Wulong Liu
Bei Jiang
Linglong Kong
35
0
0
01 Feb 2022
Reinforcement Learning with Heterogeneous Data: Estimation and Inference
Elynn Y. Chen
Rui Song
Michael I. Jordan
OffRL
24
10
0
31 Jan 2022
Deep Non-Crossing Quantiles through the Partial Derivative
Axel Brando
J. Gimeno
Jose A. Rodríguez-Serrano
Jordi Vitrià
48
13
0
30 Jan 2022
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
49
2
0
21 Jan 2022
Robustness and risk management via distributional dynamic programming
Mastane Achab
Gergely Neu
17
7
0
28 Dec 2021
Deep Reinforcement Learning Policies Learn Shared Adversarial Features Across MDPs
Ezgi Korkmaz
27
25
0
16 Dec 2021
Conjugated Discrete Distributions for Distributional Reinforcement Learning
Björn Lindenberg
Jonas Nordqvist
Karl-Olof Lindahl
OffRL
19
2
0
14 Dec 2021
Autoregressive Quantile Flows for Predictive Uncertainty Estimation
Phillip Si
Allan Bishop
Volodymyr Kuleshov
BDL
UQCV
AI4TS
25
20
0
09 Dec 2021
Quantile Filtered Imitation Learning
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
33
6
0
02 Dec 2021
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Chao-Han Huck Yang
Zhengling Qi
Yifan Cui
Pin-Yu Chen
OffRL
39
4
0
29 Nov 2021
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning
Nicolai Dorka
Tim Welschehold
Joschka Boedecker
Wolfram Burgard
OffRL
32
9
0
24 Nov 2021
Generalized Decision Transformer for Offline Hindsight Information Matching
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
21
99
0
19 Nov 2021
Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari
Dominik Schmidt
Thomas Schmied
OffRL
28
12
0
19 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Keith Ross
OffRL
17
9
0
17 Nov 2021
A Dataset Perspective on Offline Reinforcement Learning
Kajetan Schweighofer
Andreas Radler
Marius-Constantin Dinu
M. Hofmarcher
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
OffRL
30
17
0
08 Nov 2021
Batch Reinforcement Learning from Crowds
Guoxi Zhang
H. Kashima
OffRL
40
5
0
08 Nov 2021
Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning
Yingjie Fei
Zhuoran Yang
Yudong Chen
Zhaoran Wang
51
46
0
06 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
65
100
0
06 Nov 2021
Previous
1
2
3
4
5
6
7
8
9
Next