ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.10044
  4. Cited By
Distributional Reinforcement Learning with Quantile Regression

Distributional Reinforcement Learning with Quantile Regression

27 October 2017
Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
ArXivPDFHTML

Papers citing "Distributional Reinforcement Learning with Quantile Regression"

50 / 401 papers shown
Title
Resource-Constrained Station-Keeping for Helium Balloons using
  Reinforcement Learning
Resource-Constrained Station-Keeping for Helium Balloons using Reinforcement Learning
Jack D. Saunders
Loïc Prenevost
Özgür Simsek
Alan Hunter
Wenbin Li
12
1
0
02 Mar 2023
A Survey on Uncertainty Quantification Methods for Deep Learning
A Survey on Uncertainty Quantification Methods for Deep Learning
Wenchong He
Zhe Jiang
Tingsong Xiao
Zelin Xu
Yukun Li
BDL
UQCV
AI4CE
19
19
0
26 Feb 2023
Constrained Reinforcement Learning using Distributional Representation
  for Trustworthy Quadrotor UAV Tracking Control
Constrained Reinforcement Learning using Distributional Representation for Trustworthy Quadrotor UAV Tracking Control
Yanran Wang
David E. Boyle
33
8
0
22 Feb 2023
Distributional Offline Policy Evaluation with Predictive Error
  Guarantees
Distributional Offline Policy Evaluation with Predictive Error Guarantees
Runzhe Wu
Masatoshi Uehara
Wen Sun
OffRL
38
13
0
19 Feb 2023
Distributional GFlowNets with Quantile Flows
Distributional GFlowNets with Quantile Flows
Dinghuai Zhang
L. Pan
Ricky T. Q. Chen
Aaron Courville
Yoshua Bengio
34
25
0
11 Feb 2023
Distributional constrained reinforcement learning for supply chain
  optimization
Distributional constrained reinforcement learning for supply chain optimization
J. Berm\údez
Antonio del Rio-Chanona
Calvin Tsay
26
5
0
03 Feb 2023
Distillation Policy Optimization
Distillation Policy Optimization
Jianfei Ma
OffRL
26
1
0
01 Feb 2023
Revisiting Bellman Errors for Offline Model Selection
Revisiting Bellman Errors for Offline Model Selection
Joshua P. Zitovsky
Daniel de Marchi
Rishabh Agarwal
Michael R. Kosorok University of North Carolina at Chapel Hill
OffRL
35
5
0
31 Jan 2023
Generalized Munchausen Reinforcement Learning using Tsallis KL
  Divergence
Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence
Lingwei Zhu
Zheng Chen
Takamitsu Matsubara
Martha White
14
1
0
27 Jan 2023
Trust Region-Based Safe Distributional Reinforcement Learning for
  Multiple Constraints
Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints
Dohyeong Kim
Kyungjae Lee
Songhwai Oh
25
10
0
26 Jan 2023
Quasi-optimal Reinforcement Learning with Continuous Actions
Quasi-optimal Reinforcement Learning with Continuous Actions
Yuhan Li
Wenzhuo Zhou
Ruoqing Zhu
OffRL
32
5
0
21 Jan 2023
An Analysis of Quantile Temporal-Difference Learning
An Analysis of Quantile Temporal-Difference Learning
Mark Rowland
Rémi Munos
M. G. Azar
Yunhao Tang
Georg Ostrovski
Anna Harutyunyan
K. Tuyls
Marc G. Bellemare
Will Dabney
24
23
0
11 Jan 2023
Deep Spectral Q-learning with Application to Mobile Health
Deep Spectral Q-learning with Application to Mobile Health
Yuhe Gao
C. Shi
R. Song
34
0
0
03 Jan 2023
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Hangyu Mao
Rui Zhao
Hao Chen
Jianye Hao
Yiqun Chen
Dong Li
Junge Zhang
Zhen Xiao
OffRL
39
8
0
30 Dec 2022
Invariance to Quantile Selection in Distributional Continuous Control
Invariance to Quantile Selection in Distributional Continuous Control
Felix Grün
Muhammad Saif-ur-Rehman
Tobias Glasmachers
Ioannis Iossifidis
23
0
0
29 Dec 2022
Reinforcement Learning in System Identification
Reinforcement Learning in System Identification
J. Antonio
Martin H Oscar Fernández
Sergio Pérez
Anas Belfadil
C. Ibáñez-Llano
Freddy José Perozo
Javier Valle
Javier Arechalde Pelaz
20
0
0
14 Dec 2022
Confidence-Conditioned Value Functions for Offline Reinforcement
  Learning
Confidence-Conditioned Value Functions for Offline Reinforcement Learning
Joey Hong
Aviral Kumar
Sergey Levine
OffRL
39
20
0
08 Dec 2022
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce
  Order Fraud Evaluation
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation
Soysal Degirmenci
Chris Jones
OffRL
27
1
0
05 Dec 2022
Flow to Control: Offline Reinforcement Learning with Lossless Primitive
  Discovery
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery
Yiqin Yang
Haotian Hu
Wenzhe Li
Siyuan Li
Jun Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
28
9
0
02 Dec 2022
Offline Reinforcement Learning with Closed-Form Policy Improvement
  Operators
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
Jiachen Li
Edwin Zhang
Ming Yin
Qinxun Bai
Yu-Xiang Wang
William Yang Wang
OffRL
39
15
0
29 Nov 2022
Quantile Constrained Reinforcement Learning: A Reinforcement Learning
  Framework Constraining Outage Probability
Quantile Constrained Reinforcement Learning: A Reinforcement Learning Framework Constraining Outage Probability
Whiyoung Jung
Myungsik Cho
Jongeui Park
Young-Jin Sung
38
4
0
28 Nov 2022
Decision-making with Speculative Opponent Models
Decision-making with Speculative Opponent Models
Jing-rong Sun
Shuo Chen
Cong Zhang
Yining Ma
Jie Zhang
28
1
0
22 Nov 2022
A Survey on Quantum Reinforcement Learning
A Survey on Quantum Reinforcement Learning
Nico Meyer
Christian Ufrecht
Maniraman Periyasamy
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
AI4CE
LRM
47
54
0
07 Nov 2022
Bridging Distributional and Risk-sensitive Reinforcement Learning with
  Provable Regret Bounds
Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds
Hao Liang
Zhihui Luo
28
14
0
25 Oct 2022
Solving Continuous Control via Q-learning
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRL
LRM
39
22
0
22 Oct 2022
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
Xi Chen
Tianyuan Shi
Qing Zhao
Yuchen Sun
Yunfei Gao
Xiangjun Wang
33
2
0
14 Oct 2022
Distributional Reward Estimation for Effective Multi-Agent Deep
  Reinforcement Learning
Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning
Jifeng Hu
Yanchao Sun
Hechang Chen
Sili Huang
Haiyin Piao
Yi-Ju Chang
Lichao Sun
28
5
0
14 Oct 2022
Regret Bounds for Risk-Sensitive Reinforcement Learning
Regret Bounds for Risk-Sensitive Reinforcement Learning
Osbert Bastani
Y. Ma
E. Shen
Wei Xu
44
18
0
11 Oct 2022
Elastic Step DQN: A novel multi-step algorithm to alleviate
  overestimation in Deep QNetworks
Elastic Step DQN: A novel multi-step algorithm to alleviate overestimation in Deep QNetworks
Adrian Ly
Richard Dazeley
Peter Vamplew
Francisco Cruz
Sunil Aryal
18
8
0
07 Oct 2022
Accelerating Laboratory Automation Through Robot Skill Learning For
  Sample Scraping
Accelerating Laboratory Automation Through Robot Skill Learning For Sample Scraping
Gabriella Pizzuto
Hetong Wang
Hatem Fakhruldeen
Bei Peng
K. Luck
Andrew I. Cooper
28
2
0
29 Sep 2022
MAN: Multi-Action Networks Learning
MAN: Multi-Action Networks Learning
Keqin Wang
Alison Bartsch
A. Farimani
21
3
0
19 Sep 2022
Sample-based Uncertainty Quantification with a Single Deterministic
  Neural Network
Sample-based Uncertainty Quantification with a Single Deterministic Neural Network
T. Kanazawa
Chetan Gupta
UQCV
37
4
0
17 Sep 2022
Optimistic Curiosity Exploration and Conservative Exploitation with
  Linear Reward Shaping
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRL
OnRL
45
10
0
15 Sep 2022
Risk-aware linear bandits with convex loss
Risk-aware linear bandits with convex loss
Patrick Saux
Odalric-Ambrym Maillard
27
2
0
15 Sep 2022
DEQGAN: Learning the Loss Function for PINNs with Generative Adversarial
  Networks
DEQGAN: Learning the Loss Function for PINNs with Generative Adversarial Networks
Blake Bullwinkel
Dylan Randle
P. Protopapas
David Sondak
26
3
0
15 Sep 2022
Cell-Free Latent Go-Explore
Cell-Free Latent Go-Explore
Quentin Gallouedec
Emmanuel Dellandrea
24
1
0
31 Aug 2022
Normality-Guided Distributional Reinforcement Learning for Continuous
  Control
Normality-Guided Distributional Reinforcement Learning for Continuous Control
Ju-Seung Byun
Andrew Perrault
OffRL
16
0
0
28 Aug 2022
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement
  Learning: A Systematic Review
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement Learning: A Systematic Review
Fadi AlMahamid
Katarina Grolinger
30
73
0
25 Aug 2022
An intelligent algorithmic trading based on a risk-return reinforcement
  learning algorithm
An intelligent algorithmic trading based on a risk-return reinforcement learning algorithm
Boyin Jin
24
1
0
23 Aug 2022
A Risk-Sensitive Approach to Policy Optimization
A Risk-Sensitive Approach to Policy Optimization
Jared Markowitz
Ryan W. Gardner
Ashley J. Llorens
R. Arora
I-J. Wang
OffRL
34
6
0
19 Aug 2022
A Review of Uncertainty for Deep Reinforcement Learning
A Review of Uncertainty for Deep Reinforcement Learning
Owen Lockwood
Mei Si
22
38
0
18 Aug 2022
Quantitative Universal Approximation Bounds for Deep Belief Networks
Quantitative Universal Approximation Bounds for Deep Belief Networks
J. Sieber
Johann Gehringer
30
1
0
18 Aug 2022
Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous
  Control
Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control
T. Kanazawa
Haiyan Wang
Chetan Gupta
UQCV
34
4
0
27 Jul 2022
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting
  Uncertain Outcomes
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain Outcomes
A. Sorokin
N. Buzun
Leonid Pugachev
Andrey Kravchenko
31
8
0
27 Jul 2022
QuaDUE-CCM: Interpretable Distributional Reinforcement Learning using
  Uncertain Contraction Metrics for Precise Quadrotor Trajectory Tracking
QuaDUE-CCM: Interpretable Distributional Reinforcement Learning using Uncertain Contraction Metrics for Precise Quadrotor Trajectory Tracking
Yanran Wang
James O’Keeffe
Qiuchen Qian
David E. Boyle
33
8
0
15 Jul 2022
The Nature of Temporal Difference Errors in Multi-step Distributional
  Reinforcement Learning
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
Marc G. Bellemare
OffRL
32
11
0
15 Jul 2022
Deep Hedging: Continuous Reinforcement Learning for Hedging of General
  Portfolios across Multiple Risk Aversions
Deep Hedging: Continuous Reinforcement Learning for Hedging of General Portfolios across Multiple Risk Aversions
Phillip Murray
Ben Wood
Hans Buehler
Magnus Wiese
Mikko S. Pakkanen
38
19
0
15 Jul 2022
The StarCraft Multi-Agent Challenges+ : Learning of Multi-Stage Tasks
  and Environmental Factors without Precise Reward Functions
The StarCraft Multi-Agent Challenges+ : Learning of Multi-Stage Tasks and Environmental Factors without Precise Reward Functions
Mingyu Kim
Ji-Yun Oh
Yongsik Lee
Joonkee Kim
S. Kim
Song Chong
Se-Young Yun
21
2
0
05 Jul 2022
Risk Perspective Exploration in Distributional Reinforcement Learning
Risk Perspective Exploration in Distributional Reinforcement Learning
Ji-Yun Oh
Joonkee Kim
Se-Young Yun
14
5
0
28 Jun 2022
Sampling Efficient Deep Reinforcement Learning through Preference-Guided
  Stochastic Exploration
Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration
Wenhui Huang
Cong Zhang
Jingda Wu
Xiangkun He
Jie Zhang
Chengqi Lv
13
8
0
20 Jun 2022
Previous
123456789
Next