ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.10044
  4. Cited By
Distributional Reinforcement Learning with Quantile Regression

Distributional Reinforcement Learning with Quantile Regression

27 October 2017
Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
ArXivPDFHTML

Papers citing "Distributional Reinforcement Learning with Quantile Regression"

50 / 401 papers shown
Title
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning
Simo Alami C.
Rim Kaddah
Jesse Read
Marie-Paule Cani
53
0
0
07 May 2025
Unraveling the Rainbow: can value-based methods schedule?
Unraveling the Rainbow: can value-based methods schedule?
Arthur Corrêa
Alexandre Jesus
Cristóvão Silva
Samuel Moniz
OffRL
40
0
0
06 May 2025
Autonomous Control of Redundant Hydraulic Manipulator Using Reinforcement Learning with Action Feedback
Autonomous Control of Redundant Hydraulic Manipulator Using Reinforcement Learning with Action Feedback
Rohit Dhakate
Christian Brommer
C. Böhm
Stephan Weiss
J. Steinbrener
36
5
0
22 Apr 2025
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
41
0
0
14 Apr 2025
A Champion-level Vision-based Reinforcement Learning Agent for Competitive Racing in Gran Turismo 7
A Champion-level Vision-based Reinforcement Learning Agent for Competitive Racing in Gran Turismo 7
Hojoon Lee
Takuma Seno
Jun Jet Tai
K. Subramanian
Kenta Kawamoto
Peter Stone
Peter R. Wurman
34
0
0
12 Apr 2025
Offline and Distributional Reinforcement Learning for Wireless Communications
Offline and Distributional Reinforcement Learning for Wireless Communications
Eslam Eldeeb
Hirley Alves
OffRL
29
0
0
04 Apr 2025
UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality
UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality
Zelei Cheng
Xin-Qiang Cai
Yuting Tang
Pushi Zhang
Boming Yang
Masashi Sugiyama
Xinyu Xing
51
0
0
10 Mar 2025
Tractable Representations for Convergent Approximation of Distributional HJB Equations
Julie Alhosh
Harley Wiltzer
David Meger
36
0
0
07 Mar 2025
Behavior Preference Regression for Offline Reinforcement Learning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
38
0
0
02 Mar 2025
RIZE: Regularized Imitation Learning via Distributional Reinforcement Learning
RIZE: Regularized Imitation Learning via Distributional Reinforcement Learning
Adib Karimi
Mohammad Mehdi Ebadzadeh
OOD
50
0
0
27 Feb 2025
Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management
Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management
Lei Zhao
Lin Cai
Wu-Sheng Lu
52
0
0
25 Feb 2025
IGN : Implicit Generative Networks
IGN : Implicit Generative Networks
Haozheng Luo
Tianyi Wu
Feiyu Han
Zhijun Yan
OffRL
37
1
0
24 Feb 2025
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
Yirui Zhou
Xiaowei Liu
Xiaofeng Zhang
Yangchun Zhang
41
0
0
22 Jan 2025
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics
S. Hazra
P. Dasgupta
Soumyajit Dey
43
0
0
21 Jan 2025
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Félicien Hêche
Biagio Nigro
Oussama Barakat
Stephan Robert-Nicoud
OffRL
44
0
0
08 Jan 2025
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning
Mehrdad Moghimi
Hyejin Ku
OffRL
48
0
0
03 Jan 2025
AdaCred: Adaptive Causal Decision Transformers with Feature Crediting
AdaCred: Adaptive Causal Decision Transformers with Feature Crediting
Hemant Kumawat
Saibal Mukhopadhyay
82
1
0
19 Dec 2024
Hedging and Pricing Structured Products Featuring Multiple Underlying
  Assets
Hedging and Pricing Structured Products Featuring Multiple Underlying Assets
Anil Sharma
Freeman Chen
Jaesun Noh
Julio DeJesus
Mario Schlener
24
1
0
02 Nov 2024
Uncertainty-based Offline Variational Bayesian Reinforcement Learning
  for Robustness under Diverse Data Corruptions
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
Rui Yang
Jie Wang
Guoping Wu
Yangqiu Song
AAML
OffRL
53
1
0
01 Nov 2024
Q-learning for Quantile MDPs: A Decomposition, Performance, and
  Convergence Analysis
Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis
J. Hau
Erick Delage
Esther Derman
Mohammad Ghavamzadeh
Marek Petrik
31
1
0
31 Oct 2024
Action Gaps and Advantages in Continuous-Time Distributional
  Reinforcement Learning
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
Harley Wiltzer
Marc G. Bellemare
David Meger
Patrick Shafto
Yash Jhaveri
34
1
0
14 Oct 2024
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement
  Learning
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Hyunseung Kim
Jun Jet Tai
K. Subramanian
Peter R. Wurman
Jaegul Choo
Peter Stone
Takuma Seno
OffRL
78
8
0
13 Oct 2024
Offline and Distributional Reinforcement Learning for Radio Resource Management
Offline and Distributional Reinforcement Learning for Radio Resource Management
Eslam Eldeeb
Hirley Alves
OffRL
33
2
0
25 Sep 2024
Adaptive Learn-then-Test: Statistically Valid and Efficient Hyperparameter Selection
Adaptive Learn-then-Test: Statistically Valid and Efficient Hyperparameter Selection
Matteo Zecchin
Sangwoo Park
Osvaldo Simeone
LM&MA
79
3
0
24 Sep 2024
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Jonas Günster
Puze Liu
Jan Peters
Davide Tateo
OffRL
36
2
0
18 Sep 2024
Quantile Regression for Distributional Reward Models in RLHF
Quantile Regression for Distributional Reward Models in RLHF
Nicolai Dorka
37
17
0
16 Sep 2024
Offline Reinforcement Learning for Learning to Dispatch for Job Shop Scheduling
Offline Reinforcement Learning for Learning to Dispatch for Job Shop Scheduling
Jesse van Remmerden
Z. Bukhsh
Yingqian Zhang
OffRL
OnRL
50
1
0
16 Sep 2024
Foundations of Multivariate Distributional Reinforcement Learning
Foundations of Multivariate Distributional Reinforcement Learning
Harley Wiltzer
Jesse Farebrother
Arthur Gretton
Mark Rowland
OffRL
43
2
0
31 Aug 2024
Generative Bayesian Computation for Maximum Expected Utility
Generative Bayesian Computation for Maximum Expected Utility
Nick Polson
Fabrizio Ruggeri
Vadim Sokolov
31
1
0
28 Aug 2024
Improving Thompson Sampling via Information Relaxation for Budgeted
  Multi-armed Bandits
Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits
Woojin Jeong
Seungki Min
60
0
0
28 Aug 2024
EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional
  Reinforcement Learning
EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement Learning
Parvin Malekzadeh
Zissis Poulos
Jacky Chen
Zeyu Wang
Konstantinos N. Plataniotis
36
1
0
22 Aug 2024
Lifelong Reinforcement Learning via Neuromodulation
Lifelong Reinforcement Learning via Neuromodulation
Sebastian Lee
Samuel Liebana Garcia
Claudia Clopath
Will Dabney
49
0
0
15 Aug 2024
Off-Policy Reinforcement Learning with High Dimensional Reward
Off-Policy Reinforcement Learning with High Dimensional Reward
Dong Neuck Lee
Michael R. Kosorok
OffRL
21
1
0
14 Aug 2024
Robust Deep Reinforcement Learning for Inverter-based Volt-Var Control
  in Partially Observable Distribution Networks
Robust Deep Reinforcement Learning for Inverter-based Volt-Var Control in Partially Observable Distribution Networks
Qiong Liu
Ye Guo
Tong Xu
OffRL
31
0
0
13 Aug 2024
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Seyeon Kim
Joonhun Lee
Namhoon Cho
Sungjun Han
Seungeon Baek
52
0
0
05 Aug 2024
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
Taehyun Cho
Seung Han
Kyungjae Lee
Seokhun Ju
Dohyeong Kim
Jungwoo Lee
72
0
0
31 Jul 2024
How to Choose a Reinforcement-Learning Algorithm
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
34
1
0
30 Jul 2024
On Policy Evaluation Algorithms in Distributional Reinforcement Learning
On Policy Evaluation Algorithms in Distributional Reinforcement Learning
Julian Gerstenberg
Ralph Neininger
Denis Spiegel
OffRL
23
0
0
19 Jul 2024
PG-Rainbow: Using Distributional Reinforcement Learning in Policy
  Gradient Methods
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods
WooJae Jeon
KanJun Lee
Jeewoo Lee
OffRL
25
0
0
18 Jul 2024
Conditional Quantile Estimation for Uncertain Watch Time in Short-Video Recommendation
Conditional Quantile Estimation for Uncertain Watch Time in Short-Video Recommendation
Chengzhi Lin
Shuchang Liu
Chuyuan Wang
Yongqi Liu
27
4
0
17 Jul 2024
Generalizing soft actor-critic algorithms to discrete action spaces
Generalizing soft actor-critic algorithms to discrete action spaces
Le Zhang
Yong Gu
Xin Zhao
Yanshuo Zhang
Shu Zhao
Yifei Jin
Xinxin Wu
34
0
0
08 Jul 2024
Model-Free Active Exploration in Reinforcement Learning
Model-Free Active Exploration in Reinforcement Learning
Alessio Russo
Alexandre Proutiere
OffRL
23
2
0
30 Jun 2024
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
Benjamin Estermann
Luca A. Lanzendörfer
Yannick Niedermayr
Roger Wattenhofer
58
3
0
29 Jun 2024
A Super-human Vision-based Reinforcement Learning Agent for Autonomous
  Racing in Gran Turismo
A Super-human Vision-based Reinforcement Learning Agent for Autonomous Racing in Gran Turismo
Miguel Vasco
Takuma Seno
Kenta Kawamoto
K. Subramanian
Peter R. Wurman
Peter Stone
56
6
0
18 Jun 2024
Integrating Domain Knowledge for handling Limited Data in Offline RL
Integrating Domain Knowledge for handling Limited Data in Offline RL
Briti Gangopadhyay
Zhao Wang
Jia-Fong Yeh
Shingo Takamatsu
OffRL
32
0
0
11 Jun 2024
Simplification of Risk Averse POMDPs with Performance Guarantees
Simplification of Risk Averse POMDPs with Performance Guarantees
Yaacov Pariente
Vadim Indelman
32
0
0
05 Jun 2024
Offline Bayesian Aleatoric and Epistemic Uncertainty Quantification and
  Posterior Value Optimisation in Finite-State MDPs
Offline Bayesian Aleatoric and Epistemic Uncertainty Quantification and Posterior Value Optimisation in Finite-State MDPs
Filippo Valdettaro
A. Aldo Faisal
OffRL
40
0
0
04 Jun 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Dohyeong Kim
Taehyun Cho
Seung Han
Hojun Chung
Kyungjae Lee
Songhwai Oh
36
1
0
29 May 2024
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Minheng Xiao
Xian Yu
Lei Ying
42
2
0
23 May 2024
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer
  Crashes
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Kyle Stachowicz
Sergey Levine
22
6
0
07 May 2024
123456789
Next