ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.10044
  4. Cited By
Distributional Reinforcement Learning with Quantile Regression

Distributional Reinforcement Learning with Quantile Regression

27 October 2017
Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
ArXivPDFHTML

Papers citing "Distributional Reinforcement Learning with Quantile Regression"

50 / 401 papers shown
Title
The Curse of Diversity in Ensemble-Based Exploration
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
52
1
0
07 May 2024
CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
David Valencia
Henry Williams
Trevor Gee
Bruce A MacDonaland
Minas V. Liarokapis
Minas Liarokapis
OffRL
40
2
0
04 May 2024
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision
  Processes
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes
Andrew Bennett
Nathan Kallus
M. Oprescu
Wen Sun
Kaiwen Wang
AAML
OffRL
55
1
0
29 Mar 2024
Safe and Robust Reinforcement Learning: Principles and Practice
Safe and Robust Reinforcement Learning: Principles and Practice
Taku Yamagata
Raúl Santos-Rodríguez
OffRL
45
2
0
27 Mar 2024
Uncertainty-aware Distributional Offline Reinforcement Learning
Uncertainty-aware Distributional Offline Reinforcement Learning
Xiaocong Chen
Siyu Wang
Tong Yu
Lina Yao
OffRL
38
1
0
26 Mar 2024
A Simple Mixture Policy Parameterization for Improving Sample Efficiency
  of CVaR Optimization
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Yudong Luo
Yangchen Pan
Han Wang
Philip Torr
Pascal Poupart
47
3
0
17 Mar 2024
Neural-Kernel Conditional Mean Embeddings
Neural-Kernel Conditional Mean Embeddings
Eiki Shimizu
Kenji Fukumizu
Dino Sejdinovic
43
3
0
16 Mar 2024
Identifying Optimal Launch Sites of High-Altitude Latex-Balloons using
  Bayesian Optimisation for the Task of Station-Keeping
Identifying Optimal Launch Sites of High-Altitude Latex-Balloons using Bayesian Optimisation for the Task of Station-Keeping
Jack D. Saunders
Sajad Saeedi
Adam Hartshorne
Binbin Xu
Özgür Simsek
Alan Hunter
Wenbin Li
24
0
0
16 Mar 2024
A Survey on Applications of Reinforcement Learning in Spatial Resource
  Allocation
A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation
Di Zhang
Moyang Wang
Joseph D Mango
Xiang Li
Xianrui Xu
45
1
0
06 Mar 2024
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective
  Reinforcement Learning
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning
Dohyeong Kim
Mineui Hong
Jeongho Park
Songhwai Oh
42
0
0
01 Mar 2024
Investigating the Histogram Loss in Regression
Investigating the Histogram Loss in Regression
Ehsan Imani
Kai Luedemann
Sam Scholnick-Hughes
Esraa Elelimy
Martha White
UQCV
42
5
0
20 Feb 2024
A Distributional Analogue to the Successor Representation
A Distributional Analogue to the Successor Representation
Harley Wiltzer
Jesse Farebrother
Arthur Gretton
Yunhao Tang
André Barreto
Will Dabney
Marc G. Bellemare
Mark Rowland
48
5
0
13 Feb 2024
Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning
  for Digital Twins
Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning for Digital Twins
Eslam Eldeeb
Houssem Sifaou
Osvaldo Simeone
M. Shehab
Hirley Alves
OffRL
50
3
0
13 Feb 2024
Near-Minimax-Optimal Distributional Reinforcement Learning with a
  Generative Model
Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model
Mark Rowland
Wenliang Kevin Li
Rémi Munos
Clare Lyle
Yunhao Tang
Will Dabney
OOD
OffRL
30
1
0
12 Feb 2024
More Benefits of Being Distributional: Second-Order Bounds for
  Reinforcement Learning
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
Kaiwen Wang
Owen Oertell
Alekh Agarwal
Nathan Kallus
Wen Sun
OffRL
88
12
0
11 Feb 2024
Echoes of Socratic Doubt: Embracing Uncertainty in Calibrated Evidential
  Reinforcement Learning
Echoes of Socratic Doubt: Embracing Uncertainty in Calibrated Evidential Reinforcement Learning
Alex C. Stutts
Danilo Erricolo
Theja Tulabandhula
A. R. Trivedi
EDL
UQCV
46
0
0
11 Feb 2024
Vision-Language Models Provide Promptable Representations for
  Reinforcement Learning
Vision-Language Models Provide Promptable Representations for Reinforcement Learning
William Chen
Oier Mees
Aviral Kumar
Sergey Levine
VLM
LM&Ro
57
24
0
05 Feb 2024
Collaborative Reinforcement Learning Based Unmanned Aerial Vehicle (UAV)
  Trajectory Design for 3D UAV Tracking
Collaborative Reinforcement Learning Based Unmanned Aerial Vehicle (UAV) Trajectory Design for 3D UAV Tracking
Yujiao Zhu
Ming Chen
Sihua Wang
Ye Hu
Yuchen Liu
Changchuan Yin
18
5
0
22 Jan 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User
  Experiences in Recommender Systems
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
45
1
0
17 Jan 2024
A unified uncertainty-aware exploration: Combining epistemic and
  aleatory uncertainty
A unified uncertainty-aware exploration: Combining epistemic and aleatory uncertainty
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
UD
25
2
0
05 Jan 2024
A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In
  Distributional Reinforcement Learning
A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In Distributional Reinforcement Learning
Parvin Malekzadeh
Konstantinos N. Plataniotis
Zissis Poulos
Zeyu Wang
37
2
0
04 Jan 2024
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in
  Noisy Environments
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Jinyi Liu
Zhi Wang
Yan Zheng
Jianye Hao
Chenjia Bai
Junjie Ye
Zhen Wang
Haiyin Piao
Yang Sun
37
6
0
19 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
36
8
0
15 Dec 2023
Risk-Aware Continuous Control with Neural Contextual Bandits
Risk-Aware Continuous Control with Neural Contextual Bandits
J. Ayala-Romero
A. Garcia-Saavedra
Xavier Pérez Costa
21
3
0
15 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
80
5
0
13 Dec 2023
Noise Distribution Decomposition based Multi-Agent Distributional
  Reinforcement Learning
Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement Learning
Wei Geng
Baidi Xiao
Rongpeng Li
Ning Wei
Dong Wang
Zhifeng Zhao
25
1
0
12 Dec 2023
Distributional Bellman Operators over Mean Embeddings
Distributional Bellman Operators over Mean Embeddings
Wenliang Kevin Li
Grégoire Delétang
Matthew Aitchison
Marcus Hutter
Anian Ruoss
Arthur Gretton
Mark Rowland
OffRL
15
4
0
09 Dec 2023
Pearl: A Production-ready Reinforcement Learning Agent
Pearl: A Production-ready Reinforcement Learning Agent
Zheqing Zhu
Rodrigo de Salvo Braz
Jalaj Bhandari
Daniel Jiang
Yi Wan
...
D. Korenkevych
Ürün Dogan
Frank Cheng
Zheng Wu
Wanqiao Xu
VLM
OffRL
OnRL
41
6
0
06 Dec 2023
Deep Ensembles Meets Quantile Regression: Uncertainty-aware Imputation
  for Time Series
Deep Ensembles Meets Quantile Regression: Uncertainty-aware Imputation for Time Series
Ying Liu
Peng Cui
Wenbo Hu
Richang Hong
DiffM
BDL
AI4TS
22
1
0
03 Dec 2023
Learning to Simulate: Generative Metamodeling via Quantile Regression
Learning to Simulate: Generative Metamodeling via Quantile Regression
L. Hong
Yanxi Hou
Qingkai Zhang
Xiaowei Zhang
30
1
0
29 Nov 2023
Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement
  Learning
Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
18
0
0
29 Nov 2023
Towards a Standardized Reinforcement Learning Framework for AAM
  Contingency Management
Towards a Standardized Reinforcement Learning Framework for AAM Contingency Management
Luis E. Alvarez
Marc W. Brittain
Kara Breeden
19
2
0
17 Nov 2023
An introduction to reinforcement learning for neuroscience
An introduction to reinforcement learning for neuroscience
Kristopher T. Jensen
OOD
OffRL
AI4CE
36
1
0
13 Nov 2023
Mitigating Estimation Errors by Twin TD-Regularized Actor and Critic for
  Deep Reinforcement Learning
Mitigating Estimation Errors by Twin TD-Regularized Actor and Critic for Deep Reinforcement Learning
Junmin Zhong
Ruofan Wu
Jennie Si
OffRL
27
1
0
07 Nov 2023
RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value
  Factorization
RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value Factorization
Siqi Shen
Chennan Ma
Chao Li
Weiquan Liu
Yongquan Fu
Songzhu Mei
Xinwang Liu
Cheng-Yu Wang
23
10
0
03 Nov 2023
Beyond Average Return in Markov Decision Processes
Beyond Average Return in Markov Decision Processes
Alexandre Marthe
Aurélien Garivier
Claire Vernade
38
5
0
31 Oct 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
40
1
0
30 Oct 2023
Pitfall of Optimism: Distributional Reinforcement Learning by
  Randomizing Risk Criterion
Pitfall of Optimism: Distributional Reinforcement Learning by Randomizing Risk Criterion
Taehyun Cho
Seung Han
Heesoo Lee
Kyungjae Lee
Jungwoo Lee
35
3
0
25 Oct 2023
Absolute Policy Optimization
Absolute Policy Optimization
Weiye Zhao
Feihan Li
Yifan Sun
Rui Chen
Tianhao Wei
Changliu Liu
54
4
0
20 Oct 2023
Towards Robust Offline Reinforcement Learning under Diverse Data
  Corruption
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRL
OnRL
46
15
0
19 Oct 2023
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General
  Sequential Decision Scenarios
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Yazhe Niu
Yuan Pu
Zhenjie Yang
Xueyan Li
Tong Zhou
Jiyuan Ren
Shuai Hu
Hongsheng Li
Yu Liu
98
12
0
12 Oct 2023
Distributional Soft Actor-Critic with Three Refinements
Distributional Soft Actor-Critic with Three Refinements
Jingliang Duan
Wenxuan Wang
Liming Xiao
Jiaxin Gao
Shengbo Eben Li
Chang Liu
Ya-Qin Zhang
Bo Cheng
Keqiang Li
OODD
OffRL
27
2
0
09 Oct 2023
DRL-ORA: Distributional Reinforcement Learning with Online Risk Adaption
DRL-ORA: Distributional Reinforcement Learning with Online Risk Adaption
Yupeng Wu
Wenjie Huang
Chin Pang Ho
OffRL
32
0
0
08 Oct 2023
Reward Dropout Improves Control: Bi-objective Perspective on Reinforced
  LM
Reward Dropout Improves Control: Bi-objective Perspective on Reinforced LM
Changhun Lee
Chiehyeon Lim
34
0
0
06 Oct 2023
A Kernel Perspective on Behavioural Metrics for Markov Decision
  Processes
A Kernel Perspective on Behavioural Metrics for Markov Decision Processes
Pablo Samuel Castro
Tyler Kastner
Prakash Panangaden
Mark Rowland
46
4
0
05 Oct 2023
Small batch deep reinforcement learning
Small batch deep reinforcement learning
J. Obando-Ceron
Marc G. Bellemare
Pablo Samuel Castro
VLM
36
14
0
05 Oct 2023
Differentially Encoded Observation Spaces for Perceptive Reinforcement
  Learning
Differentially Encoded Observation Spaces for Perceptive Reinforcement Learning
Lev Grossman
Brian Plancher
OffRL
17
0
0
03 Oct 2023
Consciousness-Inspired Spatio-Temporal Abstractions for Better
  Generalization in Reinforcement Learning
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning
Mingde Zhao
Safa Alver
H. V. Seijen
Romain Laroche
Doina Precup
Yoshua Bengio
20
3
0
30 Sep 2023
Estimation and Inference in Distributional Reinforcement Learning
Estimation and Inference in Distributional Reinforcement Learning
Liangyu Zhang
Yang Peng
Jiadong Liang
Wenhao Yang
Zhihua Zhang
OffRL
39
1
0
29 Sep 2023
Learning Risk-Aware Quadrupedal Locomotion using Distributional
  Reinforcement Learning
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
37
9
0
25 Sep 2023
Previous
123456789
Next