ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06887
  4. Cited By
A Distributional Perspective on Reinforcement Learning

A Distributional Perspective on Reinforcement Learning

21 July 2017
Marc G. Bellemare
Will Dabney
Rémi Munos
    OffRL
ArXivPDFHTML

Papers citing "A Distributional Perspective on Reinforcement Learning"

50 / 257 papers shown
Title
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning
Simo Alami C.
Rim Kaddah
Jesse Read
Marie-Paule Cani
51
0
0
07 May 2025
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Harry Mead
Clarissa Costen
Bruno Lacerda
Nick Hawes
24
0
0
29 Apr 2025
ReLU integral probability metric and its applications
ReLU integral probability metric and its applications
Yuha Park
Kunwoong Kim
Insung Kong
Yongdai Kim
48
0
0
26 Apr 2025
Harmonia: A Multi-Agent Reinforcement Learning Approach to Data Placement and Migration in Hybrid Storage Systems
Harmonia: A Multi-Agent Reinforcement Learning Approach to Data Placement and Migration in Hybrid Storage Systems
Rakesh Nadig
Vamanan Arulchelvan
Rahul Bera
Taha Shahroodi
Gagandeep Singh
Mohammad Sadrosadati
Jisung Park
O. Mutlu
Onur Mutlu
68
0
0
26 Mar 2025
UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality
UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality
Zelei Cheng
Xin-Qiang Cai
Yuting Tang
Pushi Zhang
Boming Yang
Masashi Sugiyama
Xinyu Xing
49
0
0
10 Mar 2025
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning
Meng Feng
Viraj Parimi
B. Williams
77
1
0
25 Feb 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
63
1
0
24 Feb 2025
IGN : Implicit Generative Networks
IGN : Implicit Generative Networks
Haozheng Luo
Tianyi Wu
Feiyu Han
Zhijun Yan
OffRL
34
1
0
24 Feb 2025
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
Yirui Zhou
Xiaowei Liu
Xiaofeng Zhang
Yangchun Zhang
39
0
0
22 Jan 2025
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Félicien Hêche
Biagio Nigro
Oussama Barakat
Stephan Robert-Nicoud
OffRL
44
0
0
08 Jan 2025
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning
Mehrdad Moghimi
Hyejin Ku
OffRL
43
0
0
03 Jan 2025
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
38
0
0
06 Nov 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
41
0
0
27 Oct 2024
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar
J. Obando-Ceron
Rameswar Panda
Hugo Larochelle
Pablo Samuel Castro
MoE
142
2
0
02 Oct 2024
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Jie Cheng
Ruixi Qiao
Gang Xiong
Binhua Li
Yingwei Ma
Binhua Li
Yongbin Li
Yisheng Lv
OffRL
OnRL
LM&Ro
50
3
0
01 Oct 2024
The Central Role of the Loss Function in Reinforcement Learning
The Central Role of the Loss Function in Reinforcement Learning
Kaiwen Wang
Nathan Kallus
Wen Sun
OffRL
62
7
0
19 Sep 2024
Autonomous Vehicle Controllers From End-to-End Differentiable Simulation
Autonomous Vehicle Controllers From End-to-End Differentiable Simulation
Asen Nachkov
Danda Pani Paudel
Luc Van Gool
34
0
0
12 Sep 2024
Foundations of Multivariate Distributional Reinforcement Learning
Foundations of Multivariate Distributional Reinforcement Learning
Harley Wiltzer
Jesse Farebrother
Arthur Gretton
Mark Rowland
OffRL
43
2
0
31 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
95
3
0
20 Aug 2024
Parallel Distributional Deep Reinforcement Learning for Mapless
  Navigation of Terrestrial Mobile Robots
Parallel Distributional Deep Reinforcement Learning for Mapless Navigation of Terrestrial Mobile Robots
V. A. Kich
A. H. Kolling
J. C. Jesus
Gabriel V. Heisler
Hiago Jacobs
...
André da Silva Kelbouscas
Akihisa Ohya
Ricardo B. Grando
Paulo Lilles Jorge Drews-Jr
D. T. Gamarra
32
3
0
11 Aug 2024
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
Taehyun Cho
Seung Han
Kyungjae Lee
Seokhun Ju
Dohyeong Kim
Jungwoo Lee
72
0
0
31 Jul 2024
Functional Acceleration for Policy Mirror Descent
Functional Acceleration for Policy Mirror Descent
Veronica Chelu
Doina Precup
30
0
0
23 Jul 2024
Three Dogmas of Reinforcement Learning
Three Dogmas of Reinforcement Learning
David Abel
Mark K. Ho
Anna Harutyunyan
38
5
0
15 Jul 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
49
1
0
15 Jun 2024
World Models with Hints of Large Language Models for Goal Achieving
World Models with Hints of Large Language Models for Goal Achieving
Zeyuan Liu
Ziyu Huan
Xiyao Wang
Jiafei Lyu
Jian Tao
Xiu Li
Furong Huang
Huazhe Xu
LM&Ro
LRM
AI4CE
46
1
0
11 Jun 2024
Is Value Functions Estimation with Classification Plug-and-play for
  Offline Reinforcement Learning?
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?
Denis Tarasov
Kirill Brilliantov
Dmitrii Kharlapenko
OffRL
37
2
0
10 Jun 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Dohyeong Kim
Taehyun Cho
Seung Han
Hojun Chung
Kyungjae Lee
Songhwai Oh
34
1
0
29 May 2024
Bigger, Regularized, Optimistic: scaling for compute and
  sample-efficient continuous control
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Michal Nauman
M. Ostaszewski
Krzysztof Jankowski
Piotr Milo's
Marek Cygan
OffRL
47
16
0
25 May 2024
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Minheng Xiao
Xian Yu
Lei Ying
40
2
0
23 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement
  Learning
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
40
3
0
20 May 2024
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer
  Crashes
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Kyle Stachowicz
Sergey Levine
19
6
0
07 May 2024
CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
David Valencia
Henry Williams
Trevor Gee
Bruce A MacDonaland
Minas V. Liarokapis
Minas Liarokapis
OffRL
37
2
0
04 May 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey
  and Unifying Perspective
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
50
6
0
09 Apr 2024
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep
  Reinforcement Learning
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep Reinforcement Learning
Weiwei Gu
Senquan Wang
45
5
0
12 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
42
3
0
09 Mar 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with
  General Function Approximation
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
Yu Chen
Xiangcheng Zhang
Siwei Wang
Longbo Huang
42
3
0
28 Feb 2024
Off-policy Distributional Q($λ$): Distributional RL without
  Importance Sampling
Off-policy Distributional Q(λλλ): Distributional RL without Importance Sampling
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
OffRL
15
1
0
08 Feb 2024
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg
A. Abdolmaleki
Jingwei Zhang
Oliver Groth
Michael Bloesch
...
Sarah Bechtle
Steven Kapturowski
Roland Hafner
N. Heess
Martin Riedmiller
OffRL
LRM
27
12
0
08 Feb 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice
  via HyperAgent
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDL
OffRL
26
5
0
05 Feb 2024
Collaborative Reinforcement Learning Based Unmanned Aerial Vehicle (UAV)
  Trajectory Design for 3D UAV Tracking
Collaborative Reinforcement Learning Based Unmanned Aerial Vehicle (UAV) Trajectory Design for 3D UAV Tracking
Yujiao Zhu
Ming Chen
Sihua Wang
Ye Hu
Yuchen Liu
Changchuan Yin
13
5
0
22 Jan 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User
  Experiences in Recommender Systems
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
35
1
0
17 Jan 2024
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
22
20
0
17 Jan 2024
dRG-MEC: Decentralized Reinforced Green Offloading for MEC-enabled Cloud
  Network
dRG-MEC: Decentralized Reinforced Green Offloading for MEC-enabled Cloud Network
Asad Aftab
Semeen Rehman
14
1
0
10 Jan 2024
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
36
8
0
15 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
78
5
0
13 Dec 2023
Optimal simulation-based Bayesian decisions
Optimal simulation-based Bayesian decisions
Justin Alsing
Thomas D. P. Edwards
Benjamin Dan Wandelt
33
1
0
09 Nov 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
35
1
0
30 Oct 2023
TD-MPC2: Scalable, Robust World Models for Continuous Control
TD-MPC2: Scalable, Robust World Models for Continuous Control
Nicklas Hansen
Hao Su
Xiaolong Wang
MU
32
127
0
25 Oct 2023
Boosting Continuous Control with Consistency Policy
Boosting Continuous Control with Consistency Policy
Yuhui Chen
Haoran Li
Dongbin Zhao
OffRL
41
20
0
10 Oct 2023
Learning Risk-Aware Quadrupedal Locomotion using Distributional
  Reinforcement Learning
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
30
9
0
25 Sep 2023
123456
Next