ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.13578
  4. Cited By
Distributional Reinforcement Learning for Multi-Dimensional Reward
  Functions

Distributional Reinforcement Learning for Multi-Dimensional Reward Functions

26 October 2021
Pushi Zhang
Xiaoyu Chen
Li Zhao
Wei Xiong
Tao Qin
Tie-Yan Liu
    OffRL
ArXivPDFHTML

Papers citing "Distributional Reinforcement Learning for Multi-Dimensional Reward Functions"

8 / 8 papers shown
Title
UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality
UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality
Zelei Cheng
Xin-Qiang Cai
Yuting Tang
Pushi Zhang
Boming Yang
Masashi Sugiyama
Xinyu Xing
49
0
0
10 Mar 2025
Foundations of Multivariate Distributional Reinforcement Learning
Foundations of Multivariate Distributional Reinforcement Learning
Harley Wiltzer
Jesse Farebrother
Arthur Gretton
Mark Rowland
OffRL
43
2
0
31 Aug 2024
Diffusion-Reinforcement Learning Hierarchical Motion Planning in Multi-agent Adversarial Games
Diffusion-Reinforcement Learning Hierarchical Motion Planning in Multi-agent Adversarial Games
Zixuan Wu
Sean Ye
Manisha Natarajan
Matthew C. Gombolay
83
6
0
16 Mar 2024
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
32
1
0
28 May 2023
Distributional Offline Policy Evaluation with Predictive Error
  Guarantees
Distributional Offline Policy Evaluation with Predictive Error Guarantees
Runzhe Wu
Masatoshi Uehara
Wen Sun
OffRL
38
13
0
19 Feb 2023
Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective
  Reinforcement Learning
Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning
Conor F. Hayes
Mathieu Reymond
D. Roijers
Enda Howley
Patrick Mannion
24
4
0
23 Nov 2022
Bridging Distributional and Risk-sensitive Reinforcement Learning with
  Provable Regret Bounds
Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds
Hao Liang
Zhihui Luo
23
14
0
25 Oct 2022
Sample-based Uncertainty Quantification with a Single Deterministic
  Neural Network
Sample-based Uncertainty Quantification with a Single Deterministic Neural Network
T. Kanazawa
Chetan Gupta
UQCV
30
4
0
17 Sep 2022
1