ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07803
  4. Cited By
Dynamic Weights in Multi-Objective Deep Reinforcement Learning

Dynamic Weights in Multi-Objective Deep Reinforcement Learning

20 September 2018
Axel Abels
D. Roijers
Tom Lenaerts
A. Nowé
Denis Steckelmacher
    OffRL
ArXivPDFHTML

Papers citing "Dynamic Weights in Multi-Objective Deep Reinforcement Learning"

19 / 19 papers shown
Title
Constructing an Optimal Behavior Basis for the Option Keyboard
Constructing an Optimal Behavior Basis for the Option Keyboard
L. N. Alegre
A. Bazzan
André Barreto
Bruno C. da Silva
31
0
0
01 May 2025
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts
Helia Hashemi
J. Eisner
Corby Rosset
Benjamin Van Durme
Chris Kedzie
70
2
0
03 Jan 2025
How to Find the Exact Pareto Front for Multi-Objective MDPs?
How to Find the Exact Pareto Front for Multi-Objective MDPs?
Yining Li
Peizhong Ju
Ness B. Shroff
238
0
0
21 Oct 2024
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
48
1
0
03 Oct 2024
UCB-driven Utility Function Search for Multi-objective Reinforcement
  Learning
UCB-driven Utility Function Search for Multi-objective Reinforcement Learning
Yucheng Shi
Alexandros Agapitos
David Lynch
Giorgio Cruciata
Cengis Hasan
Hao Wang
Yayu Yao
Aleksandar Milenovic
44
0
0
01 May 2024
Unsupervised Discovery of Continuous Skills on a Sphere
Unsupervised Discovery of Continuous Skills on a Sphere
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
35
0
0
21 May 2023
Latent-Conditioned Policy Gradient for Multi-Objective Deep
  Reinforcement Learning
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
T. Kanazawa
Chetan Gupta
34
0
0
15 Mar 2023
Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective
  Reinforcement Learning
Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning
Conor F. Hayes
Mathieu Reymond
D. Roijers
Enda Howley
Patrick Mannion
26
4
0
23 Nov 2022
Redeeming Intrinsic Rewards via Constrained Optimization
Redeeming Intrinsic Rewards via Constrained Optimization
Eric Chen
Zhang-Wei Hong
Joni Pajarinen
Pulkit Agrawal
OnRL
36
24
0
14 Nov 2022
Safe Policy Improvement in Constrained Markov Decision Processes
Safe Policy Improvement in Constrained Markov Decision Processes
Luigi Berducci
Radu Grosu
OffRL
36
2
0
20 Oct 2022
Regularized Soft Actor-Critic for Behavior Transfer Learning
Regularized Soft Actor-Critic for Behavior Transfer Learning
Mingxi Tan
Andong Tian
Ludovic Denoyer
20
3
0
27 Sep 2022
Lamarckian Platform: Pushing the Boundaries of Evolutionary
  Reinforcement Learning towards Asynchronous Commercial Games
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games
Hui Bai
R. Shen
Yue Lin
Bo Xu
Ran Cheng
VLM
36
5
0
21 Sep 2022
Socially Fair Reinforcement Learning
Socially Fair Reinforcement Learning
Debmalya Mandal
Jiarui Gan
OffRL
30
13
0
26 Aug 2022
Exploring the Pareto front of multi-objective COVID-19 mitigation
  policies using reinforcement learning
Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning
Mathieu Reymond
Conor F. Hayes
L. Willem
Roxana Rădulescu
S. Abrams
...
Enda Howley
Patrick Mannion
N. Hens
Ann Nowé
Pieter J. K. Libin
24
8
0
11 Apr 2022
Scalar reward is not enough: A response to Silver, Singh, Precup and
  Sutton (2021)
Scalar reward is not enough: A response to Silver, Singh, Precup and Sutton (2021)
Peter Vamplew
Benjamin J. Smith
Johan Källström
G. Ramos
Roxana Rădulescu
...
Fredrik Heintz
Patrick Mannion
Pieter J. K. Libin
Richard Dazeley
Cameron Foale
LRM
34
66
0
25 Nov 2021
Multi-Objective Graph Heuristic Search for Terrestrial Robot Design
Multi-Objective Graph Heuristic Search for Terrestrial Robot Design
Jie Xu
Andrew Spielberg
Allan Zhao
Daniela Rus
Wojciech Matusik
69
38
0
13 Jul 2021
Levels of explainable artificial intelligence for human-aligned
  conversational explanations
Levels of explainable artificial intelligence for human-aligned conversational explanations
Richard Dazeley
Peter Vamplew
Cameron Foale
Charlotte Young
Sunil Aryal
F. Cruz
30
90
0
07 Jul 2021
A Distributional View on Multi-Objective Policy Optimization
A Distributional View on Multi-Objective Policy Optimization
A. Abdolmaleki
Sandy H. Huang
Leonard Hasenclever
Michael Neunert
H. F. Song
Martina Zambelli
M. Martins
N. Heess
R. Hadsell
Martin Riedmiller
26
74
0
15 May 2020
A Multi-Objective Deep Reinforcement Learning Framework
A Multi-Objective Deep Reinforcement Learning Framework
Thanh Thi Nguyen
Ngoc Duy Nguyen
Peter Vamplew
Saeid Nahavandi
Richard Dazeley
Chee Peng Lim
OffRL
15
108
0
08 Mar 2018
1