Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.15422
Cited By
Scalar reward is not enough: A response to Silver, Singh, Precup and Sutton (2021)
25 November 2021
Peter Vamplew
Benjamin J. Smith
Johan Källström
G. Ramos
Roxana Rădulescu
D. Roijers
Conor F. Hayes
Fredrik Heintz
Patrick Mannion
Pieter J. K. Libin
Richard Dazeley
Cameron Foale
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scalar reward is not enough: A response to Silver, Singh, Precup and Sutton (2021)"
11 / 11 papers shown
Title
On Generalization Across Environments In Multi-Objective Reinforcement Learning
Jayden Teoh
Pradeep Varakantham
Peter Vamplew
OffRL
34
0
0
02 Mar 2025
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
86
1
0
22 Jan 2025
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Sinan Ibrahim
Mostafa Mostafa
Ali Jnadi
Hadi Salloum
Pavel Osinenko
OffRL
49
12
0
31 Dec 2024
Towards Aligning Language Models with Textual Feedback
Sauc Abadal Lloret
S. Dhuliawala
K. Murugesan
Mrinmaya Sachan
VLM
40
1
0
24 Jul 2024
Learning Roles with Emergent Social Value Orientations
Wenhao Li
Xiangfeng Wang
Bo Jin
J. Lu
H. Zha
18
3
0
31 Jan 2023
Targeted Adversarial Attacks on Deep Reinforcement Learning Policies via Model Checking
Dennis Gross
T. D. Simão
N. Jansen
G. Pérez
AAML
43
2
0
10 Dec 2022
Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning
Conor F. Hayes
Mathieu Reymond
D. Roijers
Enda Howley
Patrick Mannion
19
4
0
23 Nov 2022
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Yannick Hogewind
T. D. Simão
Tal Kachman
N. Jansen
16
10
0
02 Oct 2022
Multi-Objective Coordination Graphs for the Expected Scalarised Returns with Generative Flow Models
Conor F. Hayes
T. Verstraeten
D. Roijers
Enda Howley
Patrick Mannion
24
3
0
01 Jul 2022
Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework and Survey
Richard Dazeley
Peter Vamplew
Francisco Cruz
32
59
0
20 Aug 2021
Expected Scalarised Returns Dominance: A New Solution Concept for Multi-Objective Decision Making
Conor F. Hayes
T. Verstraeten
D. Roijers
Enda Howley
Patrick Mannion
13
14
0
02 Jun 2021
1