ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.15422
16
65

Scalar reward is not enough: A response to Silver, Singh, Precup and Sutton (2021)

25 November 2021
Peter Vamplew
Benjamin J. Smith
Johan Källström
G. Ramos
Roxana Rădulescu
D. Roijers
Conor F. Hayes
Fredrik Heintz
Patrick Mannion
Pieter J. K. Libin
Richard Dazeley
Cameron Foale
    LRM
ArXivPDFHTML
Abstract

The recent paper `"Reward is Enough" by Silver, Singh, Precup and Sutton posits that the concept of reward maximisation is sufficient to underpin all intelligence, both natural and artificial. We contest the underlying assumption of Silver et al. that such reward can be scalar-valued. In this paper we explain why scalar rewards are insufficient to account for some aspects of both biological and computational intelligence, and argue in favour of explicitly multi-objective models of reward maximisation. Furthermore, we contend that even if scalar reward functions can trigger intelligent behaviour in specific cases, it is still undesirable to use this approach for the development of artificial general intelligence due to unacceptable risks of unsafe or unethical behaviour.

View on arXiv
Comments on this paper