ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.05949
31
104

Evaluating the Social Impact of Generative AI Systems in Systems and Society

9 June 2023
Irene Solaiman
Zeerak Talat
William Agnew
Lama Ahmad
Dylan K. Baker
Su Lin Blodgett
Canyu Chen
Hal Daumé
Jesse Dodge
Isabella Duan
Ellie Evans
Felix Friedrich
Avijit Ghosh
Usman Gohar
Sara Hooker
Yacine Jernite
Ria Kalluri
Alberto Lusoli
Alina Leidinger
Michelle Lin
Xiuzhu Lin
A. Luccioni
Jennifer Mickel
Margaret Mitchell
Jessica Newman
Apostol T. Vassilev
Marie-Therese Png
Shubham Singh
A. Strait
Lukas Struppek
Arjun Subramonian
    ELM
    EGVM
ArXivPDFHTML
Abstract

Generative AI systems across modalities, ranging from text (including code), image, audio, and video, have broad social impacts, but there is no official standard for means of evaluating those impacts or for which impacts should be evaluated. In this paper, we present a guide that moves toward a standard approach in evaluating a base generative AI system for any modality in two overarching categories: what can be evaluated in a base system independent of context and what can be evaluated in a societal context. Importantly, this refers to base systems that have no predetermined application or deployment context, including a model itself, as well as system components, such as training data. Our framework for a base system defines seven categories of social impact: bias, stereotypes, and representational harms; cultural values and sensitive content; disparate performance; privacy and data protection; financial costs; environmental costs; and data and content moderation labor costs. Suggested methods for evaluation apply to listed generative modalities and analyses of the limitations of existing evaluations serve as a starting point for necessary investment in future evaluations. We offer five overarching categories for what can be evaluated in a broader societal context, each with its own subcategories: trustworthiness and autonomy; inequality, marginalization, and violence; concentration of authority; labor and creativity; and ecosystem and environment. Each subcategory includes recommendations for mitigating harm.

View on arXiv
Comments on this paper