ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.05368
16
0

Speaking images. A novel framework for the automated self-description of artworks

28 May 2025
Valentine Bernasconi
Gustavo Marfia
    VGen
ArXiv (abs)PDFHTML
Main:12 Pages
4 Figures
Bibliography:4 Pages
1 Tables
Abstract

Recent breakthroughs in generative AI have opened the door to new research perspectives in the domain of art and cultural heritage, where a large number of artifacts have been digitized. There is a need for innovation to ease the access and highlight the content of digital collections. Such innovations develop into creative explorations of the digital image in relation to its malleability and contemporary interpretation, in confrontation to the original historical object. Based on the concept of the autonomous image, we propose a new framework towards the production of self-explaining cultural artifacts using open-source large-language, face detection, text-to-speech and audio-to-animation models. The goal is to start from a digitized artwork and to automatically assemble a short video of the latter where the main character animates to explain its content. The whole process questions cultural biases encapsulated in large-language models, the potential of digital images and deepfakes of artworks for educational purposes, along with concerns of the field of art history regarding such creative diversions.

View on arXiv
@article{bernasconi2025_2506.05368,
  title={ Speaking images. A novel framework for the automated self-description of artworks },
  author={ Valentine Bernasconi and Gustavo Marfia },
  journal={arXiv preprint arXiv:2506.05368},
  year={ 2025 }
}
Comments on this paper