ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.14660
7
0

EmoGist: Efficient In-Context Learning for Visual Emotion Understanding

20 May 2025
Ronald Seoh
Dan Goldwasser
    VLM
ArXivPDFHTML
Abstract

In this paper, we introduce EmoGist, a training-free, in-context learning method for performing visual emotion classification with LVLMs. The key intuition of our approach is that context-dependent definition of emotion labels could allow more accurate predictions of emotions, as the ways in which emotions manifest within images are highly context dependent and nuanced. EmoGist pre-generates multiple explanations of emotion labels, by analyzing the clusters of example images belonging to each category. At test time, we retrieve a version of explanation based on embedding similarity, and feed it to a fast VLM for classification. Through our experiments, we show that EmoGist allows up to 13 points improvement in micro F1 scores with the multi-label Memotion dataset, and up to 8 points in macro F1 in the multi-class FI dataset.

View on arXiv
@article{seoh2025_2505.14660,
  title={ EmoGist: Efficient In-Context Learning for Visual Emotion Understanding },
  author={ Ronald Seoh and Dan Goldwasser },
  journal={arXiv preprint arXiv:2505.14660},
  year={ 2025 }
}
Comments on this paper