ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.00855
53
146

Review of Large Vision Models and Visual Prompt Engineering

3 July 2023
Jiaqi Wang
Zheng Liu
Lin Zhao
Zihao Wu
Chong Ma
Sigang Yu
Haixing Dai
Qiushi Yang
Yi-Hsueh Liu
Songyao Zhang
Enze Shi
Yi Pan
Tuo Zhang
Dajiang Zhu
Xiang Li
Xi Jiang
Bao Ge
Yixuan Yuan
Dinggang Shen
Tianming Liu
Shu Zhang
    VLM
    LRM
ArXivPDFHTML
Abstract

Visual prompt engineering is a fundamental technology in the field of visual and image Artificial General Intelligence, serving as a key component for achieving zero-shot capabilities. As the development of large vision models progresses, the importance of prompt engineering becomes increasingly evident. Designing suitable prompts for specific visual tasks has emerged as a meaningful research direction. This review aims to summarize the methods employed in the computer vision domain for large vision models and visual prompt engineering, exploring the latest advancements in visual prompt engineering. We present influential large models in the visual domain and a range of prompt engineering methods employed on these models. It is our hope that this review provides a comprehensive and systematic description of prompt engineering methods based on large visual models, offering valuable insights for future researchers in their exploration of this field.

View on arXiv
Comments on this paper