ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.00491
  4. Cited By
GalleryGPT: Analyzing Paintings with Large Multimodal Models

GalleryGPT: Analyzing Paintings with Large Multimodal Models

1 August 2024
Yi Bin
Wenhao Shi
Yujuan Ding
Zhiqiang Hu
Zheng Wang
Yang Yang
See-Kiong Ng
H. Shen
    MLLM
ArXivPDFHTML

Papers citing "GalleryGPT: Analyzing Paintings with Large Multimodal Models"

9 / 9 papers shown
Title
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs
Tiancheng Gu
Kaicheng Yang
Ziyong Feng
Xingjun Wang
Yanzhao Zhang
Dingkun Long
Yingda Chen
Weidong Cai
Jiankang Deng
VLM
223
2
0
24 Apr 2025
Assesing LLMs in Art Contexts: Critique Generation and Theory of Mind Evaluation
Assesing LLMs in Art Contexts: Critique Generation and Theory of Mind Evaluation
Takaya Arita
Wenxian Zheng
Reiji Suzuki
Fuminori Akiba
26
0
0
17 Apr 2025
STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding
STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding
Zichen Liu
Kunlun Xu
Bing-Huang Su
Xu Zou
Yuxin Peng
Jiahuan Zhou
VLM
AI4TS
71
1
0
20 Mar 2025
Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Yubo Wang
Jianting Tang
Chaohu Liu
Linli Xu
AAML
63
1
0
23 Feb 2025
ChartAdapter: Large Vision-Language Model for Chart Summarization
ChartAdapter: Large Vision-Language Model for Chart Summarization
Peixin Xu
Yujuan Ding
Wenqi Fan
30
2
0
31 Dec 2024
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large
  Language Models
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
Wenhao Shi
Zhiqiang Hu
Yi Bin
Junhua Liu
Yang Yang
See-Kiong Ng
Lidong Bing
Roy Ka-Wei Lee
SyDa
MLLM
LRM
34
41
0
25 Jun 2024
Contextual Interaction via Primitive-based Adversarial Training For
  Compositional Zero-shot Learning
Contextual Interaction via Primitive-based Adversarial Training For Compositional Zero-shot Learning
Suyi Li
Chenyi Jiang
Shidong Wang
Yang Long
Zheng Zhang
Haofeng Zhang
CoGe
34
0
0
21 Jun 2024
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
372
12,081
0
04 Mar 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLM
BDL
VLM
CLIP
392
4,171
0
28 Jan 2022
1