ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.12436
  4. Cited By
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise
v1v2 (latest)

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

19 December 2023
Chaoyou Fu
Renrui Zhang
Zihan Wang
Yubo Huang
Zhengye Zhang
Longtian Qiu
Gaoxiang Ye
Yunhang Shen
Mengdan Zhang
Peixian Chen
Sirui Zhao
Shaohui Lin
Deqiang Jiang
Di Yin
Peng Gao
Ke Li
Hongsheng Li
Xing Sun
    LRMVLMMLLM
ArXiv (abs)PDFHTMLGithub (15342★)

Papers citing "A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise"

3 / 3 papers shown
Title
TurtleBench: A Visual Programming Benchmark in Turtle Geometry
TurtleBench: A Visual Programming Benchmark in Turtle Geometry
Sina Rismanchian
Yasaman Razeghi
Sameer Singh
Shayan Doroudi
115
2
0
31 Oct 2024
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Yi-Fan Zhang
Huanyu Zhang
Haochen Tian
Chaoyou Fu
Shuangqing Zhang
...
Qingsong Wen
Zhang Zhang
Liwen Wang
Rong Jin
Tieniu Tan
OffRL
108
49
0
23 Aug 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
Chris Liu
Renrui Zhang
Longtian Qiu
Siyuan Huang
Weifeng Lin
...
Hao Shao
Pan Lu
Hongsheng Li
Yu Qiao
Peng Gao
MLLM
191
113
0
08 Feb 2024
1