Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.20795
Cited By
InsightSee: Advancing Multi-agent Vision-Language Models for Enhanced Visual Understanding
31 May 2024
Huaxiang Zhang
Yaojia Mu
Guo-Niu Zhu
Zhongxue Gan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InsightSee: Advancing Multi-agent Vision-Language Models for Enhanced Visual Understanding"
4 / 4 papers shown
Title
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models
Gen Luo
Yiyi Zhou
Tianhe Ren
Shen Chen
Xiaoshuai Sun
Rongrong Ji
VLM
MLLM
51
93
0
24 May 2023
Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models
Huy Ha
Shuran Song
LM&Ro
VLM
69
103
0
23 Jul 2022
Flamingo: a Visual Language Model for Few-Shot Learning
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
...
Mikolaj Binkowski
Ricardo Barreira
Oriol Vinyals
Andrew Zisserman
Karen Simonyan
MLLM
VLM
283
3,458
0
29 Apr 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
582
9,009
0
28 Jan 2022
1