AssistGPT: A General Multi-modal Assistant that can Plan, Execute,
  Inspect, and Learn
v1v2 (latest)

AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn

    MLLM

Papers citing "AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn"

50 / 60 papers shown
Title
LLM With Tools: A Survey
LLM With Tools: A Survey
Zhuocheng Shen
85
14
0
24 Sep 2024
VideoGUI: A Benchmark for GUI Automation from Instructional Videos
VideoGUI: A Benchmark for GUI Automation from Instructional Videos
Kevin Qinghong Lin
Linjie Li
Difei Gao
Qinchen Wu
Mingyi Yan
Zhengyuan Yang
Lijuan Wang
Mike Zheng Shou
117
13
0
14 Jun 2024

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.