Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.04575
Cited By
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection
8 January 2025
Yunxing Liu
Pengxiang Li
Zishu Wei
C. Xie
Xueyu Hu
Xinchen Xu
Shengyu Zhang
Xiaotian Han
Hongxia Yang
Fei Wu
LLMAG
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection"
7 / 7 papers shown
Title
EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation
Biao Yi
Xavier Hu
Yexin Chen
Shengyu Zhang
Hongxia Yang
Fan Wu
Fei Wu
LLMAG
223
0
0
08 May 2025
Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models
Guanghao Zhou
Panjia Qiu
Chong Chen
Jie Wang
Zheming Yang
Jian Xu
Minghui Qiu
OffRL
LRM
58
1
0
30 Apr 2025
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
Yuhang Liu
Pengxiang Li
C. Xie
Xavier Hu
Xiaotian Han
Shengyu Zhang
Hongxia Yang
Fei Wu
LLMAG
LM&Ro
LRM
AI4CE
72
3
0
19 Apr 2025
Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up
Ziming Cheng
Zhiyuan Huang
Junting Pan
Zhaohui Hou
Mingjie Zhan
45
0
0
31 Mar 2025
OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents
Pengzhou Cheng
Zheng Wu
Zongru Wu
Aston Zhang
Zhuosheng Zhang
Gongshen Liu
LLMAG
58
1
0
26 Feb 2025
InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
C. Xie
Shuo Cai
Wenjun Wang
Pengxiang Li
Zhijie Sang
...
Xiaotian Han
Jianbo Yuan
Shengyu Zhang
Fei Wu
Hongxia Yang
LRM
51
1
0
17 Feb 2025
AppVLM: A Lightweight Vision Language Model for Online App Control
Georgios Papoudakis
Thomas Coste
Zhihao Wu
Jianye Hao
Jun Wang
Kun Shao
57
2
0
10 Feb 2025
1