92
0

UI-Evol: Automatic Knowledge Evolving for Computer Use Agents

Main:8 Pages
3 Figures
Bibliography:3 Pages
5 Tables
Appendix:4 Pages
Abstract

External knowledge has played a crucial role in the recent development of computer use agents. We identify a critical knowledge-execution gap: retrieved knowledge often fails to translate into effective real-world task execution. Our analysis shows even 90\% correct knowledge yields only 41\% execution success rate. To bridge this gap, we propose UI-Evol, a plug-and-play module for autonomous GUI knowledge evolution. UI-Evol consists of two stages: a Retrace Stage that extracts faithful objective action sequences from actual agent-environment interactions, and a Critique Stage that refines existing knowledge by comparing these sequences against external references. We conduct comprehensive experiments on the OSWorld benchmark with the state-of-the-art Agent S2. Our results demonstrate that UI-Evol not only significantly boosts task performance but also addresses a previously overlooked issue of high behavioral standard deviation in computer use agents, leading to superior performance on computer use tasks and substantially improved agent reliability.

View on arXiv
@article{zhang2025_2505.21964,
  title={ UI-Evol: Automatic Knowledge Evolving for Computer Use Agents },
  author={ Ziyun Zhang and Xinyi Liu and Xiaoyi Zhang and Jun Wang and Gang Chen and Yan Lu },
  journal={arXiv preprint arXiv:2505.21964},
  year={ 2025 }
}
Comments on this paper