ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.14205
10
0

AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents

17 June 2025
Jingxu Xie
Dylan Xu
Xuandong Zhao
Dawn Song
ArXiv (abs)PDFHTML
Main:11 Pages
7 Figures
Bibliography:3 Pages
17 Tables
Appendix:14 Pages
Abstract

We introduce AgentSynth, a scalable and cost-efficient pipeline for automatically synthesizing high-quality tasks and trajectory datasets for generalist computer-use agents. Leveraging information asymmetry, AgentSynth constructs subtasks that are simple during generation but significantly more challenging when composed into long-horizon tasks, enabling the creation of over 6,000 diverse and realistic tasks. Our pipeline begins with an LLM-based task proposer guided by a persona, followed by an execution agent that completes the task and logs the trajectory. This process is repeated iteratively to form a sequence of subtasks, which are then summarized by a separate agent into a composite task of controllable difficulty. A key strength of AgentSynth is its ability to precisely modulate task complexity by varying the number of subtasks. Empirical evaluations show that state-of-the-art LLM agents suffer a steep performance drop, from 18% success at difficulty level 1 to just 4% at level 6, highlighting the benchmark's difficulty and discriminative power. Moreover, our pipeline achieves a low average cost of \0.60pertrajectory,ordersofmagnitudecheaperthanhumanannotations.OurcodeanddataarepubliclyavailableatthishttpsURL0.60 per trajectory, orders of magnitude cheaper than human annotations. Our code and data are publicly available atthis https URL0.60pertrajectory,ordersofmagnitudecheaperthanhumanannotations.OurcodeanddataarepubliclyavailableatthishttpsURL

View on arXiv
@article{xie2025_2506.14205,
  title={ AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents },
  author={ Jingxu Xie and Dylan Xu and Xuandong Zhao and Dawn Song },
  journal={arXiv preprint arXiv:2506.14205},
  year={ 2025 }
}
Comments on this paper