ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2510.16559
95
0
v1v2v3 (latest)

BuildArena: A Physics-Aligned Interactive Benchmark of LLMs for Engineering Construction

18 October 2025
Tian Xia
Tianrun Gao
Wenhao Deng
Long Wei
Xiaowei Qian
Yixian Jiang
Chenglei Yu
Tailin Wu
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)
Main:9 Pages
12 Figures
Bibliography:4 Pages
3 Tables
Appendix:20 Pages
Abstract

Engineering construction automation aims to transform natural language specifications into physically viable structures, requiring complex integrated reasoning under strict physical constraints. While modern LLMs possess broad knowledge and strong reasoning capabilities that make them promising candidates for this domain, their construction competencies remain largely unevaluated. To address this gap, we introduce BuildArena, the first physics-aligned interactive benchmark designed for language-driven engineering construction. It contributes to the community in four aspects: (1) a highly customizable benchmarking framework for in-depth comparison and analysis of LLMs; (2) an extendable task design strategy spanning static and dynamic mechanics across multiple difficulty tiers; (3) a 3D Spatial Geometric Computation Library for supporting construction based on language instructions; (4) a baseline LLM agentic workflow that effectively evaluates diverse model capabilities. On eight frontier LLMs, BuildArena comprehensively evaluates their capabilities for language-driven and physics-grounded construction automation. The project page is atthis https URL.

View on arXiv
Comments on this paper