ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.17994
46
0

Instructing the Architecture Search for Spatial-temporal Sequence Forecasting with LLM

23 March 2025
Xin Xue
Haoyi Zhou
Tianyu Chen
Shuai Zhang
Yizhou Long
Jianxin Li
    AI4TS
ArXivPDFHTML
Abstract

Spatial-temporal sequence forecasting (STSF) is a long-standing research problem with widespread real-world applications. Neural architecture search (NAS), which automates the neural network design, has been shown effective in tackling the STSF problem. However, the existing NAS methods for STSF focus on generating architectures in a time-consuming data-driven fashion, which heavily limits their ability to use background knowledge and explore the complicated search trajectory. Large language models (LLMs) have shown remarkable ability in decision-making with comprehensive internal world knowledge, but how it could benefit NAS for STSF remains unexplored. In this paper, we propose a novel NAS method for STSF based on LLM. Instead of directly generate architectures with LLM, We inspire the LLM's capability with a multi-level enhancement mechanism. Specifically, on the step-level, we decompose the generation task into decision steps with powerful prompt engineering and inspire LLM to serve as instructor for architecture search based on its internal knowledge. On the instance-level, we utilize a one-step tuning framework to quickly evaluate the architecture instance and a memory bank to cumulate knowledge to improve LLM's search ability. On the task-level, we propose a two-stage architecture search, balancing the exploration stage and optimization stage, to reduce the possibility of being trapped in local optima. Extensive experimental results demonstrate that our method can achieve competitive effectiveness with superior efficiency against existing NAS methods for STSF.

View on arXiv
@article{xue2025_2503.17994,
  title={ Instructing the Architecture Search for Spatial-temporal Sequence Forecasting with LLM },
  author={ Xin Xue and Haoyi Zhou and Tianyu Chen and Shuai Zhang and Yizhou Long and Jianxin Li },
  journal={arXiv preprint arXiv:2503.17994},
  year={ 2025 }
}
Comments on this paper