ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.08524
33
0
v1v2 (latest)

Teaching Physical Awareness to LLMs through Sounds

10 June 2025
Weiguo Wang
Andy Nie
Wenrui Zhou
Yi Kai
Chengchen Hu
ArXiv (abs)PDFHTML
Abstract

Large Language Models (LLMs) have shown remarkable capabilities in text and multimodal processing, yet they fundamentally lack physical awareness--understanding of real-world physical phenomena. In this work, we present ACORN, a framework that teaches LLMs physical awareness through sound, focusing on fundamental physical phenomena like the Doppler effect, multipath effect, and spatial relationships. To overcome data scarcity, ACORN introduce a physics-based simulator combining real-world sound sources with controlled physical channels to generate diverse training data. Using this simulator, we build AQA-PHY, a comprehensive Audio Question-Answer dataset, and propose an audio encoder that processes both magnitude and phase information. By connecting our audio encoder to state-of-the-art LLMs, we demonstrate reasonable results in both simulated and real-world tasks, such as line-of-sight detection, Doppler effect estimation, and Direction-of-Arrival estimation, paving the way for enabling LLMs to understand physical world.

View on arXiv
@article{wang2025_2506.08524,
  title={ Teaching Physical Awareness to LLMs through Sounds },
  author={ Weiguo Wang and Andy Nie and Wenrui Zhou and Yi Kai and Chengchen Hu },
  journal={arXiv preprint arXiv:2506.08524},
  year={ 2025 }
}
Main:8 Pages
17 Figures
12 Tables
Appendix:12 Pages
Comments on this paper