Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems
- VLM

The integration of Large Language Models (LLMs) with computer vision is profoundly transforming perception tasks like image segmentation. For intelligent transportation systems (ITS), where accurate scene understanding is critical for safety and efficiency, this new paradigm offers unprecedented capabilities. This survey systematically reviews the emerging field of LLM-augmented image segmentation, focusing on its applications, challenges, and future directions within ITS. We provide a taxonomy of current approaches based on their prompting mechanisms and core architectures, and we highlight how these innovations can enhance road scene understanding for autonomous driving, traffic monitoring, and infrastructure maintenance. Finally, we identify key challenges, including real-time performance and safety-critical reliability, and outline a perspective centered on explainable, human-centric AI as a prerequisite for the successful deployment of this technology in next-generation transportation systems.
View on arXiv@article{akter2025_2506.14096, title={ Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems }, author={ Sanjeda Akter and Ibne Farabi Shihab and Anuj Sharma }, journal={arXiv preprint arXiv:2506.14096}, year={ 2025 } }