ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2507.02074
189
2
v1v2 (latest)

Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges

2 July 2025
Sanjeda Akter
Ibne Farabi Shihab
Anuj Sharma
    VLM
ArXiv (abs)PDFHTML
Main:20 Pages
8 Figures
Bibliography:4 Pages
5 Tables
Abstract

Crash detection from video feeds is a critical problem in intelligent transportation systems. Recent developments in large language models (LLMs) and vision-language models (VLMs) have transformed how we process, reason about, and summarize multimodal information. This paper surveys recent methods leveraging LLMs for crash detection from video data. We present a structured taxonomy of fusion strategies, summarize key datasets, analyze model architectures, compare performance benchmarks, and discuss ongoing challenges and opportunities. Our review provides a foundation for future research in this fast-growing intersection of video understanding and foundation models.

View on arXiv
Comments on this paper