ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.19378
198
1
v1v2 (latest)

Libra: Leveraging Temporal Images for Biomedical Radiology Analysis

28 November 2024
Xi Zhang
Zaiqiao Meng
Jake Lever
Edmond S. L. Ho
    MedIm
ArXiv (abs)PDFHTML
Abstract

Radiology report generation (RRG) is a challenging task, as it requires a thorough understanding of medical images, integration of multiple temporal inputs, and accurate report generation. Effective interpretation of medical images, such as chest X-rays (CXRs), demands sophisticated visual-language reasoning to map visual findings to structured reports. Recent studies have shown that multimodal large language models (MLLMs) can acquire multimodal capabilities by aligning with pre-trained vision encoders. However, current approaches predominantly focus on single-image analysis or utilise rule-based symbolic processing to handle multiple images, thereby overlooking the essential temporal information derived from comparing current images with prior ones. To overcome this critical limitation, we introduce Libra, a temporal-aware MLLM tailored for CXR report generation using temporal images. Libra integrates a radiology-specific image encoder with a MLLM and utilises a novel Temporal Alignment Connector to capture and synthesise temporal information of images across different time points with unprecedented precision. Extensive experiments show that Libra achieves new state-of-the-art performance among the same parameter scale MLLMs for RRG tasks on the MIMIC-CXR. Specifically, Libra improves the RadCliQ metric by 12.9% and makes substantial gains across all lexical metrics compared to previous models.

View on arXiv
@article{zhang2025_2411.19378,
  title={ Libra: Leveraging Temporal Images for Biomedical Radiology Analysis },
  author={ Xi Zhang and Zaiqiao Meng and Jake Lever and Edmond S. L. Ho },
  journal={arXiv preprint arXiv:2411.19378},
  year={ 2025 }
}
Comments on this paper