Large Language Model-based Nonnegative Matrix Factorization For Cardiorespiratory Sound Separation

9 February 2025

Abstract

This study represents the first integration of large language models (LLMs) with non-negative matrix factorization (NMF), marking a novel advancement in the source separation field. The LLM is employed in two unique ways: enhancing the separation results by providing detailed insights for disease prediction and operating in a feedback loop to optimize a fundamental frequency penalty added to the NMF cost function. We tested the algorithm on two datasets: 100 synthesized mixtures of real measurements, and 210 recordings of heart and lung sounds from a clinical manikin including both individual and mixed sounds, captured using a digital stethoscope. The approach consistently outperformed existing methods, demonstrating its potential to significantly enhance medical sound analysis for disease diagnostics.

View on arXiv

@article{torabi2025_2502.05757,
  title={ Large Language Model-based Nonnegative Matrix Factorization For Cardiorespiratory Sound Separation },
  author={ Yasaman Torabi and Shahram Shirani and James P. Reilly },
  journal={arXiv preprint arXiv:2502.05757},
  year={ 2025 }
}

Comments on this paper