ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.10328
105
0

Towards Scalable SOAP Note Generation: A Weakly Supervised Multimodal Framework

12 June 2025
Sadia Kamal
Tim Oates
Joy Wan
ArXiv (abs)PDFHTML
Main:4 Pages
4 Figures
Bibliography:2 Pages
4 Tables
Appendix:4 Pages
Abstract

Skin carcinoma is the most prevalent form of cancer globally, accounting for over 8billioninannualhealthcareexpenditures.Inclinicalsettings,physiciansdocumentpatientvisitsusingdetailedSOAP(Subjective,Objective,Assessment,andPlan)notes.However,manuallygeneratingthesenotesislabor−intensiveandcontributestoclinicianburnout.Inthiswork,weproposeaweaklysupervisedmultimodalframeworktogenerateclinicallystructuredSOAPnotesfromlimitedinputs,includinglesionimagesandsparseclinicaltext.Ourapproachreducesrelianceonmanualannotations,enablingscalable,clinicallygroundeddocumentationwhilealleviatingclinicianburdenandreducingtheneedforlargeannotateddata.OurmethodachievesperformancecomparabletoGPT−4o,Claude,andDeepSeekJanusProacrosskeyclinicalrelevancemetrics.Toevaluateclinicalquality,weintroducetwonovelmetricsMedConceptEvalandClinicalCoherenceScore(CCS)whichassesssemanticalignmentwithexpertmedicalconceptsandinputfeatures,respectively.8 billion in annual healthcare expenditures. In clinical settings, physicians document patient visits using detailed SOAP (Subjective, Objective, Assessment, and Plan) notes. However, manually generating these notes is labor-intensive and contributes to clinician burnout. In this work, we propose a weakly supervised multimodal framework to generate clinically structured SOAP notes from limited inputs, including lesion images and sparse clinical text. Our approach reduces reliance on manual annotations, enabling scalable, clinically grounded documentation while alleviating clinician burden and reducing the need for large annotated data. Our method achieves performance comparable to GPT-4o, Claude, and DeepSeek Janus Pro across key clinical relevance metrics. To evaluate clinical quality, we introduce two novel metrics MedConceptEval and Clinical Coherence Score (CCS) which assess semantic alignment with expert medical concepts and input features, respectively.8billioninannualhealthcareexpenditures.Inclinicalsettings,physiciansdocumentpatientvisitsusingdetailedSOAP(Subjective,Objective,Assessment,andPlan)notes.However,manuallygeneratingthesenotesislabor−intensiveandcontributestoclinicianburnout.Inthiswork,weproposeaweaklysupervisedmultimodalframeworktogenerateclinicallystructuredSOAPnotesfromlimitedinputs,includinglesionimagesandsparseclinicaltext.Ourapproachreducesrelianceonmanualannotations,enablingscalable,clinicallygroundeddocumentationwhilealleviatingclinicianburdenandreducingtheneedforlargeannotateddata.OurmethodachievesperformancecomparabletoGPT−4o,Claude,andDeepSeekJanusProacrosskeyclinicalrelevancemetrics.Toevaluateclinicalquality,weintroducetwonovelmetricsMedConceptEvalandClinicalCoherenceScore(CCS)whichassesssemanticalignmentwithexpertmedicalconceptsandinputfeatures,respectively.

View on arXiv
@article{kamal2025_2506.10328,
  title={ Towards Scalable SOAP Note Generation: A Weakly Supervised Multimodal Framework },
  author={ Sadia Kamal and Tim Oates and Joy Wan },
  journal={arXiv preprint arXiv:2506.10328},
  year={ 2025 }
}
Comments on this paper