ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.10405
25
0

Performance of Large Language Models in Supporting Medical Diagnosis and Treatment

14 April 2025
Diogo Sousa
Guilherme Barbosa
Catarina Rocha
Dulce Oliveira
    LM&MA
    ELM
    AI4MH
ArXivPDFHTML
Abstract

The integration of Large Language Models (LLMs) into healthcare holds significant potential to enhance diagnostic accuracy and support medical treatment planning. These AI-driven systems can analyze vast datasets, assisting clinicians in identifying diseases, recommending treatments, and predicting patient outcomes. This study evaluates the performance of a range of contemporary LLMs, including both open-source and closed-source models, on the 2024 Portuguese National Exam for medical specialty access (PNA), a standardized medical knowledge assessment. Our results highlight considerable variation in accuracy and cost-effectiveness, with several models demonstrating performance exceeding human benchmarks for medical students on this specific task. We identify leading models based on a combined score of accuracy and cost, discuss the implications of reasoning methodologies like Chain-of-Thought, and underscore the potential for LLMs to function as valuable complementary tools aiding medical professionals in complex clinical decision-making.

View on arXiv
@article{sousa2025_2504.10405,
  title={ Performance of Large Language Models in Supporting Medical Diagnosis and Treatment },
  author={ Diogo Sousa and Guilherme Barbosa and Catarina Rocha and Dulce Oliveira },
  journal={arXiv preprint arXiv:2504.10405},
  year={ 2025 }
}
Comments on this paper