ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.12182
30
2

Instruction Tuning and CoT Prompting for Contextual Medical QA with LLMs

13 June 2025
Chenqian Le
Ziheng Gong
Chihang Wang
Haowei Ni
Panfeng Li
Xupeng Chen
    LM&MALRM
ArXiv (abs)PDFHTML
Main:4 Pages
2 Figures
Bibliography:2 Pages
1 Tables
Abstract

Large language models (LLMs) have shown great potential in medical question answering (MedQA), yet adapting them to biomedical reasoning remains challenging due to domain-specific complexity and limited supervision. In this work, we study how prompt design and lightweight fine-tuning affect the performance of open-source LLMs on PubMedQA, a benchmark for multiple-choice biomedical questions. We focus on two widely used prompting strategies - standard instruction prompts and Chain-of-Thought (CoT) prompts - and apply QLoRA for parameter-efficient instruction tuning. Across multiple model families and sizes, our experiments show that CoT prompting alone can improve reasoning in zero-shot settings, while instruction tuning significantly boosts accuracy. However, fine-tuning on CoT prompts does not universally enhance performance and may even degrade it for certain larger models. These findings suggest that reasoning-aware prompts are useful, but their benefits are model- and scale-dependent. Our study offers practical insights into combining prompt engineering with efficient finetuning for medical QA applications.

View on arXiv
@article{le2025_2506.12182,
  title={ Instruction Tuning and CoT Prompting for Contextual Medical QA with LLMs },
  author={ Chenqian Le and Ziheng Gong and Chihang Wang and Haowei Ni and Panfeng Li and Xupeng Chen },
  journal={arXiv preprint arXiv:2506.12182},
  year={ 2025 }
}
Comments on this paper