Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications

5 January 2025

Zhe Chen

Main:8 Pages

7 Figures

Bibliography:5 Pages

21 Tables

Appendix:12 Pages

Abstract

Large language models hold promise for addressing medical challenges, such as medical diagnosis reasoning, research knowledge acquisition, clinical decision-making, and consumer health inquiry support. However, they often generate hallucinations due to limited medical knowledge. Incorporating external knowledge is therefore critical, which necessitates multi-source knowledge acquisition. We address this challenge by framing it as a source planning problem, which is to formulate context-appropriate queries tailored to the attributes of diverse sources. Existing approaches either overlook source planning or fail to achieve it effectively due to misalignment between the model's expectation of the sources and their actual content. To bridge this gap, we present MedOmniKB, a repository comprising multigenre and multi-structured medical knowledge sources. Leveraging these sources, we propose the Source Planning Optimisation method, which enhances multi-source utilisation. Our approach involves enabling an expert model to explore and evaluate potential plans while training a smaller model to learn source alignment. Experimental results demonstrate that our method substantially improves multi-source planning performance, enabling the optimised small model to achieve state-of-the-art results in leveraging diverse medical knowledge sources.

View on arXiv

@article{chen2025_2501.02460,
  title={ Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications },
  author={ Zhe Chen and Yusheng Liao and Shuyang Jiang and Pingjie Wang and Yiqiu Guo and Yanfeng Wang and Yu Wang },
  journal={arXiv preprint arXiv:2501.02460},
  year={ 2025 }
}

Comments on this paper