ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.00915
  4. Cited By
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
v1v2v3 (latest)

BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs

10 January 2025
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
Robert Tinn
Sam Preston
Rajesh N. Rao
Mu-Hsin Wei
Naveen Valluri
Cliff Wong
Andrea Tupini
Yu Wang
Matt Mazzola
Swadheen Shukla
Lars Liden
Jianfeng Gao
Angela Crabtree
B. Piening
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
    LM&MAMedIm
ArXiv (abs)PDFHTML

Papers citing "BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs"

50 / 185 papers shown
Title
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
Alejandro Lozano
Min Woo Sun
James Burgess
Liangyu Chen
Jeffrey Nirschl
...
Xiaohan Wang
Yuhui Zhang
Alfred Seunghoon Song
Robert Tibshirani
Serena Yeung-Levy
LM&MAVLMMedIm
142
10
0
13 Jan 2025
AgroGPT: Efficient Agricultural Vision-Language Model with Expert Tuning
AgroGPT: Efficient Agricultural Vision-Language Model with Expert Tuning
Muhammad Awais
Ali Husain Salem Abdulla Alharthi
Amandeep Kumar
Hisham Cholakkal
Rao Muhammad Anwer
VLM
89
5
0
10 Jan 2025
RadGPT: Constructing 3D Image-Text Tumor Datasets
RadGPT: Constructing 3D Image-Text Tumor Datasets
P. R. Bassi
Mehmet Can Yavuz
Kang Wang
Xiaoxi Chen
Wenxuan Li
S. Decherchi
Andrea Cavalli
Yang Yang
Alan Yuille
Zongwei Zhou
LM&MAMedIm
113
2
0
08 Jan 2025
Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification
Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification
K. E. Khoury
Maxime Zanella
Benoît Gérin
Tiffanie Godelaine
Benoît Macq
Saïd Mahmoudi
Christophe De Vleeschouwer
Ismail Ben Ayed
VLM
124
1
0
08 Jan 2025
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
85
5
0
08 Jan 2025
Adaptive Concept Bottleneck for Foundation Models Under Distribution
  Shifts
Adaptive Concept Bottleneck for Foundation Models Under Distribution Shifts
Jihye Choi
Jayaram Raghuram
Yixuan Li
Somesh Jha
148
5
0
18 Dec 2024
Unlocking the Potential of Weakly Labeled Data: A Co-Evolutionary
  Learning Framework for Abnormality Detection and Report Generation
Unlocking the Potential of Weakly Labeled Data: A Co-Evolutionary Learning Framework for Abnormality Detection and Report Generation
Jinghan Sun
Dong-mei Wei
Zhe Xu
Donghuan Lu
Hong Liu
Hong Wang
Sotirios A. Tsaftaris
Jingyu Sun
Yefeng Zheng
Liansheng Wang
MedIm
155
0
0
18 Dec 2024
ACE-$M^3$: Automatic Capability Evaluator for Multimodal Medical Models
ACE-M3M^3M3: Automatic Capability Evaluator for Multimodal Medical Models
Xiechi Zhang
Shunfan Zheng
Linlin Wang
Gerard de Melo
Zhu Cao
Xiaoling Wang
Liang He
ELM
143
0
0
16 Dec 2024
Libra: Leveraging Temporal Images for Biomedical Radiology Analysis
Libra: Leveraging Temporal Images for Biomedical Radiology Analysis
Xi Zhang
Zaiqiao Meng
Jake Lever
Edmond S. L. Ho
MedIm
157
1
0
28 Nov 2024
LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology
  Report Generation
LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation
Steven Song
Anirudh Subramanyam
Irene Madejski
Robert L. Grossman
MedImVLM
156
0
0
25 Nov 2024
Abnormality-Driven Representation Learning for Radiology Imaging
Abnormality-Driven Representation Learning for Radiology Imaging
M. Ligero
Tim Lenz
Georg Wolflein
Omar S. M. El Nahhas
Daniel Truhn
Jakob Nikolas Kather
MedIm
124
0
0
25 Nov 2024
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis
Bo Liu
K. Zou
Liming Zhan
Zexin Lu
Xiaoyu Dong
Yidi Chen
Chengqiang Xie
Jiannong Cao
Xiao-Ming Wu
Huazhu Fu
180
2
0
25 Nov 2024
FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing
  Interpretability in Chest X-Ray Report Generation
FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Generation
Trong-Thang Pham
Ngoc-Vuong Ho
Nhat-Tan Bui
T. Phan
Patel Brijesh
...
Gianfranco Doretto
Anh Nguyen
Carol C. Wu
Hien Nguyen
Ngan Le
165
4
0
23 Nov 2024
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models
Taha Koleilat
Hojat Asgariandehkordi
H. Rivaz
Yiming Xiao
VLM
167
1
0
21 Nov 2024
Exploring Zero-Shot Anomaly Detection with CLIP in Medical Imaging: Are
  We There Yet?
Exploring Zero-Shot Anomaly Detection with CLIP in Medical Imaging: Are We There Yet?
Aldo Marzullo
Marta Bianca Maria Ranzini
MedImUQCVVLM
51
0
0
14 Nov 2024
Leveraging Multimodal Models for Enhanced Neuroimaging Diagnostics in
  Alzheimer's Disease
Leveraging Multimodal Models for Enhanced Neuroimaging Diagnostics in Alzheimer's Disease
Francesco Chiumento
Mingming Liu
LM&MA
66
0
0
12 Nov 2024
TexLiverNet: Leveraging Medical Knowledge and Spatial-Frequency
  Perception for Enhanced Liver Tumor Segmentation
TexLiverNet: Leveraging Medical Knowledge and Spatial-Frequency Perception for Enhanced Liver Tumor Segmentation
Xiaoyan Jiang
Zhi Zhou
HaiLing Wang
Guozhong Wang
ZhiJun Fang
MedIm
75
0
0
07 Nov 2024
Medical Adaptation of Large Language and Vision-Language Models: Are We
  Making Progress?
Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress?
Daniel P. Jeong
Saurabh Garg
Zachary Chase Lipton
Michael Oberst
LM&MAVLMELM
55
13
0
06 Nov 2024
Large Language Model Benchmarks in Medical Tasks
Large Language Model Benchmarks in Medical Tasks
Lawrence K. Q. Yan
Ming Li
Yize Zhang
Caitlyn Heqi Yin
Cheng Fei
...
Ziqian Bi
Pohsun Feng
Keyu Chen
Junyu Liu
Qian Niu
LM&MAAI4MH
98
9
0
28 Oct 2024
Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering
Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering
Zhilin Zhang
Jie Wang
Zhanghao Qin
Ruiqi Zhu
Xiaoliang Gong
MedIm
179
0
0
28 Oct 2024
R-LLaVA: Improving Med-VQA Understanding through Visual Region of Interest
R-LLaVA: Improving Med-VQA Understanding through Visual Region of Interest
Xupeng Chen
Zhixin Lai
Kangrui Ruan
Shichu Chen
Jiaxiang Liu
Zuozhu Liu
102
3
0
27 Oct 2024
Diff-CXR: Report-to-CXR generation through a disease-knowledge enhanced
  diffusion model
Diff-CXR: Report-to-CXR generation through a disease-knowledge enhanced diffusion model
Peng Huang
Bowen Guo
Shuyu Liang
Junhu Fu
Yuanyuan Wang
Yi Guo
DiffMMedIm
55
1
0
26 Oct 2024
LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound
LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound
Xuechen Guo
Wenhao Chai
Shi-Yan Li
Gaoang Wang
59
9
0
19 Oct 2024
DEeR: Deviation Eliminating and Noise Regulating for Privacy-preserving
  Federated Low-rank Adaptation
DEeR: Deviation Eliminating and Noise Regulating for Privacy-preserving Federated Low-rank Adaptation
Meilu Zhu
Axiu Mao
Jun Liu
Yixuan Yuan
79
3
0
16 Oct 2024
EchoPrime: A Multi-Video View-Informed Vision-Language Model for
  Comprehensive Echocardiography Interpretation
EchoPrime: A Multi-Video View-Informed Vision-Language Model for Comprehensive Echocardiography Interpretation
Milos Vukadinovic
Xiu Tang
N. Yuan
Paul Cheng
Debiao Li
Susan Cheng
Bryan He
David Ouyang
39
11
0
13 Oct 2024
VoxelPrompt: A Vision-Language Agent for Grounded Medical Image Analysis
VoxelPrompt: A Vision-Language Agent for Grounded Medical Image Analysis
Andrew Hoopes
V. Butoi
John Guttag
Adrian V. Dalca
MedImLM&MA
79
2
0
10 Oct 2024
MedImageInsight: An Open-Source Embedding Model for General Domain
  Medical Imaging
MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging
Noel C. F. Codella
Ying Jin
Shrey Jain
Yu Gu
Ho Hin Lee
...
Lei Li
Thomas Lin
Ivan Tarapov
M. Lungren
Mu-Hsin Wei
LM&MAVLMMedIm
78
9
0
09 Oct 2024
MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA
  with LLM and MLLM Integration
MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration
Lai Wei
Wenkai Wang
Xiaoyu Shen
Yu Xie
Zhihao Fan
Xiaojin Zhang
Zhongyu Wei
Wei Chen
61
6
0
06 Oct 2024
AgriCLIP: Adapting CLIP for Agriculture and Livestock via
  Domain-Specialized Cross-Model Alignment
AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment
Umair Nawaz
Muhammad Awais
Hanan Gani
Muzammal Naseer
Fahad Khan
Salman Khan
Rao Muhammad Anwer
VLMCLIP
70
3
0
02 Oct 2024
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling
Jihai Zhang
Xiaoye Qu
Tong Zhu
Yu Cheng
82
9
0
28 Sep 2024
ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context
  Information in Multi-Turn Multimodal Medical Dialogue
ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue
Zhangpu Li
Changhong Zou
Suxue Ma
Zhicheng Yang
Chen Du
...
Xingzhi Sun
Jing Xiao
Kai Zhang
Mei Han
Mei Han
LM&MA
77
1
0
26 Sep 2024
Multi-View and Multi-Scale Alignment for Contrastive Language-Image Pre-training in Mammography
Multi-View and Multi-Scale Alignment for Contrastive Language-Image Pre-training in Mammography
Yuexi Du
John Onofrey
Nicha Dvornek
VLM
87
2
0
26 Sep 2024
Towards General Text-guided Image Synthesis for Customized Multimodal
  Brain MRI Generation
Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation
Yulin Wang
Honglin Xiong
Kaicong Sun
Shuwei Bai
Ling Dai
Zhongxiang Ding
Jiameng Liu
Qian Wang
Qian Liu
Dinggang Shen
MedImDiffM
69
2
0
25 Sep 2024
MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models
MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models
Mohammad Shahab Sepehri
Zalan Fabian
Maryam Soltanolkotabi
Mahdi Soltanolkotabi
MedIm
112
6
0
23 Sep 2024
From Text to Multimodality: Exploring the Evolution and Impact of Large
  Language Models in Medical Practice
From Text to Multimodality: Exploring the Evolution and Impact of Large Language Models in Medical Practice
Qian Niu
Keyu Chen
Ming Li
Pohsun Feng
Ziqian Bi
...
Junyu Liu
Benji Peng
Tianyang Wang
Yunze Wang
Silin Chen
LM&MA
67
7
0
14 Sep 2024
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval
Amirreza Mahbod
Nematollah Saeidi
Sepideh Hatamikia
Ramona Woitek
VLMMedIm
99
3
0
14 Sep 2024
EyeCLIP: A visual-language foundation model for multi-modal ophthalmic
  image analysis
EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis
Danli Shi
Weiyi Zhang
Jiancheng Yang
Siyu Huang
Xiaolan Chen
...
Kai Jin
Shan Lin
Shunming Liu
Qing Zhang
M. He
VLMMedIm
64
7
0
10 Sep 2024
FODA-PG for Enhanced Medical Imaging Narrative Generation: Adaptive
  Differentiation of Normal and Abnormal Attributes
FODA-PG for Enhanced Medical Imaging Narrative Generation: Adaptive Differentiation of Normal and Abnormal Attributes
Kai Shu
Yuzhuo Jia
Ziyang Zhang
Jiechao Gao
MedIm
80
0
0
06 Sep 2024
Democratizing MLLMs in Healthcare: TinyLLaVA-Med for Efficient
  Healthcare Diagnostics in Resource-Constrained Settings
Democratizing MLLMs in Healthcare: TinyLLaVA-Med for Efficient Healthcare Diagnostics in Resource-Constrained Settings
Aya El Mir
Lukelo Thadei Luoga
Boyuan Chen
Muhammad Abdullah Hanif
Mohamed Bennai
58
2
0
02 Sep 2024
EEG-Language Modeling for Pathology Detection
EEG-Language Modeling for Pathology Detection
Sam Gijsen
Kerstin Ritter
105
2
0
02 Sep 2024
A Survey for Large Language Models in Biomedicine
A Survey for Large Language Models in Biomedicine
Chong Wang
Mengyao Li
Junjun He
Zhongruo Wang
Erfan Darzi
...
Yi Yu
Pietro Liò
Tianyun Wang
Yu Guang Wang
Yiqing Shen
LM&MA
108
13
0
29 Aug 2024
A New Era in Computational Pathology: A Survey on Foundation and
  Vision-Language Models
A New Era in Computational Pathology: A Survey on Foundation and Vision-Language Models
Dibaloke Chanda
Milan Aryal
Nasim Yahya Soltani
Masoud Ganji
AI4CEVLM
124
7
0
23 Aug 2024
Has Multimodal Learning Delivered Universal Intelligence in Healthcare?
  A Comprehensive Survey
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey
Qika Lin
Yifan Zhu
Xin Mei
Ling Huang
Jingying Ma
Kai He
Zhen Peng
Min Zhang
Mengling Feng
100
22
0
23 Aug 2024
PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image
  Understanding
PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image Understanding
Dawei Dai
Yuanhui Zhang
Long Xu
Qianlan Yang
Xiaojing Shen
Shuyin Xia
Guoyin Wang
LM&MAVLM
119
11
0
18 Aug 2024
TextCAVs: Debugging vision models using text
TextCAVs: Debugging vision models using text
A. Nicolson
Yarin Gal
J. A. Noble
CoGe
66
1
0
16 Aug 2024
Navigating Data Scarcity using Foundation Models: A Benchmark of
  Few-Shot and Zero-Shot Learning Approaches in Medical Imaging
Navigating Data Scarcity using Foundation Models: A Benchmark of Few-Shot and Zero-Shot Learning Approaches in Medical Imaging
S. Woerner
Christian F. Baumgartner
VLMMedIm
53
0
0
15 Aug 2024
LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured
  Surgical Video Learning
LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Surgical Video Learning
Jiajie Li
Garrett C Skinner
Gene Yang
Brian R Quaranto
Steven D. Schwaitzberg
Peter C W Kim
Jinjun Xiong
86
11
0
15 Aug 2024
PathInsight: Instruction Tuning of Multimodal Datasets and Models for
  Intelligence Assisted Diagnosis in Histopathology
PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology
Xiaomin Wu
Rui Xu
Pengchen Wei
Wenkang Qin
Peixiang Huang
Ziheng Li
Lin Luo
LM&MA
62
6
0
13 Aug 2024
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Yunfei Xie
Ce Zhou
Lang Gao
Juncheng Wu
Xianhang Li
...
Sheng Liu
Lei Xing
James Zou
Cihang Xie
Yuyin Zhou
LM&MAMedIm
147
32
0
06 Aug 2024
Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering
Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering
Danfeng Guo
Sumitaka Honji
LRM
124
2
0
31 Jul 2024
Previous
1234
Next