Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.00915
Cited By
v1
v2
v3 (latest)
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
10 January 2025
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
Robert Tinn
Sam Preston
Rajesh N. Rao
Mu-Hsin Wei
Naveen Valluri
Cliff Wong
Andrea Tupini
Yu Wang
Matt Mazzola
Swadheen Shukla
Lars Liden
Jianfeng Gao
Angela Crabtree
B. Piening
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MA
MedIm
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs"
50 / 185 papers shown
Title
Recent Advances in Medical Image Classification
Loan Dao
Ngoc Quoc Ly
57
3
0
04 Jun 2025
A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
Shengyuan Liu
Boyun Zheng
Wenting Chen
Zhihao Peng
Zhenfei Yin
Jing Shao
Jiancong Hu
Yixuan Yuan
ELM
61
0
0
29 May 2025
Towards Scalable Language-Image Pre-training for 3D Medical Imaging
Chenhui Zhao
Yiwei Lyu
Asadur Chowdury
Edward Harake
A. Kondepudi
Akshay Rao
X. Hou
Honglak Lee
Todd C. Hollon
LM&MA
MedIm
16
0
0
28 May 2025
Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning
Cheng Peng
Kai Zhang
Mengxian Lyu
Hongfang Liu
Lichao Sun
Yonghui Wu
LM&MA
MedIm
VLM
267
0
0
23 May 2025
Advancing the Scientific Method with Large Language Models: From Hypothesis to Discovery
Yanbo Zhang
S. Khan
Adnan Mahmud
Huck Yang
Alexander Lavin
...
James A. Evans
Alan R. Bundy
Jannis Brugger
Jesper Tegner
Hector Zenil
LM&MA
67
1
0
22 May 2025
On the Robustness of Medical Vision-Language Models: Are they Truly Generalizable?
Raza Imam
Rufael Marew
Mohammad Yaqub
AAML
VLM
61
0
0
21 May 2025
Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding
Ta Duc Huy
Duy Anh Huynh
Yutong Xie
Yuankai Qi
Qi Chen
...
Anton van den Hengel
Zhibin Liao
Minh-Son To
Johan Verjans
Vu Minh Hieu Phan
79
0
0
21 May 2025
Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning
Xiaoyu Yang
Jie Lu
En Yu
44
1
0
19 May 2025
MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment
Siyuan Yan
Xiaochen Li
Ming Hu
Yiwen Jiang
Zhen Yu
Zongyuan Ge
MedIm
VLM
63
0
0
14 May 2025
Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records
Yili He
Yan Zhu
Peiyao Fu
Ruijie Yang
Tianyi Chen
Zhihua Wang
Quanlin Li
Pinghong Zhou
Xiaoyu Yang
Shuo Wang
MedIm
VLM
58
0
0
14 May 2025
BioVFM-21M: Benchmarking and Scaling Self-Supervised Vision Foundation Models for Biomedical Image Analysis
Jiarun Liu
Hong-Yu Zhou
Weijian Huang
Hao Yang
Dongning Song
Tao Tan
Yong Liang
Shanshan Wang
MedIm
58
0
0
14 May 2025
Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model
Pengfei Guo
Can Zhao
Dong Yang
Yufan He
V. Nath
...
Zongwei Zhou
Benjamin D. Simon
Stephanie Harmon
Baris Turkbey
Daguang Xu
DiffM
MedIm
84
0
0
07 May 2025
Calibrating Uncertainty Quantification of Multi-Modal LLMs using Grounding
Trilok Padhi
R. Kaur
Adam D. Cobb
Manoj Acharya
Anirban Roy
Colin Samplawski
Brian Matejek
Alexander M. Berenbeim
Nathaniel D. Bastian
Susmit Jha
64
0
0
30 Apr 2025
Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation
Shahad Albastaki
Anabia Sohail
I. I. Ganapathi
B. Alawode
Asim Khan
Sajid Javed
Naoufel Werghi
Mohammed Bennamoun
Arif Mahmood
133
0
0
26 Apr 2025
Evaluating Vision Language Models (VLMs) for Radiology: A Comprehensive Analysis
Frank Li
Hari M. Trivedi
Bardia Khosravi
Theo Dapamede
Mohammadreza Chavoshi
...
Rohan Isaac
Aawez Mansuri
Janice Newsome
S. Purkayastha
J. Gichoya
LM&MA
69
0
0
22 Apr 2025
Causal Disentanglement for Robust Long-tail Medical Image Generation
Weizhi Nie
Zichun Zhang
Weijie Wang
Bruno Lepri
Anan Liu
Nicu Seb
DiffM
MedIm
OOD
CML
194
0
0
20 Apr 2025
EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Yang Yue
Yulin Wang
Haojun Jiang
Pan Liu
S. Song
Gao Huang
VGen
107
0
0
17 Apr 2025
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging
Tan-Hanh Pham
Chris Ngo
Trong-Duong Bui
Minh Luu Quang
Tan-Huong Pham
Truong-Son Hy
114
2
0
14 Apr 2025
Leveraging LLMs for Multimodal Retrieval-Augmented Radiology Report Generation via Key Phrase Extraction
Kyoyun Choi
Byungmu Yoon
Soobum Kim
Jonggwon Park
123
1
0
10 Apr 2025
AVP-AP: Self-supervised Automatic View Positioning in 3D cardiac CT via Atlas Prompting
Xiaolin Fan
Yansen Wang
Yuanhang Zhang
Mingkun Bao
Bosen Jia
Dong Lu
Yifan Gu
Jian Cheng
Haogang Zhu
180
0
0
08 Apr 2025
A Lightweight Large Vision-language Model for Multimodal Medical Images
Belal Alsinglawi
Chris McCarthy
Sara Webb
Christopher Fluke
Navid Toosy Saidy
LM&MA
71
0
0
08 Apr 2025
DALIP: Distribution Alignment-based Language-Image Pre-Training for Domain-Specific Data
Junjie Wu
Jiangtao Xie
Zhaolin Zhang
Qilong Wang
Q. Hu
P. Li
Sen Xu
VLM
79
0
0
02 Apr 2025
Vision Language Models versus Machine Learning Models Performance on Polyp Detection and Classification in Colonoscopy Images
Mohammad Amin Khalafi
Seyed Amir Ahmad Safavi-Naini
Ameneh Salehi
Nariman Naderi
Dorsa Alijanzadeh
...
Nicholas P Tatonetti
Nicholas Hoerter
Girish Nadkarni
Hamid Asadzadeh Aghdaei
Ali Soroush
83
0
0
27 Mar 2025
A Large-Scale Vision-Language Dataset Derived from Open Scientific Literature to Advance Biomedical Generalist AI
Alejandro Lozano
Min Woo Sun
James Burgess
Jeffrey Nirschl
Christopher Polzak
...
Xiaohan Wang
Alfred Seunghoon Song
Chiang Chia-Chun
Robert Tibshirani
Serena Yeung-Levy
LM&MA
161
2
0
26 Mar 2025
MedAgent-Pro: Towards Evidence-based Multi-modal Medical Diagnosis via Reasoning Agentic Workflow
Ziyue Wang
Junde Wu
Linghan Cai
Chang Han Low
Xihong Yang
Qiaxuan Li
Yueming Jin
LRM
121
2
0
21 Mar 2025
UMIT: Unifying Medical Imaging Tasks via Vision-Language Models
Haiyang Yu
Siyang Yi
Ke Niu
Minghan Zhuo
Bin Li
LM&MA
80
4
0
20 Mar 2025
Advancing Medical Representation Learning Through High-Quality Data
Negin Baghbanzadeh
Adibvafa Fallahpour
Yasaman Parhizkar
Franklin Ogidi
Shuvendu Roy
...
Vahid Reza Khazaie
Michael Colacci
Ali Etemad
Arash Afkanpour
Elham Dolatabadi
LM&MA
151
1
0
18 Mar 2025
VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis
Z. T. Wang
Renjiao Yi
Xin Wen
Chenyang Zhu
K. Xu
DiffM
MedIm
95
0
0
17 Mar 2025
How Good is my Histopathology Vision-Language Foundation Model? A Holistic Benchmark
Roba Al Majzoub
H. Malik
Muzammal Naseer
Zaigham Zaheer
Tariq Mahmood
Salman Khan
Fahad A Khan
VLM
144
1
0
17 Mar 2025
Taxonomic Reasoning for Rare Arthropods: Combining Dense Image Captioning and RAG for Interpretable Classification
Nathaniel Lesperance
S. Ratnasingham
Graham W. Taylor
VLM
126
0
0
13 Mar 2025
On the Limitations of Vision-Language Models in Understanding Image Transforms
Ahmad Mustafa Anis
Hasnain Ali
Saquib Sarfraz
VLM
Presented at
ResearchTrend Connect | VLM
on
28 Mar 2025
190
0
0
12 Mar 2025
Multi-Modal Foundation Models for Computational Pathology: A Survey
Dong Li
Guihong Wan
Xintao Wu
Xinyu Wu
Xiaohui Chen
Yi He
Christine G. Lian
Peter K. Sorger
Yevgeniy R. Semenov
Chen Zhao
MedIm
102
0
0
12 Mar 2025
Towards a Multimodal MRI-Based Foundation Model for Multi-Level Feature Exploration in Segmentation, Molecular Subtyping, and Grading of Glioma
Somayeh Farahani
Marjaneh Hejazi
A. Di Ieva
Emad Fatemizadeh
Sidong Liu
83
0
0
10 Mar 2025
Unleashing the Potential of Large Language Models for Text-to-Image Generation through Autoregressive Representation Alignment
Xing Xie
Jiawei Liu
Ziyue Lin
Huijie Fan
Zhi Han
Yandong Tang
Liangqiong Qu
109
0
0
10 Mar 2025
Towards Universal Text-driven CT Image Segmentation
Yuheng Li
Yuxiang Lai
Maria Thor
Deborah Marshall
Zachary Buchwald
D. Yu
Xiaofeng Yang
MedIm
VLM
96
3
0
08 Mar 2025
Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation
Aishik Konwer
Zhijian Yang
Erhan Bas
Cao Xiao
Prateek Prasanna
Parminder Bhatia
Taha A. Kass-Hout
MedIm
VLM
109
1
0
06 Mar 2025
A Shared Encoder Approach to Multimodal Representation Learning
Shuvendu Roy
Franklin Ogidi
Ali Etemad
Elham Dolatabadi
Arash Afkanpour
58
0
0
03 Mar 2025
Confounder-Aware Medical Data Selection for Fine-Tuning Pretrained Vision Models
Anyang Ji
Qingbo Kang
Wei Xu
Changfan Wang
Kang Li
Qicheng Lao
64
0
0
02 Mar 2025
LesionDiffusion: Towards Text-controlled General Lesion Synthesis
Henrui Tian
Wenhui Lei
Linrui Dai
Hanyu Chen
Xiaofan Zhang
DiffM
MedIm
69
0
0
02 Mar 2025
Delving into Out-of-Distribution Detection with Medical Vision-Language Models
Lie Ju
Sijin Zhou
Yukun Zhou
Huimin Lu
Zhuoting Zhu
P. Keane
Zongyuan Ge
VLM
84
0
0
02 Mar 2025
Towards Statistical Factuality Guarantee for Large Vision-Language Models
Zechao Li
Chao Yan
Nicholas J. Jackson
Wendi Cui
B. Li
Jiaxin Zhang
Bradley Malin
122
0
0
27 Feb 2025
Repurposing the scientific literature with vision-language models
Anton Alyakin
Jaden Stryker
Daniel Alber
Karl L. Sangwon
Brandon Duderstadt
...
Laura Snyder
Eric Leuthardt
Douglas Kondziolka
E. Oermann
Eric Karl Oermann
150
0
0
26 Feb 2025
FedBM: Stealing Knowledge from Pre-trained Language Models for Heterogeneous Federated Learning
Meilu Zhu
Qiushi Yang
Zhifan Gao
Yixuan Yuan
Jun Liu
FedML
109
0
0
24 Feb 2025
Vision Language Models in Medicine
Beria Chingnabe Kalpelbe
Angel Gabriel Adaambiik
Wei Peng
VLM
LM&MA
112
2
0
24 Feb 2025
Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives
Dilermando Queiroz
Anderson Carlos
André Anjos
Lilian Berton
111
0
0
24 Feb 2025
Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification
Xiangyu Sun
Xiaoguang Zou
Yuanquan Wu
Guotai Wang
Shanghang Zhang
MedIm
VLM
98
0
0
31 Jan 2025
BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity
Zahra Gharaee
Scott C. Lowe
ZeMing Gong
Pablo Millán Arias
Nicholas Pellegrino
...
Lila Kari
Dirk Steinke
Graham W. Taylor
Paul Fieguth
Angel X. Chang
107
11
0
28 Jan 2025
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Jiajie Li
Brian R Quaranto
Chenhui Xu
Ishan Mishra
Ruiyang Qin
Dancheng Liu
Peter C W Kim
Jinjun Xiong
145
0
0
25 Jan 2025
CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification
Cristiano Patrício
Isabel Rio-Torto
J. S. Cardoso
Luís F. Teixeira
João C. Neves
VLM
487
1
0
21 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
274
26
0
17 Jan 2025
1
2
3
4
Next