Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.00747
Cited By
v1
v2 (latest)
Contrastive Learning of Medical Visual Representations from Paired Images and Text
2 October 2020
Yuhao Zhang
Hang Jiang
Yasuhide Miura
Christopher D. Manning
C. Langlotz
MedIm
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Contrastive Learning of Medical Visual Representations from Paired Images and Text"
50 / 459 papers shown
Title
Interpreting Biomedical VLMs on High-Imbalance Out-of-Distributions: An Insight into BiomedCLIP on Radiology
Nafiz Sadman
Farhana Zulkernine
Benjamin Kwan
27
0
0
17 Jun 2025
Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration
Jun Wang
Lixing Zhu
Xiaohan Yu
A. Bhalerao
Yulan He
122
0
0
12 Jun 2025
3D-RAD: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks
Xiaotang Gai
Jiaxiang Liu
Yichen Li
Zijie Meng
Jian Wu
Zuozhu Liu
VGen
20
0
0
11 Jun 2025
Foundation Models in Medical Imaging -- A Review and Outlook
Vivien van Veldhuizen
Vanessa Botha
C. Lu
Melis Erdal Cesur
Kevin Groot Lipman
...
Cees Snoek
Lodewyk Wessels
Ritse Mann
Eric Marcus
Jonas Teuwen
MedIm
VLM
AI4CE
70
0
0
10 Jun 2025
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models
Chenyu Lian
Hong-Yu Zhou
Dongyun Liang
J. Qin
L. Wang
MedIm
VLM
36
0
0
10 Jun 2025
Full Conformal Adaptation of Medical Vision-Language Models
Julio Silva-Rodríguez
Leo Fillioux
P. Cournède
Maria Vakalopoulou
Stergios Christodoulidis
Ismail Ben Ayed
Jose Dolz
VLM
70
0
0
06 Jun 2025
Recent Advances in Medical Image Classification
Loan Dao
Ngoc Quoc Ly
74
3
0
04 Jun 2025
Enhancing Biomedical Multi-modal Representation Learning with Multi-scale Pre-training and Perturbed Report Discrimination
Xinliu Zhong
Kayhan Batmanghelich
Li Sun
66
1
0
02 Jun 2025
Leveraging CLIP Encoder for Multimodal Emotion Recognition
Yehun Song
Sunyoung Cho
VLM
42
0
0
01 Jun 2025
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning
Jinquan Guan
Qi Chen
Lizhou Liang
Yuhang Liu
Vu Minh Hieu Phan
Minh-Son To
Jian Chen
Yutong Xie
LM&MA
LRM
46
0
0
29 May 2025
Bringing CLIP to the Clinic: Dynamic Soft Labels and Negation-Aware Learning for Medical Analysis
Hanbin Ko
Chang-Min Park
33
0
0
28 May 2025
Towards Scalable Language-Image Pre-training for 3D Medical Imaging
Chenhui Zhao
Yiwei Lyu
Asadur Chowdury
Edward Harake
A. Kondepudi
Akshay Rao
X. Hou
Honglak Lee
Todd C. Hollon
LM&MA
MedIm
41
0
0
28 May 2025
Learning Shared Representations from Unpaired Data
Amitai Yacobi
Nir Ben-Ari
Ronen Talmon
Uri Shaham
SSL
80
0
0
23 May 2025
Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling
Bryan Wong
Jong Woo Kim
Huazhu Fu
Mun Yi
VLM
226
0
0
23 May 2025
Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records
Yili He
Yan Zhu
Peiyao Fu
Ruijie Yang
Tianyi Chen
Zhihua Wang
Quanlin Li
Pinghong Zhou
Xiaoyu Yang
Shuo Wang
MedIm
VLM
60
0
0
14 May 2025
Position: Restructuring of Categories and Implementation of Guidelines Essential for VLM Adoption in Healthcare
Amara Tariq
Rimita Lahiri
Charles Kahn
Imon Banerjee
64
0
0
12 May 2025
A Vision-Language Model for Focal Liver Lesion Classification
Song Jian
Hu Yuchang
Wang Hui
Chen Yen-Wei
VLM
MedIm
50
0
0
06 May 2025
Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs
Dung Nguyen
Minh Khoi Ho
Huy Ta
T. Nguyen
Qi Chen
...
Zhibin Liao
Minh-Son To
Johan Verjans
Phi Le Nguyen
Vu Minh Hieu Phan
91
0
0
30 Apr 2025
CLIP-Powered Domain Generalization and Domain Adaptation: A Comprehensive Survey
Jindong Li
Yongqian Li
Yali Fu
Jiahong Liu
Yixin Liu
Menglin Yang
Irwin King
VLM
86
0
0
19 Apr 2025
ProgRoCC: A Progressive Approach to Rough Crowd Counting
Shengqin Jiang
Linfei Li
Haokui Zhang
Qingshan Liu
Amin Beheshti
Jian Yang
Anton van den Hengel
Quan Z. Sheng
Yuankai Qi
117
0
0
18 Apr 2025
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjD
VOS
329
9
0
17 Apr 2025
FedEPA: Enhancing Personalization and Modality Alignment in Multimodal Federated Learning
Yu Zhang
Qingfeng Du
Jiaqi Lv
92
0
0
16 Apr 2025
On the Value of Cross-Modal Misalignment in Multimodal Representation Learning
Yichao Cai
Yuhang Liu
Erdun Gao
Tianjiao Jiang
Zhen Zhang
Anton van den Hengel
Javen Qinfeng Shi
149
0
0
14 Apr 2025
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability
Jonggwon Park
Soobum Kim
Byungmu Yoon
Kyoyun Choi
MedIm
105
0
0
10 Apr 2025
A Lightweight Large Vision-language Model for Multimodal Medical Images
Belal Alsinglawi
Chris McCarthy
Sara Webb
Christopher Fluke
Navid Toosy Saidy
LM&MA
90
0
0
08 Apr 2025
A Reality Check of Vision-Language Pre-training in Radiology: Have We Progressed Using Text?
Julio Silva-Rodríguez
Jose Dolz
Ismail ben Ayed
VLM
MedIm
88
0
0
07 Apr 2025
DALIP: Distribution Alignment-based Language-Image Pre-Training for Domain-Specific Data
Junjie Wu
Jiangtao Xie
Zhaolin Zhang
Qilong Wang
Q. Hu
P. Li
Sen Xu
VLM
92
0
0
02 Apr 2025
STPNet: Scale-aware Text Prompt Network for Medical Image Segmentation
Dandan Shan
Zihan Li
Yunxiang Li
Qingde Li
Jie Tian
Qingqi Hong
MedIm
73
0
0
02 Apr 2025
iMedImage Technical Report
Ran Wei
ZhiXiong Lan
Qing Yan
Ning Song
Ming Lv
LongQing Ye
106
0
0
27 Mar 2025
Keyword-Oriented Multimodal Modeling for Euphemism Identification
Yuxue Hu
Junsong Li
Meixuan Chen
Dongyu Su
Tongguan Wang
Ying Sha
59
0
0
27 Mar 2025
CausalCLIPSeg: Unlocking CLIP's Potential in Referring Medical Image Segmentation with Causal Intervention
Yaxiong Chen
Minghong Wei
Zixuan Zheng
Jingliang Hu
Yilei Shi
Shengwu Xiong
Xiao Xiang Zhu
Lichao Mou
MedIm
80
1
0
20 Mar 2025
A Causality-Inspired Model for Intima-Media Thickening Assessment in Ultrasound Videos
Shuo Gao
Jingyang Zhang
Jun Xue
Meng Yang
Yiran Chen
Guangquan Zhou
CML
88
0
0
16 Mar 2025
Modeling Variants of Prompts for Vision-Language Models
Ao Li
Zongfang Liu
Xinhua Li
Jinghui Zhang
Pengwei Wang
Hu Wang
VLM
78
0
0
13 Mar 2025
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images
M. Rahaman
Ewan K. A. Millar
Erik H. W. Meijering
VLM
115
0
0
13 Mar 2025
LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?
Bangyan Li
Wenxuan Huang
Yunhang Shen
Yansen Wang
Shaohui Lin
...
Ling You
Yinqi Zhang
Ke Li
Xing Sun
Yan Sun
93
2
0
10 Mar 2025
Anatomy-Aware Conditional Image-Text Retrieval
Meng Zheng
Jiajin Zhang
Benjamin Planche
Zhongpai Gao
Terrence Chen
Ziyan Wu
MedIm
87
0
0
10 Mar 2025
OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment
Junhyun Park
Chanyu Moon
Donghwan Lee
Kyungsu Kim
Minho Hwang
VLM
MedIm
136
0
0
03 Mar 2025
A Comparison of Object Detection and Phrase Grounding Models in Chest X-ray Abnormality Localization using Eye-tracking Data
Elham Ghelichkhan
Tolga Tasdizen
67
0
0
02 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
179
2
0
02 Mar 2025
Vision Language Models in Medicine
Beria Chingnabe Kalpelbe
Angel Gabriel Adaambiik
Wei Peng
VLM
LM&MA
121
2
0
24 Feb 2025
Object-centric Binding in Contrastive Language-Image Pretraining
Rim Assouel
Pietro Astolfi
Florian Bordes
M. Drozdzal
Adriana Romero Soriano
OCL
VLM
CoGe
161
3
0
19 Feb 2025
From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine
Lukas Buess
Matthias Keicher
Nassir Navab
Andreas Maier
Soroosh Tayebi Arasteh
LM&MA
326
2
0
13 Feb 2025
Efficient Domain Adaptation of Multimodal Embeddings using Constrastive Learning
Georgios Margaritis
Periklis Petridis
Dimitris Bertsimas
142
0
0
04 Feb 2025
VisTA: Vision-Text Alignment Model with Contrastive Learning using Multimodal Data for Evidence-Driven, Reliable, and Explainable Alzheimer's Disease Diagnosis
Duy-Cat Can
Linh D. Dang
Quang-Huy Tang
Dang Minh Ly
Huong Ha
Guillaume Blanc
Oliver Y. Chén
Binh T. Nguyen
120
1
0
03 Feb 2025
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models
Jakob Krogh Petersen
Valdemar Licht
Mads Nielsen
Asbjørn Munk
VLM
90
0
0
23 Jan 2025
MedFILIP: Medical Fine-grained Language-Image Pre-training
Xinjie Liang
Xiangyu Li
Fanding Li
Jie Jiang
Qing Dong
Wei Wang
Kaidi Wang
Suyu Dong
Gongning Luo
Shuo Li
LM&MA
VLM
MedIm
170
4
0
18 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
284
27
0
17 Jan 2025
MedGrad E-CLIP: Enhancing Trust and Transparency in AI-Driven Skin Lesion Diagnosis
Sadia Kamal
Tim Oates
MedIm
67
0
0
12 Jan 2025
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
...
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MA
MedIm
244
235
0
10 Jan 2025
Gaussian Masked Autoencoders
Jathushan Rajasegaran
Xinlei Chen
Rulilong Li
Christoph Feichtenhofer
Jitendra Malik
Shiry Ginosar
3DGS
70
1
0
06 Jan 2025
1
2
3
4
...
8
9
10
Next