ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.10163
  4. Cited By
MedCLIP: Contrastive Learning from Unpaired Medical Images and Text

MedCLIP: Contrastive Learning from Unpaired Medical Images and Text

18 October 2022
Zifeng Wang
Zhenbang Wu
Dinesh Agarwal
Jimeng Sun
    CLIP
    VLM
    MedIm
ArXivPDFHTML

Papers citing "MedCLIP: Contrastive Learning from Unpaired Medical Images and Text"

50 / 244 papers shown
Title
Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner
Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner
Wenchuan Zhang
Penghao Zhang
Jingru Guo
Tao Cheng
Jie Chen
Shuwan Zhang
Zhang Zhang
Yuhao Yi
Hong Bu
AI4TS
LRM
12
0
0
16 May 2025
MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models
MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models
Yuncheng Guo
Xiaodong Gu
OffRL
VLM
27
0
0
15 May 2025
Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records
Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records
Yili He
Yan Zhu
Peiyao Fu
Ruijie Yang
Tianyi Chen
Zhihua Wang
Quanlin Li
Pinghong Zhou
Xiaoyu Yang
Shuo Wang
MedIm
VLM
28
0
0
14 May 2025
UniCAD: Efficient and Extendable Architecture for Multi-Task Computer-Aided Diagnosis System
UniCAD: Efficient and Extendable Architecture for Multi-Task Computer-Aided Diagnosis System
Yitao Zhu
Yuan Yin
Zhenrong Shen
Zihao Zhao
Haiyu Song
Sheng Wang
D. Shen
Qian Wang
MedIm
33
0
0
14 May 2025
A Multimodal Multi-Agent Framework for Radiology Report Generation
A Multimodal Multi-Agent Framework for Radiology Report Generation
Ziruo Yi
Ting Xiao
Mark V. Albert
MedIm
26
0
0
14 May 2025
Position: Restructuring of Categories and Implementation of Guidelines Essential for VLM Adoption in Healthcare
Position: Restructuring of Categories and Implementation of Guidelines Essential for VLM Adoption in Healthcare
Amara Tariq
Rimita Lahiri
Charles Kahn
Imon Banerjee
26
0
0
12 May 2025
CheXLearner: Text-Guided Fine-Grained Representation Learning for Progression Detection
CheXLearner: Text-Guided Fine-Grained Representation Learning for Progression Detection
Yunhong Wang
Junwen Duan
Xinyu Li
Jianxin Wang
MedIm
34
0
0
11 May 2025
A Vision-Language Model for Focal Liver Lesion Classification
A Vision-Language Model for Focal Liver Lesion Classification
Song Jian
Hu Yuchang
Wang Hui
Chen Yen-Wei
VLM
MedIm
43
0
0
06 May 2025
CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck Models
CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck Models
Hasan Md Tusfiqur Alam
Devansh Srivastav
Abdulrahman Mohamed Selim
Md Abdul Kadir
Md Moktadiurl Hoque Shuvo
Daniel Sonntag
MedIm
45
0
0
29 Apr 2025
Causal Disentanglement for Robust Long-tail Medical Image Generation
Causal Disentanglement for Robust Long-tail Medical Image Generation
Weizhi Nie
Zichun Zhang
Weijie Wang
Bruno Lepri
Anan Liu
Nicu Seb
DiffM
MedIm
OOD
CML
50
0
0
20 Apr 2025
Post-pre-training for Modality Alignment in Vision-Language Foundation Models
Post-pre-training for Modality Alignment in Vision-Language Foundation Models
Shinýa Yamaguchi
Dewei Feng
Sekitoshi Kanai
Kazuki Adachi
Daiki Chijiwa
VLM
34
0
0
17 Apr 2025
From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation
From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation
Jingkun Chen
Haoran Duan
Xiao Zhang
Boyan Gao
T. Tan
Vicente Grau
Jungong Han
19
0
0
15 Apr 2025
DualPrompt-MedCap: A Dual-Prompt Enhanced Approach for Medical Image Captioning
DualPrompt-MedCap: A Dual-Prompt Enhanced Approach for Medical Image Captioning
Yining Zhao
Ali Braytee
Mukesh Prasad
VLM
MedIm
32
0
0
13 Apr 2025
REMEMBER: Retrieval-based Explainable Multimodal Evidence-guided Modeling for Brain Evaluation and Reasoning in Zero- and Few-shot Neurodegenerative Diagnosis
REMEMBER: Retrieval-based Explainable Multimodal Evidence-guided Modeling for Brain Evaluation and Reasoning in Zero- and Few-shot Neurodegenerative Diagnosis
Duy-Cat Can
Quang-Huy Tang
Huong Ha
Binh T. Nguyen
Oliver Y. Chén
28
0
0
12 Apr 2025
Zeus: Zero-shot LLM Instruction for Union Segmentation in Multimodal Medical Imaging
Zeus: Zero-shot LLM Instruction for Union Segmentation in Multimodal Medical Imaging
Siyuan Dai
Kai Ye
Guodong Liu
Haoteng Tang
Liang Zhan
MedIm
38
0
0
09 Apr 2025
A Reality Check of Vision-Language Pre-training in Radiology: Have We Progressed Using Text?
A Reality Check of Vision-Language Pre-training in Radiology: Have We Progressed Using Text?
Julio Silva-Rodríguez
Jose Dolz
Ismail ben Ayed
VLM
MedIm
38
0
0
07 Apr 2025
Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation
Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation
Chengxi Zeng
Yuxuan Jiang
Fan Zhang
A. Gambaruto
T. Burghardt
MedIm
48
0
0
03 Apr 2025
Prompting Medical Vision-Language Models to Mitigate Diagnosis Bias by Generating Realistic Dermoscopic Images
Prompting Medical Vision-Language Models to Mitigate Diagnosis Bias by Generating Realistic Dermoscopic Images
Nusrat Munia
Abdullah-Al-Zubaer Imran
LM&MA
MedIm
32
0
0
02 Apr 2025
DALIP: Distribution Alignment-based Language-Image Pre-Training for Domain-Specific Data
DALIP: Distribution Alignment-based Language-Image Pre-Training for Domain-Specific Data
Junjie Wu
Jiangtao Xie
Zhaolin Zhang
Qilong Wang
Q. Hu
P. Li
Sen Xu
VLM
43
0
0
02 Apr 2025
AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs
AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs
Diwei Wang
Cédric Bobenrieth
Hyewon Seo
LRM
44
0
0
23 Mar 2025
TGV: Tabular Data-Guided Learning of Visual Cardiac Representations
TGV: Tabular Data-Guided Learning of Visual Cardiac Representations
Marta Hasny
Maxime Di Folco
Keno Bressem
Julia A. Schnabel
36
0
0
19 Mar 2025
RoMedFormer: A Rotary-Embedding Transformer Foundation Model for 3D Genito-Pelvic Structure Segmentation in MRI and CT
RoMedFormer: A Rotary-Embedding Transformer Foundation Model for 3D Genito-Pelvic Structure Segmentation in MRI and CT
Yuheng Li
Mingzhe Hu
Richard L. J. Qiu
Maria Thor
Andre Williams
Deborah Marshall
Xiaofeng Yang
MedIm
69
0
0
18 Mar 2025
Prototype-Based Image Prompting for Weakly Supervised Histopathological Image Segmentation
Prototype-Based Image Prompting for Weakly Supervised Histopathological Image Segmentation
Qingchen Tang
Lei Fan
M. Pagnucco
Yang Song
VLM
48
0
0
15 Mar 2025
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images
M. Rahaman
Ewan K. A. Millar
Erik H. W. Meijering
VLM
64
0
0
13 Mar 2025
MMRL: Multi-Modal Representation Learning for Vision-Language Models
MMRL: Multi-Modal Representation Learning for Vision-Language Models
Yuncheng Guo
Xiaodong Gu
VLM
OffRL
141
1
0
11 Mar 2025
Towards Universal Text-driven CT Image Segmentation
Yuheng Li
Yuxiang Lai
Maria Thor
Deborah Marshall
Zachary Buchwald
D. Yu
Xiaofeng Yang
MedIm
VLM
59
2
0
08 Mar 2025
Semantic Alignment of Unimodal Medical Text and Vision Representations
Maxime Di Folco
E. Chan
Marta Hasny
Cosmin I. Bercea
Julia A. Schnabel
68
0
0
06 Mar 2025
Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation
Aishik Konwer
Zhijian Yang
Erhan Bas
Cao Xiao
Prateek Prasanna
Parminder Bhatia
Taha A. Kass-Hout
MedIm
VLM
56
0
0
06 Mar 2025
Periodontal Bone Loss Analysis via Keypoint Detection With Heuristic Post-Processing
Periodontal Bone Loss Analysis via Keypoint Detection With Heuristic Post-Processing
Ryan Banks
Vishal Thengane
María Eugenia Guerrero
Nelly Maria García-Madueño
Yunpeng Li
Hongying Tang
A. Chaurasia
54
0
0
05 Mar 2025
X2CT-CLIP: Enable Multi-Abnormality Detection in Computed Tomography from Chest Radiography via Tri-Modal Contrastive Learning
Jianzhong You
Yuan Gao
Sangwook Kim
Chris McIntosh
69
1
0
04 Mar 2025
Primus: Enforcing Attention Usage for 3D Medical Image Segmentation
Tassilo Wald
Saikat Roy
Fabian Isensee
Constantin Ulrich
Sebastian Ziegler
D. Trofimova
Raphael Stock
Michael Baumgartner
Gregor Köhler
Klaus H. Maier-Hein
ViT
MedIm
42
1
0
03 Mar 2025
Abn-BLIP: Abnormality-aligned Bootstrapping Language-Image Pre-training for Pulmonary Embolism Diagnosis and Report Generation from CTPA
Z. Zhong
Yuli Wang
Lulu Bi
Zhuoqi Ma
S. H. Ahn
...
Webster Stayman
Todd M. Kolb
I. Kamel
Harrison X. Bai
Zhicheng Jiao
LM&MA
63
0
0
03 Mar 2025
Delving into Out-of-Distribution Detection with Medical Vision-Language Models
Lie Ju
Sijin Zhou
Yukun Zhou
Huimin Lu
Zhuoting Zhu
P. Keane
Zongyuan Ge
VLM
42
0
0
02 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
56
1
0
02 Mar 2025
MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention
MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention
Tianyi Wang
Jianan Fan
Dingxin Zhang
Dongnan Liu
Yong-quan Xia
Heng Huang
Weidong Cai
39
0
0
01 Mar 2025
Progressive Local Alignment for Medical Multimodal Pre-training
Progressive Local Alignment for Medical Multimodal Pre-training
Huimin Yan
Xian Yang
Liang Bai
Jiye Liang
72
0
0
25 Feb 2025
Vision Language Models in Medicine
Beria Chingnabe Kalpelbe
Angel Gabriel Adaambiik
Wei Peng
VLM
LM&MA
89
2
0
24 Feb 2025
MedForge: Building Medical Foundation Models Like Open Source Software Development
MedForge: Building Medical Foundation Models Like Open Source Software Development
Zheling Tan
Kexin Ding
Jin Gao
Mu Zhou
Dimitris N. Metaxas
Shaoting Zhang
Dequan Wang
AI4CE
45
1
0
22 Feb 2025
Fair-MoE: Fairness-Oriented Mixture of Experts in Vision-Language Models
Peiran Wang
Linjie Tong
Jiaxiang Liu
Zuozhu Liu
VLM
MoE
46
0
0
10 Feb 2025
VisTA: Vision-Text Alignment Model with Contrastive Learning using Multimodal Data for Evidence-Driven, Reliable, and Explainable Alzheimer's Disease Diagnosis
VisTA: Vision-Text Alignment Model with Contrastive Learning using Multimodal Data for Evidence-Driven, Reliable, and Explainable Alzheimer's Disease Diagnosis
Duy-Cat Can
Linh D. Dang
Quang-Huy Tang
Dang Minh Ly
Huong Ha
Guillaume Blanc
Oliver Y. Chén
Binh T. Nguyen
66
1
0
03 Feb 2025
Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification
Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification
Xiangyu Sun
Xiaoguang Zou
Yuanquan Wu
Guotai Wang
S. Zhang
MedIm
VLM
68
0
0
31 Jan 2025
Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models
Jing Zhang
Xiaowei Yu
Yanjun Lyu
Lu Zhang
Tong Chen
Chao-Yang Cao
Yan Zhuang
Minheng Chen
Tianming Liu
D. Zhu
32
2
0
28 Jan 2025
MedFILIP: Medical Fine-grained Language-Image Pre-training
MedFILIP: Medical Fine-grained Language-Image Pre-training
Xinjie Liang
Xiangyu Li
Fanding Li
Jie Jiang
Qing Dong
Wei Wang
Kaidi Wang
Suyu Dong
Gongning Luo
Shuo Li
LM&MA
VLM
MedIm
64
3
0
18 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
105
18
0
17 Jan 2025
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing
Varun Biyyala
Bharat Chanderprakash Kathuria
Jialu Li
Youshan Zhang
52
0
0
13 Jan 2025
RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment
RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment
Difei Gu
Yunhe Gao
Yang Zhou
Mu Zhou
Dimitris N. Metaxas
LM&MA
47
2
0
13 Jan 2025
Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis
Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis
Andrzej D. Dobrzycki
Ana M. Bernardos
Luca Bergesio
Andrzej Pomirski
Daniel Sáez-Trigueros
3DH
38
3
0
13 Jan 2025
MedGrad E-CLIP: Enhancing Trust and Transparency in AI-Driven Skin Lesion Diagnosis
MedGrad E-CLIP: Enhancing Trust and Transparency in AI-Driven Skin Lesion Diagnosis
Sadia Kamal
Tim Oates
MedIm
41
0
0
12 Jan 2025
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
...
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MA
MedIm
154
205
0
10 Jan 2025
Deep Learning for Ophthalmology: The State-of-the-Art and Future Trends
Deep Learning for Ophthalmology: The State-of-the-Art and Future Trends
Duy M. Nguyen
Hasan Md Tusfiqur Alam
T. Nguyen
Devansh Srivastav
H. Profitlich
Ngan Le
Daniel Sonntag
36
2
0
07 Jan 2025
12345
Next