ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.00747
  4. Cited By
Contrastive Learning of Medical Visual Representations from Paired
  Images and Text

Contrastive Learning of Medical Visual Representations from Paired Images and Text

2 October 2020
Yuhao Zhang
Hang Jiang
Yasuhide Miura
Christopher D. Manning
C. Langlotz
    MedIm
ArXivPDFHTML

Papers citing "Contrastive Learning of Medical Visual Representations from Paired Images and Text"

50 / 445 papers shown
Title
Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records
Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records
Yili He
Yan Zhu
Peiyao Fu
Ruijie Yang
Tianyi Chen
Zhihua Wang
Quanlin Li
Pinghong Zhou
Xiaoyu Yang
Shuo Wang
MedIm
VLM
31
0
0
14 May 2025
Position: Restructuring of Categories and Implementation of Guidelines Essential for VLM Adoption in Healthcare
Position: Restructuring of Categories and Implementation of Guidelines Essential for VLM Adoption in Healthcare
Amara Tariq
Rimita Lahiri
Charles Kahn
Imon Banerjee
31
0
0
12 May 2025
A Vision-Language Model for Focal Liver Lesion Classification
A Vision-Language Model for Focal Liver Lesion Classification
Song Jian
Hu Yuchang
Wang Hui
Chen Yen-Wei
VLM
MedIm
46
0
0
06 May 2025
Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs
Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs
Dung Tien Nguyen
Minh Khoi Ho
Huy Ta
T. Nguyen
Qi Chen
...
Zhibin Liao
Minh Nguyen Nhat To
Johan W. Verjans
Phi Le Nguyen
Vu Minh Hieu Phan
29
0
0
30 Apr 2025
CLIP-Powered Domain Generalization and Domain Adaptation: A Comprehensive Survey
CLIP-Powered Domain Generalization and Domain Adaptation: A Comprehensive Survey
Jindong Li
Yong Li
Yali Fu
Jiahong Liu
Yixin Liu
Menglin Yang
Irwin King
VLM
41
0
0
19 Apr 2025
ProgRoCC: A Progressive Approach to Rough Crowd Counting
ProgRoCC: A Progressive Approach to Rough Crowd Counting
Shengqin Jiang
Linfei Li
Haokui Zhang
Qingshan Liu
Amin Beheshti
Jian Yang
Anton van den Hengel
Quan Z. Sheng
Yuankai Qi
30
0
0
18 Apr 2025
Perception Encoder: The best visual embeddings are not at the output of the network
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjD
VOS
103
1
0
17 Apr 2025
FedEPA: Enhancing Personalization and Modality Alignment in Multimodal Federated Learning
FedEPA: Enhancing Personalization and Modality Alignment in Multimodal Federated Learning
Yu Zhang
Qingfeng Du
Jiaqi Lv
31
0
0
16 Apr 2025
On the Value of Cross-Modal Misalignment in Multimodal Representation Learning
On the Value of Cross-Modal Misalignment in Multimodal Representation Learning
Yichao Cai
Yuhang Liu
Erdun Gao
Tianjiao Jiang
Zhen Zhang
Anton van den Hengel
Javen Qinfeng Shi
62
0
0
14 Apr 2025
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability
Jonggwon Park
Soobum Kim
Byungmu Yoon
Kyoyun Choi
MedIm
38
0
0
10 Apr 2025
A Lightweight Large Vision-language Model for Multimodal Medical Images
A Lightweight Large Vision-language Model for Multimodal Medical Images
Belal Alsinglawi
Chris McCarthy
Sara Webb
Christopher Fluke
Navid Toosy Saidy
LM&MA
47
0
0
08 Apr 2025
A Reality Check of Vision-Language Pre-training in Radiology: Have We Progressed Using Text?
A Reality Check of Vision-Language Pre-training in Radiology: Have We Progressed Using Text?
Julio Silva-Rodríguez
Jose Dolz
Ismail ben Ayed
VLM
MedIm
38
0
0
07 Apr 2025
DALIP: Distribution Alignment-based Language-Image Pre-Training for Domain-Specific Data
DALIP: Distribution Alignment-based Language-Image Pre-Training for Domain-Specific Data
Junjie Wu
Jiangtao Xie
Zhaolin Zhang
Qilong Wang
Q. Hu
P. Li
Sen Xu
VLM
50
0
0
02 Apr 2025
iMedImage Technical Report
iMedImage Technical Report
Ran Wei
ZhiXiong Lan
Qing Yan
Ning Song
Ming Lv
LongQing Ye
54
0
0
27 Mar 2025
Keyword-Oriented Multimodal Modeling for Euphemism Identification
Keyword-Oriented Multimodal Modeling for Euphemism Identification
Yuxue Hu
Junsong Li
Meixuan Chen
Dongyu Su
Tongguan Wang
Ying Sha
33
0
0
27 Mar 2025
CausalCLIPSeg: Unlocking CLIP's Potential in Referring Medical Image Segmentation with Causal Intervention
CausalCLIPSeg: Unlocking CLIP's Potential in Referring Medical Image Segmentation with Causal Intervention
Yaxiong Chen
Minghong Wei
Zixuan Zheng
Jingliang Hu
Yilei Shi
Shengwu Xiong
Xiao Xiang Zhu
Lichao Mou
MedIm
48
0
0
20 Mar 2025
A Causality-Inspired Model for Intima-Media Thickening Assessment in Ultrasound Videos
A Causality-Inspired Model for Intima-Media Thickening Assessment in Ultrasound Videos
Shuo Gao
Jingyang Zhang
Jun Xue
Meng Yang
Yiran Chen
Guangquan Zhou
CML
71
0
0
16 Mar 2025
Modeling Variants of Prompts for Vision-Language Models
Ao Li
Zongfang Liu
Xinhua Li
Jinghui Zhang
Pengwei Wang
Hu Wang
VLM
56
0
0
13 Mar 2025
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images
M. Rahaman
Ewan K. A. Millar
Erik H. W. Meijering
VLM
64
0
0
13 Mar 2025
Anatomy-Aware Conditional Image-Text Retrieval
Meng Zheng
Jiajin Zhang
Benjamin Planche
Zhongpai Gao
Terrence Chen
Ziyan Wu
MedIm
57
0
0
10 Mar 2025
LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?
Bangyan Li
Wenxuan Huang
Yunhang Shen
Yansen Wang
Shaohui Lin
...
Ling You
Yinqi Zhang
Ke Li
Xing Sun
Y. Sun
61
2
0
10 Mar 2025
OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment
Junhyun Park
Chanyu Moon
Donghwan Lee
Kyungsu Kim
Minho Hwang
VLM
MedIm
59
0
0
03 Mar 2025
A Comparison of Object Detection and Phrase Grounding Models in Chest X-ray Abnormality Localization using Eye-tracking Data
Elham Ghelichkhan
Tolga Tasdizen
38
0
0
02 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
56
1
0
02 Mar 2025
Vision Language Models in Medicine
Beria Chingnabe Kalpelbe
Angel Gabriel Adaambiik
Wei Peng
VLM
LM&MA
89
2
0
24 Feb 2025
Object-centric Binding in Contrastive Language-Image Pretraining
Object-centric Binding in Contrastive Language-Image Pretraining
Rim Assouel
Pietro Astolfi
Florian Bordes
M. Drozdzal
Adriana Romero Soriano
OCL
VLM
CoGe
103
0
0
19 Feb 2025
From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine
From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine
Lukas Buess
Matthias Keicher
Nassir Navab
Andreas Maier
Soroosh Tayebi Arasteh
LM&MA
129
0
0
13 Feb 2025
Efficient Domain Adaptation of Multimodal Embeddings using Constrastive Learning
Efficient Domain Adaptation of Multimodal Embeddings using Constrastive Learning
Georgios Margaritis
Periklis Petridis
Dimitris Bertsimas
66
0
0
04 Feb 2025
VisTA: Vision-Text Alignment Model with Contrastive Learning using Multimodal Data for Evidence-Driven, Reliable, and Explainable Alzheimer's Disease Diagnosis
VisTA: Vision-Text Alignment Model with Contrastive Learning using Multimodal Data for Evidence-Driven, Reliable, and Explainable Alzheimer's Disease Diagnosis
Duy-Cat Can
Linh D. Dang
Quang-Huy Tang
Dang Minh Ly
Huong Ha
Guillaume Blanc
Oliver Y. Chén
Binh T. Nguyen
69
1
0
03 Feb 2025
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models
Jakob Krogh Petersen
Valdemar Licht
Mads Nielsen
Asbjørn Munk
VLM
35
0
0
23 Jan 2025
MedFILIP: Medical Fine-grained Language-Image Pre-training
MedFILIP: Medical Fine-grained Language-Image Pre-training
Xinjie Liang
Xiangyu Li
Fanding Li
Jie Jiang
Qing Dong
Wei Wang
Kaidi Wang
Suyu Dong
Gongning Luo
Shuo Li
LM&MA
VLM
MedIm
66
3
0
18 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
105
18
0
17 Jan 2025
MedGrad E-CLIP: Enhancing Trust and Transparency in AI-Driven Skin Lesion Diagnosis
MedGrad E-CLIP: Enhancing Trust and Transparency in AI-Driven Skin Lesion Diagnosis
Sadia Kamal
Tim Oates
MedIm
44
0
0
12 Jan 2025
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
...
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MA
MedIm
154
205
0
10 Jan 2025
Gaussian Masked Autoencoders
Gaussian Masked Autoencoders
Jathushan Rajasegaran
Xinlei Chen
Rulilong Li
Christoph Feichtenhofer
Jitendra Malik
Shiry Ginosar
3DGS
45
1
0
06 Jan 2025
I0T: Embedding Standardization Method Towards Zero Modality Gap
I0T: Embedding Standardization Method Towards Zero Modality Gap
Na Min An
Eunki Kim
James Thorne
Hyunjung Shim
VLM
72
1
0
18 Dec 2024
Language-guided Medical Image Segmentation with Target-informed Multi-level Contrastive Alignments
Language-guided Medical Image Segmentation with Target-informed Multi-level Contrastive Alignments
Mingjian Li
Mingyuan Meng
Shuchang Ye
David Dagan Feng
Lei Bi
Jinman Kim
84
0
0
18 Dec 2024
Optimized two-stage AI-based Neural Decoding for Enhanced Visual
  Stimulus Reconstruction from fMRI Data
Optimized two-stage AI-based Neural Decoding for Enhanced Visual Stimulus Reconstruction from fMRI Data
Lorenzo Veronese
Andrea Moglia
Luca Mainardi
Pietro Cerveri
DiffM
66
0
0
17 Dec 2024
NoteContrast: Contrastive Language-Diagnostic Pretraining for Medical
  Text
NoteContrast: Contrastive Language-Diagnostic Pretraining for Medical Text
Prajwal Kailas
Max Homilius
Rahul C. Deo
Calum A. MacRae
104
0
0
16 Dec 2024
jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Andreas Koukounas
Georgios Mastrapas
Bo Wang
Mohammad Kalim Akram
Sedigheh Eslami
Michael Gunther
Isabelle Mohr
Saba Sturua
Scott Martens
Nan Wang
VLM
110
7
0
11 Dec 2024
DiffCLIP: Few-shot Language-driven Multimodal Classifier
DiffCLIP: Few-shot Language-driven Multimodal Classifier
Jiaqing Zhang
Mingxiang Cao
Xue Yang
Kai Jiang
Yunsong Li
VLM
82
0
0
10 Dec 2024
CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance
CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance
Chu Myaet Thwal
Ye Lin Tun
Minh N. H. Nguyen
Eui-nam Huh
Choong Seon Hong
VLM
74
0
0
05 Dec 2024
Medical Multimodal Foundation Models in Clinical Diagnosis and
  Treatment: Applications, Challenges, and Future Directions
Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future Directions
Kai Sun
Siyan Xue
F. Sun
Haoran Sun
Yu-Juan Luo
...
Xinzhou Wang
Lei Yang
Shuo Jin
Jun Yan
Jiahong Dong
AI4CE
76
2
0
03 Dec 2024
LQ-Adapter: ViT-Adapter with Learnable Queries for Gallbladder Cancer Detection from Ultrasound Image
Chetan Madan
Mayuna Gupta
Soumen Basu
Pankaj Gupta
Chetan Arora
88
0
0
30 Nov 2024
Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers
Chancharik Mitra
Brandon Huang
Tianning Chai
Zhiqiu Lin
Assaf Arbelle
Rogerio Feris
Leonid Karlinsky
Trevor Darrell
Deva Ramanan
Roei Herzig
VLM
134
4
0
28 Nov 2024
Semantic Shield: Defending Vision-Language Models Against Backdooring
  and Poisoning via Fine-grained Knowledge Alignment
Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment
Alvi Md Ishmam
Christopher Thomas
AAML
121
3
0
23 Nov 2024
Uni-Mlip: Unified Self-supervision for Medical Vision Language
  Pre-training
Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training
Ameera Bawazir
Kebin Wu
Wenbin Li
CLIP
77
1
0
20 Nov 2024
TP-UNet: Temporal Prompt Guided UNet for Medical Image Segmentation
Ranmin Wang
Limin Zhuang
Hongkun Chen
Boyan Xu
Ruichu Cai
44
0
0
18 Nov 2024
HIST-AID: Leveraging Historical Patient Reports for Enhanced Multi-Modal
  Automatic Diagnosis
HIST-AID: Leveraging Historical Patient Reports for Enhanced Multi-Modal Automatic Diagnosis
Haoxu Huang
Cem M. Deniz
K. Cho
S. Chopra
Divyam Madaan
37
1
0
16 Nov 2024
Cross- and Intra-image Prototypical Learning for Multi-label Disease Diagnosis and Interpretation
Cross- and Intra-image Prototypical Learning for Multi-label Disease Diagnosis and Interpretation
Chong Wang
Fengbei Liu
Yuanhong Chen
Helen Frazer
Gustavo Carneiro
32
2
0
07 Nov 2024
123456789
Next