ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.00747
  4. Cited By
Contrastive Learning of Medical Visual Representations from Paired
  Images and Text

Contrastive Learning of Medical Visual Representations from Paired Images and Text

2 October 2020
Yuhao Zhang
Hang Jiang
Yasuhide Miura
Christopher D. Manning
C. Langlotz
    MedIm
ArXivPDFHTML

Papers citing "Contrastive Learning of Medical Visual Representations from Paired Images and Text"

50 / 445 papers shown
Title
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks
  and Algorithms
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Miaosen Zhang
Yixuan Wei
Zhen Xing
Yifei Ma
Zuxuan Wu
...
Zheng-Wei Zhang
Qi Dai
Chong Luo
Xin Geng
Baining Guo
VLM
51
1
0
13 Jun 2024
Zoom and Shift are All You Need
Zoom and Shift are All You Need
Jiahao Qin
46
2
0
13 Jun 2024
Generalization Beyond Data Imbalance: A Controlled Study on CLIP for
  Transferable Insights
Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights
Xin Wen
Bingchen Zhao
Yilun Chen
Jiangmiao Pang
Xiaojuan Qi
38
3
0
31 May 2024
Knowledge-grounded Adaptation Strategy for Vision-language Models:
  Building Unique Case-set for Screening Mammograms for Residents Training
Knowledge-grounded Adaptation Strategy for Vision-language Models: Building Unique Case-set for Screening Mammograms for Residents Training
Aisha Urooj Khan
John W. Garrett
Tyler Bradshaw
Lonie R. Salkowski
Jiwoong Jeong
Amara Tariq
Imon Banerjee
VLM
32
1
0
30 May 2024
Unlocking the Power of Spatial and Temporal Information in Medical
  Multimodal Pre-training
Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training
Jinxia Yang
Bing-Huang Su
Wayne Xin Zhao
Ji-Rong Wen
40
3
0
30 May 2024
CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text
  Radiology Reports, Patient Demographics and Additional Image Formats
CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text Radiology Reports, Patient Demographics and Additional Image Formats
Pierre J. Chambon
Jean-Benoit Delbrouck
Thomas Sounack
Shih-Cheng Huang
Zhihong Chen
Maya Varma
Steven QH Truong
Chu The Chuong
Curtis P. Langlotz
LM&MA
45
12
0
29 May 2024
Topological Perspectives on Optimal Multimodal Embedding Spaces
Topological Perspectives on Optimal Multimodal Embedding Spaces
Abdul Aziz
Abdul Rahim
BDL
45
0
0
29 May 2024
It's Not a Modality Gap: Characterizing and Addressing the Contrastive
  Gap
It's Not a Modality Gap: Characterizing and Addressing the Contrastive Gap
Abrar Fahim
Alex Murphy
Alona Fyshe
VLM
43
4
0
28 May 2024
SleepFM: Multi-modal Representation Learning for Sleep Across Brain
  Activity, ECG and Respiratory Signals
SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals
Rahul Thapa
Bryan He
Magnus Ruud Kjær
IV HyattE.Moore
Gauri Ganjoo
Emmanuel Mignot
James Zou
24
12
0
28 May 2024
CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale
CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale
ZeMing Gong
Austin T. Wang
Joakim Bruslund Haurum
Scott C. Lowe
Graham W. Taylor
Angel X. Chang
Angel X. Chang
39
5
0
27 May 2024
RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical
  Diagnostic Reports
RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic Reports
Jiawei Du
Jia Guo
Weihang Zhang
Shengzhu Yang
Hanruo Liu
Huiqi Li
Ningli Wang
MedIm
VLM
33
6
0
23 May 2024
Mammo-CLIP: A Vision Language Foundation Model to Enhance Data
  Efficiency and Robustness in Mammography
Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in Mammography
Shantanu Ghosh
Clare B. Poynton
Shyam Visweswaran
Kayhan Batmanghelich
VLM
37
10
0
20 May 2024
Self-supervised vision-langage alignment of deep learning
  representations for bone X-rays analysis
Self-supervised vision-langage alignment of deep learning representations for bone X-rays analysis
A. Englebert
Anne-Sophie Collin
O. Cornu
Christophe De Vleeschouwer
34
1
0
14 May 2024
Open Challenges and Opportunities in Federated Foundation Models Towards
  Biomedical Healthcare
Open Challenges and Opportunities in Federated Foundation Models Towards Biomedical Healthcare
Xingyu Li
Lu Peng
Yuping Wang
Weihua Zhang
AI4CE
MedIm
LM&MA
71
5
0
10 May 2024
Pseudo-Prompt Generating in Pre-trained Vision-Language Models for
  Multi-Label Medical Image Classification
Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification
Yaoqin Ye
Junjie Zhang
Hongwei Shi
MedIm
VLM
49
0
0
10 May 2024
EVA-X: A Foundation Model for General Chest X-ray Analysis with
  Self-supervised Learning
EVA-X: A Foundation Model for General Chest X-ray Analysis with Self-supervised Learning
Jingfeng Yao
Xinggang Wang
Yuehao Song
Huangxuan Zhao
Jun Ma
Yajie Chen
Wenyu Liu
Bo Wang
ViT
39
5
0
08 May 2024
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion
Zehan Wang
Ziang Zhang
Xize Cheng
Rongjie Huang
Luping Liu
...
Haifeng Huang
Yang Zhao
Tao Jin
Peng Gao
Zhou Zhao
37
9
0
08 May 2024
AAPL: Adding Attributes to Prompt Learning for Vision-Language Models
AAPL: Adding Attributes to Prompt Learning for Vision-Language Models
Gahyeon Kim
Sohee Kim
Seokju Lee
VLM
33
5
0
25 Apr 2024
ChEX: Interactive Localization and Region Description in Chest X-rays
ChEX: Interactive Localization and Region Description in Chest X-rays
Philip Muller
Georgios Kaissis
Daniel Rueckert
35
5
0
24 Apr 2024
CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and
  Radiology Reports for Full-Body Scenarios
CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios
Jingyang Lin
Yingda Xia
Jianpeng Zhang
Ke Yan
Le Lu
Jiebo Luo
Ling Zhang
VLM
LM&MA
MedIm
45
3
0
23 Apr 2024
A review of deep learning-based information fusion techniques for
  multimodal medical image classification
A review of deep learning-based information fusion techniques for multimodal medical image classification
Yi-Hsuan Li
Mostafa EL HABIB DAHO
Pierre-Henri Conze
Rachid Zeghlache
Hugo Le Boité
R. Tadayoni
B. Cochener
M. Lamard
G. Quellec
38
31
0
23 Apr 2024
Machine Learning Techniques for MRI Data Processing at Expanding Scale
Machine Learning Techniques for MRI Data Processing at Expanding Scale
Taro Langner
35
0
0
22 Apr 2024
MedThink: Explaining Medical Visual Question Answering via Multimodal
  Decision-Making Rationale
MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale
Xiaotang Gai
Chenyi Zhou
Jiaxiang Liu
Yang Feng
Jian Wu
Zuo-Qiang Liu
MedIm
36
6
0
18 Apr 2024
Knowledge-enhanced Visual-Language Pretraining for Computational
  Pathology
Knowledge-enhanced Visual-Language Pretraining for Computational Pathology
Xiao Zhou
Xiaoman Zhang
Chaoyi Wu
Ya Zhang
Weidi Xie
Yanfeng Wang
VLM
35
7
0
15 Apr 2024
Global Contrastive Training for Multimodal Electronic Health Records
  with Language Supervision
Global Contrastive Training for Multimodal Electronic Health Records with Language Supervision
Yingbo Ma
Suraj Kolla
Zhenhong Hu
Dhruv Kaliraman
Victoria Nolan
...
Jeremy A. Balch
Tyler J. Loftus
Parisa Rashidi
A. Bihorac
B. Shickel
AI4TS
33
1
0
10 Apr 2024
Unified Multi-modal Diagnostic Framework with Reconstruction
  Pre-training and Heterogeneity-combat Tuning
Unified Multi-modal Diagnostic Framework with Reconstruction Pre-training and Heterogeneity-combat Tuning
Yupei Zhang
Li Pan
Qiushi Yang
Tan Li
Zhen Chen
31
1
0
09 Apr 2024
Bootstrapping Chest CT Image Understanding by Distilling Knowledge from
  X-ray Expert Models
Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models
Weiwei Cao
Jianpeng Zhang
Yingda Xia
Tony C. W. Mok
Zi Li
X. Ye
Le Lu
Jian Zheng
Yuxing Tang
Ling Zhang
31
1
0
07 Apr 2024
DeViDe: Faceted medical knowledge for improved medical vision-language
  pre-training
DeViDe: Faceted medical knowledge for improved medical vision-language pre-training
Haozhe Luo
Ziyu Zhou
Corentin Royer
Anjany Sekuboyina
Bjoern H. Menze
VLM
ViT
MedIm
48
7
0
04 Apr 2024
Foundation Model for Advancing Healthcare: Challenges, Opportunities,
  and Future Directions
Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future Directions
Yuting He
Fuxiang Huang
Xinrui Jiang
Yuxiang Nie
Minghao Wang
Jiguang Wang
Hao Chen
LM&MA
AI4CE
76
27
0
04 Apr 2024
Cross-Modal Conditioned Reconstruction for Language-guided Medical Image
  Segmentation
Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation
Xiaoshuang Huang
Hongxiang Li
Meng Cao
Long Chen
Chenyu You
Dong An
VLM
41
5
0
03 Apr 2024
Continual Learning for Smart City: A Survey
Continual Learning for Smart City: A Survey
Li Yang
Zhipeng Luo
Shi-sheng Zhang
Fei Teng
Tian-Jie Li
HAI
30
8
0
01 Apr 2024
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action
  Generalization
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization
Anna Kukleva
Fadime Sener
Edoardo Remelli
Bugra Tekin
Eric Sauser
Bernt Schiele
Shugao Ma
VLM
EgoV
45
1
0
28 Mar 2024
Envisioning MedCLIP: A Deep Dive into Explainability for Medical
  Vision-Language Models
Envisioning MedCLIP: A Deep Dive into Explainability for Medical Vision-Language Models
Anees Ur Rehman Hashmi
Dwarikanath Mahapatra
Mohammad Yaqub
VLM
MedIm
23
2
0
27 Mar 2024
Residual-based Language Models are Free Boosters for Biomedical Imaging
Residual-based Language Models are Free Boosters for Biomedical Imaging
Zhixin Lai
Jing Wu
Suiyao Chen
Yucheng Zhou
N. Hovakimyan
MedIm
41
29
0
26 Mar 2024
Eye-gaze Guided Multi-modal Alignment for Medical Representation
  Learning
Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning
Chong Ma
Hanqi Jiang
Wenting Chen
Yiwei Li
Zihao Wu
...
Dajiang Zhu
Tuo Zhang
Dinggang Shen
Tianming Liu
Xiang Li
23
0
0
19 Mar 2024
N-Modal Contrastive Losses with Applications to Social Media Data in
  Trimodal Space
N-Modal Contrastive Losses with Applications to Social Media Data in Trimodal Space
William Theisen
Walter J. Scheirer
34
1
0
18 Mar 2024
Model Reprogramming Outperforms Fine-tuning on Out-of-distribution Data
  in Text-Image Encoders
Model Reprogramming Outperforms Fine-tuning on Out-of-distribution Data in Text-Image Encoders
Andrew Geng
Pin-Yu Chen
OODD
19
0
0
16 Mar 2024
Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A
  Pilot Study
Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study
Chenguang Wang
Ruoxi Jia
Xin Liu
Dawn Song
VLM
29
7
0
15 Mar 2024
Improving Medical Multi-modal Contrastive Learning with Expert
  Annotations
Improving Medical Multi-modal Contrastive Learning with Expert Annotations
Yogesh Kumar
Pekka Marttinen
MedIm
VLM
31
10
0
15 Mar 2024
RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-training
RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-training
Zhixiu Lu
Hailong Li
N. Parikh
Jonathan R. Dillman
Lili He
MedIm
VLM
40
0
0
15 Mar 2024
MeDSLIP: Medical Dual-Stream Language-Image Pre-training with Pathology-Anatomy Semantic Alignment
MeDSLIP: Medical Dual-Stream Language-Image Pre-training with Pathology-Anatomy Semantic Alignment
Wenrui Fan
M. N. I. Suvon
Shuo Zhou
Xianyuan Liu
S. Alabed
V. Osmani
Andrew J Swift
Chen Chen
Haiping Lu
MedIm
VLM
LM&MA
59
3
0
15 Mar 2024
Anatomical Structure-Guided Medical Vision-Language Pre-training
Anatomical Structure-Guided Medical Vision-Language Pre-training
Qingqiu Li
Xiaohan Yan
Jilan Xu
Runtian Yuan
Yuejie Zhang
Rui Feng
Quanli Shen
Xiaobo Zhang
Shujun Wang
42
5
0
14 Mar 2024
CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with
  Ground Truth Flow
CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow
Chenbin Pan
Burhaneddin Yaman
Senem Velipasalar
Liu Ren
57
10
0
13 Mar 2024
Decomposing Disease Descriptions for Enhanced Pathology Detection: A
  Multi-Aspect Vision-Language Pre-training Framework
Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training Framework
Vu Minh Hieu Phan
Yutong Xie
Yuankai Qi
Lingqiao Liu
Liyang Liu
Bowen Zhang
Zhibin Liao
Qi Wu
Minh Nguyen Nhat To
Johan W. Verjans
70
11
0
12 Mar 2024
Zero-Shot ECG Classification with Multimodal Learning and Test-time
  Clinical Knowledge Enhancement
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement
Che Liu
Zhongwei Wan
Ouyang Cheng
Anand Shah
Wenjia Bai
Rossella Arcucci
42
29
0
11 Mar 2024
MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training
  with Masked Autoencoder
MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder
Lei Li
Tianfang Zhang
Xinglin Zhang
Jiaqi Liu
Bingqi Ma
Yan-chun Luo
Tao Chen
MedIm
40
0
0
07 Mar 2024
Enhancing Conceptual Understanding in Multimodal Contrastive Learning
  through Hard Negative Samples
Enhancing Conceptual Understanding in Multimodal Contrastive Learning through Hard Negative Samples
Philipp J. Rösch
Norbert Oswald
Michaela Geierhos
Jindrich Libovický
44
3
0
05 Mar 2024
Time Weaver: A Conditional Time Series Generation Model
Time Weaver: A Conditional Time Series Generation Model
Sai Shankar Narasimhan
Shubhankar Agarwal
Oguzhan Akcin
Sujay Sanghavi
Sandeep P. Chinchali
DiffM
MedIm
33
20
0
05 Mar 2024
Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control
Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control
Thong Nguyen
Mariya Hendriksen
Andrew Yates
Maarten de Rijke
48
7
0
27 Feb 2024
CARZero: Cross-Attention Alignment for Radiology Zero-Shot
  Classification
CARZero: Cross-Attention Alignment for Radiology Zero-Shot Classification
Haoran Lai
Qingsong Yao
Zihang Jiang
Rongsheng Wang
Zhiyang He
Xiaodong Tao
S. Kevin Zhou
MedIm
46
12
0
27 Feb 2024
Previous
123456789
Next