ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.21862
  4. Cited By
Towards Scalable Language-Image Pre-training for 3D Medical Imaging

Towards Scalable Language-Image Pre-training for 3D Medical Imaging

28 May 2025
Chenhui Zhao
Yiwei Lyu
Asadur Chowdury
Edward Harake
A. Kondepudi
Akshay Rao
X. Hou
Honglak Lee
Todd C. Hollon
    LM&MAMedIm
ArXiv (abs)PDFHTML

Papers citing "Towards Scalable Language-Image Pre-training for 3D Medical Imaging"

33 / 33 papers shown
Title
3D Foundation AI Model for Generalizable Disease Detection in Head Computed Tomography
3D Foundation AI Model for Generalizable Disease Detection in Head Computed Tomography
Weicheng Zhu
Haoxu Huang
Huanze Tang
Rushabh Musthyala
Boyang Yu
...
Seena Dehkharghani
Jennifer A. Frontera
Arjun V. Masurkar
Kara Melmed
N. Razavian
MedIm
61
5
0
04 Feb 2025
Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding
Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding
Zhongyi Shui
Jianpeng Zhang
Weiwei Cao
Shuaiqiang Wang
Ruizhe Guo
...
Lin Yang
X. Ye
Tingbo Liang
Qi Zhang
Ling Zhang
LM&MAVLM
55
4
0
24 Jan 2025
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
...
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MAMedIm
220
232
0
10 Jan 2025
Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis
Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis
Zihang Jiang
Zihang Jiang
Qingsong Yao
Rongsheng Wang
Zhiyang He
Xiaodong Tao
Wei Wei
Weifu Lv
S. Kevin Zhou
42
3
0
08 Jan 2025
HyperSpace: Hypernetworks for spacing-adaptive image segmentation
HyperSpace: Hypernetworks for spacing-adaptive image segmentation
Samuel Joutard
Maximilian Pietsch
Raphael Prevost
64
4
0
04 Jul 2024
Advancing Multimodal Medical Capabilities of Gemini
Advancing Multimodal Medical Capabilities of Gemini
Lin Yang
Shawn Xu
Andrew Sellergren
Timo Kohlberger
Yuchen Zhou
...
David Steiner
Rory Pilgrim
Christopher J. Kelly
Shekoofeh Azizi
Daniel Golden
MedIm
72
66
0
06 May 2024
Bootstrapping Chest CT Image Understanding by Distilling Knowledge from
  X-ray Expert Models
Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models
Weiwei Cao
Jianpeng Zhang
Yingda Xia
Tony C. W. Mok
Zi Li
X. Ye
Le Lu
Jian Zheng
Yuxing Tang
Ling Zhang
56
4
0
07 Apr 2024
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language
  Models
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models
Fan Bai
Yuxin Du
Tiejun Huang
Max Q.-H. Meng
Bo Zhao
66
42
0
31 Mar 2024
CXR-CLIP: Toward Large Scale Chest X-ray Language-Image Pre-training
CXR-CLIP: Toward Large Scale Chest X-ray Language-Image Pre-training
Kihyun You
Jawook Gu
Jiyeon Ham
Beomhee Park
Jiho Kim
Eun K. Hong
Woonhyuk Baek
Byungseok Roh
CLIPVLM
61
63
0
20 Oct 2023
Qwen Technical Report
Qwen Technical Report
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
...
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
OSLM
264
1,895
0
28 Sep 2023
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Chaitanya K. Ryali
Yuan-Ting Hu
Daniel Bolya
Chen Wei
Haoqi Fan
...
Omid Poursaeed
Judy Hoffman
Jitendra Malik
Yanghao Li
Christoph Feichtenhofer
3DH
91
184
0
01 Jun 2023
The University of California San Francisco Brain Metastases Stereotactic
  Radiosurgery (UCSF-BMSR) MRI Dataset
The University of California San Francisco Brain Metastases Stereotactic Radiosurgery (UCSF-BMSR) MRI Dataset
J. Rudie
Rachit Saluja
R. Weiss
Pierre Nedelec
Evan Calabrese
...
S. Braunstein
Christopher P. Hess
A. Rauschecker
L. Sugrue
J. Villanueva-Meyer
AI4CE
23
15
0
14 Apr 2023
Adapting Pre-trained Vision Transformers from 2D to 3D through Weight
  Inflation Improves Medical Image Segmentation
Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image Segmentation
Yuhui Zhang
Shihua Huang
Zhengping Zhou
M. Lungren
Serena Yeung
ViTMedIm
44
10
0
08 Feb 2023
Learning to Exploit Temporal Structure for Biomedical Vision-Language
  Processing
Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing
Shruthi Bannur
Stephanie L. Hyland
Qianchu Liu
Fernando Pérez-García
Maximilian Ilse
...
Maria T. A. Wetscherek
M. Lungren
A. Nori
Javier Alvarez-Valle
Ozan Oktay
74
126
0
11 Jan 2023
MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training in
  Radiology
MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training in Radiology
Chaoyi Wu
Xiaoman Zhang
Ya Zhang
Yanfeng Wang
Weidi Xie
LM&MAVLM
75
120
0
05 Jan 2023
MONAI: An open-source framework for deep learning in healthcare
MONAI: An open-source framework for deep learning in healthcare
M. Jorge Cardoso
Wenqi Li
Richard Brown
Nic Ma
E. Kerfoot
...
Klaus H. Maier-Hein
S. Aylward
Prerna Dogra
Sebastien Ourselin
Andrew Feng
126
505
0
04 Nov 2022
Multi-Granularity Cross-modal Alignment for Generalized Medical Visual
  Representation Learning
Multi-Granularity Cross-modal Alignment for Generalized Medical Visual Representation Learning
Fuying Wang
Yuyin Zhou
Shujun Wang
V. Vardhanabhuti
Lequan Yu
91
147
0
12 Oct 2022
Making the Most of Text Semantics to Improve Biomedical Vision--Language
  Processing
Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing
Benedikt Boecking
Naoto Usuyama
Shruthi Bannur
Daniel Coelho De Castro
Anton Schwaighofer
...
Tristan Naumann
A. Nori
Javier Alvarez-Valle
Hoifung Poon
Ozan Oktay
71
243
0
21 Apr 2022
What Makes Transfer Learning Work For Medical Images: Feature Reuse &
  Other Factors
What Makes Transfer Learning Work For Medical Images: Feature Reuse & Other Factors
Christos Matsoukas
Johan Fredin Haslum
Moein Sorkhei
Magnus P Soderberg
Kevin Smith
VLMOODMedIm
115
88
0
02 Mar 2022
Joint Learning of Localized Representations from Medical Images and
  Reports
Joint Learning of Localized Representations from Medical Images and Reports
Philipp Muller
Georgios Kaissis
Cong Zou
Daniel Munich
183
85
0
06 Dec 2021
MViTv2: Improved Multiscale Vision Transformers for Classification and
  Detection
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
Yanghao Li
Chaoxia Wu
Haoqi Fan
K. Mangalam
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
153
690
0
02 Dec 2021
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViTTPM
467
7,814
0
11 Nov 2021
The RSNA-ASNR-MICCAI BraTS 2021 Benchmark on Brain Tumor Segmentation
  and Radiogenomic Classification
The RSNA-ASNR-MICCAI BraTS 2021 Benchmark on Brain Tumor Segmentation and Radiogenomic Classification
Ujjwal Baid
S. Ghodasara
S. Mohan
Michel Bilello
Evan Calabrese
...
M. Weber
A. Mahajan
Bjoern Menze
Adam Flanders
Spyridon Bakas
143
643
0
05 Jul 2021
Multiscale Vision Transformers
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
135
1,265
0
22 Apr 2021
Generic Attention-model Explainability for Interpreting Bi-Modal and
  Encoder-Decoder Transformers
Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers
Hila Chefer
Shir Gur
Lior Wolf
ViT
64
325
0
29 Mar 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
463
21,564
0
25 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
967
29,731
0
26 Feb 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
667
41,369
0
22 Oct 2020
Contrastive Learning of Medical Visual Representations from Paired
  Images and Text
Contrastive Learning of Medical Visual Representations from Paired Images and Text
Yuhao Zhang
Hang Jiang
Yasuhide Miura
Christopher D. Manning
C. Langlotz
MedIm
137
766
0
02 Oct 2020
Domain-Specific Language Model Pretraining for Biomedical Natural
  Language Processing
Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing
Yu Gu
Robert Tinn
Hao Cheng
Michael R. Lucas
Naoto Usuyama
Xiaodong Liu
Tristan Naumann
Jianfeng Gao
Hoifung Poon
LM&MAAI4CE
85
1,781
0
31 Jul 2020
Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale
  Chest Computed Tomography Volumes
Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale Chest Computed Tomography Volumes
R. Draelos
D. Dov
Maciej A. Mazurowski
J. Lo
Ricardo Henao
Geoffrey D. Rubin
Lawrence Carin
74
71
0
12 Feb 2020
CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and
  Expert Comparison
CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison
Jeremy Irvin
Pranav Rajpurkar
M. Ko
Yifan Yu
Silviana Ciurea-Ilcus
...
D. Larson
C. Langlotz
Bhavik Patel
M. Lungren
A. Ng
112
2,602
0
21 Jan 2019
SGDR: Stochastic Gradient Descent with Warm Restarts
SGDR: Stochastic Gradient Descent with Warm Restarts
I. Loshchilov
Frank Hutter
ODL
341
8,169
0
13 Aug 2016
1