Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.19578
Cited By
PathAlign: A vision-language model for whole slide images in histopathology
27 June 2024
Faruk Ahmed
Andrew Sellergren
Lin Yang
Shawn Xu
Boris Babenko
Abbi Ward
Niels Olson
Arash Mohtashamian
Yossi Matias
Greg S. Corrado
Quang Duong
D. Webster
S. Shetty
Daniel Golden
Yun-Hui Liu
David F. Steiner
Ellery Wulczyn
LM&MA
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PathAlign: A vision-language model for whole slide images in histopathology"
33 / 33 papers shown
Title
MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention
Tianyi Wang
Jianan Fan
Dingxin Zhang
Dongnan Liu
Yong-quan Xia
Heng Huang
Weidong Cai
134
0
0
01 Mar 2025
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
...
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MA
MedIm
227
232
0
10 Jan 2025
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding
Ying Chen
Guoan Wang
Yuanfeng Ji
Yanjun Li
Jin Ye
Tianbin Li
Bin Zhang
Nana Pei
Rongshan Yu
Yu Qiao
VLM
LM&MA
98
5
0
15 Oct 2024
Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology
Pei Liu
Luping Ji
Jiaxiang Gou
Bo Fu
Mao Ye
173
2
0
14 Sep 2024
Morphological Prototyping for Unsupervised Slide Representation Learning in Computational Pathology
Andrew H. Song
Richard J. Chen
Tong Ding
Drew F. K. Williamson
Guillaume Jaume
Faisal Mahmood
MedIm
90
32
0
19 May 2024
Transcriptomics-guided Slide Representation Learning in Computational Pathology
Guillaume Jaume
Lukas Oldenburg
Anurag J. Vaidya
Richard J. Chen
Drew F. K. Williamson
Thomas Peeters
Andrew H. Song
Faisal Mahmood
100
29
0
19 May 2024
PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology
George Shaikovski
Adam Casson
Kristen Severson
Eric Zimmermann
Yi Kan Wang
...
Peter Hamilton
William A. Moye
Eugene Vorontsov
Siqi Liu
Thomas J. Fuchs
MedIm
65
34
0
16 May 2024
PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning
Qifeng Zhou
Wenliang Zhong
Yuzhi Guo
Michael Xiao
Hehuan Ma
Junzhou Huang
66
11
0
13 Mar 2024
A self-supervised framework for learning whole slide representations
X. Hou
Cheng Jiang
A. Kondepudi
Yiwei Lyu
Asadur Chowdury
Honglak Lee
Todd C. Hollon
MedIm
56
5
0
09 Feb 2024
PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology
Yuxuan Sun
Hao Wu
Chenglu Zhu
Sunyi Zheng
Qizi Chen
...
Mengyue Zheng
Jingxiong Li
Xinheng Lyu
Tao Lin
Lin Yang
LM&MA
76
18
0
29 Jan 2024
Domain-specific optimization and diverse evaluation of self-supervised models for histopathology
Jeremy Lai
Faruk Ahmed
Supriya Vijay
Tiam Jaroensri
Jessica Loo
...
Jonathan Krause
Yun-Hui Liu
Po-Hsuan Cameron Chen
Ellery Wulczyn
David F. Steiner
61
7
0
20 Oct 2023
ELIXR: Towards a general purpose X-ray artificial intelligence system through alignment of large language models and radiology vision encoders
Shawn Xu
Ling Yang
Christopher J. Kelly
M. Sieniek
Timo Kohlberger
...
Shruthi Prabhakara
Daniel Golden
Rory Pilgrim
Krish Eswaran
Andrew Sellergren
LM&MA
MedIm
69
55
0
02 Aug 2023
Text-guided Foundation Model Adaptation for Pathological Image Classification
Yunkun Zhang
Jinglei Gao
Mu Zhou
Xiaosong Wang
Yu Qiao
Shaoting Zhang
Dequan Wang
MedIm
55
49
0
27 Jul 2023
Towards a Visual-Language Foundation Model for Computational Pathology
Ming Y. Lu
Bowen Chen
Drew F. K. Williamson
Richard J. Chen
Ivy Liang
...
Andrew Zhang
L. Le
Georg Gerber
Anil V. Parwani
Faisal Mahmood
VLM
MedIm
83
46
0
24 Jul 2023
Quilt-1M: One Million Image-Text Pairs for Histopathology
Wisdom O. Ikezogwo
M. S. Seyfioglu
Fatemeh Ghezloo
Dylan Stefan Chan Geva
Fatwir Sheikh Mohammed
Pavan Kumar Anand
Ranjay Krishna
Linda G. Shapiro
CLIP
VLM
286
125
0
20 Jun 2023
Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images
Ming Y. Lu
Bowen Chen
Andrew Zhang
Drew F. K. Williamson
Richard J. Chen
Tong Ding
L. Le
Yung-Sung Chuang
Faisal Mahmood
VLM
MedIm
185
102
0
13 Jun 2023
PathAsst: A Generative Foundation AI Assistant Towards Artificial General Intelligence of Pathology
Yuxuan Sun
Chenglu Zhu
S. Zheng
Kai Zhang
Xiaoxuan Yu
Zhongyi Shui
Yunlong Zhang
Honglin Li
Lin Yang
LM&MA
MedIm
118
49
0
24 May 2023
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLM
LRM
233
1,204
0
17 May 2023
PLIP: Language-Image Pre-training for Person Representation Learning
Jia-li Zuo
Jiahao Hong
Feng Zhang
Changqian Yu
Hanyu Zhou
Changxin Gao
Nong Sang
Jingdong Wang
VLM
MLLM
89
38
0
15 May 2023
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
569
4,910
0
17 Apr 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
429
4,641
0
30 Jan 2023
RandStainNA: Learning Stain-Agnostic Features from Histology Slides by Bridging Stain Augmentation and Normalization
Yiqing Shen
Yulin Luo
Dinggang Shen
Jing Ke
OOD
55
43
0
25 Jun 2022
Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning
Richard J. Chen
Chengkuan Chen
Yicong Li
Tiffany Y. Chen
A. Trister
Rahul G. Krishnan
Faisal Mahmood
ViT
MedIm
99
426
0
06 Jun 2022
CoCa: Contrastive Captioners are Image-Text Foundation Models
Jiahui Yu
Zirui Wang
Vijay Vasudevan
Legg Yeung
Mojtaba Seyedhosseini
Yonghui Wu
VLM
CLIP
OffRL
167
1,307
0
04 May 2022
Masked Siamese Networks for Label-Efficient Learning
Mahmoud Assran
Mathilde Caron
Ishan Misra
Piotr Bojanowski
Florian Bordes
Pascal Vincent
Armand Joulin
Michael G. Rabbat
Nicolas Ballas
SSL
105
322
0
14 Apr 2022
Inference of captions from histopathological patches
M. Tsuneki
F. Kanavati
73
32
0
07 Feb 2022
How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers
Andreas Steiner
Alexander Kolesnikov
Xiaohua Zhai
Ross Wightman
Jakob Uszkoreit
Lucas Beyer
ViT
116
635
0
18 Jun 2021
Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles
Jevgenij Gamper
Nasir M. Rajpoot
55
64
0
08 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
967
29,810
0
26 Feb 2021
Overcoming the limitations of patch-based learning to detect cancer in whole slide images
Ozan Ciga
Tony Xu
S. Nofech-Mozes
S. Noy
F. Lu
Anne L. Martel
59
41
0
01 Dec 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
670
41,430
0
22 Oct 2020
Universal Sentence Encoder
Daniel Cer
Yinfei Yang
Sheng-yi Kong
Nan Hua
Nicole Limtiaco
...
Steve Yuan
Chris Tar
Yun-hsuan Sung
B. Strope
R. Kurzweil
439
1,907
0
29 Mar 2018
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
730
132,363
0
12 Jun 2017
1