Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.08967
Cited By
PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning
13 March 2024
Qifeng Zhou
Wenliang Zhong
Yuzhi Guo
Michael Xiao
Hehuan Ma
Junzhou Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning"
8 / 8 papers shown
Title
Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges
Yuan Zhang
Xinfeng Zhang
Xiaoming Qi Xinyu Wu
Feng Chen
Guanyu Yang
Huazhu Fu
MedIm
LM&MA
AI4CE
22
0
0
16 May 2025
CLIP-IT: CLIP-based Pairing for Histology Images Classification
Banafsheh Karimian
Giulia Avanzato
Soufian Belharbi
Luke McCaffrey
Mohammadhadi Shateri
Eric Granger
VLM
51
0
0
22 Apr 2025
ModalTune: Fine-Tuning Slide-Level Foundation Models with Multi-Modal Information for Multi-task Learning in Digital Pathology
Vishwesh Ramanathan
Tony Xu
Pushpak Pati
Faruk Ahmed
Maged Goubran
Anne L. Martel
48
0
0
21 Mar 2025
PolyPath: Adapting a Large Multimodal Model for Multi-slide Pathology Report Generation
Faruk Ahmed
Lin Yang
Tiam Jaroensri
Andrew Sellergren
Yossi Matias
...
Shruthi Prabhakara
Yun-Hui Liu
Daniel Golden
Ellery Wulczyn
David F. Steiner
VLM
47
1
0
14 Feb 2025
PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology
Fatemeh Ghezloo
M. S. Seyfioglu
Rustin Soraki
Wisdom O. Ikezogwo
Beibin Li
Tejoram Vivekanandan
J. Elmore
Ranjay Krishna
Linda G. Shapiro
100
4
0
13 Feb 2025
A New Era in Computational Pathology: A Survey on Foundation and Vision-Language Models
Dibaloke Chanda
Milan Aryal
Nasim Yahya Soltani
Masoud Ganji
AI4CE
VLM
44
7
0
23 Aug 2024
PathAlign: A vision-language model for whole slide images in histopathology
Faruk Ahmed
Andrew Sellergren
Lin Yang
Shawn Xu
Boris Babenko
...
S. Shetty
Daniel Golden
Yun-Hui Liu
David F. Steiner
Ellery Wulczyn
LM&MA
VLM
36
15
0
27 Jun 2024
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
290
4,261
0
30 Jan 2023
1