Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.05561
Cited By
Using Multiple Instance Learning to Build Multimodal Representations
11 December 2022
Peiqi Wang
W. Wells
Seth Berkowitz
Steven Horng
Polina Golland
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Using Multiple Instance Learning to Build Multimodal Representations"
17 / 17 papers shown
Title
Multi-View and Multi-Scale Alignment for Contrastive Language-Image Pre-training in Mammography
Yuexi Du
John Onofrey
Nicha Dvornek
VLM
75
2
0
26 Sep 2024
Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing
Benedikt Boecking
Naoto Usuyama
Shruthi Bannur
Daniel Coelho De Castro
Anton Schwaighofer
...
Tristan Naumann
A. Nori
Javier Alvarez-Valle
Hoifung Poon
Ozan Oktay
60
240
0
21 Apr 2022
Joint Learning of Localized Representations from Medical Images and Reports
Philipp Muller
Georgios Kaissis
Cong Zou
Daniel Munich
168
83
0
06 Dec 2021
Multimodal Representation Learning via Maximization of Local Mutual Information
Ruizhi Liao
Daniel Moyer
Miriam Cha
Keegan Quigley
Seth Berkowitz
Steven Horng
Polina Golland
W. Wells
SSL
45
42
0
08 Mar 2021
Dual-stream Multiple Instance Learning Network for Whole Slide Image Classification with Self-supervised Contrastive Learning
Bin Li
Yin Li
K. Eliceiri
70
615
0
17 Nov 2020
Contrastive Learning of Medical Visual Representations from Paired Images and Text
Yuhao Zhang
Hang Jiang
Yasuhide Miura
Christopher D. Manning
C. Langlotz
MedIm
118
755
0
02 Oct 2020
Joint Modeling of Chest Radiographs and Radiology Reports for Pulmonary Edema Assessment
Geeticka Chauhan
Ruizhi Liao
W. Wells
Jacob Andreas
Xin Wang
Seth Berkowitz
Steven Horng
Peter Szolovits
Polina Golland
MedIm
63
53
0
22 Aug 2020
Contrastive Learning for Weakly Supervised Phrase Grounding
Tanmay Gupta
Arash Vahdat
Gal Chechik
Xiaodong Yang
Jan Kautz
Derek Hoiem
ObjD
SSL
107
141
0
17 Jun 2020
VisualBERT: A Simple and Performant Baseline for Vision and Language
Liunian Harold Li
Mark Yatskar
Da Yin
Cho-Jui Hsieh
Kai-Wei Chang
VLM
130
1,948
0
09 Aug 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
217
3,667
0
06 Aug 2019
Stacked Cross Attention for Image-Text Matching
Kuang-Huei Lee
Xi Chen
G. Hua
Houdong Hu
Xiaodong He
74
1,151
0
21 Mar 2018
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
265
8,888
0
21 Nov 2017
Classifying and Segmenting Microscopy Images Using Convolutional Multiple Instance Learning
Oren Z. Kraus
Lei Jimmy Ba
B. Frey
180
391
0
17 Nov 2015
Deep Visual-Semantic Alignments for Generating Image Descriptions
A. Karpathy
Li Fei-Fei
95
5,578
0
07 Dec 2014
From Captions to Visual Concepts and Back
Hao Fang
Saurabh Gupta
F. Iandola
R. Srivastava
Li Deng
...
Xiaodong He
Margaret Mitchell
John C. Platt
C. L. Zitnick
Geoffrey Zweig
VLM
85
1,309
0
18 Nov 2014
Deep Fragment Embeddings for Bidirectional Image Sentence Mapping
A. Karpathy
Armand Joulin
Li Fei-Fei
VLM
85
936
0
22 Jun 2014
Multi-Instance Multi-Label Learning
Zhi Zhou
Min-Ling Zhang
Sheng-Jun Huang
Yu-Feng Li
95
416
0
24 Aug 2008
1