Using Multiple Instance Learning to Build Multimodal Representations

Using Multiple Instance Learning to Build Multimodal Representations

11 December 2022

Papers citing "Using Multiple Instance Learning to Build Multimodal Representations"

17 / 17 papers shown

Title
Multi-View and Multi-Scale Alignment for Contrastive Language-Image Pre-training in Mammography Yuexi Du John Onofrey Nicha Dvornek VLM 75 2 0 26 Sep 2024
Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing Benedikt Boecking Naoto Usuyama Shruthi Bannur Daniel Coelho De Castro Anton Schwaighofer ... Tristan Naumann A. Nori Javier Alvarez-Valle Hoifung Poon Ozan Oktay 60 240 0 21 Apr 2022
Joint Learning of Localized Representations from Medical Images and Reports Philipp Muller Georgios Kaissis Cong Zou Daniel Munich 168 83 0 06 Dec 2021
Multimodal Representation Learning via Maximization of Local Mutual Information Ruizhi Liao Daniel Moyer Miriam Cha Keegan Quigley Seth Berkowitz Steven Horng Polina Golland W. Wells SSL 45 42 0 08 Mar 2021
Dual-stream Multiple Instance Learning Network for Whole Slide Image Classification with Self-supervised Contrastive Learning Bin Li Yin Li K. Eliceiri 70 615 0 17 Nov 2020
Contrastive Learning of Medical Visual Representations from Paired Images and Text Yuhao Zhang Hang Jiang Yasuhide Miura Christopher D. Manning C. Langlotz MedIm 118 755 0 02 Oct 2020
Joint Modeling of Chest Radiographs and Radiology Reports for Pulmonary Edema Assessment Geeticka Chauhan Ruizhi Liao W. Wells Jacob Andreas Xin Wang Seth Berkowitz Steven Horng Peter Szolovits Polina Golland MedIm 63 53 0 22 Aug 2020
Contrastive Learning for Weakly Supervised Phrase Grounding Tanmay Gupta Arash Vahdat Gal Chechik Xiaodong Yang Jan Kautz Derek Hoiem ObjD SSL 107 141 0 17 Jun 2020
VisualBERT: A Simple and Performant Baseline for Vision and Language Liunian Harold Li Mark Yatskar Da Yin Cho-Jui Hsieh Kai-Wei Chang VLM 130 1,948 0 09 Aug 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks Jiasen Lu Dhruv Batra Devi Parikh Stefan Lee SSL VLM 217 3,667 0 06 Aug 2019
Stacked Cross Attention for Image-Text Matching Kuang-Huei Lee Xi Chen G. Hua Houdong Hu Xiaodong He 74 1,151 0 21 Mar 2018
Non-local Neural Networks Xinyu Wang Ross B. Girshick Abhinav Gupta Kaiming He OffRL 265 8,888 0 21 Nov 2017
Classifying and Segmenting Microscopy Images Using Convolutional Multiple Instance Learning Oren Z. Kraus Lei Jimmy Ba B. Frey 180 391 0 17 Nov 2015
Deep Visual-Semantic Alignments for Generating Image Descriptions A. Karpathy Li Fei-Fei 95 5,578 0 07 Dec 2014
From Captions to Visual Concepts and Back Hao Fang Saurabh Gupta F. Iandola R. Srivastava Li Deng ... Xiaodong He Margaret Mitchell John C. Platt C. L. Zitnick Geoffrey Zweig VLM 85 1,309 0 18 Nov 2014
Deep Fragment Embeddings for Bidirectional Image Sentence Mapping A. Karpathy Armand Joulin Li Fei-Fei VLM 85 936 0 22 Jun 2014
Multi-Instance Multi-Label Learning Zhi Zhou Min-Ling Zhang Sheng-Jun Huang Yu-Feng Li 95 416 0 24 Aug 2008