Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.04537
Cited By
Multimodal Representation Learning via Maximization of Local Mutual Information
8 March 2021
Ruizhi Liao
Daniel Moyer
Miriam Cha
Keegan Quigley
Seth Berkowitz
Steven Horng
Polina Golland
W. Wells
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multimodal Representation Learning via Maximization of Local Mutual Information"
25 / 25 papers shown
Title
CROSSAN: Towards Efficient and Effective Adaptation of Multiple Multimodal Foundation Models for Sequential Recommendation
Junchen Fu
Yongxin Ni
J. Jose
Ioannis Arapakis
Kaiwen Zheng
Y. Li
Xuri Ge
34
0
0
14 Apr 2025
Anatomy-Aware Conditional Image-Text Retrieval
Meng Zheng
Jiajin Zhang
Benjamin Planche
Zhongpai Gao
Terrence Chen
Ziyan Wu
MedIm
57
0
0
10 Mar 2025
DADM: Dual Alignment of Domain and Modality for Face Anti-spoofing
Jingyi Yang
Xun Lin
Zitong Yu
Li Zhang
X. Liu
Hui Li
Xiaochen Yuan
Xiaochun Cao
CVBM
41
0
0
01 Mar 2025
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
...
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MA
MedIm
154
205
0
10 Jan 2025
Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future Directions
Kai Sun
Siyan Xue
F. Sun
Haoran Sun
Yu-Juan Luo
...
Xinzhou Wang
Lei Yang
Shuo Jin
Jun Yan
Jiahong Dong
AI4CE
76
2
0
03 Dec 2024
ChEX: Interactive Localization and Region Description in Chest X-rays
Philip Muller
Georgios Kaissis
Daniel Rueckert
35
5
0
24 Apr 2024
CLIP in Medical Imaging: A Comprehensive Survey
Zihao Zhao
Yuxiao Liu
Han Wu
Yonghao Li
Sheng Wang
L. Teng
Disheng Liu
Zhiming Cui
Qian Wang
Dinggang Shen
CLIP
MedIm
LM&MA
VLM
31
43
0
12 Dec 2023
Medical Vision Language Pretraining: A survey
Prashant Shrestha
Sanskar Amgain
Bidur Khanal
Cristian A. Linte
Binod Bhattarai
VLM
34
14
0
11 Dec 2023
Multimodal Variational Auto-encoder based Audio-Visual Segmentation
Yuxin Mao
Jing Zhang
Mochu Xiang
Yiran Zhong
Yuchao Dai
40
34
0
12 Oct 2023
Improving Face Recognition from Caption Supervision with Multi-Granular Contextual Feature Aggregation
Md Golam Moula Mehedi Hasan
Nasser M. Nasrabadi
CVBM
18
2
0
13 Aug 2023
A scoping review on multimodal deep learning in biomedical images and texts
Zhaoyi Sun
Mingquan Lin
Qingqing Zhu
Qianqian Xie
Fei-Yue Wang
Zhiyong Lu
Yifan Peng
31
18
0
14 Jul 2023
S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions
Sangwoo Mo
Minkyu Kim
Kyungmin Lee
Jinwoo Shin
VLM
CLIP
44
21
0
23 May 2023
Sample-Specific Debiasing for Better Image-Text Models
Peiqi Wang
Yingcheng Liu
Ching-Yun Ko
W. Wells
Seth Berkowitz
Steven Horng
Polina Golland
SSL
MedIm
22
1
0
25 Apr 2023
Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime
Rhydian Windsor
A. Jamaludin
T. Kadir
Andrew Zisserman
VLM
27
11
0
30 Mar 2023
Advancing Radiograph Representation Learning with Masked Record Modeling
Hong-Yu Zhou
Chenyu Lian
Lian-cheng Wang
Yizhou Yu
MedIm
38
55
0
30 Jan 2023
Using Multiple Instance Learning to Build Multimodal Representations
Peiqi Wang
W. Wells
Seth Berkowitz
Steven Horng
Polina Golland
SSL
24
6
0
11 Dec 2022
The Role of Local Alignment and Uniformity in Image-Text Contrastive Learning on Medical Images
Philip Muller
Georgios Kaissis
Daniel Rueckert
MedIm
24
7
0
14 Nov 2022
That's the Wrong Lung! Evaluating and Improving the Interpretability of Unsupervised Multimodal Encoders for Medical Data
Denis Jered McInerney
Geoffrey S. Young
Jan-Willem van de Meent
Byron C. Wallace
18
0
0
12 Oct 2022
RadTex: Learning Efficient Radiograph Representations from Text Reports
Keegan Quigley
Miriam Cha
Ruizhi Liao
Geeticka Chauhan
Steven Horng
Seth Berkowitz
Polina Golland
MedIm
25
3
0
05 Aug 2022
Context-sensitive neocortical neurons transform the effectiveness and efficiency of neural information processing
Ahsan Adeel
Mario Franco
Mohsin Raza
K. Ahmed
31
9
0
15 Jul 2022
Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing
Benedikt Boecking
Naoto Usuyama
Shruthi Bannur
Daniel Coelho De Castro
Anton Schwaighofer
...
Tristan Naumann
A. Nori
Javier Alvarez-Valle
Hoifung Poon
Ozan Oktay
21
231
0
21 Apr 2022
FlexR: Few-shot Classification with Language Embeddings for Structured Reporting of Chest X-rays
Matthias Keicher
Kamilia Zaripova
Tobias Czempiel
Kristina Mach
Ashkan Khakzar
Nassir Navab
MedIm
30
11
0
29 Mar 2022
Joint Learning of Localized Representations from Medical Images and Reports
Philipp Muller
Georgios Kaissis
Cong Zou
Daniel Munich
140
81
0
06 Dec 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Benyou Wang
Qianqian Xie
Jiahuan Pei
Zhihong Chen
Prayag Tiwari
Zhao Li
Jie Fu
LM&MA
AI4CE
37
163
0
11 Oct 2021
Contrastive Learning of Medical Visual Representations from Paired Images and Text
Yuhao Zhang
Hang Jiang
Yasuhide Miura
Christopher D. Manning
C. Langlotz
MedIm
32
731
0
02 Oct 2020
1