ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.04537
  4. Cited By
Multimodal Representation Learning via Maximization of Local Mutual
  Information

Multimodal Representation Learning via Maximization of Local Mutual Information

8 March 2021
Ruizhi Liao
Daniel Moyer
Miriam Cha
Keegan Quigley
Seth Berkowitz
Steven Horng
Polina Golland
W. Wells
    SSL
ArXivPDFHTML

Papers citing "Multimodal Representation Learning via Maximization of Local Mutual Information"

25 / 25 papers shown
Title
CROSSAN: Towards Efficient and Effective Adaptation of Multiple Multimodal Foundation Models for Sequential Recommendation
CROSSAN: Towards Efficient and Effective Adaptation of Multiple Multimodal Foundation Models for Sequential Recommendation
Junchen Fu
Yongxin Ni
J. Jose
Ioannis Arapakis
Kaiwen Zheng
Y. Li
Xuri Ge
34
0
0
14 Apr 2025
Anatomy-Aware Conditional Image-Text Retrieval
Meng Zheng
Jiajin Zhang
Benjamin Planche
Zhongpai Gao
Terrence Chen
Ziyan Wu
MedIm
57
0
0
10 Mar 2025
DADM: Dual Alignment of Domain and Modality for Face Anti-spoofing
Jingyi Yang
Xun Lin
Zitong Yu
Li Zhang
X. Liu
Hui Li
Xiaochen Yuan
Xiaochun Cao
CVBM
41
0
0
01 Mar 2025
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
...
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MA
MedIm
154
205
0
10 Jan 2025
Medical Multimodal Foundation Models in Clinical Diagnosis and
  Treatment: Applications, Challenges, and Future Directions
Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future Directions
Kai Sun
Siyan Xue
F. Sun
Haoran Sun
Yu-Juan Luo
...
Xinzhou Wang
Lei Yang
Shuo Jin
Jun Yan
Jiahong Dong
AI4CE
76
2
0
03 Dec 2024
ChEX: Interactive Localization and Region Description in Chest X-rays
ChEX: Interactive Localization and Region Description in Chest X-rays
Philip Muller
Georgios Kaissis
Daniel Rueckert
35
5
0
24 Apr 2024
CLIP in Medical Imaging: A Comprehensive Survey
CLIP in Medical Imaging: A Comprehensive Survey
Zihao Zhao
Yuxiao Liu
Han Wu
Yonghao Li
Sheng Wang
L. Teng
Disheng Liu
Zhiming Cui
Qian Wang
Dinggang Shen
CLIP
MedIm
LM&MA
VLM
28
43
0
12 Dec 2023
Medical Vision Language Pretraining: A survey
Medical Vision Language Pretraining: A survey
Prashant Shrestha
Sanskar Amgain
Bidur Khanal
Cristian A. Linte
Binod Bhattarai
VLM
34
14
0
11 Dec 2023
Multimodal Variational Auto-encoder based Audio-Visual Segmentation
Multimodal Variational Auto-encoder based Audio-Visual Segmentation
Yuxin Mao
Jing Zhang
Mochu Xiang
Yiran Zhong
Yuchao Dai
40
34
0
12 Oct 2023
Improving Face Recognition from Caption Supervision with Multi-Granular
  Contextual Feature Aggregation
Improving Face Recognition from Caption Supervision with Multi-Granular Contextual Feature Aggregation
Md Golam Moula Mehedi Hasan
Nasser M. Nasrabadi
CVBM
16
2
0
13 Aug 2023
A scoping review on multimodal deep learning in biomedical images and
  texts
A scoping review on multimodal deep learning in biomedical images and texts
Zhaoyi Sun
Mingquan Lin
Qingqing Zhu
Qianqian Xie
Fei-Yue Wang
Zhiyong Lu
Yifan Peng
31
18
0
14 Jul 2023
S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist
  Captions
S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions
Sangwoo Mo
Minkyu Kim
Kyungmin Lee
Jinwoo Shin
VLM
CLIP
44
21
0
23 May 2023
Sample-Specific Debiasing for Better Image-Text Models
Sample-Specific Debiasing for Better Image-Text Models
Peiqi Wang
Yingcheng Liu
Ching-Yun Ko
W. Wells
Seth Berkowitz
Steven Horng
Polina Golland
SSL
MedIm
22
1
0
25 Apr 2023
Vision-Language Modelling For Radiological Imaging and Reports In The
  Low Data Regime
Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime
Rhydian Windsor
A. Jamaludin
T. Kadir
Andrew Zisserman
VLM
27
11
0
30 Mar 2023
Advancing Radiograph Representation Learning with Masked Record Modeling
Advancing Radiograph Representation Learning with Masked Record Modeling
Hong-Yu Zhou
Chenyu Lian
Lian-cheng Wang
Yizhou Yu
MedIm
38
55
0
30 Jan 2023
Using Multiple Instance Learning to Build Multimodal Representations
Using Multiple Instance Learning to Build Multimodal Representations
Peiqi Wang
W. Wells
Seth Berkowitz
Steven Horng
Polina Golland
SSL
24
6
0
11 Dec 2022
The Role of Local Alignment and Uniformity in Image-Text Contrastive
  Learning on Medical Images
The Role of Local Alignment and Uniformity in Image-Text Contrastive Learning on Medical Images
Philip Muller
Georgios Kaissis
Daniel Rueckert
MedIm
24
7
0
14 Nov 2022
That's the Wrong Lung! Evaluating and Improving the Interpretability of
  Unsupervised Multimodal Encoders for Medical Data
That's the Wrong Lung! Evaluating and Improving the Interpretability of Unsupervised Multimodal Encoders for Medical Data
Denis Jered McInerney
Geoffrey S. Young
Jan-Willem van de Meent
Byron C. Wallace
18
0
0
12 Oct 2022
RadTex: Learning Efficient Radiograph Representations from Text Reports
RadTex: Learning Efficient Radiograph Representations from Text Reports
Keegan Quigley
Miriam Cha
Ruizhi Liao
Geeticka Chauhan
Steven Horng
Seth Berkowitz
Polina Golland
MedIm
25
3
0
05 Aug 2022
Context-sensitive neocortical neurons transform the effectiveness and
  efficiency of neural information processing
Context-sensitive neocortical neurons transform the effectiveness and efficiency of neural information processing
Ahsan Adeel
Mario Franco
Mohsin Raza
K. Ahmed
31
9
0
15 Jul 2022
Making the Most of Text Semantics to Improve Biomedical Vision--Language
  Processing
Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing
Benedikt Boecking
Naoto Usuyama
Shruthi Bannur
Daniel Coelho De Castro
Anton Schwaighofer
...
Tristan Naumann
A. Nori
Javier Alvarez-Valle
Hoifung Poon
Ozan Oktay
21
231
0
21 Apr 2022
FlexR: Few-shot Classification with Language Embeddings for Structured
  Reporting of Chest X-rays
FlexR: Few-shot Classification with Language Embeddings for Structured Reporting of Chest X-rays
Matthias Keicher
Kamilia Zaripova
Tobias Czempiel
Kristina Mach
Ashkan Khakzar
Nassir Navab
MedIm
27
11
0
29 Mar 2022
Joint Learning of Localized Representations from Medical Images and
  Reports
Joint Learning of Localized Representations from Medical Images and Reports
Philipp Muller
Georgios Kaissis
Cong Zou
Daniel Munich
137
81
0
06 Dec 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Benyou Wang
Qianqian Xie
Jiahuan Pei
Zhihong Chen
Prayag Tiwari
Zhao Li
Jie Fu
LM&MA
AI4CE
37
163
0
11 Oct 2021
Contrastive Learning of Medical Visual Representations from Paired
  Images and Text
Contrastive Learning of Medical Visual Representations from Paired Images and Text
Yuhao Zhang
Hang Jiang
Yasuhide Miura
Christopher D. Manning
C. Langlotz
MedIm
32
731
0
02 Oct 2020
1