Multimodal Representation Learning via Maximization of Local Mutual Information

8 March 2021

Papers citing "Multimodal Representation Learning via Maximization of Local Mutual Information"

25 / 25 papers shown

Title
CROSSAN: Towards Efficient and Effective Adaptation of Multiple Multimodal Foundation Models for Sequential Recommendation Junchen Fu Yongxin Ni J. Jose Ioannis Arapakis Kaiwen Zheng Y. Li Xuri Ge 34 0 0 14 Apr 2025
Anatomy-Aware Conditional Image-Text Retrieval Meng Zheng Jiajin Zhang Benjamin Planche Zhongpai Gao Terrence Chen Ziyan Wu MedIm 57 0 0 10 Mar 2025
DADM: Dual Alignment of Domain and Modality for Face Anti-spoofing Jingyi Yang Xun Lin Zitong Yu Li Zhang X. Liu Hui Li Xiaochen Yuan Xiaochun Cao CVBM 41 0 0 01 Mar 2025
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs Sheng Zhang Yanbo Xu Naoto Usuyama Hanwen Xu J. Bagga ... Carlo Bifulco M. Lungren Tristan Naumann Sheng Wang Hoifung Poon LM&MA MedIm 154 205 0 10 Jan 2025
Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future Directions Kai Sun Siyan Xue F. Sun Haoran Sun Yu-Juan Luo ... Xinzhou Wang Lei Yang Shuo Jin Jun Yan Jiahong Dong AI4CE 76 2 0 03 Dec 2024
ChEX: Interactive Localization and Region Description in Chest X-rays Philip Muller Georgios Kaissis Daniel Rueckert 35 5 0 24 Apr 2024
CLIP in Medical Imaging: A Comprehensive Survey Zihao Zhao Yuxiao Liu Han Wu Yonghao Li Sheng Wang L. Teng Disheng Liu Zhiming Cui Qian Wang Dinggang Shen CLIP MedIm LM&MA VLM 31 43 0 12 Dec 2023
Medical Vision Language Pretraining: A survey Prashant Shrestha Sanskar Amgain Bidur Khanal Cristian A. Linte Binod Bhattarai VLM 34 14 0 11 Dec 2023
Multimodal Variational Auto-encoder based Audio-Visual Segmentation Yuxin Mao Jing Zhang Mochu Xiang Yiran Zhong Yuchao Dai 40 34 0 12 Oct 2023
Improving Face Recognition from Caption Supervision with Multi-Granular Contextual Feature Aggregation Md Golam Moula Mehedi Hasan Nasser M. Nasrabadi CVBM 18 2 0 13 Aug 2023
A scoping review on multimodal deep learning in biomedical images and texts Zhaoyi Sun Mingquan Lin Qingqing Zhu Qianqian Xie Fei-Yue Wang Zhiyong Lu Yifan Peng 31 18 0 14 Jul 2023
S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions Sangwoo Mo Minkyu Kim Kyungmin Lee Jinwoo Shin VLM CLIP 44 21 0 23 May 2023
Sample-Specific Debiasing for Better Image-Text Models Peiqi Wang Yingcheng Liu Ching-Yun Ko W. Wells Seth Berkowitz Steven Horng Polina Golland SSL MedIm 22 1 0 25 Apr 2023
Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime Rhydian Windsor A. Jamaludin T. Kadir Andrew Zisserman VLM 27 11 0 30 Mar 2023
Advancing Radiograph Representation Learning with Masked Record Modeling Hong-Yu Zhou Chenyu Lian Lian-cheng Wang Yizhou Yu MedIm 38 55 0 30 Jan 2023
Using Multiple Instance Learning to Build Multimodal Representations Peiqi Wang W. Wells Seth Berkowitz Steven Horng Polina Golland SSL 24 6 0 11 Dec 2022
The Role of Local Alignment and Uniformity in Image-Text Contrastive Learning on Medical Images Philip Muller Georgios Kaissis Daniel Rueckert MedIm 24 7 0 14 Nov 2022
That's the Wrong Lung! Evaluating and Improving the Interpretability of Unsupervised Multimodal Encoders for Medical Data Denis Jered McInerney Geoffrey S. Young Jan-Willem van de Meent Byron C. Wallace 18 0 0 12 Oct 2022
RadTex: Learning Efficient Radiograph Representations from Text Reports Keegan Quigley Miriam Cha Ruizhi Liao Geeticka Chauhan Steven Horng Seth Berkowitz Polina Golland MedIm 25 3 0 05 Aug 2022
Context-sensitive neocortical neurons transform the effectiveness and efficiency of neural information processing Ahsan Adeel Mario Franco Mohsin Raza K. Ahmed 31 9 0 15 Jul 2022
Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing Benedikt Boecking Naoto Usuyama Shruthi Bannur Daniel Coelho De Castro Anton Schwaighofer ... Tristan Naumann A. Nori Javier Alvarez-Valle Hoifung Poon Ozan Oktay 21 231 0 21 Apr 2022
FlexR: Few-shot Classification with Language Embeddings for Structured Reporting of Chest X-rays Matthias Keicher Kamilia Zaripova Tobias Czempiel Kristina Mach Ashkan Khakzar Nassir Navab MedIm 30 11 0 29 Mar 2022
Joint Learning of Localized Representations from Medical Images and Reports Philipp Muller Georgios Kaissis Cong Zou Daniel Munich 140 81 0 06 Dec 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey Benyou Wang Qianqian Xie Jiahuan Pei Zhihong Chen Prayag Tiwari Zhao Li Jie Fu LM&MA AI4CE 37 163 0 11 Oct 2021
Contrastive Learning of Medical Visual Representations from Paired Images and Text Yuhao Zhang Hang Jiang Yasuhide Miura Christopher D. Manning C. Langlotz MedIm 32 731 0 02 Oct 2020