ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.01523
  4. Cited By
A Comparison of Pre-trained Vision-and-Language Models for Multimodal
  Representation Learning across Medical Images and Reports

A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports

3 September 2020
Yikuan Li
Hanyin Wang
Yuan Luo
ArXivPDFHTML

Papers citing "A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports"

14 / 14 papers shown
Title
Bi-VLGM : Bi-Level Class-Severity-Aware Vision-Language Graph Matching
  for Text Guided Medical Image Segmentation
Bi-VLGM : Bi-Level Class-Severity-Aware Vision-Language Graph Matching for Text Guided Medical Image Segmentation
Wenting Chen
Jie Liu
Yixuan Yuan
VLM
39
3
0
20 May 2023
Towards Unifying Medical Vision-and-Language Pre-training via Soft
  Prompts
Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts
Zhihong Chen
Shizhe Diao
Benyou Wang
Guanbin Li
Xiang Wan
MedIm
22
29
0
17 Feb 2023
A Comparative Study of Pretrained Language Models for Long Clinical Text
A Comparative Study of Pretrained Language Models for Long Clinical Text
Yikuan Li
R. M. Wehbe
F. Ahmad
Hanyin Wang
Yuan Luo
LM&MA
ELM
VLM
MedIm
26
79
0
27 Jan 2023
AD-BERT: Using Pre-trained contextualized embeddings to Predict the
  Progression from Mild Cognitive Impairment to Alzheimer's Disease
AD-BERT: Using Pre-trained contextualized embeddings to Predict the Progression from Mild Cognitive Impairment to Alzheimer's Disease
Chengsheng Mao
Jie Xu
Luke Rasmussen
Yikuan Li
P. Adekkanattu
...
R. Vassar
Guoqian Jiang
Fei Wang
Jyotishman Pathak
Yuan Luo
19
6
0
07 Nov 2022
The Ability of Image-Language Explainable Models to Resemble Domain
  Expertise
The Ability of Image-Language Explainable Models to Resemble Domain Expertise
P. Werner
Anna Zapaishchykova
Ujjwal Ratan
48
2
0
19 Sep 2022
MedFuse: Multi-modal fusion with clinical time-series data and chest
  X-ray images
MedFuse: Multi-modal fusion with clinical time-series data and chest X-ray images
Nasir Hayat
Krzysztof J. Geras
Farah E. Shamout
MedIm
24
40
0
14 Jul 2022
Multimodal Learning with Transformers: A Survey
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
60
527
0
13 Jun 2022
Multimodal Machine Learning in Precision Health
Multimodal Machine Learning in Precision Health
Adrienne S. Kline
Hanyin Wang
Yikuan Li
Saya Dennis
M. Hutch
Zhenxing Xu
Fei Wang
F. Cheng
Yuan Luo
24
15
0
10 Apr 2022
Multi-Modal Learning Using Physicians Diagnostics for Optical Coherence
  Tomography Classification
Multi-Modal Learning Using Physicians Diagnostics for Optical Coherence Tomography Classification
Yash-yee Logan
Kiran Kokilepersaud
Gukyeong Kwon
Ghassan Al-Regib
C. Wykoff
Hannah J. Yu
19
8
0
20 Mar 2022
Indication as Prior Knowledge for Multimodal Disease Classification in
  Chest Radiographs with Transformers
Indication as Prior Knowledge for Multimodal Disease Classification in Chest Radiographs with Transformers
Grzegorz Jacenków
Alison Q. OÑeil
Sotirios A. Tsaftaris
ViT
MedIm
17
23
0
12 Feb 2022
Machine Learning for Multimodal Electronic Health Records-based
  Research: Challenges and Perspectives
Machine Learning for Multimodal Electronic Health Records-based Research: Challenges and Perspectives
Ziyi Liu
Jiaqi Zhang
Yongshuai Hou
Xinran Zhang
Ge Li
Yang Xiang
19
14
0
09 Nov 2021
Improving Joint Learning of Chest X-Ray and Radiology Report by Word
  Region Alignment
Improving Joint Learning of Chest X-Ray and Radiology Report by Word Region Alignment
Zhanghexuan Ji
Mohammad Abuzar Shaikh
Dana Moukheiber
S. Srihari
Yifan Peng
Mingchen Gao
SSL
14
20
0
04 Sep 2021
Aggregated Residual Transformations for Deep Neural Networks
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
297
10,220
0
16 Nov 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
152
1,464
0
06 Jun 2016
1