ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11867
  4. Cited By
Overcoming Data Limitation in Medical Visual Question Answering

Overcoming Data Limitation in Medical Visual Question Answering

26 September 2019
Binh Duc Nguyen
Thanh-Toan Do
Binh X. Nguyen
Tuong Khanh Long Do
Erman Tjiputra
Quang-Dieu Tran
    MedIm
ArXivPDFHTML

Papers citing "Overcoming Data Limitation in Medical Visual Question Answering"

50 / 65 papers shown
Title
MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks
MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks
Wenqi Zeng
Yuqi Sun
Chenxi Ma
Weimin Tan
Bo Yan
LM&MA
VLM
55
0
0
09 May 2025
Structure Causal Models and LLMs Integration in Medical Visual Question Answering
Structure Causal Models and LLMs Integration in Medical Visual Question Answering
Zibo Xu
Qiang Li
Weizhi Nie
Weijie Wang
Anan Liu
CML
MedIm
49
0
0
05 May 2025
AdCare-VLM: Leveraging Large Vision Language Model (LVLM) to Monitor Long-Term Medication Adherence and Care
AdCare-VLM: Leveraging Large Vision Language Model (LVLM) to Monitor Long-Term Medication Adherence and Care
Md Asaduzzaman Jabin
Hanqi Jiang
Yuchen Li
Patrick Kaggwa
Eugene Douglass
Juliet N. Sekandi
Tianming Liu
LM&MA
76
0
0
01 May 2025
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging
Tan-Hanh Pham
Chris Ngo
Trong-Duong Bui
Minh Luu Quang
Tan-Huong Pham
Truong-Son Hy
31
1
0
14 Apr 2025
DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels
DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels
Erjian Guo
Zhen Zhao
Zicheng Wang
Tong Chen
Yunyi Liu
Luping Zhou
DiffM
MedIm
72
0
0
24 Mar 2025
MedCoT: Medical Chain of Thought via Hierarchical Expert
MedCoT: Medical Chain of Thought via Hierarchical Expert
Jiaxiang Liu
Yuan Wang
Jiawei Du
Qiufeng Wang
Zuozhu Liu
LRM
93
9
0
18 Dec 2024
Uni-Mlip: Unified Self-supervision for Medical Vision Language
  Pre-training
Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training
Ameera Bawazir
Kebin Wu
Wenbin Li
CLIP
77
1
0
20 Nov 2024
Memory-Augmented Multimodal LLMs for Surgical VQA via Self-Contained Inquiry
Wenjun Hou
Yi Cheng
Kaishuai Xu
Yan Hu
Wenjie Li
Jiang-Dong Liu
40
0
0
17 Nov 2024
Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering
Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering
Zhilin Zhang
Jie Wang
Zhanghao Qin
Ruiqi Zhu
Xiaoliang Gong
MedIm
57
0
0
28 Oct 2024
R-LLaVA: Improving Med-VQA Understanding through Visual Region of Interest
R-LLaVA: Improving Med-VQA Understanding through Visual Region of Interest
Xupeng Chen
Zhixin Lai
Kangrui Ruan
Shichu Chen
Jiaxiang Liu
Zuozhu Liu
43
1
0
27 Oct 2024
Which Client is Reliable?: A Reliable and Personalized Prompt-based
  Federated Learning for Medical Image Question Answering
Which Client is Reliable?: A Reliable and Personalized Prompt-based Federated Learning for Medical Image Question Answering
He Zhu
Ren Togo
Takahiro Ogawa
Miki Haseyama
MedIm
31
0
0
23 Oct 2024
Has Multimodal Learning Delivered Universal Intelligence in Healthcare?
  A Comprehensive Survey
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey
Qika Lin
Yifan Zhu
Xin Mei
Ling Huang
Jingying Ma
Kai He
Zhen Peng
Min Zhang
Mengling Feng
49
19
0
23 Aug 2024
Tri-VQA: Triangular Reasoning Medical Visual Question Answering for
  Multi-Attribute Analysis
Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis
Lin Fan
Xun Gong
Cenyang Zheng
Yafei Ou
25
0
0
21 Jun 2024
Efficiency in Focus: LayerNorm as a Catalyst for Fine-tuning Medical
  Visual Language Pre-trained Models
Efficiency in Focus: LayerNorm as a Catalyst for Fine-tuning Medical Visual Language Pre-trained Models
Jiawei Chen
Dingkang Yang
Yue Jiang
Mingcheng Li
Jinjie Wei
Xiaolu Hou
Lihua Zhang
56
6
0
25 Apr 2024
LaPA: Latent Prompt Assist Model For Medical Visual Question Answering
LaPA: Latent Prompt Assist Model For Medical Visual Question Answering
Tiancheng Gu
Kaicheng Yang
Dongnan Liu
Weidong Cai
MedIm
41
2
0
19 Apr 2024
MedThink: Explaining Medical Visual Question Answering via Multimodal
  Decision-Making Rationale
MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale
Xiaotang Gai
Chenyi Zhou
Jiaxiang Liu
Yang Feng
Jian Wu
Zuo-Qiang Liu
MedIm
36
6
0
18 Apr 2024
Unified Multi-modal Diagnostic Framework with Reconstruction
  Pre-training and Heterogeneity-combat Tuning
Unified Multi-modal Diagnostic Framework with Reconstruction Pre-training and Heterogeneity-combat Tuning
Yupei Zhang
Li Pan
Qiushi Yang
Tan Li
Zhen Chen
33
1
0
09 Apr 2024
Enhancing Generalization in Medical Visual Question Answering Tasks via
  Gradient-Guided Model Perturbation
Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation
Gang Liu
Hongyang Li
Zerui He
Shenjun Zhong
MedIm
25
0
0
05 Mar 2024
Prompt-based Personalized Federated Learning for Medical Visual Question
  Answering
Prompt-based Personalized Federated Learning for Medical Visual Question Answering
He Zhu
Ren Togo
Takahiro Ogawa
Miki Haseyama
MedIm
24
5
0
15 Feb 2024
MultiMedEval: A Benchmark and a Toolkit for Evaluating Medical
  Vision-Language Models
MultiMedEval: A Benchmark and a Toolkit for Evaluating Medical Vision-Language Models
Corentin Royer
Bjoern H. Menze
Anjany Sekuboyina
VLM
38
10
0
14 Feb 2024
MISS: A Generative Pretraining and Finetuning Approach for Med-VQA
MISS: A Generative Pretraining and Finetuning Approach for Med-VQA
Jiawei Chen
Dingkang Yang
Yue Jiang
Yuxuan Lei
Lihua Zhang
LM&MA
MedIm
18
13
0
10 Jan 2024
UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic
  Cross-modal Learnable Prompts
UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts
Chenlu Zhan
Yufei Zhang
Yu Lin
Gaoang Wang
Hongwei Wang
VLM
MedIm
40
5
0
18 Dec 2023
BESTMVQA: A Benchmark Evaluation System for Medical Visual Question
  Answering
BESTMVQA: A Benchmark Evaluation System for Medical Visual Question Answering
Xiaojie Hong
Zixin Song
Liangzhi Li
Xiaoli Wang
Feiyan Liu
28
1
0
13 Dec 2023
A Systematic Evaluation of GPT-4V's Multimodal Capability for Medical
  Image Analysis
A Systematic Evaluation of GPT-4V's Multimodal Capability for Medical Image Analysis
Yingshu Li
Yunyi Liu
Zhanyu Wang
Xinyu Liang
Lei Wang
Lingqiao Liu
Leyang Cui
Zhaopeng Tu
Longyue Wang
Luping Zhou
ELM
LM&MA
40
38
0
31 Oct 2023
Causal Reasoning through Two Layers of Cognition for Improving
  Generalization in Visual Question Answering
Causal Reasoning through Two Layers of Cognition for Improving Generalization in Visual Question Answering
Trang Nguyen
Naoaki Okazaki
LRM
43
0
0
09 Oct 2023
A scoping review on multimodal deep learning in biomedical images and
  texts
A scoping review on multimodal deep learning in biomedical images and texts
Zhaoyi Sun
Mingquan Lin
Qingqing Zhu
Qianqian Xie
Fei-Yue Wang
Zhiyong Lu
Yifan Peng
36
18
0
14 Jul 2023
Rad-ReStruct: A Novel VQA Benchmark and Method for Structured Radiology
  Reporting
Rad-ReStruct: A Novel VQA Benchmark and Method for Structured Radiology Reporting
Chantal Pellegrini
Matthias Keicher
Ege Özsoy
Nassir Navab
37
13
0
11 Jul 2023
Masked Vision and Language Pre-training with Unimodal and Multimodal
  Contrastive Losses for Medical Visual Question Answering
Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question Answering
Pengfei Li
Gang Liu
Jinlong He
Zixu Zhao
Shenjun Zhong
13
34
0
11 Jul 2023
Localized Questions in Medical Visual Question Answering
Localized Questions in Medical Visual Question Answering
Sergio Tascon-Morales
Pablo Márquez-Neila
Raphael Sznitman
24
8
0
03 Jul 2023
Multimodal Prompt Retrieval for Generative Visual Question Answering
Multimodal Prompt Retrieval for Generative Visual Question Answering
Timothy Ossowski
Junjie Hu
23
1
0
30 Jun 2023
Multi-modal Pre-training for Medical Vision-language Understanding and
  Generation: An Empirical Study with A New Benchmark
Multi-modal Pre-training for Medical Vision-language Understanding and Generation: An Empirical Study with A New Benchmark
Li Xu
Bo Liu
Ameer Hamza Khan
Lu Fan
Xiao-Ming Wu
LM&MA
35
9
0
10 Jun 2023
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained
  Transformer for Vision, Language, and Multimodal Tasks
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
Kai Zhang
Jun Yu
Eashan Adhikarla
Rong Zhou
Zhilin Yan
...
Xun Chen
Yong Chen
Quanzheng Li
Hongfang Liu
Lichao Sun
LM&MA
MedIm
37
157
0
26 May 2023
PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering
PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering
Xiaoman Zhang
Chaoyi Wu
Ziheng Zhao
Weixiong Lin
Ya Zhang
Yanfeng Wang
Weidi Xie
LM&MA
48
153
0
17 May 2023
Multi-task Paired Masking with Alignment Modeling for Medical
  Vision-Language Pre-training
Multi-task Paired Masking with Alignment Modeling for Medical Vision-Language Pre-training
Kecheng Zhang
Jing Zhang
Jun Yu
Han Jiang
Jianping Fan
Qing-An Huang
Weidong Han
MedIm
38
29
0
13 May 2023
Towards Medical Artificial General Intelligence via Knowledge-Enhanced
  Multimodal Pretraining
Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining
Bingqian Lin
Zicong Chen
Mingjie Li
Haokun Lin
Hang Xu
...
Ling-Hao Chen
Xiaojun Chang
Yi Yang
L. Xing
Xiaodan Liang
LM&MA
MedIm
AI4CE
40
14
0
26 Apr 2023
Q2ATransformer: Improving Medical VQA via an Answer Querying Decoder
Q2ATransformer: Improving Medical VQA via an Answer Querying Decoder
Yunyi Liu
Zhanyu Wang
Dong Xu
Luping Zhou
ViT
MedIm
19
35
0
04 Apr 2023
Logical Implications for Visual Question Answering Consistency
Logical Implications for Visual Question Answering Consistency
Sergio Tascon-Morales
Pablo Márquez-Neila
Raphael Sznitman
18
9
0
16 Mar 2023
PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical
  Documents
PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents
Weixiong Lin
Ziheng Zhao
Xiaoman Zhang
Chaoyi Wu
Ya Zhang
Yanfeng Wang
Weidi Xie
LM&MA
VLM
MedIm
31
144
0
13 Mar 2023
Open-Ended Medical Visual Question Answering Through Prefix Tuning of
  Language Models
Open-Ended Medical Visual Question Answering Through Prefix Tuning of Language Models
Tom van Sonsbeek
Mohammad Mahdi Derakhshani
Ivona Najdenkoska
Cees G. M. Snoek
M. Worring
LM&MA
16
51
0
10 Mar 2023
RAMM: Retrieval-augmented Biomedical Visual Question Answering with
  Multi-modal Pre-training
RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training
Zheng Yuan
Qiao Jin
Chuanqi Tan
Zhengyun Zhao
Hongyi Yuan
Fei Huang
Songfang Huang
52
27
0
01 Mar 2023
Medical visual question answering using joint self-supervised learning
Medical visual question answering using joint self-supervised learning
Yuan Zhou
Jing Mei
Yiqin Yu
T. Syeda-Mahmood
MedIm
38
1
0
25 Feb 2023
MetaLDC: Meta Learning of Low-Dimensional Computing Classifiers for Fast
  On-Device Adaption
MetaLDC: Meta Learning of Low-Dimensional Computing Classifiers for Fast On-Device Adaption
Yejia Liu
Shijin Duan
Xiaolin Xu
Shaolei Ren
38
4
0
23 Feb 2023
Towards Unifying Medical Vision-and-Language Pre-training via Soft
  Prompts
Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts
Zhihong Chen
Shizhe Diao
Benyou Wang
Guanbin Li
Xiang Wan
MedIm
27
29
0
17 Feb 2023
UnICLAM:Contrastive Representation Learning with Adversarial Masking for
  Unified and Interpretable Medical Vision Question Answering
UnICLAM:Contrastive Representation Learning with Adversarial Masking for Unified and Interpretable Medical Vision Question Answering
Chenlu Zhan
Peng Peng
Hongsen Wang
Tao Chen
Hongwei Wang
MedIm
25
3
0
21 Dec 2022
Self-supervised vision-language pretraining for Medical visual question
  answering
Self-supervised vision-language pretraining for Medical visual question answering
Pengfei Li
Gang Liu
Lin Tan
Jinying Liao
Shenjun Zhong
MedIm
21
33
0
24 Nov 2022
A Dual-Attention Learning Network with Word and Sentence Embedding for
  Medical Visual Question Answering
A Dual-Attention Learning Network with Word and Sentence Embedding for Medical Visual Question Answering
Xiaofei Huang
Hongfang Gong
MedIm
66
12
0
01 Oct 2022
RepsNet: Combining Vision with Language for Automated Medical Reports
RepsNet: Combining Vision with Language for Automated Medical Reports
A. Tanwani
Joelle Barral
Daniel Freedman
MedIm
44
20
0
27 Sep 2022
Align, Reason and Learn: Enhancing Medical Vision-and-Language
  Pre-training with Knowledge
Align, Reason and Learn: Enhancing Medical Vision-and-Language Pre-training with Knowledge
Zhihong Chen
Guanbin Li
Xiang Wan
127
65
0
15 Sep 2022
Multi-Modal Masked Autoencoders for Medical Vision-and-Language
  Pre-Training
Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training
Zhihong Chen
Yu Du
Jinpeng Hu
Yang Liu
Guanbin Li
Xiang Wan
Tsung-Hui Chang
91
111
0
15 Sep 2022
Consistency-preserving Visual Question Answering in Medical Imaging
Consistency-preserving Visual Question Answering in Medical Imaging
Sergio Tascon-Morales
Pablo Márquez-Neila
Raphael Sznitman
MedIm
27
12
0
27 Jun 2022
12
Next