ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.03147
  4. Cited By
Improving Visual Question Answering Models through Robustness Analysis
  and In-Context Learning with a Chain of Basic Questions

Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions

6 April 2023
Jia-Hong Huang
Modar Alfadly
Guohao Li
M. Worring
    OOD
    AAML
ArXivPDFHTML

Papers citing "Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions"

4 / 4 papers shown
Title
Contextualized Keyword Representations for Multi-modal Retinal Image
  Captioning
Contextualized Keyword Representations for Multi-modal Retinal Image Captioning
Jia-Hong Huang
Ting-Wei Wu
M. Worring
MedIm
60
26
0
26 Apr 2021
GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video
  Summarization
GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization
Jia-Hong Huang
L. Murn
M. Mrak
M. Worring
ViT
94
37
0
26 Apr 2021
Counterfactual Samples Synthesizing for Robust Visual Question Answering
Counterfactual Samples Synthesizing for Robust Visual Question Answering
Long Chen
Xin Yan
Jun Xiao
Hanwang Zhang
Shiliang Pu
Yueting Zhuang
OOD
AAML
154
290
0
14 Mar 2020
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
167
1,464
0
06 Jun 2016
1