Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.11544
Cited By
Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives
25 April 2022
Shaoning Xiao
Long Chen
Kaifeng Gao
Zhao Wang
Yi Yang
Zhimeng Zhang
Jun Xiao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives"
3 / 3 papers shown
Title
Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
165
100
0
29 Apr 2021
Counterfactual Samples Synthesizing for Robust Visual Question Answering
Long Chen
Xin Yan
Jun Xiao
Hanwang Zhang
Shiliang Pu
Yueting Zhuang
OOD
AAML
154
290
0
14 Mar 2020
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
167
1,464
0
06 Jun 2016
1