Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.00038
Cited By
BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection
31 January 2019
H. Ben-younes
Rémi Cadène
Nicolas Thome
Matthieu Cord
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection"
12 / 12 papers shown
Title
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Guan-Feng Wang
Long Bai
Wan Jun Nah
Jie Wang
Zhaoxi Zhang
Zhen Chen
Jinlin Wu
Mobarakol Islam
Hongbin Liu
Hongliang Ren
75
18
0
22 Mar 2024
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports
Haopeng Li
Andong Deng
Qiuhong Ke
Jun Liu
Hossein Rahmani
Yulan Guo
Mohammed Bennamoun
Chen Chen
88
17
0
03 Jan 2024
Pythia v0.1: the Winning Entry to the VQA Challenge 2018
Yu Jiang
Vivek Natarajan
Xinlei Chen
Marcus Rohrbach
Dhruv Batra
Devi Parikh
VLM
56
203
0
26 Jul 2018
Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings
Micael Carvalho
Rémi Cadène
David Picard
Laure Soulier
Nicolas Thome
Matthieu Cord
43
180
0
30 Apr 2018
Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation
Ruichi Yu
Ang Li
Vlad I. Morariu
L. Davis
52
312
0
28 Jul 2017
Deformable Part-based Fully Convolutional Network for Object Detection
Taylor Mordan
Nicolas Thome
Matthieu Cord
Gilles Hénaff
ObjD
46
15
0
19 Jul 2017
Detecting Visual Relationships with Deep Relational Networks
Bo Dai
Yuqi Zhang
Dahua Lin
GNN
92
501
0
11 Apr 2017
Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection
Xiaodan Liang
Lisa Lee
Eric Xing
74
252
0
08 Mar 2017
Visual Translation Embedding Network for Visual Relation Detection
Hanwang Zhang
Zawlin Kyaw
Shih-Fu Chang
Tat-Seng Chua
ViT
230
560
0
27 Feb 2017
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
328
3,235
0
02 Dec 2016
Visual Relationship Detection with Language Priors
Cewu Lu
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
VLM
73
1,139
0
31 Jul 2016
Neural Module Networks
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
CoGe
131
1,072
0
09 Nov 2015
1