BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and
Visual Relationship Detection

BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection

31 January 2019

Papers citing "BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection"

12 / 12 papers shown

Title
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery Guan-Feng Wang Long Bai Wan Jun Nah Jie Wang Zhaoxi Zhang Zhen Chen Jinlin Wu Mobarakol Islam Hongbin Liu Hongliang Ren 75 18 0 22 Mar 2024
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports Haopeng Li Andong Deng Qiuhong Ke Jun Liu Hossein Rahmani Yulan Guo Mohammed Bennamoun Chen Chen 88 17 0 03 Jan 2024
Pythia v0.1: the Winning Entry to the VQA Challenge 2018 Yu Jiang Vivek Natarajan Xinlei Chen Marcus Rohrbach Dhruv Batra Devi Parikh VLM 56 203 0 26 Jul 2018
Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings Micael Carvalho Rémi Cadène David Picard Laure Soulier Nicolas Thome Matthieu Cord 43 180 0 30 Apr 2018
Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation Ruichi Yu Ang Li Vlad I. Morariu L. Davis 52 312 0 28 Jul 2017
Deformable Part-based Fully Convolutional Network for Object Detection Taylor Mordan Nicolas Thome Matthieu Cord Gilles Hénaff ObjD 46 15 0 19 Jul 2017
Detecting Visual Relationships with Deep Relational Networks Bo Dai Yuqi Zhang Dahua Lin GNN 92 501 0 11 Apr 2017
Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection Xiaodan Liang Lisa Lee Eric Xing 74 252 0 08 Mar 2017
Visual Translation Embedding Network for Visual Relation Detection Hanwang Zhang Zawlin Kyaw Shih-Fu Chang Tat-Seng Chua ViT 230 560 0 27 Feb 2017
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering Yash Goyal Tejas Khot D. Summers-Stay Dhruv Batra Devi Parikh CoGe 328 3,235 0 02 Dec 2016
Visual Relationship Detection with Language Priors Cewu Lu Ranjay Krishna Michael S. Bernstein Li Fei-Fei VLM 73 1,139 0 31 Jul 2016
Neural Module Networks Jacob Andreas Marcus Rohrbach Trevor Darrell Dan Klein CoGe 131 1,072 0 09 Nov 2015