ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.00038
  4. Cited By
BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and
  Visual Relationship Detection

BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection

31 January 2019
H. Ben-younes
Rémi Cadène
Nicolas Thome
Matthieu Cord
ArXivPDFHTML

Papers citing "BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection"

12 / 12 papers shown
Title
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Guan-Feng Wang
Long Bai
Wan Jun Nah
Jie Wang
Zhaoxi Zhang
Zhen Chen
Jinlin Wu
Mobarakol Islam
Hongbin Liu
Hongliang Ren
75
18
0
22 Mar 2024
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports
Haopeng Li
Andong Deng
Qiuhong Ke
Jun Liu
Hossein Rahmani
Yulan Guo
Mohammed Bennamoun
Chen Chen
88
17
0
03 Jan 2024
Pythia v0.1: the Winning Entry to the VQA Challenge 2018
Pythia v0.1: the Winning Entry to the VQA Challenge 2018
Yu Jiang
Vivek Natarajan
Xinlei Chen
Marcus Rohrbach
Dhruv Batra
Devi Parikh
VLM
56
203
0
26 Jul 2018
Cross-Modal Retrieval in the Cooking Context: Learning Semantic
  Text-Image Embeddings
Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings
Micael Carvalho
Rémi Cadène
David Picard
Laure Soulier
Nicolas Thome
Matthieu Cord
43
180
0
30 Apr 2018
Visual Relationship Detection with Internal and External Linguistic
  Knowledge Distillation
Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation
Ruichi Yu
Ang Li
Vlad I. Morariu
L. Davis
52
312
0
28 Jul 2017
Deformable Part-based Fully Convolutional Network for Object Detection
Deformable Part-based Fully Convolutional Network for Object Detection
Taylor Mordan
Nicolas Thome
Matthieu Cord
Gilles Hénaff
ObjD
46
15
0
19 Jul 2017
Detecting Visual Relationships with Deep Relational Networks
Detecting Visual Relationships with Deep Relational Networks
Bo Dai
Yuqi Zhang
Dahua Lin
GNN
92
501
0
11 Apr 2017
Deep Variation-structured Reinforcement Learning for Visual Relationship
  and Attribute Detection
Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection
Xiaodan Liang
Lisa Lee
Eric Xing
74
252
0
08 Mar 2017
Visual Translation Embedding Network for Visual Relation Detection
Visual Translation Embedding Network for Visual Relation Detection
Hanwang Zhang
Zawlin Kyaw
Shih-Fu Chang
Tat-Seng Chua
ViT
230
560
0
27 Feb 2017
Making the V in VQA Matter: Elevating the Role of Image Understanding in
  Visual Question Answering
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
328
3,235
0
02 Dec 2016
Visual Relationship Detection with Language Priors
Visual Relationship Detection with Language Priors
Cewu Lu
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
VLM
73
1,139
0
31 Jul 2016
Neural Module Networks
Neural Module Networks
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
CoGe
131
1,072
0
09 Nov 2015
1