Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.09700
Cited By
Scene Graph Generation from Objects, Phrases and Region Captions
31 July 2017
Yikang Li
Wanli Ouyang
Bolei Zhou
Kun Wang
Xiaogang Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scene Graph Generation from Objects, Phrases and Region Captions"
37 / 37 papers shown
Title
Compile Scene Graphs with Reinforcement Learning
Zuyao Chen
Jinlin Wu
Zhen Lei
Marc Pollefeys
Chang Wen Chen
OffRL
LRM
101
2
0
18 Apr 2025
Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation
Sayak Nag
Udita Ghosh
Sarosij Bose
Calvin-Khang Ta
Jiachen Li
Amit K. Roy-Chowdhury
189
0
0
18 Mar 2025
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
Xinyao Liao
Xiaoye Qu
Dangyang Chen
Yuanyuan Fu
91
0
0
10 Jan 2025
SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation
Hang Zhang
Zhuoling Li
Jun Liu
LRM
130
1
0
15 Dec 2024
Situational Scene Graph for Structured Human-centric Situation Understanding
Chinthani Sugandhika
Chen Li
Deepu Rajan
Basura Fernando
409
1
0
30 Oct 2024
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Shilong Zhang
Pei Sun
Shoufa Chen
Min Xiao
Wenqi Shao
Wenwei Zhang
Yu Liu
Kai-xiang Chen
Ping Luo
MLLM
VLM
133
233
0
07 Jul 2023
DDS: Decoupled Dynamic Scene-Graph Generation Network
A S M Iftekhar
Raphael Ruschel
Satish Kumar
Suya You
B. S. Manjunath
67
2
0
18 Jan 2023
ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection
Ye Liu
Junsong Yuan
Chang Wen Chen
187
82
0
14 Aug 2020
PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN
Hanwang Zhang
Zawlin Kyaw
Jinyang Yu
Shih-Fu Chang
50
141
0
07 Aug 2017
Care about you: towards large-scale human-centric visual relationship detection
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
Anton Van Den Hengel
32
21
0
28 May 2017
Detecting Visual Relationships with Deep Relational Networks
Bo Dai
Yuqi Zhang
Dahua Lin
GNN
92
501
0
11 Apr 2017
Towards Context-aware Interaction Recognition
Bohan Zhuang
Lingqiao Liu
Chunhua Shen
Ian Reid
HAI
55
143
0
18 Mar 2017
Towards Diverse and Natural Image Descriptions via a Conditional GAN
Bo Dai
Sanja Fidler
R. Urtasun
Dahua Lin
GAN
55
453
0
17 Mar 2017
Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection
Xiaodan Liang
Lisa Lee
Eric Xing
69
252
0
08 Mar 2017
Visual Translation Embedding Network for Visual Relation Detection
Hanwang Zhang
Zawlin Kyaw
Shih-Fu Chang
Tat-Seng Chua
ViT
224
560
0
27 Feb 2017
Person Search with Natural Language Description
Shuang Li
Tong Xiao
Hongsheng Li
Bolei Zhou
Dayu Yue
Xiaogang Wang
70
389
0
19 Feb 2017
Scene Graph Generation by Iterative Message Passing
Danfei Xu
Yuke Zhu
Chris Choy
Li Fei-Fei
GNN
3DV
78
1,219
0
10 Jan 2017
Phrase Localization and Visual Relationship Detection with Comprehensive Image-Language Cues
Bryan A. Plummer
Arun Mallya
Christopher M. Cervantes
Julia Hockenmaier
Svetlana Lazebnik
65
189
0
21 Nov 2016
Learning to generalize to new compositions in image understanding
Yuval Atzmon
Jonathan Berant
Vahid Kezami
Amir Globerson
Gal Chechik
52
67
0
27 Aug 2016
Visual Relationship Detection with Language Priors
Cewu Lu
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
VLM
73
1,138
0
31 Jul 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
196
5,726
0
23 Feb 2016
SSD: Single Shot MultiBox Detector
Wen Liu
Dragomir Anguelov
D. Erhan
Christian Szegedy
Scott E. Reed
Cheng-Yang Fu
Alexander C. Berg
ObjD
BDL
202
29,742
0
08 Dec 2015
Simple Baseline for Visual Question Answering
Bolei Zhou
Yuandong Tian
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
FAtt
62
323
0
07 Dec 2015
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
Justin Johnson
A. Karpathy
Li Fei-Fei
VLM
124
1,167
0
24 Nov 2015
You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon
S. Divvala
Ross B. Girshick
Ali Farhadi
ObjD
665
36,801
0
08 Jun 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
478
62,122
0
04 Jun 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
182
5,452
0
03 May 2015
Fast R-CNN
Ross B. Girshick
ObjD
290
25,033
0
30 Apr 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
324
10,050
0
10 Feb 2015
Deep Visual-Semantic Alignments for Generating Image Descriptions
A. Karpathy
Li Fei-Fei
103
5,578
0
07 Dec 2014
Learning a Recurrent Visual Representation for Image Caption Generation
Xinlei Chen
C. L. Zitnick
SSL
GAN
95
195
0
20 Nov 2014
From Captions to Visual Concepts and Back
Hao Fang
Saurabh Gupta
F. Iandola
R. Srivastava
Li Deng
...
Xiaodong He
Margaret Mitchell
John C. Platt
C. L. Zitnick
Geoffrey Zweig
VLM
95
1,309
0
18 Nov 2014
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
150
6,048
0
17 Nov 2014
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
1.5K
100,213
0
04 Sep 2014
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
ObjD
367
11,199
0
18 Jun 2014
Rich feature hierarchies for accurate object detection and semantic segmentation
Ross B. Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
ObjD
274
26,168
0
11 Nov 2013
A Convex Formulation for Learning Task Relationships in Multi-Task Learning
Yu Zhang
Dit-Yan Yeung
106
461
0
15 Mar 2012
1