Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.07332
Cited By
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
23 February 2016
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
Joshua Kravitz
Stephanie Chen
Yannis Kalantidis
Li-Jia Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations"
50 / 1,032 papers shown
Title
ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors
Weicheng Kuo
A. Angelova
Jitendra Malik
Nayeon Lee
3DPC
ISeg
30
117
0
05 Apr 2019
Context and Attribute Grounded Dense Captioning
Guojun Yin
Lu Sheng
Bin Liu
Nenghai Yu
Xiaogang Wang
Jing Shao
16
75
0
02 Apr 2019
An End-to-End Network for Generating Social Relationship Graphs
A. Goel
K. Ma
Cheston Tan
GNN
21
39
0
23 Mar 2019
MMKG: Multi-Modal Knowledge Graphs
Ye Liu
Hui Li
Alberto García-Durán
Mathias Niepert
Daniel Oñoro-Rubio
David S. Rosenblum
18
193
0
13 Mar 2019
Visual Semantic Information Pursuit: A Survey
Daqi Liu
M. Bober
J. Kittler
15
31
0
13 Mar 2019
Knowledge-Embedded Routing Network for Scene Graph Generation
Tianshui Chen
Weihao Yu
Riquan Chen
Liang Lin
GNN
49
371
0
08 Mar 2019
Answer Them All! Toward Universal Visual Question Answering Models
Robik Shrestha
Kushal Kafle
Christopher Kanan
17
82
0
01 Mar 2019
MUREL: Multimodal Relational Reasoning for Visual Question Answering
Rémi Cadène
H. Ben-younes
Matthieu Cord
Nicolas Thome
LRM
19
271
0
25 Feb 2019
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Gi-Cheon Kang
Jaeseo Lim
Byoung-Tak Zhang
22
72
0
25 Feb 2019
Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog
Zhe Gan
Yu Cheng
Ahmed El Kholy
Linjie Li
Jingjing Liu
Jianfeng Gao
11
104
0
01 Feb 2019
Adversarial Adaptation of Scene Graph Models for Understanding Civic Issues
Shanu Kumar
Shubham Atreja
Anjali Singh
Mohit Jain
16
12
0
29 Jan 2019
Adversarial Attacks on Deep Learning Models in Natural Language Processing: A Survey
W. Zhang
Quan Z. Sheng
A. Alhazmi
Chenliang Li
AAML
24
57
0
21 Jan 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Ning Xie
Farley Lai
Derek Doran
Asim Kadav
CoGe
51
322
0
20 Jan 2019
Evaluating Text-to-Image Matching using Binary Image Selection (BISON)
Hexiang Hu
Ishan Misra
L. V. D. van der Maaten
24
22
0
19 Jan 2019
Using Scene Graph Context to Improve Image Generation
Subarna Tripathi
Anahita Bhiwandiwalla
A. Bastidas
Hanlin Tang
GNN
48
32
0
11 Jan 2019
Scene Graph Reasoning with Prior Visual Relationship for Visual Question Answering
Zhuoqian Yang
Zengchang Qin
Jing Yu
Yue Hu
GNN
25
16
0
23 Dec 2018
nocaps: novel object captioning at scale
Harsh Agrawal
Karan Desai
Yufei Wang
Xinlei Chen
Rishabh Jain
Mark Johnson
Dhruv Batra
Devi Parikh
Stefan Lee
Peter Anderson
VLM
21
468
0
20 Dec 2018
Grounded Video Description
Luowei Zhou
Yannis Kalantidis
Xinlei Chen
Jason J. Corso
Marcus Rohrbach
27
190
0
17 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
24
363
0
13 Dec 2018
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
62
477
0
12 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
21
692
0
06 Dec 2018
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Long Chen
Hanwang Zhang
Jun Xiao
Xiangnan He
Shiliang Pu
Shih-Fu Chang
22
159
0
06 Dec 2018
Image Generation from Layout
Bo Zhao
Lili Meng
Weidong Yin
Leonid Sigal
19
208
0
28 Nov 2018
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRM
BDL
OCL
ReLM
47
866
0
27 Nov 2018
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation
Matteo Tomei
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
DiffM
17
76
0
26 Nov 2018
Object-oriented Targets for Visual Navigation using Rich Semantic Representations
Jean-Benoit Delbrouck
Stéphane Dupont
31
3
0
22 Nov 2018
Scene Graph Generation via Conditional Random Fields
Weilin Cong
Luu Anh Tuan
Wang-Chien Lee
GNN
19
22
0
20 Nov 2018
Explicit Bias Discovery in Visual Question Answering Models
Varun Manjunatha
Nirat Saini
L. Davis
CML
FAtt
19
92
0
19 Nov 2018
SEIGAN: Towards Compositional Image Generation by Simultaneously Learning to Segment, Enhance, and Inpaint
Pavel Ostyakov
Roman Suvorov
Elizaveta Logacheva
Oleg Khomenko
Sergey I. Nikolenko
GAN
11
23
0
19 Nov 2018
Exploiting Class Learnability in Noisy Data
Matthew Klawonn
Eric Heim
James A. Hendler
NoLa
20
7
0
15 Nov 2018
LinkNet: Relational Embedding for Scene Graph
Sanghyun Woo
Dahun Kim
Donghyeon Cho
In So Kweon
GNN
15
147
0
15 Nov 2018
Hybrid Knowledge Routed Modules for Large-scale Object Detection
Chenhan Jiang
Hang Xu
Xiangdan Liang
Liang Lin
VLM
ObjD
39
86
0
30 Oct 2018
Visual Semantic Navigation using Scene Priors
Wei Yang
Xinyu Wang
Ali Farhadi
Abhinav Gupta
Roozbeh Mottaghi
LM&Ro
33
320
0
15 Oct 2018
The Focus-Aspect-Polarity Model for Predicting Subjective Noun Attributes in Images
Tushar Karayil
Philipp Blandfort
Jörn Hees
Andreas Dengel
27
0
0
15 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
45
760
0
06 Oct 2018
DASNet: Reducing Pixel-level Annotations for Instance and Semantic Segmentation
Chuang Niu
Shenghan Ren
Jimin Liang
SSeg
14
0
0
17 Sep 2018
Context-Dependent Diffusion Network for Visual Relationship Detection
Zhen Cui
Chunyan Xu
Wenming Zheng
Jian Yang
GNN
22
50
0
11 Sep 2018
Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks
Shivang Agarwal
Jean Ogier du Terrail
F. Jurie
ObjD
24
123
0
10 Sep 2018
Deep Learning for Generic Object Detection: A Survey
Li Liu
Wanli Ouyang
Xiaogang Wang
Paul Fieguth
Jie Chen
Xinwang Liu
M. Pietikäinen
ObjD
VLM
OOD
72
2,420
0
06 Sep 2018
Object Hallucination in Image Captioning
Anna Rohrbach
Lisa Anne Hendricks
Kaylee Burns
Trevor Darrell
Kate Saenko
27
402
0
06 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
33
55
0
06 Sep 2018
OCNet: Object Context Network for Scene Parsing
Yuhui Yuan
Lang Huang
Jianyuan Guo
Chao Zhang
Xilin Chen
Jingdong Wang
25
599
0
04 Sep 2018
Diverse and Coherent Paragraph Generation from Images
Moitreya Chatterjee
A. Schwing
19
66
0
03 Sep 2018
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Houfeng Wang
Xu Sun
98
65
0
27 Aug 2018
Context-Aware Visual Policy Network for Sequence-Level Image Captioning
Daqing Liu
Zhengjun Zha
Hanwang Zhang
Yongdong Zhang
Feng Wu
CLIP
33
103
0
16 Aug 2018
Graph R-CNN for Scene Graph Generation
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
GNN
57
836
0
01 Aug 2018
Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship Features
Xu Yang
Hanwang Zhang
Jianfei Cai
47
74
0
01 Aug 2018
Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning
P. Gay
Stuart James
Alessio Del Bue
OCL
50
31
0
16 Jul 2018
Object Relation Detection Based on One-shot Learning
Li Zhou
Jian-jun Zhao
Jianshu Li
Li-xin Yuan
Jiashi Feng
ObjD
19
23
0
16 Jul 2018
Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition
Guojun Yin
Lu Sheng
Bin Liu
Nenghai Yu
Xiaogang Wang
Jing Shao
Chen Change Loy
ObjD
32
156
0
13 Jul 2018
Previous
1
2
3
...
18
19
20
21
Next