ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.07332
  4. Cited By
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense
  Image Annotations

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

23 February 2016
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
Joshua Kravitz
Stephanie Chen
Yannis Kalantidis
Li-Jia Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
ArXivPDFHTML

Papers citing "Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations"

50 / 1,032 papers shown
Title
ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors
ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors
Weicheng Kuo
A. Angelova
Jitendra Malik
Nayeon Lee
3DPC
ISeg
30
117
0
05 Apr 2019
Context and Attribute Grounded Dense Captioning
Context and Attribute Grounded Dense Captioning
Guojun Yin
Lu Sheng
Bin Liu
Nenghai Yu
Xiaogang Wang
Jing Shao
16
75
0
02 Apr 2019
An End-to-End Network for Generating Social Relationship Graphs
An End-to-End Network for Generating Social Relationship Graphs
A. Goel
K. Ma
Cheston Tan
GNN
21
39
0
23 Mar 2019
MMKG: Multi-Modal Knowledge Graphs
MMKG: Multi-Modal Knowledge Graphs
Ye Liu
Hui Li
Alberto García-Durán
Mathias Niepert
Daniel Oñoro-Rubio
David S. Rosenblum
18
193
0
13 Mar 2019
Visual Semantic Information Pursuit: A Survey
Visual Semantic Information Pursuit: A Survey
Daqi Liu
M. Bober
J. Kittler
15
31
0
13 Mar 2019
Knowledge-Embedded Routing Network for Scene Graph Generation
Knowledge-Embedded Routing Network for Scene Graph Generation
Tianshui Chen
Weihao Yu
Riquan Chen
Liang Lin
GNN
49
371
0
08 Mar 2019
Answer Them All! Toward Universal Visual Question Answering Models
Answer Them All! Toward Universal Visual Question Answering Models
Robik Shrestha
Kushal Kafle
Christopher Kanan
17
82
0
01 Mar 2019
MUREL: Multimodal Relational Reasoning for Visual Question Answering
MUREL: Multimodal Relational Reasoning for Visual Question Answering
Rémi Cadène
H. Ben-younes
Matthieu Cord
Nicolas Thome
LRM
19
271
0
25 Feb 2019
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Gi-Cheon Kang
Jaeseo Lim
Byoung-Tak Zhang
22
72
0
25 Feb 2019
Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog
Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog
Zhe Gan
Yu Cheng
Ahmed El Kholy
Linjie Li
Jingjing Liu
Jianfeng Gao
11
104
0
01 Feb 2019
Adversarial Adaptation of Scene Graph Models for Understanding Civic
  Issues
Adversarial Adaptation of Scene Graph Models for Understanding Civic Issues
Shanu Kumar
Shubham Atreja
Anjali Singh
Mohit Jain
16
12
0
29 Jan 2019
Adversarial Attacks on Deep Learning Models in Natural Language
  Processing: A Survey
Adversarial Attacks on Deep Learning Models in Natural Language Processing: A Survey
W. Zhang
Quan Z. Sheng
A. Alhazmi
Chenliang Li
AAML
24
57
0
21 Jan 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Ning Xie
Farley Lai
Derek Doran
Asim Kadav
CoGe
51
322
0
20 Jan 2019
Evaluating Text-to-Image Matching using Binary Image Selection (BISON)
Evaluating Text-to-Image Matching using Binary Image Selection (BISON)
Hexiang Hu
Ishan Misra
L. V. D. van der Maaten
24
22
0
19 Jan 2019
Using Scene Graph Context to Improve Image Generation
Using Scene Graph Context to Improve Image Generation
Subarna Tripathi
Anahita Bhiwandiwalla
A. Bastidas
Hanlin Tang
GNN
48
32
0
11 Jan 2019
Scene Graph Reasoning with Prior Visual Relationship for Visual Question
  Answering
Scene Graph Reasoning with Prior Visual Relationship for Visual Question Answering
Zhuoqian Yang
Zengchang Qin
Jing Yu
Yue Hu
GNN
25
16
0
23 Dec 2018
nocaps: novel object captioning at scale
nocaps: novel object captioning at scale
Harsh Agrawal
Karan Desai
Yufei Wang
Xinlei Chen
Rishabh Jain
Mark Johnson
Dhruv Batra
Devi Parikh
Stefan Lee
Peter Anderson
VLM
21
468
0
20 Dec 2018
Grounded Video Description
Grounded Video Description
Luowei Zhou
Yannis Kalantidis
Xinlei Chen
Jason J. Corso
Marcus Rohrbach
27
190
0
17 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual
  Question Answering
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
24
363
0
13 Dec 2018
Long-Term Feature Banks for Detailed Video Understanding
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
62
477
0
12 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
21
692
0
06 Dec 2018
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Long Chen
Hanwang Zhang
Jun Xiao
Xiangnan He
Shiliang Pu
Shih-Fu Chang
22
159
0
06 Dec 2018
Image Generation from Layout
Image Generation from Layout
Bo Zhao
Lili Meng
Weidong Yin
Leonid Sigal
19
208
0
28 Nov 2018
From Recognition to Cognition: Visual Commonsense Reasoning
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRM
BDL
OCL
ReLM
47
866
0
27 Nov 2018
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware
  Image-to-Image Translation
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation
Matteo Tomei
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
DiffM
17
76
0
26 Nov 2018
Object-oriented Targets for Visual Navigation using Rich Semantic
  Representations
Object-oriented Targets for Visual Navigation using Rich Semantic Representations
Jean-Benoit Delbrouck
Stéphane Dupont
31
3
0
22 Nov 2018
Scene Graph Generation via Conditional Random Fields
Weilin Cong
Luu Anh Tuan
Wang-Chien Lee
GNN
19
22
0
20 Nov 2018
Explicit Bias Discovery in Visual Question Answering Models
Explicit Bias Discovery in Visual Question Answering Models
Varun Manjunatha
Nirat Saini
L. Davis
CML
FAtt
19
92
0
19 Nov 2018
SEIGAN: Towards Compositional Image Generation by Simultaneously
  Learning to Segment, Enhance, and Inpaint
SEIGAN: Towards Compositional Image Generation by Simultaneously Learning to Segment, Enhance, and Inpaint
Pavel Ostyakov
Roman Suvorov
Elizaveta Logacheva
Oleg Khomenko
Sergey I. Nikolenko
GAN
11
23
0
19 Nov 2018
Exploiting Class Learnability in Noisy Data
Exploiting Class Learnability in Noisy Data
Matthew Klawonn
Eric Heim
James A. Hendler
NoLa
20
7
0
15 Nov 2018
LinkNet: Relational Embedding for Scene Graph
LinkNet: Relational Embedding for Scene Graph
Sanghyun Woo
Dahun Kim
Donghyeon Cho
In So Kweon
GNN
15
147
0
15 Nov 2018
Hybrid Knowledge Routed Modules for Large-scale Object Detection
Hybrid Knowledge Routed Modules for Large-scale Object Detection
Chenhan Jiang
Hang Xu
Xiangdan Liang
Liang Lin
VLM
ObjD
39
86
0
30 Oct 2018
Visual Semantic Navigation using Scene Priors
Visual Semantic Navigation using Scene Priors
Wei Yang
Xinyu Wang
Ali Farhadi
Abhinav Gupta
Roozbeh Mottaghi
LM&Ro
33
320
0
15 Oct 2018
The Focus-Aspect-Polarity Model for Predicting Subjective Noun
  Attributes in Images
The Focus-Aspect-Polarity Model for Predicting Subjective Noun Attributes in Images
Tushar Karayil
Philipp Blandfort
Jörn Hees
Andreas Dengel
27
0
0
15 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
45
760
0
06 Oct 2018
DASNet: Reducing Pixel-level Annotations for Instance and Semantic
  Segmentation
DASNet: Reducing Pixel-level Annotations for Instance and Semantic Segmentation
Chuang Niu
Shenghan Ren
Jimin Liang
SSeg
14
0
0
17 Sep 2018
Context-Dependent Diffusion Network for Visual Relationship Detection
Context-Dependent Diffusion Network for Visual Relationship Detection
Zhen Cui
Chunyan Xu
Wenming Zheng
Jian Yang
GNN
22
50
0
11 Sep 2018
Recent Advances in Object Detection in the Age of Deep Convolutional
  Neural Networks
Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks
Shivang Agarwal
Jean Ogier du Terrail
F. Jurie
ObjD
24
123
0
10 Sep 2018
Deep Learning for Generic Object Detection: A Survey
Deep Learning for Generic Object Detection: A Survey
Li Liu
Wanli Ouyang
Xiaogang Wang
Paul Fieguth
Jie Chen
Xinwang Liu
M. Pietikäinen
ObjD
VLM
OOD
72
2,420
0
06 Sep 2018
Object Hallucination in Image Captioning
Object Hallucination in Image Captioning
Anna Rohrbach
Lisa Anne Hendricks
Kaylee Burns
Trevor Darrell
Kate Saenko
27
402
0
06 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
33
55
0
06 Sep 2018
OCNet: Object Context Network for Scene Parsing
OCNet: Object Context Network for Scene Parsing
Yuhui Yuan
Lang Huang
Jianyuan Guo
Chao Zhang
Xilin Chen
Jingdong Wang
25
599
0
04 Sep 2018
Diverse and Coherent Paragraph Generation from Images
Diverse and Coherent Paragraph Generation from Images
Moitreya Chatterjee
A. Schwing
19
66
0
03 Sep 2018
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and
  Comprehensive Image Captions
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Houfeng Wang
Xu Sun
98
65
0
27 Aug 2018
Context-Aware Visual Policy Network for Sequence-Level Image Captioning
Context-Aware Visual Policy Network for Sequence-Level Image Captioning
Daqing Liu
Zhengjun Zha
Hanwang Zhang
Yongdong Zhang
Feng Wu
CLIP
33
103
0
16 Aug 2018
Graph R-CNN for Scene Graph Generation
Graph R-CNN for Scene Graph Generation
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
GNN
57
836
0
01 Aug 2018
Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship
  Features
Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship Features
Xu Yang
Hanwang Zhang
Jianfei Cai
47
74
0
01 Aug 2018
Visual Graphs from Motion (VGfM): Scene understanding with object
  geometry reasoning
Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning
P. Gay
Stuart James
Alessio Del Bue
OCL
50
31
0
16 Jul 2018
Object Relation Detection Based on One-shot Learning
Object Relation Detection Based on One-shot Learning
Li Zhou
Jian-jun Zhao
Jianshu Li
Li-xin Yuan
Jiashi Feng
ObjD
19
23
0
16 Jul 2018
Zoom-Net: Mining Deep Feature Interactions for Visual Relationship
  Recognition
Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition
Guojun Yin
Lu Sheng
Bin Liu
Nenghai Yu
Xiaogang Wang
Jing Shao
Chen Change Loy
ObjD
32
156
0
13 Jul 2018
Previous
123...18192021
Next