ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.04224
  4. Cited By
Spatial Memory for Context Reasoning in Object Detection

Spatial Memory for Context Reasoning in Object Detection

13 April 2017
Xinlei Chen
Abhinav Gupta
    ObjD
ArXiv (abs)PDFHTML

Papers citing "Spatial Memory for Context Reasoning in Object Detection"

50 / 54 papers shown
Title
Deep Variation-structured Reinforcement Learning for Visual Relationship
  and Attribute Detection
Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection
Xiaodan Liang
Lisa Lee
Eric Xing
74
252
0
08 Mar 2017
Neural Map: Structured Memory for Deep Reinforcement Learning
Neural Map: Structured Memory for Deep Reinforcement Learning
Emilio Parisotto
Ruslan Salakhutdinov
75
260
0
27 Feb 2017
Cognitive Mapping and Planning for Visual Navigation
Cognitive Mapping and Planning for Visual Navigation
Saurabh Gupta
Varun Tolani
James Davidson
Sergey Levine
Rahul Sukthankar
Jitendra Malik
89
715
0
13 Feb 2017
An Implementation of Faster RCNN with Study for Region Sampling
An Implementation of Faster RCNN with Study for Region Sampling
Xinlei Chen
Abhinav Gupta
ObjD
54
160
0
07 Feb 2017
Beyond Skip Connections: Top-Down Modulation for Object Detection
Beyond Skip Connections: Top-Down Modulation for Object Detection
Abhinav Shrivastava
Rahul Sukthankar
Jitendra Malik
Abhinav Gupta
ObjD
92
321
0
20 Dec 2016
Feature Pyramid Networks for Object Detection
Feature Pyramid Networks for Object Detection
Nayeon Lee
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
483
22,134
0
09 Dec 2016
Speed/accuracy trade-offs for modern convolutional object detectors
Speed/accuracy trade-offs for modern convolutional object detectors
Jonathan Huang
V. Rathod
Chen Sun
Menglong Zhu
Anoop Korattikara Balan
...
Ian S. Fischer
Z. Wojna
Yang Song
S. Guadarrama
Kevin Patrick Murphy
3DH3DV
102
2,572
0
30 Nov 2016
Hierarchical Object Detection with Deep Reinforcement Learning
Hierarchical Object Detection with Deep Reinforcement Learning
Míriam Bellver
Xavier Giró-i-Nieto
F. Marqués
Jordi Torres
42
105
0
11 Nov 2016
End-to-end training of object class detectors for mean average precision
End-to-end training of object class detectors for mean average precision
Paul Henderson
V. Ferrari
ObjD
65
268
0
12 Jul 2016
Attend Refine Repeat: Active Box Proposal Generation via In-Out
  Localization
Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization
Spyridon Gidaris
N. Komodakis
ObjD
73
80
0
14 Jun 2016
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets,
  Atrous Convolution, and Fully Connected CRFs
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Liang-Chieh Chen
George Papandreou
Iasonas Kokkinos
Kevin Patrick Murphy
Alan Yuille
SSeg
265
18,259
0
02 Jun 2016
Adversarial Feature Learning
Adversarial Feature Learning
Jiasen Lu
Philipp Krahenbuhl
Trevor Darrell
GAN
115
1
0
31 May 2016
End-to-End Instance Segmentation with Recurrent Attention
End-to-End Instance Segmentation with Recurrent Attention
Mengye Ren
R. Zemel
SSeg
77
62
0
30 May 2016
R-FCN: Object Detection via Region-based Fully Convolutional Networks
R-FCN: Object Detection via Region-based Fully Convolutional Networks
Jifeng Dai
Yi Li
Kaiming He
Jian Sun
ObjD
177
5,642
0
20 May 2016
Attentive Contexts for Object Detection
Attentive Contexts for Object Detection
Jianan Li
Yunchao Wei
Xiaodan Liang
Jian Dong
Tingfa Xu
Jiashi Feng
Shuicheng Yan
ObjD
46
222
0
24 Mar 2016
Dynamic Memory Networks for Visual and Textual Question Answering
Dynamic Memory Networks for Visual and Textual Question Answering
Caiming Xiong
Stephen Merity
R. Socher
77
755
0
04 Mar 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense
  Image Annotations
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
223
5,761
0
23 Feb 2016
Learning to Compose Neural Networks for Question Answering
Learning to Compose Neural Networks for Question Answering
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
NAIKELMBDLCoGe
99
568
0
07 Jan 2016
Adaptive Object Detection Using Adjacency and Zoom Prediction
Adaptive Object Detection Using Adjacency and Zoom Prediction
Y. Lu
T. Javidi
Svetlana Lazebnik
ObjD
58
76
0
24 Dec 2015
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and
  Recurrent Neural Networks
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
Sean Bell
C. L. Zitnick
Kavita Bala
Ross B. Girshick
ObjD
86
1,210
0
14 Dec 2015
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,322
0
10 Dec 2015
SSD: Single Shot MultiBox Detector
SSD: Single Shot MultiBox Detector
Wen Liu
Dragomir Anguelov
D. Erhan
Christian Szegedy
Scott E. Reed
Cheng-Yang Fu
Alexander C. Berg
ObjDBDL
244
29,859
0
08 Dec 2015
Exploring Person Context and Local Scene Context for Object Detection
Exploring Person Context and Local Scene Context for Object Detection
Saurabh Gupta
Bharath Hariharan
Jitendra Malik
ObjD
52
25
0
25 Nov 2015
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
Justin Johnson
A. Karpathy
Li Fei-Fei
VLM
129
1,170
0
24 Nov 2015
Top-Down Learning for Structured Labeling with Convolutional Pseudoprior
Top-Down Learning for Structured Labeling with Convolutional Pseudoprior
Saining Xie
Xun Huang
Zhuowen Tu
SSL
60
38
0
23 Nov 2015
Where To Look: Focus Regions for Visual Question Answering
Where To Look: Focus Regions for Visual Question Answering
Kevin J. Shih
Saurabh Singh
Derek Hoiem
73
460
0
23 Nov 2015
A convnet for non-maximum suppression
A convnet for non-maximum suppression
J. Hosang
Rodrigo Benenson
Bernt Schiele
44
76
0
19 Nov 2015
Active Object Localization with Deep Reinforcement Learning
Active Object Localization with Deep Reinforcement Learning
Juan C. Caicedo
Svetlana Lazebnik
ObjD
65
445
0
18 Nov 2015
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for
  Visual Question Answering
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
Huijuan Xu
Kate Saenko
76
763
0
17 Nov 2015
Grounding of Textual Phrases in Images by Reconstruction
Grounding of Textual Phrases in Images by Reconstruction
Anna Rohrbach
Marcus Rohrbach
Ronghang Hu
Trevor Darrell
Bernt Schiele
80
497
0
12 Nov 2015
Visual7W: Grounded Question Answering in Images
Visual7W: Grounded Question Answering in Images
Yuke Zhu
Oliver Groth
Michael S. Bernstein
Li Fei-Fei
102
887
0
11 Nov 2015
Stacked Attention Networks for Image Question Answering
Stacked Attention Networks for Image Question Answering
Zichao Yang
Xiaodong He
Jianfeng Gao
Li Deng
Alex Smola
BDL
109
1,883
0
07 Nov 2015
AttentionNet: Aggregating Weak Directions for Accurate Object Detection
AttentionNet: Aggregating Weak Directions for Accurate Object Detection
Donggeun Yoo
Sunggyun Park
Joon-Young Lee
Anthony S. Paek
In So Kweon
92
161
0
25 Jun 2015
End-to-end people detection in crowded scenes
End-to-end people detection in crowded scenes
Russell Stewart
Mykhaylo Andriluka
73
544
0
16 Jun 2015
ParseNet: Looking Wider to See Better
ParseNet: Looking Wider to See Better
Wei Liu
Andrew Rabinovich
Alexander C. Berg
SSeg
124
1,214
0
15 Jun 2015
Scheduled Sampling for Sequence Prediction with Recurrent Neural
  Networks
Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Samy Bengio
Oriol Vinyals
Navdeep Jaitly
Noam M. Shazeer
152
2,038
0
09 Jun 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMatObjD
520
62,360
0
04 Jun 2015
Are You Talking to a Machine? Dataset and Methods for Multilingual Image
  Question Answering
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
Haoyuan Gao
Junhua Mao
Jie Zhou
Zhiheng Huang
Lei Wang
Wenyuan Xu
78
501
0
21 May 2015
Contextual Action Recognition with R*CNN
Contextual Action Recognition with R*CNN
Georgia Gkioxari
Ross B. Girshick
Jitendra Malik
HAI
87
403
0
05 May 2015
Ask Your Neurons: A Neural-based Approach to Answering Questions about
  Images
Ask Your Neurons: A Neural-based Approach to Answering Questions about Images
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
108
600
0
05 May 2015
Sequence to Sequence -- Video to Text
Sequence to Sequence -- Video to Text
Subhashini Venugopalan
Marcus Rohrbach
Jeff Donahue
Raymond J. Mooney
Trevor Darrell
Kate Saenko
142
1,419
0
03 May 2015
VQA: Visual Question Answering
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
214
5,497
0
03 May 2015
Fast R-CNN
Fast R-CNN
Ross B. Girshick
ObjD
309
25,081
0
30 Apr 2015
Conditional Random Fields as Recurrent Neural Networks
Conditional Random Fields as Recurrent Neural Networks
Shuai Zheng
Sadeep Jayasumana
Bernardino Romera-Paredes
Vibhav Vineet
Zhizhong Su
Dalong Du
Chang Huang
Philip Torr
SSeg
243
2,536
0
11 Feb 2015
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
463
43,328
0
11 Feb 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
348
10,079
0
10 Feb 2015
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)
Junhua Mao
Wenyuan Xu
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
VLM
176
1,240
0
20 Dec 2014
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence
  Modeling
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Junyoung Chung
Çağlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
598
12,734
0
11 Dec 2014
Show and Tell: A Neural Image Caption Generator
Show and Tell: A Neural Image Caption Generator
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
249
6,035
0
17 Nov 2014
Long-term Recurrent Convolutional Networks for Visual Recognition and
  Description
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
165
6,056
0
17 Nov 2014
12
Next