ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.01497
  4. Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
v1v2v3 (latest)

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
    AIMatObjD
ArXiv (abs)PDFHTML

Papers citing "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"

50 / 10,543 papers shown
Title
Image Captioning with Object Detection and Localization
Image Captioning with Object Detection and Localization
Zhongliang Yang
Yujin Zhang
S. Rehman
Yongfeng Huang
ObjDVLM
55
47
0
08 Jun 2017
Driver Action Prediction Using Deep (Bidirectional) Recurrent Neural
  Network
Driver Action Prediction Using Deep (Bidirectional) Recurrent Neural Network
O. Olabiyi
E. Martinson
Vijay Chintalapudi
Rui Guo
59
76
0
07 Jun 2017
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Jingkuan Song
Zhao Guo
Lianli Gao
Wu Liu
Dongxiang Zhang
Heng Tao Shen
112
168
0
05 Jun 2017
Face R-CNN
Face R-CNN
Hao Wang
Zhifeng Li
Xing Ji
Yitong Wang
ObjDCVBM
86
91
0
04 Jun 2017
Where and Who? Automatic Semantic-Aware Person Composition
Where and Who? Automatic Semantic-Aware Person Composition
Fuwen Tan
Crispin Bernier
Benjamin Cohen
Vicente Ordonez
Connelly Barnes
3DH
103
51
0
04 Jun 2017
IDK Cascades: Fast Deep Learning by Learning not to Overthink
IDK Cascades: Fast Deep Learning by Learning not to Overthink
Xin Wang
Yujia Luo
D. Crankshaw
Alexey Tumanov
Fisher Yu
Joseph E. Gonzalez
109
108
0
03 Jun 2017
Image Restoration from Patch-based Compressed Sensing Measurement
Image Restoration from Patch-based Compressed Sensing Measurement
Guangtao Nie
Ying Fu
Yinqiang Zheng
Hua Huang
117
10
0
02 Jun 2017
Generic Tubelet Proposals for Action Localization
Generic Tubelet Proposals for Action Localization
Jiawei He
Mostafa S. Ibrahim
Zhiwei Deng
Greg Mori
MedImViT
49
30
0
30 May 2017
Care about you: towards large-scale human-centric visual relationship
  detection
Care about you: towards large-scale human-centric visual relationship detection
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
Anton Van Den Hengel
57
21
0
28 May 2017
Enhancement of SSD by concatenating feature maps for object detection
Enhancement of SSD by concatenating feature maps for object detection
Jisoo Jeong
Hyojin Park
Nojun Kwak
ObjD
104
318
0
26 May 2017
Algorithmic clothing: hybrid recommendation, from street-style-to-shop
Algorithmic clothing: hybrid recommendation, from street-style-to-shop
Y. Qian
P. Giaccone
Michele Sasdelli
E. Vasquez
B. Sengupta
62
6
0
26 May 2017
Deep Learning for Lung Cancer Detection: Tackling the Kaggle Data
  Science Bowl 2017 Challenge
Deep Learning for Lung Cancer Detection: Tackling the Kaggle Data Science Bowl 2017 Challenge
Kingsley Kuan
Mathieu Ravaut
Gaurav Manek
Huiling Chen
Jie Lin
Babar Nazir
Cen Chen
T. C. Howe
Zengfeng Zeng
V. Chandrasekhar
MedIm
135
88
0
26 May 2017
Extraction and Classification of Diving Clips from Continuous Video
  Footage
Extraction and Classification of Diving Clips from Continuous Video Footage
Aiden Nibali
Zhen He
Stuart Morgan
Daniel Greenwood
83
18
0
25 May 2017
Attention-based Natural Language Person Retrieval
Attention-based Natural Language Person Retrieval
Tao Zhou
Muhao Chen
Jie Yu
Demetri Terzopoulos
54
14
0
24 May 2017
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual
  Actions
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Chunhui Gu
Chen Sun
David A. Ross
Carl Vondrick
C. Pantofaru
...
G. Toderici
Susanna Ricco
Rahul Sukthankar
Cordelia Schmid
Jitendra Malik
VGen
237
1,035
0
23 May 2017
Fusion of Head and Full-Body Detectors for Multi-Object Tracking
Fusion of Head and Full-Body Detectors for Multi-Object Tracking
Roberto Henschel
Laura Leal-Taixé
Zorah Lähner
Bodo Rosenhahn
VOT
97
13
0
23 May 2017
Towards seamless multi-view scene analysis from satellite to
  street-level
Towards seamless multi-view scene analysis from satellite to street-level
Sébastien Lefèvre
D. Tuia
Jan Dirk Wegner
T. Produit
Ahmed Samy Nassaar
86
67
0
23 May 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
452
8,079
0
22 May 2017
Computer vision-based food calorie estimation: dataset, method, and
  experiment
Computer vision-based food calorie estimation: dataset, method, and experiment
Yanchao Liang
Jianhua Li
64
38
0
22 May 2017
The Do's and Don'ts for CNN-based Face Verification
The Do's and Don'ts for CNN-based Face Verification
Ankan Bansal
Carlos D. Castillo
Rajeev Ranjan
Rama Chellappa
CVBM
84
97
0
21 May 2017
Multiple-Human Parsing in the Wild
Multiple-Human Parsing in the Wild
Jianshu Li
Jian-jun Zhao
Yunchao Wei
Congyan Lang
Yidong Li
Terence Sim
Bo An
Jiashi Feng
69
18
0
19 May 2017
Sparse Coding on Stereo Video for Object Detection
Sparse Coding on Stereo Video for Object Detection
Sheng Y. Lundquist
Melanie Mitchell
Garrett Kenyon
85
8
0
19 May 2017
Matching neural paths: transfer from recognition to correspondence
  search
Matching neural paths: transfer from recognition to correspondence search
Nikolay Savinov
Lubor Ladicky
Marc Pollefeys
80
10
0
19 May 2017
Re3 : Real-Time Recurrent Regression Networks for Visual Tracking of
  Generic Objects
Re3 : Real-Time Recurrent Regression Networks for Visual Tracking of Generic Objects
Daniel Gordon
Ali Farhadi
Dieter Fox
VOT
84
48
0
17 May 2017
LCDet: Low-Complexity Fully-Convolutional Neural Networks for Object
  Detection in Embedded Systems
LCDet: Low-Complexity Fully-Convolutional Neural Networks for Object Detection in Embedded Systems
Subarna Tripathi
G. Dane
Byeongkeun Kang
V. Bhaskaran
Truong Thao Nguyen
ObjD
99
37
0
16 May 2017
WebVision Challenge: Visual Learning and Understanding With Web Data
WebVision Challenge: Visual Learning and Understanding With Web Data
Wen Li
Limin Wang
Wei Li
E. Agustsson
Jesse Berent
Abhinav Gupta
Rahul Sukthankar
Luc Van Gool
VLM
84
19
0
16 May 2017
IAN: The Individual Aggregation Network for Person Search
IAN: The Individual Aggregation Network for Person Search
Jimin Xiao
Yanchun Xie
T. Tillo
Kaizhu Huang
Yunchao Wei
Jiashi Feng
3DPC
74
146
0
16 May 2017
Reconfiguring the Imaging Pipeline for Computer Vision
Reconfiguring the Imaging Pipeline for Computer Vision
Mark Buckler
Suren Jayasuriya
Adrian Sampson
113
105
0
11 May 2017
CORe50: a New Dataset and Benchmark for Continuous Object Recognition
CORe50: a New Dataset and Benchmark for Continuous Object Recognition
Vincenzo Lomonaco
Davide Maltoni
209
496
0
09 May 2017
Cell Tracking via Proposal Generation and Selection
Cell Tracking via Proposal Generation and Selection
S. Akram
Arno Solin
L. Eklund
J. Heikkilä
101
28
0
09 May 2017
Learning non-maximum suppression
Learning non-maximum suppression
J. Hosang
Rodrigo Benenson
Bernt Schiele
ObjD
92
524
0
08 May 2017
A Dual-Source Approach for 3D Human Pose Estimation from a Single Image
A Dual-Source Approach for 3D Human Pose Estimation from a Single Image
Umar Iqbal
Andreas Doering
H. Yasin
Björn Krüger
A. Weber
Juergen Gall
3DH
57
37
0
08 May 2017
What Can Help Pedestrian Detection?
What Can Help Pedestrian Detection?
Jiayuan Mao
Tete Xiao
Yuning Jiang
Zhimin Cao
101
288
0
08 May 2017
Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation
Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation
G. Ning
Zhi Zhang
Zhiquan He
GAN
76
172
0
05 May 2017
DeepCorrect: Correcting DNN models against Image Distortions
DeepCorrect: Correcting DNN models against Image Distortions
Tejas S. Borkar
Lina Karam
163
93
0
05 May 2017
Face Detection, Bounding Box Aggregation and Pose Estimation for Robust
  Facial Landmark Localisation in the Wild
Face Detection, Bounding Box Aggregation and Pose Estimation for Robust Facial Landmark Localisation in the Wild
Zhenhua Feng
J. Kittler
Muhammad Awais
P. Huber
Xiaojun Wu
CVBM
67
40
0
05 May 2017
S-OHEM: Stratified Online Hard Example Mining for Object Detection
S-OHEM: Stratified Online Hard Example Mining for Object Detection
Min-ne Li
Zhaoning Zhang
Hao Yu
Xinyuan Chen
Dongsheng Li
ObjD
46
16
0
05 May 2017
Unsupervised learning of object landmarks by factorized spatial
  embeddings
Unsupervised learning of object landmarks by factorized spatial embeddings
James Thewlis
Hakan Bilen
Andrea Vedaldi
OCLSSL
82
162
0
05 May 2017
TALL: Temporal Activity Localization via Language Query
TALL: Temporal Activity Localization via Language Query
J. Gao
Chen Sun
Zhenheng Yang
Ram Nevatia
220
830
0
05 May 2017
Action Tubelet Detector for Spatio-Temporal Action Localization
Action Tubelet Detector for Spatio-Temporal Action Localization
Vicky Kalogeiton
Philippe Weinzaepfel
V. Ferrari
Cordelia Schmid
96
327
0
04 May 2017
Deep 360 Pilot: Learning a Deep Agent for Piloting through 360°
  Sports Video
Deep 360 Pilot: Learning a Deep Agent for Piloting through 360° Sports Video
Hou-Ning Hu
Yen-Chen Lin
Ming-Yuan Liu
Hsien-Tzu Cheng
Yung-Ju Chang
Min Sun
96
178
0
04 May 2017
VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera
VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera
Dushyant Mehta
Srinath Sridhar
Oleksandr Sotnychenko
Helge Rhodin
Mohammad Shafiei
Hans-Peter Seidel
Weipeng Xu
Dan Casas
Christian Theobalt
3DH
142
822
0
03 May 2017
Weakly-supervised Visual Grounding of Phrases with Linguistic Structures
Weakly-supervised Visual Grounding of Phrases with Linguistic Structures
Fanyi Xiao
Leonid Sigal
Yong Jae Lee
103
139
0
03 May 2017
Cascaded Boundary Regression for Temporal Action Detection
Cascaded Boundary Regression for Temporal Action Detection
J. Gao
Zhenheng Yang
Ram Nevatia
93
220
0
02 May 2017
Lesion detection and Grading of Diabetic Retinopathy via Two-stages Deep
  Convolutional Neural Networks
Lesion detection and Grading of Diabetic Retinopathy via Two-stages Deep Convolutional Neural Networks
Yehui Yang
Tao Li
Wensi Li
Haishan Wu
Wei Fan
Wensheng Zhang
MedIm
69
176
0
02 May 2017
Classical Planning in Deep Latent Space: Bridging the
  Subsymbolic-Symbolic Boundary
Classical Planning in Deep Latent Space: Bridging the Subsymbolic-Symbolic Boundary
Masataro Asai
A. Fukunaga
199
173
0
29 Apr 2017
ICNet for Real-Time Semantic Segmentation on High-Resolution Images
ICNet for Real-Time Semantic Segmentation on High-Resolution Images
Hengshuang Zhao
Xiaojuan Qi
Xiaoyong Shen
Jianping Shi
Jiaya Jia
SSeg
118
1,417
0
27 Apr 2017
Spatio-temporal Person Retrieval via Natural Language Queries
Spatio-temporal Person Retrieval via Natural Language Queries
Masataka Yamaguchi
Kuniaki Saito
Yoshitaka Ushiku
Tatsuya Harada
103
58
0
26 Apr 2017
Skeleton-based Action Recognition with Convolutional Neural Networks
Skeleton-based Action Recognition with Convolutional Neural Networks
Chong Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
MedImViT
116
368
0
25 Apr 2017
Detecting and Recognizing Human-Object Interactions
Detecting and Recognizing Human-Object Interactions
Georgia Gkioxari
Ross B. Girshick
Piotr Dollár
Kaiming He
155
578
0
24 Apr 2017
Previous
123...201202203...209210211
Next