ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.01497
  4. Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
    AIMat
    ObjD
ArXivPDFHTML

Papers citing "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"

50 / 7,713 papers shown
Title
Sequential Latent Spaces for Modeling the Intention During Diverse Image
  Captioning
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
J. Aneja
Harsh Agrawal
Dhruv Batra
Alex Schwing
BDL
VLM
26
66
0
22 Aug 2019
Object detection on aerial imagery using CenterNet
Object detection on aerial imagery using CenterNet
D. Pailla
V. Kollerathu
Sai Saketh Chennamsetty
ObjD
21
11
0
22 Aug 2019
InstaBoost: Boosting Instance Segmentation via Probability Map Guided
  Copy-Pasting
InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting
Haoshu Fang
Jianhua Sun
Runzhong Wang
Minghao Gou
Yong-Lu Li
Cewu Lu
ISeg
19
206
0
21 Aug 2019
Saccader: Improving Accuracy of Hard Attention Models for Vision
Saccader: Improving Accuracy of Hard Attention Models for Vision
Gamaleldin F. Elsayed
Simon Kornblith
Quoc V. Le
VLM
29
72
0
20 Aug 2019
On Object Symmetries and 6D Pose Estimation from Images
On Object Symmetries and 6D Pose Estimation from Images
Giorgia Pitteri
Michael Ramamonjisoa
Slobodan Ilic
Vincent Lepetit
11
51
0
20 Aug 2019
Phrase Localization Without Paired Training Examples
Phrase Localization Without Paired Training Examples
Josiah Wang
Lucia Specia
35
42
0
20 Aug 2019
LXMERT: Learning Cross-Modality Encoder Representations from
  Transformers
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
Hao Hao Tan
Joey Tianyi Zhou
VLM
MLLM
156
2,457
0
20 Aug 2019
A Novel method for IDC Prediction in Breast Cancer Histopathology images
  using Deep Residual Neural Networks
A Novel method for IDC Prediction in Breast Cancer Histopathology images using Deep Residual Neural Networks
Chandra Churh Chatterjee
G. Krishna
36
9
0
20 Aug 2019
Deep High-Resolution Representation Learning for Visual Recognition
Deep High-Resolution Representation Learning for Visual Recognition
Jingdong Wang
Ke Sun
Tianheng Cheng
Borui Jiang
Chaorui Deng
...
Yadong Mu
Mingkui Tan
Xinggang Wang
Wenyu Liu
Bin Xiao
195
3,550
0
20 Aug 2019
Proposal-free Temporal Moment Localization of a Natural-Language Query
  in Video using Guided Attention
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention
Cristian Rodriguez-Opazo
Edison Marrese-Taylor
F. Saleh
Hongdong Li
Stephen Gould
30
147
0
20 Aug 2019
Zero-Shot Grounding of Objects from Natural Language Queries
Zero-Shot Grounding of Objects from Natural Language Queries
Arka Sadhu
Kan Chen
Ram Nevatia
ObjD
30
157
0
20 Aug 2019
Unpaired Image-to-Speech Synthesis with Multimodal Information
  Bottleneck
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Shuang Ma
Daniel J. McDuff
Yale Song
25
23
0
19 Aug 2019
Attention on Attention for Image Captioning
Attention on Attention for Image Captioning
Lun Huang
Wenmin Wang
Jie Chen
Xiao-Yong Wei
24
824
0
19 Aug 2019
A Kings Ransom for Encryption: Ransomware Classification using Augmented
  One-Shot Learning and Bayesian Approximation
A Kings Ransom for Encryption: Ransomware Classification using Augmented One-Shot Learning and Bayesian Approximation
Amir Atapour-Abarghouei
Stephen Bonner
A. Mcgough
40
7
0
19 Aug 2019
C-RPNs: Promoting Object Detection in real world via a Cascade Structure
  of Region Proposal Networks
C-RPNs: Promoting Object Detection in real world via a Cascade Structure of Region Proposal Networks
Dongming Yang
YueXian Zou
Jian Zhang
Ge Li
ObjD
20
9
0
19 Aug 2019
Weakly-supervised Action Localization with Background Modeling
Weakly-supervised Action Localization with Background Modeling
P. Nguyen
Deva Ramanan
Charless C. Fowlkes
SSL
WSOL
30
157
0
19 Aug 2019
A Delay Metric for Video Object Detection: What Average Precision Fails
  to Tell
A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Huizi Mao
Xiaodong Yang
W. Dally
24
39
0
18 Aug 2019
A Fast and Accurate One-Stage Approach to Visual Grounding
A Fast and Accurate One-Stage Approach to Visual Grounding
Zhengyuan Yang
Boqing Gong
Liwei Wang
Wenbing Huang
Dong Yu
Jiebo Luo
ObjD
17
361
0
18 Aug 2019
Anomaly Detection in Video Sequence with Appearance-Motion
  Correspondence
Anomaly Detection in Video Sequence with Appearance-Motion Correspondence
Trong-Nguyen Nguyen
J. Meunier
26
344
0
17 Aug 2019
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal
  Pre-training
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training
Gen Li
Nan Duan
Yuejian Fang
Ming Gong
Daxin Jiang
Ming Zhou
SSL
VLM
MLLM
91
895
0
16 Aug 2019
Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel
  Aggregation Network
Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network
Wenhai Wang
Enze Xie
Xiaoge Song
Yuhang Zang
Wenjia Wang
Tong Lu
Gang Yu
Chunhua Shen
29
415
0
16 Aug 2019
Differentiable Learning-to-Group Channels via Groupable Convolutional
  Neural Networks
Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks
Zhaoyang Zhang
Jingyu Li
Wenqi Shao
Zhanglin Peng
Ruimao Zhang
Xiaogang Wang
Ping Luo
30
37
0
16 Aug 2019
PubLayNet: largest dataset ever for document layout analysis
PubLayNet: largest dataset ever for document layout analysis
Xu Zhong
Jianbin Tang
Antonio Jimeno Yepes
15
448
0
16 Aug 2019
Deep Sparse Band Selection for Hyperspectral Face Recognition
Deep Sparse Band Selection for Hyperspectral Face Recognition
Fariborz Taherkhani
J. Dawson
Nasser M. Nasrabadi
CVBM
20
11
0
15 Aug 2019
IoU-balanced Loss Functions for Single-stage Object Detection
IoU-balanced Loss Functions for Single-stage Object Detection
Shengkai Wu
Jinrong Yang
Xinggang Wang
Xiaoping Li
ObjD
26
101
0
15 Aug 2019
A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended
  Multi-Task Learning
A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning
Pengfei Wang
Chengquan Zhang
Fei Qi
Zuming Huang
Mengyi En
Junyu Han
Jingtuo Liu
Errui Ding
Guangming Shi
37
59
0
15 Aug 2019
Sex Trafficking Detection with Ordinal Regression Neural Networks
Sex Trafficking Detection with Ordinal Regression Neural Networks
Longshaokan Wang
E. Laber
Yeng Saanchi
Sherrie Caltagirone
17
14
0
15 Aug 2019
Once a MAN: Towards Multi-Target Attack via Learning Multi-Target
  Adversarial Network Once
Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once
Jiangfan Han
Xiaoyi Dong
Ruimao Zhang
Dongdong Chen
Weiming Zhang
Nenghai Yu
Ping Luo
Xiaogang Wang
AAML
24
28
0
14 Aug 2019
VideoNavQA: Bridging the Gap between Visual and Embodied Question
  Answering
VideoNavQA: Bridging the Gap between Visual and Embodied Question Answering
Cătălina Cangea
Eugene Belilovsky
Pietro Lio
Aaron Courville
16
17
0
14 Aug 2019
Why Does a Visual Question Have Different Answers?
Why Does a Visual Question Have Different Answers?
Nilavra Bhattacharya
Qing Li
Danna Gurari
31
65
0
12 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language
  Interactions
Multimodal Unified Attention Networks for Vision-and-Language Interactions
Zhou Yu
Yuhao Cui
Jun Yu
Dacheng Tao
Q. Tian
27
38
0
12 Aug 2019
Explicit Shape Encoding for Real-Time Instance Segmentation
Explicit Shape Encoding for Real-Time Instance Segmentation
Wenqiang Xu
Haiyang Wang
Fubo Qi
Cewu Lu
41
104
0
12 Aug 2019
Mix & Match: training convnets with mixed image sizes for improved
  accuracy, speed and scale resiliency
Mix & Match: training convnets with mixed image sizes for improved accuracy, speed and scale resiliency
Elad Hoffer
Berry Weinstein
Itay Hubara
Tal Ben-Nun
Torsten Hoefler
Daniel Soudry
29
20
0
12 Aug 2019
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking
Tan Wang
Xing Xu
Yang Yang
Alan Hanjalic
Heng Tao Shen
Jingkuan Song
22
146
0
12 Aug 2019
Robust Online Multi-target Visual Tracking using a HISP Filter with
  Discriminative Deep Appearance Learning
Robust Online Multi-target Visual Tracking using a HISP Filter with Discriminative Deep Appearance Learning
N. L. Baisa
38
23
0
11 Aug 2019
UM-Adapt: Unsupervised Multi-Task Adaptation Using Adversarial
  Cross-Task Distillation
UM-Adapt: Unsupervised Multi-Task Adaptation Using Adversarial Cross-Task Distillation
Jogendra Nath Kundu
Nishank Lakkakula
R. Venkatesh Babu
19
59
0
11 Aug 2019
Delving into Robust Object Detection from Unmanned Aerial Vehicles: A
  Deep Nuisance Disentanglement Approach
Delving into Robust Object Detection from Unmanned Aerial Vehicles: A Deep Nuisance Disentanglement Approach
Zhenyu Wu
Karthik Suresh
Priya Narayanan
Hongyu Xu
H. Kwon
Zhangyang Wang
AAML
31
76
0
11 Aug 2019
IoU Loss for 2D/3D Object Detection
IoU Loss for 2D/3D Object Detection
Dingfu Zhou
Jin Fang
Xibin Song
Chenye Guan
Junbo Yin
Yuchao Dai
Ruigang Yang
18
379
0
11 Aug 2019
MobileFAN: Transferring Deep Hidden Representation for Face Alignment
MobileFAN: Transferring Deep Hidden Representation for Face Alignment
Yang Zhao
Yifan Liu
Chunhua Shen
Yongsheng Gao
Shengwu Xiong
CVBM
34
39
0
11 Aug 2019
Object-Aware Instance Labeling for Weakly Supervised Object Detection
Object-Aware Instance Labeling for Weakly Supervised Object Detection
Satoshi Kosugi
T. Yamasaki
Kiyoharu Aizawa
WSOD
27
54
0
10 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
25
82
0
10 Aug 2019
Bayesian Loss for Crowd Count Estimation with Point Supervision
Bayesian Loss for Crowd Count Estimation with Point Supervision
Zhiheng Ma
Xing Wei
Xiaopeng Hong
Yihong Gong
3DPC
48
483
0
10 Aug 2019
Recent Advances in Deep Learning for Object Detection
Recent Advances in Deep Learning for Object Detection
Xiongwei Wu
Doyen Sahoo
Guosheng Lin
VLM
ObjD
42
801
0
10 Aug 2019
Star-convex Polyhedra for 3D Object Detection and Segmentation in
  Microscopy
Star-convex Polyhedra for 3D Object Detection and Segmentation in Microscopy
Martin Weigert
Uwe Schmidt
Robert Haase
Ko Sugawara
E. Myers
36
363
0
09 Aug 2019
VisualBERT: A Simple and Performant Baseline for Vision and Language
VisualBERT: A Simple and Performant Baseline for Vision and Language
Liunian Harold Li
Mark Yatskar
Da Yin
Cho-Jui Hsieh
Kai-Wei Chang
VLM
84
1,924
0
09 Aug 2019
Question-Agnostic Attention for Visual Question Answering
Question-Agnostic Attention for Visual Question Answering
M. Farazi
Salman H Khan
Nick Barnes
13
10
0
09 Aug 2019
Sim-to-Real Learning for Casualty Detection from Ground Projected Point
  Cloud Data
Sim-to-Real Learning for Casualty Detection from Ground Projected Point Cloud Data
Roni Permana Saputra
Nemanja Rakićević
Petar Kormushev
3DH
3DPC
19
8
0
08 Aug 2019
Location Field Descriptors: Single Image 3D Model Retrieval in the Wild
Location Field Descriptors: Single Image 3D Model Retrieval in the Wild
Alexander Grabner
P. Roth
Vincent Lepetit
3DPC
27
36
0
07 Aug 2019
An Adaptive Supervision Framework for Active Learning in Object
  Detection
An Adaptive Supervision Framework for Active Learning in Object Detection
Sai Vikas Desai
Akshay L Chandra
Wei Guo
S. Ninomiya
V. Balasubramanian
29
41
0
07 Aug 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for
  Vision-and-Language Tasks
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
149
3,645
0
06 Aug 2019
Previous
123...122123124...153154155
Next