Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 7,713 papers shown
Title
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
J. Aneja
Harsh Agrawal
Dhruv Batra
Alex Schwing
BDL
VLM
26
66
0
22 Aug 2019
Object detection on aerial imagery using CenterNet
D. Pailla
V. Kollerathu
Sai Saketh Chennamsetty
ObjD
21
11
0
22 Aug 2019
InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting
Haoshu Fang
Jianhua Sun
Runzhong Wang
Minghao Gou
Yong-Lu Li
Cewu Lu
ISeg
19
206
0
21 Aug 2019
Saccader: Improving Accuracy of Hard Attention Models for Vision
Gamaleldin F. Elsayed
Simon Kornblith
Quoc V. Le
VLM
29
72
0
20 Aug 2019
On Object Symmetries and 6D Pose Estimation from Images
Giorgia Pitteri
Michael Ramamonjisoa
Slobodan Ilic
Vincent Lepetit
11
51
0
20 Aug 2019
Phrase Localization Without Paired Training Examples
Josiah Wang
Lucia Specia
35
42
0
20 Aug 2019
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
Hao Hao Tan
Joey Tianyi Zhou
VLM
MLLM
156
2,457
0
20 Aug 2019
A Novel method for IDC Prediction in Breast Cancer Histopathology images using Deep Residual Neural Networks
Chandra Churh Chatterjee
G. Krishna
36
9
0
20 Aug 2019
Deep High-Resolution Representation Learning for Visual Recognition
Jingdong Wang
Ke Sun
Tianheng Cheng
Borui Jiang
Chaorui Deng
...
Yadong Mu
Mingkui Tan
Xinggang Wang
Wenyu Liu
Bin Xiao
195
3,550
0
20 Aug 2019
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention
Cristian Rodriguez-Opazo
Edison Marrese-Taylor
F. Saleh
Hongdong Li
Stephen Gould
30
147
0
20 Aug 2019
Zero-Shot Grounding of Objects from Natural Language Queries
Arka Sadhu
Kan Chen
Ram Nevatia
ObjD
30
157
0
20 Aug 2019
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Shuang Ma
Daniel J. McDuff
Yale Song
25
23
0
19 Aug 2019
Attention on Attention for Image Captioning
Lun Huang
Wenmin Wang
Jie Chen
Xiao-Yong Wei
24
824
0
19 Aug 2019
A Kings Ransom for Encryption: Ransomware Classification using Augmented One-Shot Learning and Bayesian Approximation
Amir Atapour-Abarghouei
Stephen Bonner
A. Mcgough
40
7
0
19 Aug 2019
C-RPNs: Promoting Object Detection in real world via a Cascade Structure of Region Proposal Networks
Dongming Yang
YueXian Zou
Jian Zhang
Ge Li
ObjD
20
9
0
19 Aug 2019
Weakly-supervised Action Localization with Background Modeling
P. Nguyen
Deva Ramanan
Charless C. Fowlkes
SSL
WSOL
30
157
0
19 Aug 2019
A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Huizi Mao
Xiaodong Yang
W. Dally
24
39
0
18 Aug 2019
A Fast and Accurate One-Stage Approach to Visual Grounding
Zhengyuan Yang
Boqing Gong
Liwei Wang
Wenbing Huang
Dong Yu
Jiebo Luo
ObjD
17
361
0
18 Aug 2019
Anomaly Detection in Video Sequence with Appearance-Motion Correspondence
Trong-Nguyen Nguyen
J. Meunier
26
344
0
17 Aug 2019
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training
Gen Li
Nan Duan
Yuejian Fang
Ming Gong
Daxin Jiang
Ming Zhou
SSL
VLM
MLLM
91
895
0
16 Aug 2019
Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network
Wenhai Wang
Enze Xie
Xiaoge Song
Yuhang Zang
Wenjia Wang
Tong Lu
Gang Yu
Chunhua Shen
29
415
0
16 Aug 2019
Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks
Zhaoyang Zhang
Jingyu Li
Wenqi Shao
Zhanglin Peng
Ruimao Zhang
Xiaogang Wang
Ping Luo
30
37
0
16 Aug 2019
PubLayNet: largest dataset ever for document layout analysis
Xu Zhong
Jianbin Tang
Antonio Jimeno Yepes
15
448
0
16 Aug 2019
Deep Sparse Band Selection for Hyperspectral Face Recognition
Fariborz Taherkhani
J. Dawson
Nasser M. Nasrabadi
CVBM
20
11
0
15 Aug 2019
IoU-balanced Loss Functions for Single-stage Object Detection
Shengkai Wu
Jinrong Yang
Xinggang Wang
Xiaoping Li
ObjD
26
101
0
15 Aug 2019
A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning
Pengfei Wang
Chengquan Zhang
Fei Qi
Zuming Huang
Mengyi En
Junyu Han
Jingtuo Liu
Errui Ding
Guangming Shi
37
59
0
15 Aug 2019
Sex Trafficking Detection with Ordinal Regression Neural Networks
Longshaokan Wang
E. Laber
Yeng Saanchi
Sherrie Caltagirone
17
14
0
15 Aug 2019
Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once
Jiangfan Han
Xiaoyi Dong
Ruimao Zhang
Dongdong Chen
Weiming Zhang
Nenghai Yu
Ping Luo
Xiaogang Wang
AAML
24
28
0
14 Aug 2019
VideoNavQA: Bridging the Gap between Visual and Embodied Question Answering
Cătălina Cangea
Eugene Belilovsky
Pietro Lio
Aaron Courville
16
17
0
14 Aug 2019
Why Does a Visual Question Have Different Answers?
Nilavra Bhattacharya
Qing Li
Danna Gurari
31
65
0
12 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language Interactions
Zhou Yu
Yuhao Cui
Jun Yu
Dacheng Tao
Q. Tian
27
38
0
12 Aug 2019
Explicit Shape Encoding for Real-Time Instance Segmentation
Wenqiang Xu
Haiyang Wang
Fubo Qi
Cewu Lu
41
104
0
12 Aug 2019
Mix & Match: training convnets with mixed image sizes for improved accuracy, speed and scale resiliency
Elad Hoffer
Berry Weinstein
Itay Hubara
Tal Ben-Nun
Torsten Hoefler
Daniel Soudry
29
20
0
12 Aug 2019
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking
Tan Wang
Xing Xu
Yang Yang
Alan Hanjalic
Heng Tao Shen
Jingkuan Song
22
146
0
12 Aug 2019
Robust Online Multi-target Visual Tracking using a HISP Filter with Discriminative Deep Appearance Learning
N. L. Baisa
38
23
0
11 Aug 2019
UM-Adapt: Unsupervised Multi-Task Adaptation Using Adversarial Cross-Task Distillation
Jogendra Nath Kundu
Nishank Lakkakula
R. Venkatesh Babu
19
59
0
11 Aug 2019
Delving into Robust Object Detection from Unmanned Aerial Vehicles: A Deep Nuisance Disentanglement Approach
Zhenyu Wu
Karthik Suresh
Priya Narayanan
Hongyu Xu
H. Kwon
Zhangyang Wang
AAML
31
76
0
11 Aug 2019
IoU Loss for 2D/3D Object Detection
Dingfu Zhou
Jin Fang
Xibin Song
Chenye Guan
Junbo Yin
Yuchao Dai
Ruigang Yang
18
379
0
11 Aug 2019
MobileFAN: Transferring Deep Hidden Representation for Face Alignment
Yang Zhao
Yifan Liu
Chunhua Shen
Yongsheng Gao
Shengwu Xiong
CVBM
34
39
0
11 Aug 2019
Object-Aware Instance Labeling for Weakly Supervised Object Detection
Satoshi Kosugi
T. Yamasaki
Kiyoharu Aizawa
WSOD
27
54
0
10 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
25
82
0
10 Aug 2019
Bayesian Loss for Crowd Count Estimation with Point Supervision
Zhiheng Ma
Xing Wei
Xiaopeng Hong
Yihong Gong
3DPC
48
483
0
10 Aug 2019
Recent Advances in Deep Learning for Object Detection
Xiongwei Wu
Doyen Sahoo
Guosheng Lin
VLM
ObjD
42
801
0
10 Aug 2019
Star-convex Polyhedra for 3D Object Detection and Segmentation in Microscopy
Martin Weigert
Uwe Schmidt
Robert Haase
Ko Sugawara
E. Myers
36
363
0
09 Aug 2019
VisualBERT: A Simple and Performant Baseline for Vision and Language
Liunian Harold Li
Mark Yatskar
Da Yin
Cho-Jui Hsieh
Kai-Wei Chang
VLM
84
1,924
0
09 Aug 2019
Question-Agnostic Attention for Visual Question Answering
M. Farazi
Salman H Khan
Nick Barnes
13
10
0
09 Aug 2019
Sim-to-Real Learning for Casualty Detection from Ground Projected Point Cloud Data
Roni Permana Saputra
Nemanja Rakićević
Petar Kormushev
3DH
3DPC
19
8
0
08 Aug 2019
Location Field Descriptors: Single Image 3D Model Retrieval in the Wild
Alexander Grabner
P. Roth
Vincent Lepetit
3DPC
27
36
0
07 Aug 2019
An Adaptive Supervision Framework for Active Learning in Object Detection
Sai Vikas Desai
Akshay L Chandra
Wei Guo
S. Ninomiya
V. Balasubramanian
29
41
0
07 Aug 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
149
3,645
0
06 Aug 2019
Previous
1
2
3
...
122
123
124
...
153
154
155
Next