Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
v1
v2
v3 (latest)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 10,529 papers shown
Title
Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection
Matthias Bartolo
D. Seychell
Josef Bajada
49
1
0
13 Aug 2024
Unified-IoU: For High-Quality Object Detection
Xiangjie Luo
Zhihao Cai
Bo Shao
Yingxun Wang
NoLa
81
3
0
13 Aug 2024
DePatch: Towards Robust Adversarial Patch for Evading Person Detectors in the Real World
Jikang Cheng
Ying Zhang
Zhongyuan Wang
Zou Qin
Chen Li
AAML
60
0
0
13 Aug 2024
DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection
Junjie Guo
Chenqiang Gao
Fangcen Liu
Deyu Meng
ViT
116
3
0
12 Aug 2024
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection
Zitian Wang
Zehao Huang
Yulu Gao
Naiyan Wang
Si Liu
3DPC
126
6
0
12 Aug 2024
U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training
Zhuoyan Liu
Bo Wang
Ye Li
ViT
79
0
0
11 Aug 2024
FADE: A Dataset for Detecting Falling Objects around Buildings in Video
Zhigang Tu
Zitao Gao
Zhengbo Zhang
Chunluan Zhou
Junsong Yuan
Bo Du
119
0
0
11 Aug 2024
A Training-Free Framework for Video License Plate Tracking and Recognition with Only One-Shot
Haoxuan Ding
Qi. Wang
Junyu Gao
Qiang Li
VLM
81
0
0
11 Aug 2024
Evaluating BM3D and NBNet: A Comprehensive Study of Image Denoising Across Multiple Datasets
Ghazal Kaviani
Reza Marzban
Ghassan AlRegib
52
0
0
11 Aug 2024
PS-TTL: Prototype-based Soft-labels and Test-Time Learning for Few-shot Object Detection
Yingjie Gao
Yanan Zhang
Ziyue Huang
Nanqing Liu
Di Huang
ObjD
106
2
0
11 Aug 2024
Contrast, Imitate, Adapt: Learning Robotic Skills From Raw Human Videos
Zhifeng Qian
Mingyu You
Hongjun Zhou
Xuanhui Xu
Hao Fu
Jinzhe Xue
Bin He
141
1
0
10 Aug 2024
Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery
Long Bai
Guankun Wang
Mobarakol Islam
Lalithkumar Seenivasan
An-Chi Wang
Hongliang Ren
117
17
0
09 Aug 2024
UAV-Enhanced Combination to Application: Comprehensive Analysis and Benchmarking of a Human Detection Dataset for Disaster Scenarios
Ragib Amin Nihal
Benjamin Yen
Katsutoshi Itoyama
Kazuhiro Nakadai
62
2
0
09 Aug 2024
On the Element-Wise Representation and Reasoning in Zero-Shot Image Recognition: A Systematic Survey
Jingcai Guo
Zhijie Rao
Zhi Chen
Song Guo
Jingren Zhou
Dacheng Tao
124
3
0
09 Aug 2024
MSG-Chart: Multimodal Scene Graph for ChartQA
Yue Dai
Soyeon Caren Han
Wei Liu
41
1
0
09 Aug 2024
SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes
Boshra Khalili
Andrew W. Smyth
ObjD
95
3
0
08 Aug 2024
Survey: Transformer-based Models in Data Modality Conversion
Elyas Rashno
Amir Eskandari
Aman Anand
F. Zulkernine
MedIm
99
0
0
08 Aug 2024
Detection of Animal Movement from Weather Radar using Self-Supervised Learning
Mubin Ul Haque
J. Dabrowski
Rebecca M. Rogers
Hazel Parry
SSL
101
0
0
08 Aug 2024
ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
William Y. Zhu
Keren Ye
Junjie Ke
Jiahui Yu
Leonidas Guibas
P. Milanfar
Feng Yang
107
2
0
07 Aug 2024
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model
Mingcan Xiang
Steven Jiaxun Tang
Qizheng Yang
Hui Guan
Tongping Liu
VLM
77
1
0
07 Aug 2024
Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model
Guoqing Zhu
Honghu Pan
Qiang Wang
Chao Tian
Chao Yang
Zhenyu He
84
1
0
07 Aug 2024
Path-based Design Model for Constructing and Exploring Alternative Visualisations
James R. Jackson
Panagiotis D. Ritsos
P. Butcher
Jonathan C. Roberts
26
1
0
07 Aug 2024
JARViS: Detecting Actions in Video Using Unified Actor-Scene Context Relation Modeling
Seok Hwan Lee
Taein Son
Soo Won Seo
Jisong Kim
Jun Won Choi
112
0
0
07 Aug 2024
Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach
B. Hosp
Björn Severitt
R. Agarwala
Evgenia Rusak
Yannick Sauer
Siegfried Wahl
120
2
0
07 Aug 2024
GUI Element Detection Using SOTA YOLO Deep Learning Models
Seyed Shayan Daneshvar
Shaowei Wang
71
1
0
07 Aug 2024
Nighttime Pedestrian Detection Based on Fore-Background Contrast Learning
He Yao
Yongjun Zhang
Huachun Jian
Li Zhang
Ruzhong Cheng
67
11
0
06 Aug 2024
Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection
Sen Nie
Zhuo Wang
Xinxin Wang
Kun He
DiffM
158
0
0
06 Aug 2024
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection
Ting Lei
Shaofeng Yin
Yuxin Peng
Yang Liu
VLM
119
6
0
05 Aug 2024
AssemAI: Interpretable Image-Based Anomaly Detection for Manufacturing Pipelines
Renjith Prasad
Chathurangi Shyalika
Ramtin Zand
Fadi El Kalach
R. Venkataramanan
R. Harik
Amit P. Sheth
77
1
0
05 Aug 2024
Supervised Image Translation from Visible to Infrared Domain for Object Detection
Prahlad Anand
Qiranul Saadiyean
Aniruddh Sikdar
N. Nalini
Suresh Sundaram
72
1
0
03 Aug 2024
Domain penalisation for improved Out-of-Distribution Generalisation
Shuvam Jena
Sushmetha Sumathi Rajendran
Karthik Seemakurthy
A. Sasithradevi
M. Vijayalakshmi
Prakash Poornachari
93
0
0
03 Aug 2024
LAM3D: Leveraging Attention for Monocular 3D Object Detection
Diana-Alexandra Sas
Leandro Di Bella
Yangxintong Lyu
F. Oniga
Adrian Munteanu
94
1
0
03 Aug 2024
Autonomous Integration of Bench-Top Wet Lab Equipment
Zachary Logan
Kam Undieh
Mohammad Goli
47
0
0
02 Aug 2024
PC
2
^2
2
: Pseudo-Classification Based Pseudo-Captioning for Noisy Correspondence Learning in Cross-Modal Retrieval
Yue Duan
Zhangxuan Gu
ZhenZhe Ying
Wei Li
Yu Zhang
Zibin Zheng
57
2
0
02 Aug 2024
Underwater Object Detection Enhancement via Channel Stabilization
Muhammad Ali
Rita Sevastjanova
51
5
0
02 Aug 2024
Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration
Donwon Park
Leixian Shen
Se Young Chun
114
2
0
02 Aug 2024
Deep Learning based Visually Rich Document Content Understanding: A Survey
Muhammad Ali
Jean Lee
Salman Khan
Eduard Hovy
126
6
0
02 Aug 2024
Compositional Physical Reasoning of Objects and Events from Videos
Zhenfang Chen
Shilong Dong
Kexin Yi
Yunzhu Li
Mingyu Ding
Antonio Torralba
Joshua B. Tenenbaum
Chuang Gan
OCL
139
3
0
02 Aug 2024
MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection
Youjia Fu
Zihao Xu
Junsong Fu
Huixia Xue
Shuqiu Tan
Lei Li
Mamba
84
0
0
01 Aug 2024
A Simple Background Augmentation Method for Object Detection with Diffusion Model
Yuhang Li
Jun Gao
Chen Chen
Yue Zhang
Jielei Zhang
DiffM
90
5
0
01 Aug 2024
Revocable Backdoor for Deep Model Trading
Yiran Xu
Nan Zhong
Zhenxing Qian
Xinpeng Zhang
AAML
88
0
0
01 Aug 2024
Automated Sperm Morphology Analysis Based on Instance-Aware Part Segmentation
Wenyuan Chen
Haocong Song
C. Dai
Ziwei Wang
Guanqiao Shan
...
Katy Fatemeh Moosavi
Shruti Pathak
Clifford Librach
Zhuoran Zhang
Changliu Liu
61
2
0
31 Jul 2024
Dynamic Object Queries for Transformer-based Incremental Object Detection
Jichuan Zhang
Wei Li
Shuang Cheng
Yali Li
Shengjin Wang
89
0
0
31 Jul 2024
Spatial Transformer Network YOLO Model for Agricultural Object Detection
Yash Zambre
Ankit Varshney
Akshatha Mohan
Joshua Peeples
62
0
0
31 Jul 2024
MaskUno: Switch-Split Block For Enhancing Instance Segmentation
Jawad Haidar
Marc Mouawad
Imad Elhajj
Daniel C. Asmar
ISeg
60
0
0
31 Jul 2024
SmileyNet -- Towards the Prediction of the Lottery by Reading Tea Leaves with AI
Andreas Birk
32
0
0
31 Jul 2024
Lifelong Person Search
Jae-Won Yang
Seungbin Hong
Jae-Young Sim
CLL
102
0
0
31 Jul 2024
Advancing Vietnamese Visual Question Answering with Transformer and Convolutional Integration
Ngoc Son Nguyen
Van Nguyen
Tung Le
ViT
106
1
0
30 Jul 2024
Add-SD: Rational Generation without Manual Reference
Lingfeng Yang
Xinyu Zhang
Xiang Li
Jinwen Chen
Kun Yao
Gang Zhang
Errui Ding
Ling-Ling Liu
Jingdong Wang
Jian Yang
82
0
0
30 Jul 2024
AxiomVision: Accuracy-Guaranteed Adaptive Visual Model Selection for Perspective-Aware Video Analytics
Xiangxiang Dai
Zeyu Zhang
Peng Yang
Yuedong Xu
Xutong Liu
J. C. Lui
107
9
0
29 Jul 2024
Previous
1
2
3
...
18
19
20
...
209
210
211
Next