Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
v1
v2
v3 (latest)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 10,529 papers shown
Title
MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing
Chuang Yang
Bingxuan Zhao
Qing Zhou
Qi Wang
192
3
0
18 Dec 2024
Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection
Ahmet Oğuz Saltık
Alicia Allmendinger
Anthony Stein
128
5
0
18 Dec 2024
JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts
Taein Son
Soo Won Seo
Jisong Kim
S. Lee
Jun Won Choi
VGen
154
0
0
18 Dec 2024
Differential Alignment for Domain Adaptive Object Detection
Xinyu He
Xinhui Li
Xiaojie Guo
152
1
0
17 Dec 2024
RA-SGG: Retrieval-Augmented Scene Graph Generation Framework via Multi-Prototype Learning
Kanghoon Yoon
Kibum Kim
Jaehyung Jeon
Yeonjun In
Donghyun Kim
Chanyoung Park
117
1
0
17 Dec 2024
Open-World Panoptic Segmentation
Matteo Sodano
Federico Magistri
Jens Behley
Cyrill Stachniss
VLM
161
0
0
17 Dec 2024
Efficient Oriented Object Detection with Enhanced Small Object Recognition in Aerial Images
Zhifei Shi
Zongyao Yin
Sheng Chang
Xiao Yi
Xianchuan Yu
138
0
0
17 Dec 2024
Domain Generalization in Autonomous Driving: Evaluating YOLOv8s, RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset
Madiyar Alimov
Temirlan Meiramkhanov
ViT
116
1
0
16 Dec 2024
Expanded Comprehensive Robotic Cholecystectomy Dataset (CRCD)
K. Oh
Leonardo Borgioli
Alberto Mangano
Valentina Valle
Marco Di Pangrazio
...
Luciano Ambrosini
Alvaro Ducas
Milos Zefran
Liaohai Chen
P. Giulianotti
131
1
0
16 Dec 2024
Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges
Martin Aubard
Ana Madureira
Luis F. Teixeira
José Pinto
AAML
145
3
0
16 Dec 2024
CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector
Tianheng Qiu
Ka Lung Law
Guanghua Pan
Jufei Wang
Xin Gao
Xuan Huang
Hu Wei
140
0
0
16 Dec 2024
Neural Collapse Inspired Knowledge Distillation
Shuoxi Zhang
Zijian Song
Kun He
191
1
0
16 Dec 2024
A comprehensive GeoAI review: Progress, Challenges and Outlooks
Anasse Boutayeb
Iyad Lahsen-cherif
Ahmed El Khadimi
109
0
0
16 Dec 2024
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning
Chang Xu
Ruixiang Zhang
Wen Yang
Haoran Zhu
Fang Xu
Jian Ding
Gui-Song Xia
ObjD
169
1
0
16 Dec 2024
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations
Jin-Cheng Jhang
Tao Tu
Fu-En Wang
Ke Zhang
Min Sun
Cheng-Hao Kuo
130
2
0
16 Dec 2024
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Jinrong Zhang
Penghui Wang
Chunxiao Liu
Wei Liu
D. Jin
Qiong Zhang
Erli Meng
Zhengnan Hu
VLM
152
0
0
14 Dec 2024
A Decade of Deep Learning: A Survey on The Magnificent Seven
Dilshod Azizov
Muhammad Arslan Manzoor
Velibor Bojkovic
Yingxu Wang
Ziyi Wang
...
Liang Li
Siwei Liu
Yu Zhong
Wei Liu
Shangsong Liang
OOD
AI4TS
MedIm
183
0
0
13 Dec 2024
Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering
Sai Bhargav Rongali
M. Cui
Ankit Jha
Neha Bhargava
Saurabh Prasad
Biplab Banerjee
129
0
0
12 Dec 2024
UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework
Silin Cheng
Yuanpei Liu
Kai Han
EDL
151
0
0
12 Dec 2024
Analysis of Object Detection Models for Tiny Object in Satellite Imagery: A Dataset-Centric Approach
Kailas PS
Selvakumaran R
Palani Murugan
Ramesh Kumar V
Malaya Kumar Biswal M
139
1
0
12 Dec 2024
TimeRefine: Temporal Grounding with Time Refining Video LLM
Xizi Wang
Feng Cheng
Ziyang Wang
Huiyu Wang
Md. Mohaiminul Islam
Lorenzo Torresani
Joey Tianyi Zhou
Gedas Bertasius
David J. Crandall
216
2
0
12 Dec 2024
DALI: Domain Adaptive LiDAR Object Detection via Distribution-level and Instance-level Pseudo Label Denoising
Xiaohu Lu
H. Radha
139
0
0
11 Dec 2024
TECO: Improving Multimodal Intent Recognition with Text Enhancement through Commonsense Knowledge Extraction
Quynh-Mai Thi Nguyen
Lan-Nhi Thi Nguyen
Cam-Van Thi Nguyen
86
0
0
11 Dec 2024
Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation
Jiaming Lv
Haoyuan Yang
P. Li
164
3
0
11 Dec 2024
PGRID: Power Grid Reconstruction in Informal Developments Using High-Resolution Aerial Imagery
Simone Fobi Nsutezo
Amrita Gupta
Duncan Kebut
Seema Iyer
Luana Marotti
Rahul Dodhia
J. L. Ferres
Anthony Ortiz
111
0
0
10 Dec 2024
Making the Flow Glow -- Robot Perception under Severe Lighting Conditions using Normalizing Flow Gradients
Simon Kristoffersson Lind
Rudolph Triebel
V. Krüger
129
0
0
10 Dec 2024
Enhancing 3D Object Detection in Autonomous Vehicles Based on Synthetic Virtual Environment Analysis
Vladislav Li
Ilias Siniosoglou
Thomai Karamitsou
A. Lytos
Ioannis D. Moscholios
Sotirios K Goudos
Jyoti S. Banerjee
Panagiotis Sarigiannidi
Vasileios Argyriou
3DPC
130
2
0
10 Dec 2024
Swap Path Network for Robust Person Search Pre-training
Lucas Jaffe
A. Zakhor
3DPC
114
0
0
06 Dec 2024
From classical techniques to convolution-based models: A review of object detection algorithms
Fnu Neha
Deepshikha Bhati
Deepak Kumar Shukla
Md. Amiruzzaman
ObjD
VLM
106
3
0
06 Dec 2024
Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection
K. Hashmi
Talha Uddin Sheikh
Didier Stricker
Muhammad Zeshan Afzal
130
0
0
06 Dec 2024
DrIFT: Autonomous Drone Dataset with Integrated Real and Synthetic Data, Flexible Views, and Transformed Domains
Fardad Dadboud
Hamid Azad
Varun Mehta
M. Bolic
Iraj Mntegh
130
0
0
06 Dec 2024
HyperDefect-YOLO: Enhance YOLO with HyperGraph Computation for Industrial Defect Detection
Zuo Zuo
Jiahao Dong
Yue Gao
Zongze Wu
120
1
0
05 Dec 2024
DEIM: DETR with Improved Matching for Fast Convergence
Shihua Huang
Zhichao Lu
Xiaodong Cun
Yongjun Yu
Xiao Zhou
Xi Shen
VLM
508
8
0
05 Dec 2024
Data Fusion of Semantic and Depth Information in the Context of Object Detection
Md Abu Yusuf
Md Rezaul Karim Khan
Partha Pratim Saha
Mohammed Mahbubur Rahaman
3DPC
91
1
0
04 Dec 2024
Gesture Classification in Artworks Using Contextual Image Features
Azhar Hussian
Mathias Zinnen
Thi My Hang Tran
Andreas Maier
Vincent Christlein
127
0
0
04 Dec 2024
Composed Image Retrieval for Training-Free Domain Conversion
Nikos Efthymiadis
Bill Psomas
Zakaria Laskar
Konstantinos Karantzalos
Yannis Avrithis
Ondřej Chum
Giorgos Tolias
129
0
0
04 Dec 2024
Seamless Optical Cloud Computing across Edge-Metro Network for Generative AI
Sizhe Xing
Aolong Sun
Chengxi Wang
Yizhi Wang
Boyu Dong
...
Xi Xiao
R. Penty
Qixiang Cheng
Nan Chi
Junwen Zhang
186
0
0
04 Dec 2024
RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations
Savya Khosla
S. Vallecorsa
Alex Schwing
Derek Hoiem
135
2
0
02 Dec 2024
Behavior Backdoor for Deep Learning Models
Jinqiao Wang
Pengfei Zhang
R. Tao
Jian Yang
Hao Liu
Xianglong Liu
Y. X. Wei
Yao Zhao
AAML
134
0
0
02 Dec 2024
Cerberus: Attribute-based person re-identification using semantic IDs
Chanho Eom
Geon Lee
Kyunghwan Cho
Hyeonseok Jung
Moonsub Jin
Bumsub Ham
111
0
0
02 Dec 2024
DiffPatch: Generating Customizable Adversarial Patches using Diffusion Models
Zhixiang Wang
Guangnan Ye
Xinyu Wang
Siheng Chen
Ziyi Wang
Xingjun Ma
Yu-Gang Jiang
AAML
DiffM
226
0
0
02 Dec 2024
A Cross-Scene Benchmark for Open-World Drone Active Tracking
Haowei Sun
Jinwu Hu
Zhirui Zhang
Haoyuan Tian
Xinze Xie
Yufeng Wang
Zhuliang Yu
Xiaohua Xie
Mingkui Tan
141
0
0
01 Dec 2024
BGM: Background Mixup for X-ray Prohibited Items Detection
Wen Liu
R. Tao
Hongguang Zhu
Yunda Sun
Yao Zhao
Y. X. Wei
173
0
0
30 Nov 2024
LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention
Zewen Du
Zhenjiang Hu
Guiyu Zhao
Ying Jin
Hongbin Ma
125
1
0
29 Nov 2024
Co-Learning: Towards Semi-Supervised Object Detection with Road-side Cameras
Jicheng Yuan
Anh Le-Tuan
Ali Ganbarov
M. Hauswirth
Danh Le-Phuoc
127
0
0
28 Nov 2024
CrossTracker: Robust Multi-modal 3D Multi-Object Tracking via Cross Correction
Lipeng Gu
Xuefeng Yan
Weiming Wang
Honghua Chen
Dingkun Zhu
Liangliang Nan
Mingqiang Wei
150
0
0
28 Nov 2024
Perception of Visual Content: Differences Between Humans and Foundation Models
Nardiena A. Pratama
Shaoyang Fan
Gianluca Demartini
VLM
167
0
0
28 Nov 2024
Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks
Chen Zhou
Peng Cheng
Sihang Li
Yize Zhang
Yibo Yan
Xiaojun Jia
Yanyan Xu
Kaidi Wang
Xiaochun Cao
153
0
0
27 Nov 2024
Dual-view X-ray Detection: Can AI Detect Prohibited Items from Dual-view X-ray Images like Humans?
R. Tao
Haoyu Wang
Yuzhe Guo
Hechang Chen
Li Zhang
Xianglong Liu
Y. X. Wei
Yao-Min Zhao
110
1
0
27 Nov 2024
RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model
Huiyang Hu
Peijin Wang
Hanbo Bi
Boyuan Tong
Zehua Wang
...
Ziqi Zhang
Yaowei Wang
QiXiang Ye
Kun Fu
Xian Sun
325
0
0
27 Nov 2024
Previous
1
2
3
...
10
11
12
...
209
210
211
Next