Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
v1
v2
v3 (latest)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 10,498 papers shown
Title
Style-Adaptive Detection Transformer for Single-Source Domain Generalized Object Detection
Jianhong Han
Yupei Wang
Liang Chen
ViT
109
0
0
29 Apr 2025
T2ID-CAS: Diffusion Model and Class Aware Sampling to Mitigate Class Imbalance in Neck Ultrasound Anatomical Landmark Detection
Manikanta Varaganti
Amulya Vankayalapati
Nour Awad
Gregory R. Dion
Laura J. Brattain
DiffM
MedIm
119
0
0
29 Apr 2025
Purifying, Labeling, and Utilizing: A High-Quality Pipeline for Small Object Detection
Siwei Wang
Zhiwei Chen
Liujuan Cao
Rongrong Ji
ObjD
117
0
0
29 Apr 2025
CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation
Jianyu Wu
Yizhou Wang
Xiangyu Yue
Xinzhu Ma
Jinpei Guo
Dongzhan Zhou
Wanli Ouyang
Shixiang Tang
161
0
0
29 Apr 2025
Crowd Detection Using Very-Fine-Resolution Satellite Imagery
Tong Xiao
Qunming Wang
Ping Lu
Tenghai Huang
Xiaohua Tong
P. M. Atkinson
99
0
0
28 Apr 2025
More Clear, More Flexible, More Precise: A Comprehensive Oriented Object Detection benchmark for UAV
Kai Ye
Haidi Tang
Bowen Liu
Pingyang Dai
Liujuan Cao
Rongrong Ji
AI4TS
99
0
0
28 Apr 2025
DG-DETR: Toward Domain Generalized Detection Transformer
Seongmin Hwang
Daeyoung Han
Moongu Jeon
ViT
111
0
0
28 Apr 2025
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Juhan Park
Kyungjae Lee
Hyung Jin Chang
Jungchan Cho
VLM
124
0
0
28 Apr 2025
SynergyAmodal: Deocclude Anything with Text Control
Xinyang Li
Chengjie Yi
Jiawei Lai
Mingbao Lin
Yansong Qu
Shengchuan Zhang
Liujuan Cao
DiffM
144
0
0
28 Apr 2025
Swapped Logit Distillation via Bi-level Teacher Alignment
Stephen Ekaputra Limantoro
Jhe-Hao Lin
Chih-Yu Wang
Yi-Lung Tsai
Hong-Han Shuai
Ching-Chun Huang
Wen-Huang Cheng
174
0
0
27 Apr 2025
Transcending Dimensions using Generative AI: Real-Time 3D Model Generation in Augmented Reality
Majid Behravan
Maryam Haghani
Denis Gračanin
121
1
0
27 Apr 2025
ODExAI: A Comprehensive Object Detection Explainable AI Evaluation
Loc Phuc Truong Nguyen
Hung Nguyen
Hung Cao
123
1
0
27 Apr 2025
Boosting Single-domain Generalized Object Detection via Vision-Language Knowledge Interaction
Xiaoran Xu
Jiangang Yang
Wenyue Chong
Wenhui Shi
Siyang Song
Jing Xing
Jian Liu
ObjD
VLM
167
0
0
27 Apr 2025
CapsFake: A Multimodal Capsule Network for Detecting Instruction-Guided Deepfakes
Tuan Nguyen
Naseem Khan
Issa Khalil
AAML
172
0
0
27 Apr 2025
Improving Small Drone Detection Through Multi-Scale Processing and Data Augmentation
Rayson Laroca
Marcelo dos Santos
David Menotti
ObjD
109
1
0
27 Apr 2025
VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation
Niaz Ahmad
Youngmoon Lee
Guanghui Wang
3DH
101
1
0
26 Apr 2025
R-Sparse R-CNN: SAR Ship Detection Based on Background-Aware Sparse Learnable Proposals
Kamirul Kamirul
Odysseas A. Pappas
A. Achim
92
0
0
26 Apr 2025
MASF-YOLO: An Improved YOLOv11 Network for Small Object Detection on Drone View
Liugang Lu
Dabin He
Congxiang Liu
Zhixiang Deng
83
0
0
25 Apr 2025
A Review of 3D Object Detection with Vision-Language Models
Ranjan Sapkota
Konstantinos I. Roumeliotis
Rahul Harsha Cheppally
Marco Flores Calero
Manoj Karkee
VLM
167
2
0
25 Apr 2025
Multi-Grained Compositional Visual Clue Learning for Image Intent Recognition
Yin Tang
Jiankai Li
Hongyu Yang
Xuan Dong
Lifeng Fan
Weixin Li
79
0
0
25 Apr 2025
A Large Vision-Language Model based Environment Perception System for Visually Impaired People
Zezhou Chen
Zhaoxiang Liu
Ning Wang
Kohou Wang
Shiguo Lian
233
0
0
25 Apr 2025
S3MOT: Monocular 3D Object Tracking with Selective State Space Model
Zhuohao Yan
Shaoquan Feng
Xingxing Li
Yuxuan Zhou
Chunxi Xia
Shengyu Li
VOT
147
0
0
25 Apr 2025
Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection
Brian K. S. Isaac-Medina
T. Breckon
OODD
494
0
0
25 Apr 2025
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection
Carlo Sgaravatti
Roberto Basla
Riccardo Pieroni
Matteo Corno
S. Savaresi
Luca Magri
Giacomo Boracchi
3DPC
107
0
0
25 Apr 2025
AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models
Mohammad Zarei
Melanie A Jutras
Eliana Evans
Mike Tan
Omid Aaramoon
AAML
DiffM
75
0
0
24 Apr 2025
DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks
Yinqi Li
Hong Chang
Ruibing Hou
Shiguang Shan
Xilin Chen
DiffM
103
0
0
24 Apr 2025
A Decade of You Only Look Once (YOLO) for Object Detection
Leo Thomas Ramos
Angel D. Sappa
160
0
0
24 Apr 2025
Improving Open-World Object Localization by Discovering Background
Ashish Singh
Michael Jeffrey Jones
Kuan-Chuan Peng
A. Cherian
Moitreya Chatterjee
Erik Learned-Miller
ObjD
OCL
VLM
123
0
0
24 Apr 2025
RGB-D Tracking via Hierarchical Modality Aggregation and Distribution Network
Boyue Xu
Zifei Shan
Ruichao Hou
Jia Bei
Tongwei Ren
Gangshan Wu
113
1
0
24 Apr 2025
CIVIL: Causal and Intuitive Visual Imitation Learning
Yinlong Dai
Robert Ramirez Sanchez
Ryan Jeronimus
Shahabedin Sagheb
Cara M. Nunez
Heramb Nemlekar
Dylan P. Losey
159
1
0
24 Apr 2025
MTSGL: Multi-Task Structure Guided Learning for Robust and Interpretable SAR Aircraft Recognition
Qishan He
Lingjun Zhao
Ru Luo
Siqian Zhang
Lin Lei
Kefeng Ji
Gangyao Kuang
113
0
0
23 Apr 2025
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation
Junrong Yue
Yanzhe Zhang
Chuan Qin
Jing Chen
Xiaomin Lie
Xinlei Yu
Wenxin Zhang
Zhendong Zhao
142
1
0
23 Apr 2025
Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection
Jens Petersen
Davide Abati
A. Habibian
Auke Wiggers
ViT
3DPC
91
0
0
23 Apr 2025
Multimodal Perception for Goal-oriented Navigation: A Survey
I-Tak Ieong
Hao Tang
LM&Ro
LRM
104
0
0
22 Apr 2025
You Sense Only Once Beneath: Ultra-Light Real-Time Underwater Object Detection
Jun Dong
Wenli Wu
Jintao Cheng
Xiaoyu Tang
88
0
0
22 Apr 2025
SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems
Manjunath D
Aniruddh Sikdar
Prajwal Gurunath
Sumanth Udupa
Suresh Sundaram
70
1
0
22 Apr 2025
An Efficient Aerial Image Detection with Variable Receptive Fields
Liu Wenbin
51
0
0
21 Apr 2025
SuoiAI: Building a Dataset for Aquatic Invertebrates in Vietnam
Tue Vo
Lakshay Sharma
Tuan Dinh
Khuong Dinh
T. Nguyen
Trung Phan
Minh Do
Duong Vu
97
0
0
21 Apr 2025
Context Aware Grounded Teacher for Source Free Object Detection
Tajamul Ashraf
Rajes Manna
Partha Sarathi Purkayastha
Tavaheed Tariq
Janibul Bashir
122
0
0
21 Apr 2025
EmoSEM: Segment and Explain Emotion Stimuli in Visual Art
Jing Zhang
Dan Guo
Zhangbin Li
Meng Wang
92
0
0
20 Apr 2025
Single Document Image Highlight Removal via A Large-Scale Real-World Dataset and A Location-Aware Network
Lu Pan
Yu-Hsuan Huang
Hongxia Xie
Cheng Zhang
H Zhao
Hong-Han Shuai
Wen-Huang Cheng
63
0
0
19 Apr 2025
ISTD-YOLO: A Multi-Scale Lightweight High-Performance Infrared Small Target Detection Algorithm
Shang Zhang
Yujie Cui
Ruoyan Xiong
Huanbin Zhang
45
0
0
19 Apr 2025
Compile Scene Graphs with Reinforcement Learning
Zuyao Chen
Jinlin Wu
Zhen Lei
Marc Pollefeys
Chang Wen Chen
OffRL
LRM
177
3
0
18 Apr 2025
Context-Awareness and Interpretability of Rare Occurrences for Discovery and Formalization of Critical Failure Modes
Sridevi Polavaram
Xin Zhou
Meenu Ravi
Mohammad Zarei
Anmol Srivastava
47
0
0
18 Apr 2025
LimitNet: Progressive, Content-Aware Image Offloading for Extremely Weak Devices & Networks
A. Hojjat
Janek Haberer
Tayyaba Zainab
Olaf Landsiedel
108
3
0
18 Apr 2025
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection
YangChen Zeng
ViT
85
0
0
18 Apr 2025
Collaborative Perception Datasets for Autonomous Driving: A Review
N. Wang
Deyong Shang
Yan Gong
X. S. Hu
Caiyan Jia
Hongyu Pan
Yanwen Huang
Xiaoyu Wang
J. Lu
168
0
0
17 Apr 2025
ChartQA-X: Generating Explanations for Visual Chart Reasoning
Shamanthak Hegde
Pooyan Fazli
H. Seifi
111
0
0
17 Apr 2025
Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes
Andreas Lau Hansen
Lukas Wanzeck
Dim P. Papadopoulos
67
0
0
17 Apr 2025
Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign Classification
Reek Majumder
M. Chowdhury
S. Khan
Zadid Khan
Fahim Ahmad
Frank Ngeni
G. Comert
Judith Mwakalonge
Dimitra Michalaka
AAML
49
0
0
17 Apr 2025
Previous
1
2
3
4
5
...
208
209
210
Next