Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
v1
v2
v3 (latest)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 10,498 papers shown
Title
RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety
Andrei Dumitriu
Florin Tatui
Florin Miron
Aakash Ralhan
Radu Tudor Ionescu
Radu Timofte
112
0
0
01 Apr 2025
Real-Time Navigation for Autonomous Aerial Vehicles Using Video
Khizar Anjum
Parul Pandey
Vidyasagar Sadhu
Roberto Tron
D. Pompili
115
0
0
01 Apr 2025
3D Dental Model Segmentation with Geometrical Boundary Preserving
Shufan Xi
Zexian Liu
Junlin Chang
Hongyu Wu
Xiaogang Wang
Aimin Hao
3DV
3DPC
85
0
0
31 Mar 2025
EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing
Hongxiang Jiang
Jihao Yin
Qixiong Wang
Jiaqi Feng
Guo Chen
105
1
0
30 Mar 2025
Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection
Marc-Antoine Lavoie
Anas Mahmoud
Steven Waslander
132
0
0
29 Mar 2025
VLM-C4L: Continual Core Dataset Learning with Corner Case Optimization via Vision-Language Models for Autonomous Driving
Haibo Hu
Jiacheng Zuo
Yang Lou
Yufei Cui
Jianping Wang
Nan Guan
Jin Wang
Yung-Hui Li
Chun Jason Xue
VLM
122
1
0
29 Mar 2025
A GAN-Enhanced Deep Learning Framework for Rooftop Detection from Historical Aerial Imagery
Pengyu Chen
Sicheng Wang
Cuizhen Wang
Senrong Wang
Beiao Huang
Lu Huang
Zhe Zang
119
0
0
29 Mar 2025
RUNA: Object-level Out-of-Distribution Detection via Regional Uncertainty Alignment of Multimodal Representations
Bin Zhang
Jinggang Chen
Xiaoyang Qu
Guokuan Li
Kai Lu
Jiguang Wan
Jing Xiao
Jianzong Wang
ObjD
99
1
0
28 Mar 2025
AGILE: A Diffusion-Based Attention-Guided Image and Label Translation for Efficient Cross-Domain Plant Trait Identification
Earl Ranario
Lars Lundqvist
Heesup Yun
Brian N Bailey
J. M. Earles
VLM
69
0
0
27 Mar 2025
Embedding Compression Distortion in Video Coding for Machines
Yizhou Sun
Yao-Min Zhao
Meiqin Liu
Chao Yao
Weisi Lin
77
0
0
27 Mar 2025
Multimodal surface defect detection from wooden logs for sawing optimization
Bořek Reich
Matej Kunda
Fedor Zolotarev
Tuomas Eerola
Pavel Zemčík
Tomi Kauppi
80
0
0
27 Mar 2025
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
Hang Zhou
Wei Ji
Rui Ma
Li Cheng
ViT
131
0
0
27 Mar 2025
RSRWKV: A Linear-Complexity 2D Attention Mechanism for Efficient Remote Sensing Vision Task
Chunshan Li
Rong Wang
Xiaofei Yang
Dianhui Chu
160
0
0
26 Mar 2025
TerraTorch: The Geospatial Foundation Models Toolkit
Carlos Gomes
Benedikt Blumenstiel
Joao Lucas de Sousa Almeida
Pedro Henrique de Oliveira
P. Fraccaro
Francesc Marti Escofet
Daniela Szwarcman
Naomi Simumba
Romeo Kienzler
Bianca Zadrozny
172
1
0
26 Mar 2025
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
Xingyu Peng
Si Liu
Chen Gao
Yan Bai
Beipeng Mu
Xiaofei Wang
Huaxia Xia
125
0
0
26 Mar 2025
Bandwidth Allocation for Cloud-Augmented Autonomous Driving
Peter Schafhalter
Alexander Krentsel
Joseph E. Gonzalez
Sylvia Ratnasamy
S. Shenker
Ion Stoica
113
0
0
26 Mar 2025
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
Chen Tang
Xinzhu Ma
Encheng Su
Xiufeng Song
Xiaohong Liu
Wei-Hong Li
Lei Bai
Wanli Ouyang
Xiangyu Yue
3DGS
AI4TS
102
0
0
26 Mar 2025
BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction
Jan Kohút
Martin Dočekal
Michal Hradiš
Marek Vaško
95
0
0
25 Mar 2025
Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines
Junle Liu
Yun Zhang
Zixi Guo
79
0
0
25 Mar 2025
SFDLA: Source-Free Document Layout Analysis
Sebastian Tewes
Yufan Chen
Omar Moured
Jiaming Zhang
Rainer Stiefelhagen
105
0
0
24 Mar 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun
Huazhang Hu
Yidong Ma
Gang Liu
Nemo Chen
Xu Tang
Feng-Long Xie
Yongchao Xu
ObjD
131
0
0
24 Mar 2025
Anomaly Detection Using Computer Vision: A Comparative Analysis of Class Distinction and Performance Metrics
Md. Barkat Ullah Tusher
Shartaz Khan Akash
Amirul Islam Showmik
64
0
0
24 Mar 2025
Distilling Stereo Networks for Performant and Efficient Leaner Networks
Rafia Rahim
Samuel Woerz
A. Zell
188
0
0
24 Mar 2025
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
Xingxing Zou
Wen Zhang
Nanxuan Zhao
154
0
0
24 Mar 2025
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery
Sara Al-Emadi
Yin Yang
Ferda Ofli
73
0
0
24 Mar 2025
Vision-Guided Loco-Manipulation with a Snake Robot
Adarsh Salagame
Sasank Potluri
Keshav Bharadwaj Vaidyanathan
Kruthika Gangaraju
Eric N. Sihite
Milad Ramezani
Alireza Ramezani
91
0
0
24 Mar 2025
Frequency Dynamic Convolution for Dense Image Prediction
Linwei Chen
Lin Gu
Liang Li
C. Yan
Ying Fu
71
0
0
24 Mar 2025
Channel Consistency Prior and Self-Reconstruction Strategy Based Unsupervised Image Deraining
Guanglu Dong
Tianheng Zheng
Yuanzhouhan Cao
L. Qing
Chao Ren
DiffM
152
0
0
24 Mar 2025
Vehicular Road Crack Detection with Deep Learning: A New Online Benchmark for Comprehensive Evaluation of Existing Algorithms
Nachuan Ma
Zhengfei Song
Qiang Hu
Chuang-Wei Liu
Yu Han
Yanting Zhang
Rui Fan
Lihua Xie
115
0
0
23 Mar 2025
Vision-R1: Evolving Human-Free Alignment in Large Vision-Language Models via Vision-Guided Reinforcement Learning
Yufei Zhan
Yousong Zhu
Shurong Zheng
Hongyin Zhao
Fan Yang
Ming Tang
Jinqiao Wang
VLM
125
19
0
23 Mar 2025
MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability
P. Hill
Zhiming Liu
Nantheera Anantrasirichai
Mamba
126
0
0
22 Mar 2025
A Causal Adjustment Module for Debiasing Scene Graph Generation
Li Liu
Shuzhou Sun
Shuaifeng Zhi
Fan Shi
Zhen Liu
J. Heikkilä
Yongxiang Liu
CML
93
2
0
22 Mar 2025
BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors
Yu Wang
Junxian Mu
Hongzhi Huang
Qilong Wang
Pengfei Zhu
Q. Hu
253
1
0
22 Mar 2025
EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision
Xiaofeng Mao
YueFeng Chen
Rong Zhang
Hui Xue
Zhao Li
Hang Su
AAML
VLM
83
0
0
21 Mar 2025
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction
Ting Sun
Cheng Cui
Yuning Du
Yi Liu
111
1
0
21 Mar 2025
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding
Keyan Chen
Chenyang Liu
Bowen Chen
Wenyuan Li
Zhengxia Zou
Zhenwei Shi
90
3
0
20 Mar 2025
TextBite: A Historical Czech Document Dataset for Logical Page Segmentation
Martin Kostelník
Karel Beneš
Michal Hradiš
72
0
0
20 Mar 2025
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
Jiawei Wang
Kai Hu
Qiang Huo
119
0
0
20 Mar 2025
A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition
Ritabrata Chakraborty
Shivakumara Palaiahnakote
Umapada Pal
Cheng-Lin Liu
VLM
117
0
0
19 Mar 2025
xMOD: Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D motion
Saad Lahlali
Sandra Kara
Hejer Ammar
Florian Chabot
Nicolas Granger
Hervé Le Borgne
Q. C. Pham
3DPC
107
0
0
19 Mar 2025
Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark
Ying Liu
Yijing Hua
Haojiang Chai
Yanbo Wang
TengQi Ye
ObjD
115
0
0
19 Mar 2025
DCA: Dividing and Conquering Amnesia in Incremental Object Detection
Aoting Zhang
Dongbao Yang
Chang-Shu Liu
Xiaopeng Hong
Miao Shang
Yu Zhou
CLL
112
0
0
19 Mar 2025
A Language Vision Model Approach for Automated Tumor Contouring in Radiation Oncology
Yi Luo
H. Hooshangnejad
Xue Feng
Gaofeng Huang
Xiao Chen
Rui Zhang
Quan Chen
Wil Ngwa
Kai Ding
77
0
0
19 Mar 2025
Test-Time Backdoor Detection for Object Detection Models
Hangtao Zhang
Yichen Wang
Shihui Yan
Chenyu Zhu
Ziqi Zhou
Linshan Hou
Shengshan Hu
Minghui Li
Yanjun Zhang
L. Zhang
AAML
89
1
0
19 Mar 2025
Universal Scene Graph Generation
Shengqiong Wu
Hao Fei
Tat-Seng Chua
157
0
0
19 Mar 2025
Disentangling Fine-Tuning from Pre-Training in Visual Captioning with Hybrid Markov Logic
Monika Shah
Somdeb Sarkhel
Deepak Venugopal
MLLM
BDL
VLM
136
0
0
18 Mar 2025
Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation
Sayak Nag
Udita Ghosh
Sarosij Bose
Calvin-Khang Ta
Jiachen Li
Amit K. Roy-Chowdhury
225
0
0
18 Mar 2025
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
Chuxin Wang
Wenfei Yang
Xiang Liu
Tianzhu Zhang
145
1
0
18 Mar 2025
SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model
Xinqing Li
Ruiqi Song
Qingyu Xie
Ye Wu
Nanxin Zeng
Yunfeng Ai
VGen
SyDa
111
2
0
18 Mar 2025
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
Siqi Zhang
Yanyuan Qiao
Qunbo Wang
Longteng Guo
Zhihua Wei
Qingbin Liu
LM&Ro
170
3
0
18 Mar 2025
Previous
1
2
3
...
5
6
7
...
208
209
210
Next