Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
v1
v2
v3 (latest)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 10,513 papers shown
Title
Real-Time Multi-Object Tracking using YOLOv8 and SORT on a SoC FPGA
Michal Danilowicz
T. Kryjak
VOT
97
0
0
17 Mar 2025
Finite Samples for Shallow Neural Networks
Yu Xia
Zhiqiang Xu
72
0
0
17 Mar 2025
Beyond RGB: Adaptive Parallel Processing for RAW Object Detection
Shani Gamrian
Hila Barel
Feiran Li
Masakazu Yoshimura
Daisuke Iso
94
0
0
17 Mar 2025
ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing
Aditi Tiwari
Klara Nahrstedt
130
2
0
17 Mar 2025
Hybrid Learners Do Not Forget: A Brain-Inspired Neuro-Symbolic Approach to Continual Learning
Amin Banayeeanzade
Mohammad Rostami
CLL
105
0
0
16 Mar 2025
Real-Time Manipulation Action Recognition with a Factorized Graph Sequence Encoder
Enes Erdogan
E. Aksoy
Sanem Sariel
118
0
0
15 Mar 2025
Gun Detection Using Combined Human Pose and Weapon Appearance
Amulya Reddy Maligireddy
Manohar Reddy Uppula
Nidhi Rastogi
Yaswanth Reddy Parla
110
0
0
15 Mar 2025
An Efficient Deep Learning-Based Approach to Automating Invoice Document Validation
Aziz Amari
Mariem Makni
Wissal Fnaich
Akram Lahmar
Fedi Koubaa
Oumayma Charrad
Mohamed Ali Zormati
Rabaa Youssef Douss
77
0
0
15 Mar 2025
SPRINT: Script-agnostic Structure Recognition in Tables
Dhruv Kudale
Badri Vishal Kasuba
Venkatapathy Subramanian
P. Chaudhuri
Ganesh Ramakrishnan
LMTD
166
1
0
15 Mar 2025
Minuscule Cell Detection in AS-OCT Images with Progressive Field-of-View Focusing
Boyu Chen
A. L. Solebo
Daqian Shi
Jinge Wu
Paul Taylor
121
0
0
15 Mar 2025
DLA-Count: Dynamic Label Assignment Network for Dense Cell Distribution Counting
Yuqing Yan
Yirui Wu
80
0
0
15 Mar 2025
Salient Temporal Encoding for Dynamic Scene Graph Generation
Zhihao Zhu
96
0
0
15 Mar 2025
Robotic Sim-to-Real Transfer for Long-Horizon Pick-and-Place Tasks in the Robotic Sim2Real Competition
Ming Yang
Hongyu Cao
Lixuan Zhao
Chenrui Zhang
Yaran Chen
97
0
0
14 Mar 2025
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
Chuhan Zhang
Chaoyang Zhu
Pingcheng Dong
Long Chen
Dong Zhang
ObjD
VLM
504
0
0
14 Mar 2025
Enhanced Multi-View Pedestrian Detection Using Probabilistic Occupancy Volume
Reef Alturki
A. Hilton
Jean-Yves Guillemaut
80
0
0
14 Mar 2025
6D Object Pose Tracking in Internet Videos for Robotic Manipulation
Georgy Ponimatkin
Martin Cífka
Tomáš Souček
Médéric Fourmy
Yann Labbé
Vladimir Petrik
Josef Sivic
90
1
0
13 Mar 2025
Deep Learning-Based Automated Workflow for Accurate Segmentation and Measurement of Abdominal Organs in CT Scans
Praveen Shastry
Ashok Sharma
Kavya Mohan
Naveen Kumarasami
Anandakumar D
Mounigasri M
Keerthana R
Kishore Prasath Venkatesh
Bargava Subramanian
Kalyan Sivasailam
131
1
0
13 Mar 2025
Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection
Zihao Zhang
Aming Wu
Yahong Han
ObjD
110
0
0
13 Mar 2025
OmniSTVG: Toward Spatio-Temporal Omni-Object Video Grounding
Jiali Yao
Xinran Deng
Xin Gu
Mengrui Dai
Bing Fan
Zhipeng Zhang
Yan Huang
Heng Fan
L. Zhang
161
0
0
13 Mar 2025
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection
Shenghao Fu
Junkai Yan
Q. Yang
Xihan Wei
Xiaohua Xie
Wei-Shi Zheng
ObjD
VLM
87
0
0
13 Mar 2025
RFUAV: A Benchmark Dataset for Unmanned Aerial Vehicle Detection and Identification
Rui Shi
Xiaodong Yu
Shengming Wang
Yijia Zhang
Lu Xu
Peng Pan
Chunlai Ma
128
0
0
12 Mar 2025
Implicit Contrastive Representation Learning with Guided Stop-gradient
Byeongchan Lee
Sehyun Lee
SSL
281
2
0
12 Mar 2025
Fully-Synthetic Training for Visual Quality Inspection in Automotive Production
Christoph Huber
Dino Knoll
Michael Guthe
108
1
0
12 Mar 2025
NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model
Yuzhi Lai
Shenghai Yuan
Youssef Nassar
Mingyu Fan
T. Weber
Matthias Rätsch
LM&Ro
136
3
0
12 Mar 2025
SuperCap: Multi-resolution Superpixel-based Image Captioning
Henry Senior
Luca Rossi
Gregory Slabaugh
Shanxin Yuan
VLM
110
0
0
11 Mar 2025
Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method
Fei Wang
Chong Chen
Hongyu Chen
Yugang Chang
Weiming Zeng
ObjD
122
0
0
11 Mar 2025
XR-VLM: Cross-Relationship Modeling with Multi-part Prompts and Visual Features for Fine-Grained Recognition
Chuanming Wang
Henming Mao
Huanhuan Zhang
Huiyuan Fu
Huadong Ma
VLM
103
0
0
10 Mar 2025
Mitigating Hallucinations in YOLO-based Object Detection Models: A Revisit to Out-of-Distribution Detection
Weicheng He
Changshun Wu
Chih-Hong Cheng
Xiaowei Huang
Saddek Bensalem
OODD
114
0
0
10 Mar 2025
FastInstShadow: A Simple Query-Based Model for Instance Shadow Detection
Takeru Inoue
Ryusuke Miyamoto
73
0
0
10 Mar 2025
SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements
Haiyang Xie
Xi Shen
Shihua Huang
Qirui Wang
Zheng Wang
123
0
0
10 Mar 2025
YOLOE: Real-Time Seeing Anything
Ao Wang
Lihao Liu
Hui Chen
Zijia Lin
Jiawei Han
Guiguang Ding
VLM
ObjD
142
6
0
10 Mar 2025
Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection
Wentao Wu
Chenglong Li
Xinyu Wang
Bin Luo
Qi Liu
71
1
0
10 Mar 2025
YOLOMG: Vision-based Drone-to-Drone Detection with Appearance and Pixel-Level Motion Fusion
Hanqing Guo
Xiuxiu Lin
Shiyu Zhao
440
0
0
10 Mar 2025
Analysis of 3D Urticaceae Pollen Classification Using Deep Learning Models
Tijs Konijn
Imaan Bijl
Lu Cao
Fons Verbeek
114
0
0
10 Mar 2025
Split-n-Chain: Privacy-Preserving Multi-Node Split Learning with Blockchain-Based Auditability
Mukesh Sahani
Binanda Sengupta
FedML
97
0
0
10 Mar 2025
AnywhereDoor: Multi-Target Backdoor Attacks on Object Detection
Jialin Lu
Junjie Shan
Ziqi Zhao
Ka-Ho Chow
AAML
160
0
0
09 Mar 2025
SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model
Jing Zhang
Zhiyu Li
Qingyi Gu
MQ
VLM
79
0
0
09 Mar 2025
Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection
Yifan Chang
Junjie Huang
Xiaofeng Wang
Yun Ye
Zhujin Liang
Yi Shan
Dalong Du
Xingang Wang
3DPC
146
0
0
08 Mar 2025
Removing Multiple Hybrid Adverse Weather in Video via a Unified Model
Yecong Wan
Mingwen Shao
Yuanshuo Cheng
Jun Shu
Shuigen Wang
100
0
0
08 Mar 2025
ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation
Qizhen Lan
Qing Tian
VLM
107
0
0
08 Mar 2025
Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
Seil Kang
Jinyeong Kim
Junhyeok Kim
Seong Jae Hwang
VLM
138
5
0
08 Mar 2025
Explainable Synthetic Image Detection through Diffusion Timestep Ensembling
Y. Wu
Feiran Zhang
Tianyuan Shi
Ruicheng Yin
Zhenghua Wang
Zhenliang Gan
Xinyu Wang
Changze Lv
Xiaoqing Zheng
Xuanjing Huang
145
0
0
08 Mar 2025
ColFigPhotoAttnNet: Reliable Finger Photo Presentation Attack Detection Leveraging Window-Attention on Color Spaces
Anudeep Vurity
Emanuela Marasco
Raghavendra Ramachandra
Jongwoo Park
AAML
79
1
0
07 Mar 2025
DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation
Amin Karimi
Charalambos Poullis
VLM
128
0
0
06 Mar 2025
DEAL-YOLO: Drone-based Efficient Animal Localization using YOLO
Aditya Prashant Naidu
Hem Gosalia
Ishaan Gakhar
Shaurya Singh Rathore
Krish Didwania
Ujjwal Verma
87
0
0
06 Mar 2025
Inclusive STEAM Education: A Framework for Teaching Cod-2 ing and Robotics to Students with Visually Impairment Using 3 Advanced Computer Vision
Mahmoud Hamash
Md Raqib Khan
Peter Tiernan
67
0
0
06 Mar 2025
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
Jialong Xue
Wei Gao
Yu Wang
Chao Ji
Dongdong Zhao
Shi Yan
Shiwu Zhang
97
1
0
06 Mar 2025
A lightweight model FDM-YOLO for small target improvement based on YOLOv8
Xuerui Zhang
ObjD
107
0
0
06 Mar 2025
Teach YOLO to Remember: A Self-Distillation Approach for Continual Object Detection
Riccardo De Monte
Davide Dalle Pezze
Gian Antonio Susto
CLL
120
0
0
06 Mar 2025
Fractional Correspondence Framework in Detection Transformer
Masoumeh Zareapoor
Pourya Shamsolmoali
Huiyu Zhou
Yue Lu
Salvador García
90
0
0
06 Mar 2025
Previous
1
2
3
...
6
7
8
...
209
210
211
Next