ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.01497
  4. Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
v1v2v3 (latest)

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
    AIMatObjD
ArXiv (abs)PDFHTML

Papers citing "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"

50 / 10,525 papers shown
Title
ENACT: Entropy-based Clustering of Attention Input for Reducing the Computational Needs of Object Detection Transformers
ENACT: Entropy-based Clustering of Attention Input for Reducing the Computational Needs of Object Detection Transformers
Giorgos Savathrakis
Antonis Argyros
ViT
48
0
0
11 Sep 2024
AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models
AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models
Boming Miao
Chunxiao Li
Yao Zhu
Weixiang Sun
Zizhe Wang
Xiaoyi Wang
Chuanlong Xie
DiffMAAML
182
1
0
11 Sep 2024
When to Extract ReID Features: A Selective Approach for Improved
  Multiple Object Tracking
When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking
Emirhan Bayar
Cemal Aker
VOT
118
0
0
10 Sep 2024
Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming
  Perception
Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception
Xiang Zhang
Yufei Cui
Chenchen Fu
Weiwei Wu
Zihao Wang
Yuyang Sun
Xue Liu
76
0
0
10 Sep 2024
Knowledge Distillation via Query Selection for Detection Transformer
Knowledge Distillation via Query Selection for Detection Transformer
Yi Liu
Luting Wang
Zongheng Tang
Yue Liao
Yifan Sun
Lijun Zhang
Si Liu
110
0
0
10 Sep 2024
ALSS-YOLO: An Adaptive Lightweight Channel Split and Shuffling Network
  for TIR Wildlife Detection in UAV Imagery
ALSS-YOLO: An Adaptive Lightweight Channel Split and Shuffling Network for TIR Wildlife Detection in UAV Imagery
Ang He
Xiaobo Li
Ximei Wu
Chengyue Su
Jing Chen
Sheng Xu
Xiaobin Guo
63
6
0
10 Sep 2024
Renormalized Connection for Scale-preferred Object Detection in
  Satellite Imagery
Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery
Fan Zhang
Lingling Li
Licheng Jiao
Xu Liu
Fang Liu
Shuyuan Yang
B. Hou
ObjD
76
0
0
09 Sep 2024
StratXplore: Strategic Novelty-seeking and Instruction-aligned
  Exploration for Vision and Language Navigation
StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation
Muraleekrishna Gopinathan
Jumana Abu-Khalaf
David Suter
Martin Masek
88
0
0
09 Sep 2024
Spatially-Aware Speaker for Vision-and-Language Navigation Instruction
  Generation
Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation
Muraleekrishna Gopinathan
Martin Masek
Jumana Abu-Khalaf
David Suter
LM&Ro
83
2
0
09 Sep 2024
Replay Consolidation with Label Propagation for Continual Object Detection
Replay Consolidation with Label Propagation for Continual Object Detection
Riccardo De Monte
Davide Dalle Pezze
Marina Ceccon
Francesco Pasti
Francesco Paissan
Elisabetta Farella
Gian Antonio Susto
Nicola Bellotto
147
2
0
09 Sep 2024
Can OOD Object Detectors Learn from Foundation Models?
Can OOD Object Detectors Learn from Foundation Models?
Jiahui Liu
Xin Wen
Shizhen Zhao
Yukang Chen
Xiaojuan Qi
OODD
94
2
0
08 Sep 2024
A foundation model enpowered by a multi-modal prompt engine for
  universal seismic geobody interpretation across surveys
A foundation model enpowered by a multi-modal prompt engine for universal seismic geobody interpretation across surveys
Hang Gao
Xinming Wu
Luming Liang
Hanlin Sheng
Xu Si
Gao Hui
Yaxing Li
AI4CE
74
2
0
08 Sep 2024
PB-LRDWWS System for the SLT 2024 Low-Resource Dysarthria Wake-Up Word
  Spotting Challenge
PB-LRDWWS System for the SLT 2024 Low-Resource Dysarthria Wake-Up Word Spotting Challenge
Shiyao Wang
Jiaming Zhou
Shiwan Zhao
Yong Qin
90
1
0
07 Sep 2024
LoCa: Logit Calibration for Knowledge Distillation
LoCa: Logit Calibration for Knowledge Distillation
Runming Yang
Taiqiang Wu
Yujiu Yang
94
1
0
07 Sep 2024
Advancing SEM Based Nano-Scale Defect Analysis in Semiconductor
  Manufacturing for Advanced IC Nodes
Advancing SEM Based Nano-Scale Defect Analysis in Semiconductor Manufacturing for Advanced IC Nodes
Bappaditya Dey
Matthias Monden
Victor Blanco
Sandip Halder
S. de Gendt
68
0
0
06 Sep 2024
RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement
RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement
Hao Luo
Baoliang Chen
Lingyu Zhu
Peilin Chen
Shiqi Wang
3DV
281
2
0
06 Sep 2024
Ground-roll Separation From Land Seismic Records Based on Convolutional
  Neural Network
Ground-roll Separation From Land Seismic Records Based on Convolutional Neural Network
Zhuang Jia
Wenkai Lu
Meng Zhang
Yongkang Miao
64
0
0
05 Sep 2024
TG-LMM: Enhancing Medical Image Segmentation Accuracy through
  Text-Guided Large Multi-Modal Model
TG-LMM: Enhancing Medical Image Segmentation Accuracy through Text-Guided Large Multi-Modal Model
Yihao Zhao
Enhao Zhong
Cuiyun Yuan
Yang Li
Man Zhao
Chunxia Li
Jun Hu
Chenbin Liu
VLMMedIm
106
0
0
05 Sep 2024
Enhancing User-Centric Privacy Protection: An Interactive Framework
  through Diffusion Models and Machine Unlearning
Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning
Huaxi Huang
Xin Yuan
Qiyu Liao
Dadong Wang
Tongliang Liu
DiffM
86
0
0
05 Sep 2024
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
Hayeon Jo
Hyesong Choi
Minhee Cho
Dongbo Min
139
2
0
04 Sep 2024
NoiseAttack: An Evasive Sample-Specific Multi-Targeted Backdoor Attack
  Through White Gaussian Noise
NoiseAttack: An Evasive Sample-Specific Multi-Targeted Backdoor Attack Through White Gaussian Noise
Abdullah Arafat Miah
Kaan Icer
Resit Sendag
Yu Bi
AAMLDiffM
83
1
0
03 Sep 2024
A Modern Take on Visual Relationship Reasoning for Grasp Planning
A Modern Take on Visual Relationship Reasoning for Grasp Planning
Paolo Rabino
Tatiana Tommasi
84
1
0
03 Sep 2024
From Grounding to Planning: Benchmarking Bottlenecks in Web Agents
From Grounding to Planning: Benchmarking Bottlenecks in Web Agents
Segev Shlomov
Ben Wiesel
Aviad Sela
Ido Levy
Liane Galanti
Roy Abitbol
LLMAG
135
3
0
03 Sep 2024
Latent Distillation for Continual Object Detection at the Edge
Latent Distillation for Continual Object Detection at the Edge
Francesco Pasti
Marina Ceccon
Davide Dalle Pezze
Francesco Paissan
Elisabetta Farella
Gian Antonio Susto
Nicola Bellotto
84
4
0
03 Sep 2024
Real-Time Indoor Object Detection based on hybrid CNN-Transformer
  Approach
Real-Time Indoor Object Detection based on hybrid CNN-Transformer Approach
Salah Eddine Laidoudi
Madjid Maidi
Samir Otmane
99
0
0
03 Sep 2024
Enhancing Fine-Grained Visual Recognition in the Low-Data Regime Through
  Feature Magnitude Regularization
Enhancing Fine-Grained Visual Recognition in the Low-Data Regime Through Feature Magnitude Regularization
Avraham Chapman
Haiming Xu
Lingqiao Liu
102
0
0
03 Sep 2024
ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for
  Efficient Action Recognition
ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition
Shiting Xiao
Yuhang Li
Youngeun Kim
Donghyun Lee
Priyadarshini Panda
93
1
0
03 Sep 2024
DS MYOLO: A Reliable Object Detector Based on SSMs for Driving Scenarios
DS MYOLO: A Reliable Object Detector Based on SSMs for Driving Scenarios
Yang Li
Jianli Xiao
88
0
0
02 Sep 2024
Progressive Retinal Image Registration via Global and Local Deformable
  Transformations
Progressive Retinal Image Registration via Global and Local Deformable Transformations
Yepeng Liu
Baosheng Yu
Tian Chen
Yuliang Gu
Bo Du
Yongchao Xu
Jun Cheng
MedIm
92
2
0
02 Sep 2024
ViRED: Prediction of Visual Relations in Engineering Drawings
ViRED: Prediction of Visual Relations in Engineering Drawings
Chao Gu
Ke Lin
Yiyang Luo
Jiahui Hou
Xiang-Yang Li
83
0
0
02 Sep 2024
Towards Student Actions in Classroom Scenes: New Dataset and Baseline
Towards Student Actions in Classroom Scenes: New Dataset and Baseline
Zhuolin Tan
Chenqiang Gao
Anyong Qin
Ruixin Chen
Tiecheng Song
Feng Yang
Deyu Meng
87
0
0
02 Sep 2024
IAFI-FCOS: Intra- and across-layer feature interaction FCOS model for
  lesion detection of CT images
IAFI-FCOS: Intra- and across-layer feature interaction FCOS model for lesion detection of CT images
Q. Guan
Mengjie Pan
Feng Chen
Zhiqiang Yang
Zhongwen Yu
Qianwei Zhou
Haigen Hu
71
0
0
01 Sep 2024
ProteinRPN: Towards Accurate Protein Function Prediction with
  Graph-Based Region Proposals
ProteinRPN: Towards Accurate Protein Function Prediction with Graph-Based Region Proposals
Shania Mitra
Lei Huang
Manolis Kellis
ViT
44
0
0
01 Sep 2024
Mapping earth mounds from space
Mapping earth mounds from space
Baki Uzun
Shivam Pande
Gwendal Cachin-Bernard
M. Pham
Sébastien Lefèvre
Rumais Blatrix
Doyle McKey
28
1
0
31 Aug 2024
The MERIT Dataset: Modelling and Efficiently Rendering Interpretable
  Transcripts
The MERIT Dataset: Modelling and Efficiently Rendering Interpretable Transcripts
I. de Rodrigo
A. Sanchez-Cuadrado
J. Boal
A. J. Lopez-Lopez
VLM
104
1
0
31 Aug 2024
A method for detecting dead fish on large water surfaces based on
  improved YOLOv10
A method for detecting dead fish on large water surfaces based on improved YOLOv10
Qingbin Tian
Yukang Huo
Mingyuan Yao
Haihua Wang
38
2
0
31 Aug 2024
Look, Learn and Leverage (L$^3$): Mitigating Visual-Domain Shift and
  Discovering Intrinsic Relations via Symbolic Alignment
Look, Learn and Leverage (L3^33): Mitigating Visual-Domain Shift and Discovering Intrinsic Relations via Symbolic Alignment
Hanchen Xie
Jiageng Zhu
Mahyar Khayatkhoei
Jiazhi Li
Wael AbdAlmageed
OOD
84
0
0
30 Aug 2024
Structuring a Training Strategy to Robustify Perception Models with
  Realistic Image Augmentations
Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations
Ahmed Hammam
B. K. Sreedhar
Nura Kawa
Tim Patzelt
Oliver De Candido
100
0
0
30 Aug 2024
Hybrid Classification-Regression Adaptive Loss for Dense Object
  Detection
Hybrid Classification-Regression Adaptive Loss for Dense Object Detection
Yanquan Huang
Liu Wei Zhen
Yun Hao
Mengyuan Zhang
Qingyao Wu
Zikun Deng
Xueming Liu
Hong Deng
88
0
0
30 Aug 2024
UAV-Based Human Body Detector Selection and Fusion for Geolocated
  Saliency Map Generation
UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation
P. Rudol
Patrick Doherty
M. Wzorek
Chattrakul Sombattheera
28
0
0
29 Aug 2024
Enhancing Sound Source Localization via False Negative Elimination
Enhancing Sound Source Localization via False Negative Elimination
Zengjie Song
Jiangshe Zhang
Yuxi Wang
Junsong Fan
Zhaoxiang Zhang
97
0
0
29 Aug 2024
Rethinking Sparse Lexical Representations for Image Retrieval in the Age
  of Rising Multi-Modal Large Language Models
Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models
K. Nakata
Daisuke Miyashita
Youyang Ng
Yasuto Hoshi
J. Deguchi
71
0
0
29 Aug 2024
Anno-incomplete Multi-dataset Detection
Anno-incomplete Multi-dataset Detection
Yiran Xu
Haoxiang Zhong
Kai Wu
Jialin Li
Yong Liu
Chengjie Wang
Shu-Tao Xia
Hongen Liao
ObjD
77
0
0
29 Aug 2024
PSE-Net: Channel Pruning for Convolutional Neural Networks with
  Parallel-subnets Estimator
PSE-Net: Channel Pruning for Convolutional Neural Networks with Parallel-subnets Estimator
Shiguang Wang
Tao Xie
Haijun Liu
Xingcheng Zhang
Jian Cheng
90
2
0
29 Aug 2024
Spatio-Temporal Context Prompting for Zero-Shot Action Detection
Spatio-Temporal Context Prompting for Zero-Shot Action Detection
Wei-Jhe Huang
Min-Hung Chen
Shang-Hong Lai
91
0
0
28 Aug 2024
Pixels to Prose: Understanding the art of Image Captioning
Pixels to Prose: Understanding the art of Image Captioning
Hrishikesh Singh
Aarti Sharma
Millie Pant
3DVVLM
92
1
0
28 Aug 2024
Comparison of Model Predictive Control and Proximal Policy Optimization
  for a 1-DOF Helicopter System
Comparison of Model Predictive Control and Proximal Policy Optimization for a 1-DOF Helicopter System
Georg Schäfer
Jakob Rehrl
Stefan Huber
Simon Hirlaender
132
3
0
28 Aug 2024
Small Object Detection for Indoor Assistance to the Blind using YOLO NAS
  Small and Super Gradients
Small Object Detection for Indoor Assistance to the Blind using YOLO NAS Small and Super Gradients
Rashmi BN
R. Guru
A. AnusuyaM
116
1
0
28 Aug 2024
An Investigation on The Position Encoding in Vision-Based Dynamics
  Prediction
An Investigation on The Position Encoding in Vision-Based Dynamics Prediction
Jiageng Zhu
Hanchen Xie
Jiazhi Li
Mahyar Khayatkhoei
Wael AbdAlmageed
94
1
0
27 Aug 2024
Knowledge Discovery in Optical Music Recognition: Enhancing Information
  Retrieval with Instance Segmentation
Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation
Elona Shatri
George Fazekas
76
2
0
27 Aug 2024
Previous
123...161718...209210211
Next