Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
v1
v2
v3 (latest)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 10,525 papers shown
Title
ENACT: Entropy-based Clustering of Attention Input for Reducing the Computational Needs of Object Detection Transformers
Giorgos Savathrakis
Antonis Argyros
ViT
48
0
0
11 Sep 2024
AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models
Boming Miao
Chunxiao Li
Yao Zhu
Weixiang Sun
Zizhe Wang
Xiaoyi Wang
Chuanlong Xie
DiffM
AAML
182
1
0
11 Sep 2024
When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking
Emirhan Bayar
Cemal Aker
VOT
118
0
0
10 Sep 2024
Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception
Xiang Zhang
Yufei Cui
Chenchen Fu
Weiwei Wu
Zihao Wang
Yuyang Sun
Xue Liu
76
0
0
10 Sep 2024
Knowledge Distillation via Query Selection for Detection Transformer
Yi Liu
Luting Wang
Zongheng Tang
Yue Liao
Yifan Sun
Lijun Zhang
Si Liu
110
0
0
10 Sep 2024
ALSS-YOLO: An Adaptive Lightweight Channel Split and Shuffling Network for TIR Wildlife Detection in UAV Imagery
Ang He
Xiaobo Li
Ximei Wu
Chengyue Su
Jing Chen
Sheng Xu
Xiaobin Guo
63
6
0
10 Sep 2024
Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery
Fan Zhang
Lingling Li
Licheng Jiao
Xu Liu
Fang Liu
Shuyuan Yang
B. Hou
ObjD
76
0
0
09 Sep 2024
StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation
Muraleekrishna Gopinathan
Jumana Abu-Khalaf
David Suter
Martin Masek
88
0
0
09 Sep 2024
Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation
Muraleekrishna Gopinathan
Martin Masek
Jumana Abu-Khalaf
David Suter
LM&Ro
83
2
0
09 Sep 2024
Replay Consolidation with Label Propagation for Continual Object Detection
Riccardo De Monte
Davide Dalle Pezze
Marina Ceccon
Francesco Pasti
Francesco Paissan
Elisabetta Farella
Gian Antonio Susto
Nicola Bellotto
147
2
0
09 Sep 2024
Can OOD Object Detectors Learn from Foundation Models?
Jiahui Liu
Xin Wen
Shizhen Zhao
Yukang Chen
Xiaojuan Qi
OODD
94
2
0
08 Sep 2024
A foundation model enpowered by a multi-modal prompt engine for universal seismic geobody interpretation across surveys
Hang Gao
Xinming Wu
Luming Liang
Hanlin Sheng
Xu Si
Gao Hui
Yaxing Li
AI4CE
74
2
0
08 Sep 2024
PB-LRDWWS System for the SLT 2024 Low-Resource Dysarthria Wake-Up Word Spotting Challenge
Shiyao Wang
Jiaming Zhou
Shiwan Zhao
Yong Qin
90
1
0
07 Sep 2024
LoCa: Logit Calibration for Knowledge Distillation
Runming Yang
Taiqiang Wu
Yujiu Yang
94
1
0
07 Sep 2024
Advancing SEM Based Nano-Scale Defect Analysis in Semiconductor Manufacturing for Advanced IC Nodes
Bappaditya Dey
Matthias Monden
Victor Blanco
Sandip Halder
S. de Gendt
68
0
0
06 Sep 2024
RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement
Hao Luo
Baoliang Chen
Lingyu Zhu
Peilin Chen
Shiqi Wang
3DV
281
2
0
06 Sep 2024
Ground-roll Separation From Land Seismic Records Based on Convolutional Neural Network
Zhuang Jia
Wenkai Lu
Meng Zhang
Yongkang Miao
64
0
0
05 Sep 2024
TG-LMM: Enhancing Medical Image Segmentation Accuracy through Text-Guided Large Multi-Modal Model
Yihao Zhao
Enhao Zhong
Cuiyun Yuan
Yang Li
Man Zhao
Chunxia Li
Jun Hu
Chenbin Liu
VLM
MedIm
106
0
0
05 Sep 2024
Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning
Huaxi Huang
Xin Yuan
Qiyu Liao
Dadong Wang
Tongliang Liu
DiffM
86
0
0
05 Sep 2024
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
Hayeon Jo
Hyesong Choi
Minhee Cho
Dongbo Min
139
2
0
04 Sep 2024
NoiseAttack: An Evasive Sample-Specific Multi-Targeted Backdoor Attack Through White Gaussian Noise
Abdullah Arafat Miah
Kaan Icer
Resit Sendag
Yu Bi
AAML
DiffM
83
1
0
03 Sep 2024
A Modern Take on Visual Relationship Reasoning for Grasp Planning
Paolo Rabino
Tatiana Tommasi
84
1
0
03 Sep 2024
From Grounding to Planning: Benchmarking Bottlenecks in Web Agents
Segev Shlomov
Ben Wiesel
Aviad Sela
Ido Levy
Liane Galanti
Roy Abitbol
LLMAG
135
3
0
03 Sep 2024
Latent Distillation for Continual Object Detection at the Edge
Francesco Pasti
Marina Ceccon
Davide Dalle Pezze
Francesco Paissan
Elisabetta Farella
Gian Antonio Susto
Nicola Bellotto
84
4
0
03 Sep 2024
Real-Time Indoor Object Detection based on hybrid CNN-Transformer Approach
Salah Eddine Laidoudi
Madjid Maidi
Samir Otmane
99
0
0
03 Sep 2024
Enhancing Fine-Grained Visual Recognition in the Low-Data Regime Through Feature Magnitude Regularization
Avraham Chapman
Haiming Xu
Lingqiao Liu
102
0
0
03 Sep 2024
ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition
Shiting Xiao
Yuhang Li
Youngeun Kim
Donghyun Lee
Priyadarshini Panda
93
1
0
03 Sep 2024
DS MYOLO: A Reliable Object Detector Based on SSMs for Driving Scenarios
Yang Li
Jianli Xiao
88
0
0
02 Sep 2024
Progressive Retinal Image Registration via Global and Local Deformable Transformations
Yepeng Liu
Baosheng Yu
Tian Chen
Yuliang Gu
Bo Du
Yongchao Xu
Jun Cheng
MedIm
92
2
0
02 Sep 2024
ViRED: Prediction of Visual Relations in Engineering Drawings
Chao Gu
Ke Lin
Yiyang Luo
Jiahui Hou
Xiang-Yang Li
83
0
0
02 Sep 2024
Towards Student Actions in Classroom Scenes: New Dataset and Baseline
Zhuolin Tan
Chenqiang Gao
Anyong Qin
Ruixin Chen
Tiecheng Song
Feng Yang
Deyu Meng
87
0
0
02 Sep 2024
IAFI-FCOS: Intra- and across-layer feature interaction FCOS model for lesion detection of CT images
Q. Guan
Mengjie Pan
Feng Chen
Zhiqiang Yang
Zhongwen Yu
Qianwei Zhou
Haigen Hu
71
0
0
01 Sep 2024
ProteinRPN: Towards Accurate Protein Function Prediction with Graph-Based Region Proposals
Shania Mitra
Lei Huang
Manolis Kellis
ViT
44
0
0
01 Sep 2024
Mapping earth mounds from space
Baki Uzun
Shivam Pande
Gwendal Cachin-Bernard
M. Pham
Sébastien Lefèvre
Rumais Blatrix
Doyle McKey
28
1
0
31 Aug 2024
The MERIT Dataset: Modelling and Efficiently Rendering Interpretable Transcripts
I. de Rodrigo
A. Sanchez-Cuadrado
J. Boal
A. J. Lopez-Lopez
VLM
104
1
0
31 Aug 2024
A method for detecting dead fish on large water surfaces based on improved YOLOv10
Qingbin Tian
Yukang Huo
Mingyuan Yao
Haihua Wang
38
2
0
31 Aug 2024
Look, Learn and Leverage (L
3
^3
3
): Mitigating Visual-Domain Shift and Discovering Intrinsic Relations via Symbolic Alignment
Hanchen Xie
Jiageng Zhu
Mahyar Khayatkhoei
Jiazhi Li
Wael AbdAlmageed
OOD
84
0
0
30 Aug 2024
Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations
Ahmed Hammam
B. K. Sreedhar
Nura Kawa
Tim Patzelt
Oliver De Candido
100
0
0
30 Aug 2024
Hybrid Classification-Regression Adaptive Loss for Dense Object Detection
Yanquan Huang
Liu Wei Zhen
Yun Hao
Mengyuan Zhang
Qingyao Wu
Zikun Deng
Xueming Liu
Hong Deng
88
0
0
30 Aug 2024
UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation
P. Rudol
Patrick Doherty
M. Wzorek
Chattrakul Sombattheera
28
0
0
29 Aug 2024
Enhancing Sound Source Localization via False Negative Elimination
Zengjie Song
Jiangshe Zhang
Yuxi Wang
Junsong Fan
Zhaoxiang Zhang
97
0
0
29 Aug 2024
Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models
K. Nakata
Daisuke Miyashita
Youyang Ng
Yasuto Hoshi
J. Deguchi
71
0
0
29 Aug 2024
Anno-incomplete Multi-dataset Detection
Yiran Xu
Haoxiang Zhong
Kai Wu
Jialin Li
Yong Liu
Chengjie Wang
Shu-Tao Xia
Hongen Liao
ObjD
77
0
0
29 Aug 2024
PSE-Net: Channel Pruning for Convolutional Neural Networks with Parallel-subnets Estimator
Shiguang Wang
Tao Xie
Haijun Liu
Xingcheng Zhang
Jian Cheng
90
2
0
29 Aug 2024
Spatio-Temporal Context Prompting for Zero-Shot Action Detection
Wei-Jhe Huang
Min-Hung Chen
Shang-Hong Lai
91
0
0
28 Aug 2024
Pixels to Prose: Understanding the art of Image Captioning
Hrishikesh Singh
Aarti Sharma
Millie Pant
3DV
VLM
92
1
0
28 Aug 2024
Comparison of Model Predictive Control and Proximal Policy Optimization for a 1-DOF Helicopter System
Georg Schäfer
Jakob Rehrl
Stefan Huber
Simon Hirlaender
132
3
0
28 Aug 2024
Small Object Detection for Indoor Assistance to the Blind using YOLO NAS Small and Super Gradients
Rashmi BN
R. Guru
A. AnusuyaM
116
1
0
28 Aug 2024
An Investigation on The Position Encoding in Vision-Based Dynamics Prediction
Jiageng Zhu
Hanchen Xie
Jiazhi Li
Mahyar Khayatkhoei
Wael AbdAlmageed
94
1
0
27 Aug 2024
Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation
Elona Shatri
George Fazekas
76
2
0
27 Aug 2024
Previous
1
2
3
...
16
17
18
...
209
210
211
Next