Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
v1
v2
v3 (latest)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 10,518 papers shown
Title
On the VC dimension of deep group convolutional neural networks
Anna Sepliarskaia
S. Langer
Johannes Schmidt-Hieber
MLT
69
0
0
21 Oct 2024
How Important are Data Augmentations to Close the Domain Gap for Object Detection in Orbit?
Maximilian Ulmer
Leonard Klüpfel
M. Durner
Rudolph Triebel
60
0
0
21 Oct 2024
Online Pseudo-Label Unified Object Detection for Multiple Datasets Training
XiaoJun Tang
Jingru Wang
Zeyu Shangguan
Darun Tang
Yuyu Liu
ObjD
67
0
0
21 Oct 2024
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
Yusuke Hosoya
Masanori Suganuma
Takayuki Okatani
ObjD
96
0
0
20 Oct 2024
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning
Haiwen Diao
Ying Zhang
Shang Gao
Jiawen Zhu
Long Chen
Huchuan Lu
70
5
0
20 Oct 2024
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary
Hao-Tang Tsui
Chien-Yao Wang
H. Liao
ObjD
VLM
160
0
0
20 Oct 2024
Modeling Visual Memorability Assessment with Autoencoders Reveals Characteristics of Memorable Images
Elham Bagheri
Yalda Mohsenzadeh
168
0
0
19 Oct 2024
Human Action Anticipation: A Survey
Bolin Lai
Sam Toyer
Tushar Nagarajan
Rohit Girdhar
S. Zha
James M. Rehg
Kris Kitani
Kristen Grauman
Ruta Desai
Miao Liu
AI4TS
92
1
0
17 Oct 2024
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement
Yansong Peng
Hebei Li
Peixi Wu
Yueyi Zhang
Xingwu Sun
Feng Wu
107
19
0
17 Oct 2024
ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense Concepts about Actions
Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
LM&Ro
101
0
0
17 Oct 2024
Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring
Kristina Telegraph
Christos Kyrkou
ObjD
60
0
0
17 Oct 2024
RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images
Kejun Ren
Xin Wu
Lianming Xu
Li Wang
Mamba
152
4
0
17 Oct 2024
Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes
Dibyanayan Bandyopadhyay
Mohammed Hasanuzzaman
Asif Ekbal
AAML
60
1
0
17 Oct 2024
RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents
Zhuoran Liu
Danpei Zhao
Bo Yuan
104
1
0
17 Oct 2024
See Behind Walls in Real-time Using Aerial Drones and Augmented Reality
Sikai Yang
Kang Yang
Yuning Chen
Fan Zhao
Wan Du
41
0
0
17 Oct 2024
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Zhiyuan Zhao
Hengrui Kang
Bin Wang
Zeang Sheng
80
17
0
16 Oct 2024
CMAL: A Novel Cross-Modal Associative Learning Framework for Vision-Language Pre-Training
Zhiyuan Ma
Jianjun Li
Guohui Li
Kaiyan Huang
VLM
127
9
0
16 Oct 2024
Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look
Yong Zhang
Rui Zhu
Shifeng Zhang
Xu Zhou
Shifeng Chen
Xiaofan Chen
SSL
74
0
0
16 Oct 2024
Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond
Pengwei Liang
Junjun Jiang
Qing Ma
Xianming Liu
Jiayi Ma
79
2
0
16 Oct 2024
MC-Bench: A Benchmark for Multi-Context Visual Grounding in the Era of MLLMs
Yunqiu Xu
Linchao Zhu
Yi Yang
143
5
0
16 Oct 2024
YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection
Olalekan Akindele
Joshua Atolagbe
69
0
0
15 Oct 2024
Leveraging Structure Knowledge and Deep Models for the Detection of Abnormal Handwritten Text
Zi-Rui Wang
46
0
0
15 Oct 2024
SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection
Shuhan Dong
Yunsong Li
Weiying Xie
Jiaqing Zhang
Jiayuan Tian
Danian Yang
Jie Lei
86
1
0
15 Oct 2024
Fractal Calibration for long-tailed object detection
Konstantinos Panagiotis Alexandridis
Ismail Elezi
Jiankang Deng
Anh H. Nguyen
Shan Luo
492
0
0
15 Oct 2024
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Rasoul Shafipour
David Harrison
Maxwell Horton
Jeffrey Marker
Houman Bedayat
Sachin Mehta
Mohammad Rastegari
Mahyar Najibi
Saman Naderiparizi
MQ
133
3
0
14 Oct 2024
SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments
Khaled Gabr
Mohamed Abdelkader
Imen Jarraya
Abdullah AlMusalami
Anis Koubaa
62
4
0
14 Oct 2024
LG-CAV: Train Any Concept Activation Vector with Language Guidance
Qihan Huang
Jie Song
Mengqi Xue
Han Zhang
Bingde Hu
Huiqiong Wang
Hao Jiang
Xingen Wang
Xiuming Zhang
VLM
83
3
0
14 Oct 2024
SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Yuqi Li
Yao Lu
Zhihong Zhu
Chuanguang Yang
Yihao Chen
Jianping Gou
76
6
0
14 Oct 2024
Leveraging Customer Feedback for Multi-modal Insight Extraction
Sandeep Sricharan Mukku
Abinesh Kanagarajan
Pushpendu Ghosh
Chetan Aggarwal
44
0
0
13 Oct 2024
RailYolact -- A Yolact Focused on edge for Real-Time Rail Segmentation
Qihao Qian
124
0
0
12 Oct 2024
REGNet V2: End-to-End REgion-based Grasp Detection Network for Grippers of Different Sizes in Point Clouds
Binglei Zhao
Han Wang
Jian Tang
Chengzhong Ma
Hanbo Zhang
Jiayuan Zhang
Xuguang Lan
Xingyu Chen
3DPC
64
0
0
12 Oct 2024
M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought
G. Kumari
Kirtan Jain
Asif Ekbal
147
4
0
11 Oct 2024
DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection
Haoyang Li
Rui Zhang
Hantao Yao
X. Zhang
Yifan Hao
Xinkai Song
Xiaqing Li
Yongwei Zhao
Ling Li
Yunji Chen
ObjD
VLM
126
5
0
11 Oct 2024
Impact of Surface Reflections in Maritime Obstacle Detection
Samed Yalçın
Hazım Kemal Ekenel
112
0
0
11 Oct 2024
Boosting Open-Vocabulary Object Detection by Handling Background Samples
Ruizhe Zeng
Lu Zhang
Xu Yang
Zhiyong Liu
VLM
ObjD
61
0
0
11 Oct 2024
Alberta Wells Dataset: Pinpointing Oil and Gas Wells from Satellite Imagery
Pratinav Seth
Michelle Lin
Brefo Dwamena Yaw
Jade Boutot
Mary Kang
David Rolnick
263
0
0
11 Oct 2024
Optimizing YOLO Architectures for Optimal Road Damage Detection and Classification: A Comparative Study from YOLOv7 to YOLOv10
Vung Pham
Lan Dong Thi Ngoc
Duy-Linh Bui
58
1
0
10 Oct 2024
RayEmb: Arbitrary Landmark Detection in X-Ray Images Using Ray Embedding Subspace
Pragyan Shrestha
Chun Xie
Yuichi Yoshii
I. Kitahara
94
1
0
10 Oct 2024
BA-Net: Bridge Attention in Deep Neural Networks
Ronghui Zhang
Runzong Zou
Yue Zhao
Zirui Zhang
Junzhou Chen
Yue Cao
Chuan Hu
Houbing Song
67
1
0
10 Oct 2024
Multi-Scale Deformable Transformers for Student Learning Behavior Detection in Smart Classroom
Zhifeng Wang
Minghui Wang
Chunyan Zeng
Longlong Li
68
1
0
10 Oct 2024
LaB-CL: Localized and Balanced Contrastive Learning for improving parking slot detection
U Jin Jeong
Sumin Roh
Il Yong Chun
110
0
0
10 Oct 2024
When the Small-Loss Trick is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections
Keryan Chelouche
Marie Lachaize
Marine Bernard
Louise Olgiati
Remi Cuingnet
NoLa
60
0
0
10 Oct 2024
CountMamba: Exploring Multi-directional Selective State-Space Models for Plant Counting
Hulingxiao He
Yaqi Zhang
Jinglin Xu
Yuxin Peng
Mamba
68
0
0
10 Oct 2024
O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out
Mısra Yavuz
Fatma Guney
74
0
0
10 Oct 2024
SALINA: Towards Sustainable Live Sonar Analytics in Wild Ecosystems
Chi Xu
Rongsheng Qian
Hao Fang
Xiaoqiang Ma
William I. Atlas
Jiangchuan Liu
Mark A. Spoljaric
85
1
0
10 Oct 2024
Segmenting objects with Bayesian fusion of active contour models and convnet priors
P. Polewski
Jacquelyn A. Shelton
W. Yao
M. Heurich
114
1
0
09 Oct 2024
Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation
Seungho Lee
Hwijeong Lee
Hyunjung Shim
71
0
0
09 Oct 2024
Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis
Ahmed Abdullah
Nikolas Ebert
Oliver Wasenmüller
ObjD
62
1
0
09 Oct 2024
Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Siyuan Li
Juanxi Tian
Zedong Wang
Luyuan Zhang
Zicheng Liu
Weiyang Jin
Yang Liu
Baigui Sun
Stan Z. Li
95
0
0
08 Oct 2024
Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
Zhiwei Lin
Yongtao Wang
Zhi Tang
ObjD
VLM
85
7
0
08 Oct 2024
Previous
1
2
3
...
13
14
15
...
209
210
211
Next