Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 7,360 papers shown
Title
Adventures with Grace Hopper AI Super Chip and the National Research Platform
J. Alex Hurt
Grant J. Scott
Derek Weitzel
Huijun Zhu
26
0
0
21 Oct 2024
Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models
Yufei Zhan
Hongyin Zhao
Yousong Zhu
Fan Yang
Ming Tang
Jinqiao Wang
MLLM
43
1
0
21 Oct 2024
Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data
Nikos Sakellariou
Antonios Lalas
K. Votis
Dimitrios Tzovaras
21
1
0
21 Oct 2024
Hybrid Architecture for Real-Time Video Anomaly Detection: Integrating Spatial and Temporal Analysis
Fabien Poirier
26
0
0
21 Oct 2024
On the VC dimension of deep group convolutional neural networks
Anna Sepliarskaia
S. Langer
Johannes Schmidt-Hieber
MLT
35
0
0
21 Oct 2024
How Important are Data Augmentations to Close the Domain Gap for Object Detection in Orbit?
Maximilian Ulmer
Leonard Klüpfel
M. Durner
Rudolph Triebel
34
0
0
21 Oct 2024
Online Pseudo-Label Unified Object Detection for Multiple Datasets Training
XiaoJun Tang
Jingru Wang
Zeyu Shangguan
Darun Tang
Yuyu Liu
ObjD
40
0
0
21 Oct 2024
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
Yusuke Hosoya
Masanori Suganuma
Takayuki Okatani
ObjD
21
0
0
20 Oct 2024
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning
Haiwen Diao
Ying Zhang
Shang Gao
Jiawen Zhu
Long Chen
Huchuan Lu
34
4
0
20 Oct 2024
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary
Hao-Tang Tsui
Chien-Yao Wang
H. Liao
ObjD
VLM
59
0
0
20 Oct 2024
Modeling Visual Memorability Assessment with Autoencoders Reveals Characteristics of Memorable Images
Elham Bagheri
Yalda Mohsenzadeh
23
0
0
19 Oct 2024
Human Action Anticipation: A Survey
Bolin Lai
Sam Toyer
Tushar Nagarajan
Rohit Girdhar
S. Zha
James M. Rehg
Kris Kitani
Kristen Grauman
Ruta Desai
Miao Liu
AI4TS
46
1
0
17 Oct 2024
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement
Yansong Peng
Hebei Li
Peixi Wu
Yueyi Zhang
Xingwu Sun
Feng Wu
44
14
0
17 Oct 2024
ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense Concepts about Actions
Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
LM&Ro
20
0
0
17 Oct 2024
Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring
Kristina Telegraph
Christos Kyrkou
ObjD
26
0
0
17 Oct 2024
RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images
Kejun Ren
Xin Wu
Lianming Xu
Li Wang
Mamba
45
1
0
17 Oct 2024
Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes
Dibyanayan Bandyopadhyay
Mohammed Hasanuzzaman
Asif Ekbal
AAML
39
0
0
17 Oct 2024
RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents
Zhuoran Liu
Danpei Zhao
Bo Yuan
30
1
0
17 Oct 2024
See Behind Walls in Real-time Using Aerial Drones and Augmented Reality
Sikai Yang
Kang Yang
Yuning Chen
Fan Zhao
Wan Du
29
0
0
17 Oct 2024
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Zhiyuan Zhao
Hengrui Kang
Bin Wang
Zeang Sheng
35
11
0
16 Oct 2024
CMAL: A Novel Cross-Modal Associative Learning Framework for Vision-Language Pre-Training
Zhiyuan Ma
Jianjun Li
Guohui Li
Kaiyan Huang
VLM
58
9
0
16 Oct 2024
Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look
Yong Zhang
Rui Zhu
Shifeng Zhang
Xu Zhou
Shifeng Chen
Xiaofan Chen
SSL
45
0
0
16 Oct 2024
MC-Bench: A Benchmark for Multi-Context Visual Grounding in the Era of MLLMs
Yunqiu Xu
Linchao Zhu
Yi Yang
34
3
0
16 Oct 2024
Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond
Pengwei Liang
Junjun Jiang
Qing Ma
Xianming Liu
Jiayi Ma
34
2
0
16 Oct 2024
YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection
Olalekan Akindele
Joshua Atolagbe
26
0
0
15 Oct 2024
Leveraging Structure Knowledge and Deep Models for the Detection of Abnormal Handwritten Text
Zi-Rui Wang
13
0
0
15 Oct 2024
SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection
Shuhan Dong
Yunsong Li
Weiying Xie
Jiaqing Zhang
Jiayuan Tian
Danian Yang
Jie Lei
31
1
0
15 Oct 2024
Open World Object Detection: A Survey
Yiming Li
Yi Wang
Wenqian Wang
Dan Lin
Bingbing Li
Kim-Hui Yap
ObjD
50
0
0
15 Oct 2024
Fractal Calibration for long-tailed object detection
Konstantinos Panagiotis Alexandridis
Ismail Elezi
Jiankang Deng
Anh H. Nguyen
Shan Luo
216
0
0
15 Oct 2024
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Rasoul Shafipour
David Harrison
Maxwell Horton
Jeffrey Marker
Houman Bedayat
Sachin Mehta
Mohammad Rastegari
Mahyar Najibi
Saman Naderiparizi
MQ
62
0
0
14 Oct 2024
SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments
Khaled Gabr
Mohamed Abdelkader
Imen Jarraya
Abdullah AlMusalami
Anis Koubaa
28
1
0
14 Oct 2024
LG-CAV: Train Any Concept Activation Vector with Language Guidance
Qihan Huang
Jie Song
Mengqi Xue
Han Zhang
Bingde Hu
Huiqiong Wang
Hao Jiang
Xingen Wang
Xiuming Zhang
VLM
34
0
0
14 Oct 2024
SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Yuqi Li
Yao Lu
Zhihong Zhu
Chuanguang Yang
Yihao Chen
Jianping Gou
29
3
0
14 Oct 2024
Leveraging Customer Feedback for Multi-modal Insight Extraction
Sandeep Sricharan Mukku
Abinesh Kanagarajan
Pushpendu Ghosh
Chetan Aggarwal
27
0
0
13 Oct 2024
RailYolact -- A Yolact Focused on edge for Real-Time Rail Segmentation
Qihao Qian
31
0
0
12 Oct 2024
REGNet V2: End-to-End REgion-based Grasp Detection Network for Grippers of Different Sizes in Point Clouds
Binglei Zhao
Han Wang
Jian Tang
Chengzhong Ma
Hanbo Zhang
Jiayuan Zhang
Xuguang Lan
Xingyu Chen
3DPC
28
0
0
12 Oct 2024
M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought
G. Kumari
Kirtan Jain
Asif Ekbal
27
1
0
11 Oct 2024
DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection
Haoyang Li
Rui Zhang
Hantao Yao
X. Zhang
Yifan Hao
Xinkai Song
Xiaqing Li
Yongwei Zhao
Ling Li
Yunji Chen
ObjD
VLM
42
4
0
11 Oct 2024
Impact of Surface Reflections in Maritime Obstacle Detection
Samed Yalçın
Hazım Kemal Ekenel
35
0
0
11 Oct 2024
Boosting Open-Vocabulary Object Detection by Handling Background Samples
Ruizhe Zeng
Lu Zhang
Xu Yang
Zhiyong Liu
VLM
ObjD
33
0
0
11 Oct 2024
Alberta Wells Dataset: Pinpointing Oil and Gas Wells from Satellite Imagery
Pratinav Seth
Michelle Lin
Brefo Dwamena Yaw
Jade Boutot
Mary Kang
David Rolnick
35
0
0
11 Oct 2024
Optimizing YOLO Architectures for Optimal Road Damage Detection and Classification: A Comparative Study from YOLOv7 to YOLOv10
Vung Pham
Lan Dong Thi Ngoc
Duy-Linh Bui
30
1
0
10 Oct 2024
RayEmb: Arbitrary Landmark Detection in X-Ray Images Using Ray Embedding Subspace
Pragyan Shrestha
Chun Xie
Yuichi Yoshii
I. Kitahara
41
1
0
10 Oct 2024
BA-Net: Bridge Attention in Deep Neural Networks
Ronghui Zhang
Runzong Zou
Yue Zhao
Zirui Zhang
Junzhou Chen
Yue Cao
Chuan Hu
Houbing Song
38
0
0
10 Oct 2024
Multi-Scale Deformable Transformers for Student Learning Behavior Detection in Smart Classroom
Zhifeng Wang
Minghui Wang
Chunyan Zeng
Longlong Li
31
1
0
10 Oct 2024
LaB-CL: Localized and Balanced Contrastive Learning for improving parking slot detection
U Jin Jeong
Sumin Roh
Il Yong Chun
21
0
0
10 Oct 2024
When the Small-Loss Trick is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections
Keryan Chelouche
Marie Lachaize
Marine Bernard
Louise Olgiati
Remi Cuingnet
NoLa
29
0
0
10 Oct 2024
CountMamba: Exploring Multi-directional Selective State-Space Models for Plant Counting
Hulingxiao He
Yaqi Zhang
Jinglin Xu
Yuxin Peng
Mamba
33
0
0
10 Oct 2024
O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out
Mısra Yavuz
Fatma Guney
28
0
0
10 Oct 2024
SALINA: Towards Sustainable Live Sonar Analytics in Wild Ecosystems
Chi Xu
Rongsheng Qian
Hao Fang
Xiaoqiang Ma
William I. Atlas
Jiangchuan Liu
Mark A. Spoljaric
52
1
0
10 Oct 2024
Previous
1
2
3
...
11
12
13
...
146
147
148
Next