Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
v1
v2
v3 (latest)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 10,536 papers shown
Title
From a Social Cognitive Perspective: Context-aware Visual Social Relationship Recognition
Shiwei Wu
Chao Zhang
Joya Chen
Tong Xu
Likang Wu
Yao Hu
Enhong Chen
68
1
0
12 Jun 2024
I Don't Know You, But I Can Catch You: Real-Time Defense against Diverse Adversarial Patches for Object Detectors
Zijin Lin
Yue Zhao
Kai Chen
Jinwen He
AAML
84
1
0
12 Jun 2024
Adaptive Teaching with Shared Classifier for Knowledge Distillation
Jaeyeon Jang
Young-Ik Kim
Jisu Lim
Hyeonseong Lee
74
0
0
12 Jun 2024
Automated Pavement Cracks Detection and Classification Using Deep Learning
Selvia Nafaa
Hafsa Essam
Karim Ashour
Doaa Emad
Rana Mohamed
Mohammed Elhenawy
Huthaifa I. Ashqar
Abdallah A. Hassan
Taqwa I. Alhadidi
26
4
0
11 Jun 2024
Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions
Renjie Pi
Jianshu Zhang
Jipeng Zhang
Boyao Wang
Zhekai Chen
Tong Zhang
3DV
98
24
0
11 Jun 2024
MeMSVD: Long-Range Temporal Structure Capturing Using Incremental SVD
Ioanna Ntinou
Enrique Sanchez
Georgios Tzimiropoulos
84
0
0
11 Jun 2024
AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Xing Zhang
Jiaxi Gu
Haoyu Zhao
Shicong Wang
Hang Xu
Renjing Pei
Songcen Xu
Zuxuan Wu
Yu-Gang Jiang
96
0
0
11 Jun 2024
Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection
Junfei Yi
Jianxu Mao
Tengfei Liu
Mingjie Li
Hanyu Gu
Hui Zhang
Xiaojun Chang
Yaonan Wang
83
2
0
11 Jun 2024
CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks with Front-End UI Only
Junhee Cho
Jihoon Kim
Daseul Bae
Jinho Choo
Youngjune Gwon
Yeong-Dae Kwon
LLMAG
64
2
0
11 Jun 2024
UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving
Daniel Bogdoll
Noël Ollick
Tim Joseph
J. Marius Zöllner
115
2
0
10 Jun 2024
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset
Shijie Lian
Ziyi Zhang
Hua Li
Wenjie Li
Laurence Tianruo Yang
Sam Kwong
Runmin Cong
VLM
108
15
0
10 Jun 2024
ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery
Xian Sun
Qiwei Yan
Chubo Deng
Chenglong Liu
Yi Jiang
...
Wanxuan Lu
Fanglong Yao
Xiaoyu Liu
Lingxiang Hao
Hongfeng Yu
113
0
0
10 Jun 2024
Stealthy Targeted Backdoor Attacks against Image Captioning
Wenshu Fan
Hongwei Li
Wenbo Jiang
Meng Hao
Shui Yu
Xiao Zhang
DiffM
77
6
0
09 Jun 2024
ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving
Chen Ma
Ningfei Wang
Zhengyu Zhao
Qian Wang
Qi Alfred Chen
Chao Shen
AAML
78
2
0
09 Jun 2024
SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving
Chen Ma
Ningfei Wang
Zhengyu Zhao
Qi Alfred Chen
Chao Shen
102
1
0
09 Jun 2024
OD-DETR: Online Distillation for Stabilizing Training of Detection Transformer
Shengjian Wu
Li Sun
Qingli Li
124
0
0
09 Jun 2024
A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
Hou-I Liu
Yu-Wen Tseng
Kai-Cheng Chang
Pin-Jyun Wang
Hong-Han Shuai
Wen-Huang Cheng
ViT
ObjD
157
32
0
09 Jun 2024
Interpretable Multimodal Out-of-context Detection with Soft Logic Regularization
Huanhuan Ma
Jinghao Zhang
Qiang Liu
Shu Wu
Liang Wang
86
2
0
07 Jun 2024
Camera-Pose Robust Crater Detection from Changé 5
Matthew Rodda
Sofia Mcleod
Ky Cuong Pham
Tat-Jun Chin
25
1
0
07 Jun 2024
DeTra: A Unified Model for Object Detection and Trajectory Forecasting
Sergio Casas
Ben Agro
Jiageng Mao
Thomas Gilles
Alexander Cui
Thomas Li
R. Urtasun
95
5
0
06 Jun 2024
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
109
27
0
06 Jun 2024
FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles
Cyprien Quéméneur
Soumaya Cherkaoui
106
3
0
05 Jun 2024
LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection
Qiang Chen
Xiangbo Su
Xinyu Zhang
Jian Wang
Jiahui Chen
...
Shan Zhang
Kun Yao
Errui Ding
Gang Zhang
Jingdong Wang
ViT
120
22
0
05 Jun 2024
Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching
Xuri Ge
Fuhai Chen
Songpei Xu
Fuxiang Tao
Jie Wang
Joemon M. Jose
77
1
0
05 Jun 2024
MMCL: Boosting Deformable DETR-Based Detectors with Multi-Class Min-Margin Contrastive Learning for Superior Prohibited Item Detection
Mingyuan Li
Tong Jia
Hui Lu
Bowen Ma
Hao Wang
Dongyue Chen
93
3
0
05 Jun 2024
Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework
Eliraz Orfaig
Inna Stainvas
Igal Bilik
68
0
0
05 Jun 2024
EgoSurgery-Tool: A Dataset of Surgical Tool and Hand Detection from Egocentric Open Surgery Videos
Ryo Fujii
Hideo Saito
Hiroki Kajita
74
5
0
05 Jun 2024
DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images
Yimian Dai
Minrui Zou
Yuxuan Li
Xiang Li
Kang Ni
Jian Yang
86
5
0
05 Jun 2024
Story Generation from Visual Inputs: Techniques, Related Tasks, and Challenges
Daniel A. P. Oliveira
Eugénio Ribeiro
David Martins de Matos
VGen
69
3
0
04 Jun 2024
Negative Prototypes Guided Contrastive Learning for WSOD
Yu Zhang
Chuang Zhu
Guoqing Yang
Siqi Chen
112
0
0
04 Jun 2024
Leveraging Predicate and Triplet Learning for Scene Graph Generation
Jiankai Li
Yunhong Wang
Xiefan Guo
Ruijie Yang
Weixin Li
146
6
0
04 Jun 2024
The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise
Yuanhao Ban
Ruochen Wang
Tianyi Zhou
Boqing Gong
Cho-Jui Hsieh
Minhao Cheng
DiffM
126
6
0
04 Jun 2024
Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning
Heather J. Doig
Oscar Pizarro
Jacquomo Monk
Stefan Williams
90
0
0
04 Jun 2024
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
Weichao Zhao
Hao Feng
Qi Liu
Jingqun Tang
Shubo Wei
...
Lei Liao
Yongjie Ye
Hao Liu
Houqiang Li
Can Huang
LMTD
109
24
0
03 Jun 2024
Augmented Commonsense Knowledge for Remote Object Grounding
Bahram Mohammadi
Yicong Hong
Yuankai Qi
Qi Wu
Shirui Pan
Javen Qinfeng Shi
105
8
0
03 Jun 2024
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a Hybrid Model
Khaled Alomar
Halil Ibrahim Aysel
Xiaohao Cai
MedIm
ViT
95
9
0
02 Jun 2024
Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance
Jun Li
Tongkun Su
Baoliang Zhao
Faqin Lv
Qiong Wang
Nassir Navab
Yin Hu
Zhongliang Jiang
MedIm
92
6
0
02 Jun 2024
Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Xingrui Wang
Wufei Ma
Angtian Wang
Shuo Chen
Adam Kortylewski
Alan Yuille
121
6
0
02 Jun 2024
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
Jiaming Li
Jiacheng Zhang
Jichang Li
Ge Li
Si Liu
Liang Lin
Guanbin Li
ObjD
VLM
116
14
0
01 Jun 2024
DroneVis: Versatile Computer Vision Library for Drones
Ahmed Heakl
F. Youssef
Victor Parque
Walid Gomaa
AI4TS
89
1
0
01 Jun 2024
Learning Manipulation by Predicting Interaction
Jia Zeng
Qingwen Bu
Bangjun Wang
Wenke Xia
Li Chen
...
Heming Cui
Bin Zhao
Xuelong Li
Yu Qiao
Hongyang Li
141
26
0
01 Jun 2024
Image Captioning via Dynamic Path Customization
Yiwei Ma
Jiayi Ji
Xiaoshuai Sun
Yiyi Zhou
Xiaopeng Hong
Yongjian Wu
Rongrong Ji
94
1
0
01 Jun 2024
Efficient Open Set Single Image Test Time Adaptation of Vision Language Models
Manogna Sreenivas
Soma Biswas
VLM
114
0
0
01 Jun 2024
Advancing Ear Biometrics: Enhancing Accuracy and Robustness through Deep Learning
Youssef Mohamed
Zeyad Youssef
Ahmed Heakl
A. Zaky
53
1
0
31 May 2024
Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning
Cheng Tan
Jingxuan Wei
Linzhuang Sun
Zhangyang Gao
Siyuan Li
Bihui Yu
Ruifeng Guo
Stan Z. Li
ReLM
LRM
3DV
127
7
0
31 May 2024
Power of Cooperative Supervision: Multiple Teachers Framework for Enhanced 3D Semi-Supervised Object Detection
Jin-Hee Lee
Jae-Keun Lee
Je-Seok Kim
Soon Kwon
3DPC
78
0
0
31 May 2024
Automatic Counting and Classification of Mosquito Eggs in Field Traps
Javier Naranjo-Alcazar
Jordi Grau-Haro
P. Zuccarello
D. Almenar
Jesús López Ballester
39
0
0
31 May 2024
On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines
Selim Kuzucu
Kemal Oksuz
Jonathan Sadeghi
P. Dokania
89
5
0
30 May 2024
P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation
Qi Zhang
Guohua Geng
Long-He Yan
Pengbo Zhou
Zhaodi Li
Kang Li
Qinglin Liu
DiffM
120
1
0
30 May 2024
SurgiTrack: Fine-Grained Multi-Class Multi-Tool Tracking in Surgical Videos
C. Nwoye
N. Padoy
94
5
0
30 May 2024
Previous
1
2
3
...
23
24
25
...
209
210
211
Next