Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 6,788 papers shown
Title
Graph Network for Sign Language Tasks
Shiwei Gan
Yafeng Yin
Zhiwei Jiang
Hongkai Wen
Lei Xie
Sanglu Lu
SLR
49
0
0
16 Apr 2025
GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*
Eunsoo Im
Jung Kwon Lee
Changhyun Jee
41
0
0
15 Apr 2025
Fine-Grained Rib Fracture Diagnosis with Hyperbolic Embeddings: A Detailed Annotation Framework and Multi-Label Classification Model
Shripad Pate
Aiman Farooq
Suvrankar Datta
Musadiq Aadil Sheikh
Atin Kumar
Deepak Mishra
31
0
0
15 Apr 2025
PatrolVision: Automated License Plate Recognition in the wild
Anmol Singhal Navya Singhal
31
0
0
15 Apr 2025
A comprehensive review of remote sensing in wetland classification and mapping
Shuai Yuan
Xiangan Liang
Tianwu Lin
Shuang Chen
Rui Liu
Jie Wang
Huatian Zhang
Peng Gong
34
0
0
15 Apr 2025
Weather-Aware Object Detection Transformer for Domain Adaptation
Soheil Gharatappeh
Salimeh Yasaei Sekeh
Vikas Dhiman
ViT
31
0
0
15 Apr 2025
Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization
Darryl Hannan
John Cooper
Dylan White
Timothy Doster
Henry Kvinge
Y. Watkins
29
0
0
14 Apr 2025
Improving Multimodal Hateful Meme Detection Exploiting LMM-Generated Knowledge
Maria Tzelepi
Vasileios Mezaris
34
0
0
14 Apr 2025
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers
Xingjian Leng
Jaskirat Singh
Yunzhong Hou
Zhenchang Xing
Saining Xie
Liang Zheng
41
1
0
14 Apr 2025
COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts
Jiansheng Li
Xingxuan Zhang
Hao Zou
Yige Guo
Renzhe Xu
Yilong Liu
Chuzhao Zhu
Yue He
Peng Cui
VLM
44
0
0
14 Apr 2025
Density-based Object Detection in Crowded Scenes
Chenyang Zhao
Jia Wan
Antoni B. Chan
34
0
0
14 Apr 2025
Small Object Detection with YOLO: A Performance Analysis Across Model Versions and Hardware
Muhammad Fasih Tariq
Muhammad Azeem Javed
ObjD
56
0
0
14 Apr 2025
Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation
Yongchao Feng
Yajie Liu
Shuai Yang
Wenrui Cai
Jingyang Zhang
...
Jiahui Lv
Ziqiang Liu
Tengyuan Shi
Qingjie Liu
Yansen Wang
MLLM
VLM
63
1
0
13 Apr 2025
Using Vision Language Models for Safety Hazard Identification in Construction
Muhammad Adil
Gaang Lee
Vicente A. Gonzalez
Qipei Mei
36
1
0
12 Apr 2025
Explorer: Robust Collection of Interactable GUI Elements
Iason Chaimalas
Arnas Vyšniauskas
Gabriel Brostow
31
0
0
12 Apr 2025
Title block detection and information extraction for enhanced building drawings search
Alessio Lombardi
Li Duan
Ahmed Elnagar
Ahmed Zaalouk
Khalid Ismail
Edlira Vakaj
21
0
0
11 Apr 2025
HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields
Asterios Reppas
Grigorios-Aris Cheimariotis
Panos K. Papadopoulos
Panagiotis Frasiolas
Dimitrios Zarpalas
34
0
0
11 Apr 2025
On Transfer-based Universal Attacks in Pure Black-box Setting
M. Jalwana
Naveed Akhtar
Ajmal Mian
Nazanin Rahnavard
Mubarak Shah
AAML
31
0
0
11 Apr 2025
VLMT: Vision-Language Multimodal Transformer for Multimodal Multi-hop Question Answering
Qi Zhi Lim
C. Lee
K. Lim
Kalaiarasi Sonai Muthu Anbananthen
31
0
0
11 Apr 2025
MBE-ARI: A Multimodal Dataset Mapping Bi-directional Engagement in Animal-Robot Interaction
Ian Noronha
Advait Prasad Jawaji
Juan Camilo Soto
Jiajun An
Yan Gu
Upinder Kaur
37
0
0
11 Apr 2025
WS-DETR: Robust Water Surface Object Detection through Vision-Radar Fusion with Detection Transformer
Huilin Yin
Pengyu Wang
Senmao Li
Jun Yan
Daniel Watzenig
31
0
0
10 Apr 2025
AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations
Junli Liu
Qizhi Chen
Zechuan Wang
Yiwen Tang
Yiting Zhang
Chi Yan
Dong Wang
X. Li
Bin Zhao
CoGe
49
0
0
10 Apr 2025
Perception-R1: Pioneering Perception Policy with Reinforcement Learning
En Yu
Kangheng Lin
Liang Zhao
Jisheng Yin
Yana Wei
...
Zheng Ge
Xiangyu Zhang
Daxin Jiang
Jingyu Wang
Wenbing Tao
VLM
OffRL
LRM
40
3
0
10 Apr 2025
Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks
Erin Carson
Xinye Chen
54
0
0
10 Apr 2025
RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions
Youngwan Jin
Michal Kovac
Yagiz Nalcakan
Hyeongjin Ju
Hanbin Song
Sanghyeop Yeo
Shiho Kim
44
0
0
10 Apr 2025
Few-Shot Adaptation of Grounding DINO for Agricultural Domain
Rajhans Singh
Rafael Bidese Puhl
Kshitiz Dhakal
Sudhir Sornapudi
31
0
0
09 Apr 2025
UAV Position Estimation using a LiDAR-based 3D Object Detection Method
Uthman Olawoye
Jason N. Gross
3DPC
52
3
0
09 Apr 2025
Perception in Reflection
Yana Wei
Liang Zhao
Kangheng Lin
En Yu
Yuang Peng
...
Jianjian Sun
Haoran Wei
Zheng Ge
Xiangyu Zhang
Vishal M. Patel
31
0
0
09 Apr 2025
Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection
Ruoyu Chen
Hua Zhang
Jingzhi Li
Li Liu
Zhen Huang
Xiaochun Cao
37
0
0
09 Apr 2025
Class Imbalance Correction for Improved Universal Lesion Detection and Tagging in CT
Peter D. Erickson
T. Mathai
Ronald M. Summers
56
4
0
08 Apr 2025
A Robust Real-Time Lane Detection Method with Fog-Enhanced Feature Fusion for Foggy Conditions
Ronghui Zhang
Yuhang Ma
Tengfei Li
Ziyu Lin
Yueying Wu
Junzhou Chen
Lin Zhang
Jia Hu
Tony Z. Qiu
Konghui Guo
41
0
0
08 Apr 2025
D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition
Rupayan Mallick
Sibo Dong
Nataniel Ruiz
Sarah Adel Bargal
DiffM
49
0
0
08 Apr 2025
AD-Det: Boosting Object Detection in UAV Images with Focused Small Objects and Balanced Tail Classes
Zhenteng Li
Sheng Lian
Dengfeng Pan
Yufei Wang
Wei Liu
56
0
0
08 Apr 2025
Don't Lag, RAG: Training-Free Adversarial Detection Using RAG
Roie Kazoom
Raz Lapid
Moshe Sipper
Ofer Hadar
VLM
ObjD
AAML
69
0
0
07 Apr 2025
GAMDTP: Dynamic Trajectory Prediction with Graph Attention Mamba Network
Yunxiang Liu
Hongkuo Niu
Jianlin Zhu
32
0
0
07 Apr 2025
Universal Lymph Node Detection in Multiparametric MRI with Selective Augmentation
T. Mathai
Sungwon Lee
Thomas C. Shen
Zhiyong Lu
Ronald M. Summers
40
0
0
07 Apr 2025
Inland Waterway Object Detection in Multi-environment: Dataset and Approach
Shanshan Wang
Haixiang Xu
Hui Feng
Xiaoqian Wang
Pei Song
Sijie Liu
Jianhua He
29
0
0
07 Apr 2025
Feedback-Enhanced Hallucination-Resistant Vision-Language Model for Real-Time Scene Understanding
Zahir Alsulaimawi
33
0
0
07 Apr 2025
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection
Jiancheng Pan
Yanxing Liu
Xiao He
Long Peng
Jiahao Li
Yuze Sun
Xiaomeng Huang
40
0
0
06 Apr 2025
Progressive Multi-Source Domain Adaptation for Personalized Facial Expression Recognition
Muhammad Osama Zeeshan
M. Pedersoli
A. L. Koerich
Eric Grange
29
0
0
05 Apr 2025
Edge Approximation Text Detector
Chuang Yang
Xu Han
T. Han
Han Han
Bingxuan Zhao
Qi Wang
43
0
0
05 Apr 2025
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning
Xiao-Hui Li
Fei Yin
Cheng-Lin Liu
44
0
0
05 Apr 2025
Loss Functions in Deep Learning: A Comprehensive Review
Omar Elharrouss
Yasir Mahmood
Yassine Bechqito
Mohamed Adel Serhani
E. Badidi
Jamal Riffi
Hamid Tairi
38
0
0
05 Apr 2025
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection
Divya Velayudhan
A. Ahmed
Mohamad Alansari
Neha Gour
Abderaouf Behouch
...
Muzammal Naseer
Juergen Gall
Mohammed Bennamoun
Ernesto Damiani
Naoufel Werghi
50
0
0
03 Apr 2025
Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
45
0
0
03 Apr 2025
Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results
Andrei Dumitriu
Florin Tatui
Florin Miron
Radu Tudor Ionescu
Radu Timofte
47
21
0
03 Apr 2025
NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds
Mahan Rafidashti
Ji Lan
M. Fatemi
Junsheng Fu
Lars Hammarstrand
Lennart Svensson
49
0
0
01 Apr 2025
RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety
Andrei Dumitriu
Florin Tatui
Florin Miron
Aakash Ralhan
Radu Tudor Ionescu
Radu Timofte
53
0
0
01 Apr 2025
Coca-Splat: Collaborative Optimization for Camera Parameters and 3D Gaussians
Jiamin Wu
Hongyang Li
Xiaoke Jiang
Yuan Yao
Lei Zhang
3DGS
56
0
0
01 Apr 2025
Real-Time Navigation for Autonomous Aerial Vehicles Using Video
Khizar Anjum
Parul Pandey
Vidyasagar Sadhu
Roberto Tron
D. Pompili
44
0
0
01 Apr 2025
Previous
1
2
3
4
5
6
...
134
135
136
Next