Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.06870
Cited By
Mask R-CNN
20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mask R-CNN"
50 / 3,638 papers shown
Title
The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection
Tianjiao Cao
Jiahao Lyu
Weichao Zeng
Weimin Mu
Yu Zhou
7
0
0
21 May 2025
Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation
Bin-Bin Gao
Xiaochen Chen
Z. Huang
Congchong Nie
Jun Liu
Jinxiang Lai
Guannan Jiang
Xi-Zhao Wang
Chengjie Wang
16
26
0
20 May 2025
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object Detection
Xiao Wang
Yu Jin
Lan Chen
Bo Jiang
Lin Zhu
Yonghong Tian
Jin Tang
Bin Luo
9
0
0
19 May 2025
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
Fei Xie
Jiahao Nie
Yujin Tang
W. Zhang
Hongshen Zhao
Mamba
13
0
0
19 May 2025
FIGhost: Fluorescent Ink-based Stealthy and Flexible Backdoor Attacks on Physical Traffic Sign Recognition
Shuai Yuan
Guowen Xu
Hongwei Li
Rui Zhang
Xinyuan Qian
Wenbo Jiang
Hangcheng Cao
Qingchuan Zhao
AAML
26
0
0
17 May 2025
Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation
Niaz Ahmad
Jawad Khan
Kang G. Shin
Youngmoon Lee
Guanghui Wang
3DH
9
0
0
17 May 2025
Pseudo-Label Quality Decoupling and Correction for Semi-Supervised Instance Segmentation
Jianghang Lin
Yilin Lu
Yunhang Shen
Chaoyang Zhu
Shengchuan Zhang
Liujuan Cao
Rongrong Ji
ISeg
41
0
0
16 May 2025
SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision
Utsav Rai
Haozheng Xu
Stamatia Giannarou
MedIm
20
0
0
16 May 2025
FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling
Yue Wen
Liang Song
Yi Liu
Siting Zhu
Yanzi Miao
Lijun Han
Hesheng Wang
44
0
0
14 May 2025
Beyond Pixels: Leveraging the Language of Soccer to Improve Spatio-Temporal Action Detection in Broadcast Videos
Jeremie Ochin
Raphael Chekroun
Bogdan Stanciulescu
Sotiris Manitsaris
19
0
0
14 May 2025
Knowledge-Informed Deep Learning for Irrigation Type Mapping from Remote Sensing
Oishee Bintey Hoque
Nibir Chandra Mandal
Abhijin Adiga
Samarth Swarup
S. Nouwakpo
Amanda Wilson
Madhav Marathe
31
0
0
13 May 2025
The RaspGrade Dataset: Towards Automatic Raspberry Ripeness Grading with Deep Learning
M. L. Mekhalfi
P. Chippendale
Fabio Poiesi
Samuele Bonecher
Gilberto Osler
Nicola Zancanella
26
0
0
13 May 2025
Feedback-Driven Pseudo-Label Reliability Assessment: Redefining Thresholding for Semi-Supervised Semantic Segmentation
Negin Ghamsarian
Sahar Nasirihaghighi
Klaus Schoeffmann
Raphael Sznitman
36
0
0
12 May 2025
Language-Driven Dual Style Mixing for Single-Domain Generalized Object Detection
Hongda Qin
Xiao Lu
Zhiyong Wei
Yihong Cao
Kailun Yang
Ningjiang Chen
ObjD
MLLM
VLM
31
0
0
12 May 2025
DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection
Mingqian Ji
Jian Yang
Shanshan Zhang
3DPC
MDE
45
0
0
12 May 2025
Autonomous Robotic Pruning in Orchards and Vineyards: a Review
Alessandro Navone
Mauro Martini
Marcello Chiaberge
38
0
0
12 May 2025
Uni-AIMS: AI-Powered Microscopy Image Analysis
Yanhui Hong
Nan Wang
Zhiyi Xia
Haoyi Tao
Xi Fang
...
Shengyu Li
Ziqi Chen
Zezhong Zhang
Guolin Ke
Linfeng Zhang
26
0
0
11 May 2025
SimMIL: A Universal Weakly Supervised Pre-Training Framework for Multi-Instance Learning in Whole Slide Pathology Images
Yicheng Song
Tiancheng Lin
Die Peng
Su Yang
Yi Xu
MedIm
33
0
0
10 May 2025
A Short Overview of Multi-Modal Wi-Fi Sensing
Zijian Zhao
35
0
0
10 May 2025
The Application of Deep Learning for Lymph Node Segmentation: A Systematic Review
Jingguo Qu
Xinyang Han
Man-Lik Chui
Yao Pu
Simon Takadiyi Gunda
...
Jing Qin
Ann Dorothy King
Winnie Chiu-Wing Chu
J. Cai
Michael Tin-Cheung Ying
31
0
0
09 May 2025
Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection
Zhangchi Hu
Peixi Wu
Jie Chen
Huyue Zhu
Yijun Wang
Yansong Peng
Hong Li
Xingchen Sun
49
0
0
09 May 2025
FG-CLIP: Fine-Grained Visual and Textual Alignment
Chunyu Xie
Bin Wang
Fanjing Kong
Jincheng Li
Dawei Liang
Gengshen Zhang
Dawei Leng
Yuhui Yin
CLIP
VLM
66
0
0
08 May 2025
InstanceGen: Image Generation with Instance-level Instructions
Etai Sella
Yanir Kleiman
Hadar Averbuch-Elor
36
0
0
08 May 2025
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
Junjie Wang
Bin Chen
Yulin Li
Bin Kang
Yulin Chen
Zhuotao Tian
VLM
38
0
0
07 May 2025
Towards Application-Specific Evaluation of Vision Models: Case Studies in Ecology and Biology
A. H. H. Chan
Otto Brookes
Urs Waldmann
Hemal Naik
I. Couzin
...
Lukas Boesch
M. Arandjelovic
H. Kühl
T. Burghardt
Fumihiro Kano
205
0
0
05 May 2025
Continuous Normalizing Flows for Uncertainty-Aware Human Pose Estimation
Shipeng Liu
Ziliang Xiong
Bastian Wandt
Per-Erik Forssén
45
0
0
04 May 2025
OODTE: A Differential Testing Engine for the ONNX Optimizer
Nikolaos Louloudakis
Ajitha Rajan
46
0
0
03 May 2025
A Novel WaveInst-based Network for Tree Trunk Structure Extraction and Pattern Analysis in Forest Inventory
Chenyang Fan
Xujie Zhu
Taige Luo
Sheng Xu
Zhulin Chen
Hongxin Yang
28
0
0
03 May 2025
Efficient Vision-based Vehicle Speed Estimation
Andrej Macko
Lukás Gajdosech
Viktor Kocur
226
0
0
02 May 2025
Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing
Fahong Zhang
Yilei Shi
Xiao Xiang Zhu
43
0
0
02 May 2025
A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic
Muhammad Imran Zaman
Usama Ijaz Bajwa
Gulshan Saleem
Rana Hammad Raza
VOT
62
7
0
01 May 2025
Pack-PTQ: Advancing Post-training Quantization of Neural Networks by Pack-wise Reconstruction
Changjun Li
Runqing Jiang
Zhuo Song
Pengpeng Yu
Ye Zhang
Yulan Guo
MQ
56
0
0
01 May 2025
Purifying, Labeling, and Utilizing: A High-Quality Pipeline for Small Object Detection
Siwei Wang
Zhiwei Chen
Liujuan Cao
Rongrong Ji
ObjD
69
0
0
29 Apr 2025
VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning
Run Luo
Renke Shan
Longze Chen
Zichen Liu
Lu Wang
Min Yang
Xiaobo Xia
MLLM
VLM
99
0
0
28 Apr 2025
Motion Generation for Food Topping Challenge 2024: Serving Salmon Roe Bowl and Picking Fried Chicken
Koki Inami
Masashi Konosu
Koki Yamane
Nozomu Masuya
Yunhan Li
Yu-Han Shu
Hiroshi Sato
Shinnosuke Homma
S. Sakaino
49
0
0
28 Apr 2025
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Juhan Park
Kyungjae Lee
Hyung Jin Chang
Jungchan Cho
VLM
66
0
0
28 Apr 2025
SynergyAmodal: Deocclude Anything with Text Control
Xinyang Li
Chengjie Yi
Jiawei Lai
Mingbao Lin
Yansong Qu
Shengchuan Zhang
Liujuan Cao
DiffM
85
0
0
28 Apr 2025
BARIS: Boundary-Aware Refinement with Environmental Degradation Priors for Robust Underwater Instance Segmentation
Pin-Chi Pan
Soo-Chang Pei
64
0
0
28 Apr 2025
ODExAI: A Comprehensive Object Detection Explainable AI Evaluation
Loc Phuc Truong Nguyen
Hung Nguyen
Hung Cao
71
0
0
27 Apr 2025
Transcending Dimensions using Generative AI: Real-Time 3D Model Generation in Augmented Reality
Majid Behravan
Maryam Haghani
Denis Gračanin
91
1
0
27 Apr 2025
VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation
Niaz Ahmad
Youngmoon Lee
Guanghui Wang
3DH
67
1
0
26 Apr 2025
R-Sparse R-CNN: SAR Ship Detection Based on Background-Aware Sparse Learnable Proposals
Kamirul Kamirul
Odysseas A. Pappas
A. Achim
58
0
0
26 Apr 2025
MASF-YOLO: An Improved YOLOv11 Network for Small Object Detection on Drone View
Liugang Lu
Dabin He
Congxiang Liu
Zhixiang Deng
54
0
0
25 Apr 2025
Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models
Patrick Müller
Alexander Braun
M. Keuper
59
0
0
25 Apr 2025
Sampling-Based Grasp and Collision Prediction for Assisted Teleoperation
Simon Manschitz
Berk Gueler
Wei Ma
Dirk Ruiken
31
0
0
25 Apr 2025
Instance-Adaptive Keypoint Learning with Local-to-Global Geometric Aggregation for Category-Level Object Pose Estimation
Jiahui Geng
Lu Zou
Tao Lu
Yuan Yao
Zhangjin Huang
Guoping Wang
3DPC
35
0
0
21 Apr 2025
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches
Guodong Shen
Yuqi Ouyang
Junru Lu
Yixuan Yang
Victor Sanchez
38
1
0
20 Apr 2025
LimitNet: Progressive, Content-Aware Image Offloading for Extremely Weak Devices & Networks
A. Hojjat
Janek Haberer
Tayyaba Zainab
Olaf Landsiedel
49
3
0
18 Apr 2025
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection
YangChen Zeng
ViT
38
0
0
18 Apr 2025
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjD
VOS
103
2
0
17 Apr 2025
1
2
3
4
...
71
72
73
Next