Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.06870
Cited By
Mask R-CNN
20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mask R-CNN"
50 / 3,768 papers shown
Title
Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis
Brian K. S. Isaac-Medina
Yona Falinie A. Gaus
Neelanjan Bhowmik
T. Breckon
28
2
0
22 Jul 2024
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
Weiqin Jiao
Claudio Persello
G. Vosselman
3DV
31
3
0
20 Jul 2024
Learning Visual Grounding from Generative Vision and Language Model
Shijie Wang
Dahun Kim
A. Taalimi
Chen Sun
Weicheng Kuo
ObjD
38
6
0
18 Jul 2024
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
Junseo Park
Hyeryung Jang
81
0
0
17 Jul 2024
PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition
Xiao-Li Li
Yining Liu
Na Dong
Sitian Qin
Xiaolin Hu
41
3
0
15 Jul 2024
A Fair Ranking and New Model for Panoptic Scene Graph Generation
Julian Lorenz
Alexander Pest
Daniel Kienzle
K. Ludwig
Rainer Lienhart
51
1
0
12 Jul 2024
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Ali Hatamizadeh
Jan Kautz
Mamba
47
59
0
10 Jul 2024
AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review
Rui Jin
Derun Li
Dehui Xiang
Lei Zhang
Hailing Zhou
Fei Shi
Weifang Zhu
Jing Cai
Tao Peng
Xinjian Chen
44
0
0
09 Jul 2024
Dynamic Neural Radiance Field From Defocused Monocular Video
Xianrui Luo
Huiqiang Sun
Juewen Peng
Zhiguo Cao
VGen
54
2
0
08 Jul 2024
Towards Reflected Object Detection: A Benchmark
Zhongtian Wang
You Wu
Hui Zhou
Shuiwang Li
ObjD
34
2
0
08 Jul 2024
CountGD: Multi-Modal Open-World Counting
Niki Amini-Naieni
Tengda Han
Andrew Zisserman
ObjD
64
10
0
05 Jul 2024
Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling Capacities for Efficient 3D Human Pose Estimation
Mengmeng Cui
Kunbo Zhang
Zhenan Sun
ViT
36
0
0
03 Jul 2024
A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling
Minghao Zhou
Hong Wang
Yefeng Zheng
Deyu Meng
33
1
0
02 Jul 2024
Multi-View Black-Box Physical Attacks on Infrared Pedestrian Detectors Using Adversarial Infrared Grid
Kalibinuer Tiliwalidi
Chengyin Hu
Weiwen Shi
AAML
36
1
0
01 Jul 2024
GMT: A Robust Global Association Model for Multi-Target Multi-Camera Tracking
Huijie Fan
Tinghui Zhao
Qiang Wang
Baojie Fan
Yandong Tang
Lianqing Liu
51
2
0
01 Jul 2024
Robot Instance Segmentation with Few Annotations for Grasping
Moshe Kimhi
David Vainshtein
Chaim Baskin
Dotan Di Castro
66
2
0
01 Jul 2024
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Xiang Li
Cristina Mata
J. Park
Kumara Kahatapitiya
Yoo Sung Jang
...
Kanchana Ranasinghe
R. Burgert
Mu Cai
Yong Jae Lee
Michael S. Ryoo
LM&Ro
72
26
0
28 Jun 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLM
DiffM
59
3
0
28 Jun 2024
High-resolution open-vocabulary object 6D pose estimation
Jaime Corsetti
Davide Boscaini
Francesco Giuliari
Changjae Oh
Andrea Cavallaro
Fabio Poiesi
34
1
0
24 Jun 2024
CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery
Oluwatosin O. Alabi
K. Toe
Zijian Zhou
C. Budd
Nicholas Raison
Miaojing Shi
Tom Kamiel Magda Vercauteren
ISeg
64
1
0
23 Jun 2024
Towards Timely Video Analytics Services at the Network Edge
Xishuo Li
Shan Zhang
Yuejiao Huang
Xiao Ma
Zhiyuan Wang
Hongbin Luo
46
1
0
21 Jun 2024
TraceNet: Segment one thing efficiently
Mingyuan Wu
Zichuan Liu
Haozhen Zheng
Hongpeng Guo
Bo Chen
Xin Lu
Klara Nahrstedt
39
0
0
21 Jun 2024
Autonomous Robotic Drilling System for Mice Cranial Window Creation
Enduo Zhao
M. M. Marinho
K. Harada
21
5
0
20 Jun 2024
Graspness Discovery in Clutters for Fast and Accurate Grasp Detection
Chenxi Wang
Hao-Shu Fang
Minghao Gou
Hongjie Fang
Jin Gao
Cewu Lu
55
113
0
17 Jun 2024
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications
Jordy Van Landeghem
Subhajit Maity
Ayan Banerjee
Matthew Blaschko
Marie-Francine Moens
Josep Lladós
Sanket Biswas
52
2
0
12 Jun 2024
ROADWork Dataset: Learning to Recognize, Observe, Analyze and Drive Through Work Zones
Anurag Ghosh
R. Tamburo
Shen Zheng
Juan R. Alvarez-Padilla
Hailiang Zhu
Michael Cardei
Nicholas Dunn
Christoph Mertz
Srinivasa G. Narasimhan
54
1
0
11 Jun 2024
PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation
Zhenyu Li
Shariq Farooq Bhat
Peter Wonka
3DV
MDE
37
7
0
10 Jun 2024
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset
Shijie Lian
Ziyi Zhang
Hua Li
Wenjie Li
Laurence Tianruo Yang
Sam Kwong
Runmin Cong
VLM
31
14
0
10 Jun 2024
A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
Hou-I Liu
Yu-Wen Tseng
Kai-Cheng Chang
Pin-Jyun Wang
Hong-Han Shuai
Wen-Huang Cheng
ViT
ObjD
48
24
0
09 Jun 2024
F-LMM: Grounding Frozen Large Multimodal Models
Size Wu
Sheng Jin
Wenwei Zhang
Lumin Xu
Wentao Liu
Wei Li
Chen Change Loy
MLLM
80
12
0
09 Jun 2024
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
45
22
0
06 Jun 2024
ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
Fang Chen
Gourav Datta
Mujahid Al Rafi
Hyeran Jeon
Meng Tang
93
1
0
06 Jun 2024
Reconstructing training data from document understanding models
Jérémie Dentan
Arnaud Paran
A. Shabou
AAML
SyDa
54
1
0
05 Jun 2024
GrootVL: Tree Topology is All You Need in State Space Model
Yicheng Xiao
Lin Song
Shaoli Huang
Jiangshan Wang
Siyu Song
Yixiao Ge
Xiu Li
Ying Shan
Mamba
47
11
0
04 Jun 2024
DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation
Ron Keuth
Lasse Hansen
Maren Balks
Ronja Jäger
Anne-Nele Schröder
Ludger Tüshaus
Mattias P. Heinrich
35
0
0
30 May 2024
YotoR-You Only Transform One Representation
José Ignacio Díaz Villa
P. Loncomilla
Javier Ruiz-del-Solar
ViT
46
0
0
30 May 2024
SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving
Yiming Cui
Cheng Han
Dongfang Liu
40
0
0
29 May 2024
A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic
Ioanna C. Gogou
Dimitrios A. Koutsomitropoulos
3DH
33
2
0
28 May 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLM
ISeg
62
4
0
28 May 2024
A Two-Level Stochastic Model for the Lateral Movement of Vehicles Within Their Lane Under Homogeneous Traffic Conditions
N. Neis
Juergen Beyerer
13
6
0
27 May 2024
ModelLock: Locking Your Model With a Spell
Yifeng Gao
Yuhua Sun
Xingjun Ma
Zuxuan Wu
Yu-Gang Jiang
VLM
50
1
0
25 May 2024
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge
Brendan Park
Madeline Janecek
Naser Ezzati-Jivan
Yifeng Li
Ali Emami
40
0
0
25 May 2024
Retro: Reusing teacher projection head for efficient embedding distillation on Lightweight Models via Self-supervised Learning
Khanh-Binh Nguyen
Chae Jung Park
39
0
0
24 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
82
45
0
23 May 2024
Vision Transformer with Sparse Scan Prior
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
ViT
48
5
0
22 May 2024
BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once
Theodore Zhao
Yu Gu
Jianwei Yang
Naoto Usuyama
Ho Hin Lee
...
B. Piening
Carlo Bifulco
Mu-Hsin Wei
Hoifung Poon
Sheng Wang
MedIm
41
23
0
21 May 2024
Learning Spatial Similarity Distribution for Few-shot Object Counting
Yuanwu Xu
Feifan Song
Haofeng Zhang
48
3
0
20 May 2024
SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization
Jialong Guo
Xinghao Chen
Yehui Tang
Yunhe Wang
ViT
49
9
0
19 May 2024
Beyond Traditional Single Object Tracking: A Survey
Omar Abdelaziz
Mohamed Shehata
Mohamed Mohamed
39
0
0
16 May 2024
HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase Recognition
Kun Yuan
V. Srivastav
Nassir Navab
N. Padoy
47
12
0
16 May 2024
Previous
1
2
3
4
5
6
...
74
75
76
Next