Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.06870
Cited By
Mask R-CNN
20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mask R-CNN"
50 / 3,872 papers shown
Title
PACO: Parts and Attributes of Common Objects
Vignesh Ramanathan
Anmol Kalia
Vladan Petrovic
Yiqian Wen
Baixue Zheng
...
Abhishek Kadian
Amir Mousavi
Yi-Zhe Song
Abhimanyu Dubey
D. Mahajan
VLM
30
96
0
04 Jan 2023
RecRecNet: Rectangling Rectified Wide-Angle Images by Thin-Plate Spline Model and DoF-based Curriculum Learning
K. Liao
Lang Nie
Chunyu Lin
Zishuo Zheng
Yao Zhao
41
11
0
04 Jan 2023
A deep local attention network for pre-operative lymph node metastasis prediction in pancreatic cancer via multiphase CT imaging
Zhilin Zheng
Xu Fang
Jiawen Yao
Mengmeng Zhu
Le Lu
...
Hong Lu
Jian-Ping Lu
Ling Zhang
C. Shao
Yun Bian
MedIm
27
1
0
04 Jan 2023
Object Segmentation with Audio Context
Kaihui Zheng
Yuqing Ren
Zixin Shen
Tianxu Qin
VOS
29
0
0
04 Jan 2023
Ego-Only: Egocentric Action Detection without Exocentric Transferring
Huiyu Wang
Mitesh Singh
Lorenzo Torresani
EgoV
79
24
0
03 Jan 2023
Explainability and Robustness of Deep Visual Classification Models
Jindong Gu
AAML
52
2
0
03 Jan 2023
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
Yue Han
Jiangning Zhang
Zhucun Xue
Chao Xu
Xintian Shen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
54
17
0
03 Jan 2023
Correlation Loss: Enforcing Correlation between Classification and Localization
Fehmi Kahraman
Kemal Oksuz
Sinan Kalkan
Emre Akbas
39
4
0
03 Jan 2023
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Xiangtai Li
Shilin Xu
Yibo Yang
Haobo Yuan
Guangliang Cheng
Yu Tong
Zhouchen Lin
Ming-Hsuan Yang
Dacheng Tao
ViT
57
21
0
03 Jan 2023
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders
Sanghyun Woo
Shoubhik Debnath
Ronghang Hu
Xinlei Chen
Zhuang Liu
In So Kweon
Saining Xie
SyDa
103
737
0
02 Jan 2023
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Jianzong Wu
Xiangtai Li
Henghui Ding
Xia Li
Guangliang Cheng
Yu Tong
Chen Change Loy
VLM
102
31
0
02 Jan 2023
P3DC-Shot: Prior-Driven Discrete Data Calibration for Nearest-Neighbor Few-Shot Classification
Shuang Wang
Rui Ma
Tieru Wu
Yang Cao
36
5
0
02 Jan 2023
Learning Road Scene-level Representations via Semantic Region Prediction
Zihao Xiao
Alan Yuille
Yi-Ting Chen
34
2
0
02 Jan 2023
Image To Tree with Recursive Prompting
James Batten
Matthew Sinclair
Ben Glocker
M. Schaap
MedIm
24
1
0
01 Jan 2023
Deep Learning Technique for Human Parsing: A Survey and Outlook
Lu Yang
Wenhe Jia
Shane Li
Q. Song
ViT
61
17
0
01 Jan 2023
Robust Domain Adaptive Object Detection with Unified Multi-Granularity Alignment
Libo Zhang
Wenzhang Zhou
Heng Fan
Tiejian Luo
Haibin Ling
ObjD
31
12
0
01 Jan 2023
ExploreADV: Towards exploratory attack for Neural Networks
Tianzuo Luo
Yuyi Zhong
S. Khoo
AAML
28
1
0
01 Jan 2023
Tracking Passengers and Baggage Items using Multiple Overhead Cameras at Security Checkpoints
Abubakar Siddique
Henry Medeiros
39
5
0
31 Dec 2022
Ponder: Point Cloud Pre-training via Neural Rendering
Di Huang
Sida Peng
Tong He
Honghui Yang
Xiaowei Zhou
Wanli Ouyang
SSL
3DPC
47
41
0
31 Dec 2022
Improving Visual Representation Learning through Perceptual Understanding
Samyakh Tukra
Frederick Hoffman
Ken Chatfield
33
5
0
30 Dec 2022
Fruit Ripeness Classification: a Survey
Matteo Rizzo
Matteo Marcuzzo
A. Zangari
A. Gasparetto
A. Albarelli
32
62
0
29 Dec 2022
PanDepth: Joint Panoptic Segmentation and Depth Completion
J. Lagos
Esa Rahtu
3DPC
VLM
41
1
0
29 Dec 2022
AVstack: An Open-Source, Reconfigurable Platform for Autonomous Vehicle Development
R. S. Hallyburton
Shucheng Zhang
Miroslav Pajic
19
10
0
28 Dec 2022
Continuous Depth Recurrent Neural Differential Equations
Srinivas Anumasa
Geetakrishnasai Gunapati
P. K. Srijith
AI4TS
26
0
0
28 Dec 2022
Fewer is More: Efficient Object Detection in Large Aerial Images
Xingxing Xie
Gong Cheng
Qingyang Li
Shicheng Miao
Ke Li
Junwei Han
ObjD
33
67
0
26 Dec 2022
MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos
Fengrui Tian
S. Du
Yueqi Duan
VGen
29
42
0
26 Dec 2022
SMMix: Self-Motivated Image Mixing for Vision Transformers
Mengzhao Chen
Mingbao Lin
Zhihang Lin
Yuxin Zhang
Rongrong Ji
Rongrong Ji
55
10
0
26 Dec 2022
Hyperspherical Quantization: Toward Smaller and More Accurate Models
Dan Liu
X. Chen
Chen Ma
Xue Liu
MQ
35
3
0
24 Dec 2022
xFBD: Focused Building Damage Dataset and Analysis
Dennis Melamed
Cameron Johnson
Chen Zhao
Russell Blue
Philip Morrone
A. Hoogs
Brian Clipp
AAML
11
2
0
23 Dec 2022
A Close Look at Spatial Modeling: From Attention to Convolution
Xu Ma
Huan Wang
Can Qin
Kunpeng Li
Xing Zhao
Jie Fu
Yun Fu
ViT
3DPC
25
11
0
23 Dec 2022
Precise Location Matching Improves Dense Contrastive Learning in Digital Pathology
Jingwei Zhang
S. Kapse
Ke Ma
Prateek Prasanna
Maria Vakalopoulou
Joel H. Saltz
Dimitris Samaras
32
9
0
23 Dec 2022
DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders
Xiaoyang Kang
Tao Yang
Wenqi Ouyang
Peiran Ren
Lingzhi Li
Xuansong Xie
DiffM
MQ
30
39
0
22 Dec 2022
DaDe: Delay-adaptive Detector for Streaming Perception
Wonwoo Jo
Kyung-Min Lee
J. Baik
Sangsun Lee
Dongho Choi
Hyunkyoo Park
42
2
0
22 Dec 2022
Similarity Contrastive Estimation for Image and Video Soft Contrastive Self-Supervised Learning
J. Denize
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
SSL
24
6
0
21 Dec 2022
High-Throughput, High-Performance Deep Learning-Driven Light Guide Plate Surface Visual Quality Inspection Tailored for Real-World Manufacturing Environments
Carol Xu
M. Famouri
Gautam Bathla
M. Shafiee
Alexander Wong
26
2
0
20 Dec 2022
RangeAugment: Efficient Online Augmentation with Range Learning
Sachin Mehta
Saeid Naderiparizi
Fartash Faghri
Maxwell Horton
Lailin Chen
Ali Farhadi
Oncel Tuzel
Mohammad Rastegari
32
6
0
20 Dec 2022
Which Pixel to Annotate: a Label-Efficient Nuclei Segmentation Framework
Wei Lou
Haofeng Li
Guanbin Li
Xiaoguang Han
Xiang Wan
26
26
0
20 Dec 2022
Bridging Images and Videos: A Simple Learning Framework for Large Vocabulary Video Object Detection
Sanghyun Woo
Kwanyong Park
Seoung Wug Oh
In So Kweon
Joon-Young Lee
VLM
VOS
33
6
0
20 Dec 2022
LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
Ning Yu
Chia-Chih Chen
Zeyuan Chen
Rui Meng
Ganglu Wu
P. Josel
Juan Carlos Niebles
Caiming Xiong
Ran Xu
ViT
DiffM
32
7
0
19 Dec 2022
Building Height Prediction with Instance Segmentation
F. Bagci
A. Kındıroglu
Metehan Yalçin
Ufuk Uyan
Mahiye Uluyagmur Öztürk
24
2
0
19 Dec 2022
Mask-FPAN: Semi-Supervised Face Parsing in the Wild With De-Occlusion and UV GAN
Lei Li
Tianfang Zhang
Fabian Gieseke
Christian Igel
CVBM
34
19
0
18 Dec 2022
Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain Generalization
Yuyang Zhao
Zhun Zhong
Na Zhao
N. Sebe
G. Lee
37
29
0
18 Dec 2022
Efficient Image Captioning for Edge Devices
Ning Wang
Jiangrong Xie
Hangzai Luo
Qinglin Cheng
Jihao Wu
Mingbo Jia
Linlin Li
VLM
CLIP
30
20
0
18 Dec 2022
Fully and Weakly Supervised Referring Expression Segmentation with End-to-End Learning
Hui Li
Mingjie Sun
Jimin Xiao
Eng Gee Lim
Yao-Min Zhao
29
20
0
17 Dec 2022
RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers
Zhikai Li
Junrui Xiao
Lianwei Yang
Qingyi Gu
MQ
31
82
0
16 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
42
162
0
15 Dec 2022
Objaverse: A Universe of Annotated 3D Objects
Matt Deitke
Dustin Schwenk
Jordi Salvador
Luca Weihs
Oscar Michel
Eli VanderBilt
Ludwig Schmidt
Kiana Ehsani
Aniruddha Kembhavi
Ali Farhadi
40
894
0
15 Dec 2022
Multi-task Fusion for Efficient Panoptic-Part Segmentation
Sravan Kumar Jagadeesh
René Schuster
D. Stricker
27
6
0
15 Dec 2022
EM-Paste: EM-guided Cut-Paste with DALL-E Augmentation for Image-level Weakly Supervised Instance Segmentation
Yunhao Ge
Lyne Tchapmi
Brian Nlong Zhao
Laurent Itti
Vibhav Vineet
DiffM
41
5
0
15 Dec 2022
Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework with Spatio-Temporal Collaboration
Liqi Yan
Qifan Wang
Siqi Ma
Jingang Wang
Changbin (Brad) Yu
VOS
37
38
0
15 Dec 2022
Previous
1
2
3
...
17
18
19
...
76
77
78
Next