Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.06870
Cited By
Mask R-CNN
20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mask R-CNN"
50 / 4,190 papers shown
Title
Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding
Kirill Mazur
Edgar Sucar
Andrew J. Davison
3DPC
AI4CE
96
45
0
06 Oct 2022
Effective Self-supervised Pre-training on Low-compute Networks without Distillation
Fuwen Tan
F. Saleh
Brais Martínez
56
4
0
06 Oct 2022
Vision-Based Defect Classification and Weight Estimation of Rice Kernels
Ehtesham Iqbal
Asim Niaz
Asif Aziz Memon
K. Choi
27
3
0
06 Oct 2022
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation
Khoa T. Vo
Sang Truong
Kashu Yamazaki
Bhiksha Raj
Minh-Triet Tran
Ngan Le
93
27
0
05 Oct 2022
DALL-E-Bot: Introducing Web-Scale Diffusion Models to Robotics
Ivan Kapelyukh
Vitalis Vosylius
Edward Johns
LM&Ro
DiffM
127
147
0
05 Oct 2022
Spatio-Temporal Learnable Proposals for End-to-End Video Object Detection
K. Hashmi
D. Stricker
Muhammamd Zeshan Afzal
39
7
0
05 Oct 2022
Promising or Elusive? Unsupervised Object Segmentation from Real-world Single Images
Yafei Yang
Bo Yang
OCL
129
17
0
05 Oct 2022
Centralized Feature Pyramid for Object Detection
Yu Quan
Dong Zhang
Liyan Zhang
Jinhui Tang
ObjD
38
158
0
05 Oct 2022
MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation
Hanwei Zhang
Hideaki Uchiyama
Shintaro Ono
Hiroshi Kawasaki
28
15
0
05 Oct 2022
Centerpoints Are All You Need in Overhead Imagery
James Mason Inder
Mark Lowell
A. J. Maltenfort
3DPC
44
2
0
04 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Chenglin Yang
Siyuan Qiao
Qihang Yu
Xiaoding Yuan
Yukun Zhu
Alan Yuille
Hartwig Adam
Liang-Chieh Chen
ViT
MoE
59
60
0
04 Oct 2022
FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier Convolutions
Bruno Berenguel-Baeta
J. Bermudez-Cameo
Jose J. Guerrero
MDE
67
15
0
04 Oct 2022
Open-source High-precision Autonomous Suturing Framework With Visual Guidance
Hongbin Lin
Bin Li
Yunhui Liu
K. W. S. Au
52
4
0
04 Oct 2022
Generative Category-Level Shape and Pose Estimation with Semantic Primitives
Guanglin Li
Yifeng Li
Zhichao Ye
Qihang Zhang
Tao Kong
Zhaopeng Cui
Guofeng Zhang
49
24
0
03 Oct 2022
Improving Sample Quality of Diffusion Models Using Self-Attention Guidance
Susung Hong
Gyuseong Lee
Wooseok Jang
Seung Wook Kim
DiffM
40
97
0
03 Oct 2022
Unbiased Scene Graph Generation using Predicate Similarities
Misaki Ohashi
Yusuke Matsui
58
1
0
03 Oct 2022
Learning Equivariant Segmentation with Instance-Unique Querying
Wenguan Wang
James Liang
Dongfang Liu
ISeg
52
48
0
03 Oct 2022
Heatmap Distribution Matching for Human Pose Estimation
Haoxuan Qu
Li Xu
Yujun Cai
Lin Geng Foo
Jun Liu
37
14
0
03 Oct 2022
DFA: Dynamic Feature Aggregation for Efficient Video Object Detection
Yiming Cui
50
8
0
02 Oct 2022
Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation
Xinhang Liu
Jiaben Chen
Huai Yu
Yu-Wing Tai
Chi-Keung Tang
97
28
0
02 Oct 2022
Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2
A. Eslamian
M. Ahmadzadeh
37
6
0
01 Oct 2022
Differentiable Parsing and Visual Grounding of Natural Language Instructions for Object Placement
Zirui Zhao
W. Lee
David Hsu
OOD
37
9
0
01 Oct 2022
An In-depth Study of Stochastic Backpropagation
J. Fang
Ming Xu
Hao Chen
Bing Shuai
Zhuowen Tu
Joseph Tighe
BDL
43
2
0
30 Sep 2022
BayesFT: Bayesian Optimization for Fault Tolerant Neural Network Architecture
Nanyang Ye
Jingbiao Mei
Zhicheng Fang
Yuwen Zhang
Ziqing Zhang
Huaying Wu
Xiaoyao Liang
OOD
33
5
0
30 Sep 2022
F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models
Weicheng Kuo
Huayu Chen
Xiuye Gu
A. Piergiovanni
A. Angelova
MLLM
VLM
ObjD
93
135
0
30 Sep 2022
Automated Characterization of Catalytically Active Inclusion Body Production in Biotechnological Screening Systems
Karina Ruzaeva
K. Küsters
W. Wiechert
B. Berkels
M. Oldiges
K. Nöh
18
0
0
30 Sep 2022
Slimmable Networks for Contrastive Self-supervised Learning
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yi Yang
40
1
0
30 Sep 2022
NBV-SC: Next Best View Planning based on Shape Completion for Fruit Mapping and Reconstruction
Rohit Menon
Tobias Zaenker
Nils Dengler
Maren Bennewitz
44
27
0
30 Sep 2022
Dilated Neighborhood Attention Transformer
Ali Hassani
Humphrey Shi
ViT
MedIm
35
69
0
29 Sep 2022
GDIP: Gated Differentiable Image Processing for Object-Detection in Adverse Conditions
Sanket Kalwar
Dhruv V. Patel
Aakash Aanegola
Krishna Reddy Konda
Sourav Garg
K. M. Krishna
61
37
0
29 Sep 2022
4D-StOP: Panoptic Segmentation of 4D LiDAR using Spatio-temporal Object Proposal Generation and Aggregation
Lars Kreuzberg
Idil Esen Zulfikar
Sabarinath Mahadevan
Francis Engelmann
Bastian Leibe
3DPC
71
34
0
29 Sep 2022
DIY Graphics Tab: A Cost-Effective Alternative to Graphics Tablet for Educators
Mohammad Imrul Jubair
Arafat Ibne Yousuf
Tashfiq Ahmed
Hasanath Jamy
Foisal Reza
Mohsena Ashraf
3DV
25
0
0
29 Sep 2022
View-Invariant Localization using Semantic Objects in Changing Environments
Jacqueline Ankenbauer
Kaveh Fathian
Jonathan P. How
20
2
0
28 Sep 2022
A Survey on Physical Adversarial Attack in Computer Vision
Donghua Wang
Wen Yao
Tingsong Jiang
Guijian Tang
Xiaoqian Chen
AAML
75
38
0
28 Sep 2022
Road Rutting Detection using Deep Learning on Images
Poonam Kumari Saha
Deeksha M. Arya
Ashutosh Kumar
Hiroya Maeda
Y. Sekimoto
31
9
0
28 Sep 2022
Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
Fengyuan Shi
Ruopeng Gao
Weilin Huang
Limin Wang
37
25
0
28 Sep 2022
EgoSpeed-Net: Forecasting Speed-Control in Driver Behavior from Egocentric Video Data
Yichen Ding
Ziming Zhang
Jun Luo
Xun Zhou
44
3
0
27 Sep 2022
A Pathologist-Informed Workflow for Classification of Prostate Glands in Histopathology
Alessandro Ferrero
Beatrice Knudsen
Deepika Sirohi
Ross T. Whitaker
38
0
0
27 Sep 2022
A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective
Chaoqi Chen
Yushuang Wu
Qiyuan Dai
Hong-Yu Zhou
Mutian Xu
Sibei Yang
Xiaoguang Han
Yizhou Yu
ViT
MedIm
AI4CE
34
74
0
27 Sep 2022
Diversified Dynamic Routing for Vision Tasks
Botos Csaba
Adel Bibi
Yanwei Li
Philip Torr
Ser-Nam Lim
MoE
73
0
0
26 Sep 2022
DeepFusion: A Robust and Modular 3D Object Detector for Lidars, Cameras and Radars
F. Drews
Di Feng
F. Faion
Lars Rosenbaum
Michael Ulrich
Claudius Gläser
3DPC
38
21
0
26 Sep 2022
TAD: A Large-Scale Benchmark for Traffic Accidents Detection from Video Surveillance
Yajun Xu
Chuwen Huang
Yibing Nan
Kai Wang
77
8
0
26 Sep 2022
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video
A. Athar
Jonathon Luiten
P. Voigtlaender
Tarasha Khurana
Achal Dave
Bastian Leibe
Deva Ramanan
VOS
VLM
29
58
0
25 Sep 2022
CryptoGCN: Fast and Scalable Homomorphically Encrypted Graph Convolutional Network Inference
Ran Ran
Nuo Xu
Wei Wang
Quan Gang
Jieming Yin
Wujie Wen
GNN
36
20
0
24 Sep 2022
Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes
J. Kim
M. Urschler
Patricia J. Riddle
Jörg Simon Wicker
45
6
0
24 Sep 2022
Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs
Youya Xia
Josephine Monica
Wei-Lun Chao
Bharath Hariharan
Kilian Q. Weinberger
Mark E. Campbell
42
12
0
23 Sep 2022
Query-based Hard-Image Retrieval for Object Detection at Test Time
Edward W. Ayers
Jonathan Sadeghi
John Redford
Romain Mueller
P. Dokania
30
2
0
23 Sep 2022
Google Coral-based edge computing person reidentification using human parsing combined with analytical method
N. Gabdullin
A. Raskovalov
40
4
0
22 Sep 2022
Detecting Rotated Objects as Gaussian Distributions and Its 3-D Generalization
Xue Yang
Gefan Zhang
Xiaojiang Yang
Yue Zhou
Wentao Wang
Jin Tang
Tao He
Junchi Yan
34
89
0
22 Sep 2022
SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation
Xiaoye Han
L. Yang
64
10
0
22 Sep 2022
Previous
1
2
3
...
24
25
26
...
82
83
84
Next