Mask R-CNN

20 March 2017

Piotr Dollár

Papers citing "Mask R-CNN"

50 / 4,190 papers shown

Title
Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding Kirill Mazur Edgar Sucar Andrew J. Davison 3DPC AI4CE 96 45 0 06 Oct 2022
Effective Self-supervised Pre-training on Low-compute Networks without Distillation Fuwen Tan F. Saleh Brais Martínez 56 4 0 06 Oct 2022
Vision-Based Defect Classification and Weight Estimation of Rice Kernels Ehtesham Iqbal Asim Niaz Asif Aziz Memon K. Choi 27 3 0 06 Oct 2022
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation Khoa T. Vo Sang Truong Kashu Yamazaki Bhiksha Raj Minh-Triet Tran Ngan Le 93 27 0 05 Oct 2022
DALL-E-Bot: Introducing Web-Scale Diffusion Models to Robotics Ivan Kapelyukh Vitalis Vosylius Edward Johns LM&Ro DiffM 127 147 0 05 Oct 2022
Spatio-Temporal Learnable Proposals for End-to-End Video Object Detection K. Hashmi D. Stricker Muhammamd Zeshan Afzal 39 7 0 05 Oct 2022
Promising or Elusive? Unsupervised Object Segmentation from Real-world Single Images Yafei Yang Bo Yang OCL 129 17 0 05 Oct 2022
Centralized Feature Pyramid for Object Detection Yu Quan Dong Zhang Liyan Zhang Jinhui Tang ObjD 38 158 0 05 Oct 2022
MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation Hanwei Zhang Hideaki Uchiyama Shintaro Ono Hiroshi Kawasaki 28 15 0 05 Oct 2022
Centerpoints Are All You Need in Overhead Imagery James Mason Inder Mark Lowell A. J. Maltenfort 3DPC 44 2 0 04 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models Chenglin Yang Siyuan Qiao Qihang Yu Xiaoding Yuan Yukun Zhu Alan Yuille Hartwig Adam Liang-Chieh Chen ViT MoE 59 60 0 04 Oct 2022
FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier Convolutions Bruno Berenguel-Baeta J. Bermudez-Cameo Jose J. Guerrero MDE 67 15 0 04 Oct 2022
Open-source High-precision Autonomous Suturing Framework With Visual Guidance Hongbin Lin Bin Li Yunhui Liu K. W. S. Au 52 4 0 04 Oct 2022
Generative Category-Level Shape and Pose Estimation with Semantic Primitives Guanglin Li Yifeng Li Zhichao Ye Qihang Zhang Tao Kong Zhaopeng Cui Guofeng Zhang 49 24 0 03 Oct 2022
Improving Sample Quality of Diffusion Models Using Self-Attention Guidance Susung Hong Gyuseong Lee Wooseok Jang Seung Wook Kim DiffM 40 97 0 03 Oct 2022
Unbiased Scene Graph Generation using Predicate Similarities Misaki Ohashi Yusuke Matsui 58 1 0 03 Oct 2022
Learning Equivariant Segmentation with Instance-Unique Querying Wenguan Wang James Liang Dongfang Liu ISeg 52 48 0 03 Oct 2022
Heatmap Distribution Matching for Human Pose Estimation Haoxuan Qu Li Xu Yujun Cai Lin Geng Foo Jun Liu 37 14 0 03 Oct 2022
DFA: Dynamic Feature Aggregation for Efficient Video Object Detection Yiming Cui 50 8 0 02 Oct 2022
Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation Xinhang Liu Jiaben Chen Huai Yu Yu-Wing Tai Chi-Keung Tang 97 28 0 02 Oct 2022
Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2 A. Eslamian M. Ahmadzadeh 37 6 0 01 Oct 2022
Differentiable Parsing and Visual Grounding of Natural Language Instructions for Object Placement Zirui Zhao W. Lee David Hsu OOD 37 9 0 01 Oct 2022
An In-depth Study of Stochastic Backpropagation J. Fang Ming Xu Hao Chen Bing Shuai Zhuowen Tu Joseph Tighe BDL 43 2 0 30 Sep 2022
BayesFT: Bayesian Optimization for Fault Tolerant Neural Network Architecture Nanyang Ye Jingbiao Mei Zhicheng Fang Yuwen Zhang Ziqing Zhang Huaying Wu Xiaoyao Liang OOD 33 5 0 30 Sep 2022
F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models Weicheng Kuo Huayu Chen Xiuye Gu A. Piergiovanni A. Angelova MLLM VLM ObjD 93 135 0 30 Sep 2022
Automated Characterization of Catalytically Active Inclusion Body Production in Biotechnological Screening Systems Karina Ruzaeva K. Küsters W. Wiechert B. Berkels M. Oldiges K. Nöh 18 0 0 30 Sep 2022
Slimmable Networks for Contrastive Self-supervised Learning Shuai Zhao Xiaohan Wang Linchao Zhu Yi Yang 40 1 0 30 Sep 2022
NBV-SC: Next Best View Planning based on Shape Completion for Fruit Mapping and Reconstruction Rohit Menon Tobias Zaenker Nils Dengler Maren Bennewitz 44 27 0 30 Sep 2022
Dilated Neighborhood Attention Transformer Ali Hassani Humphrey Shi ViT MedIm 35 69 0 29 Sep 2022
GDIP: Gated Differentiable Image Processing for Object-Detection in Adverse Conditions Sanket Kalwar Dhruv V. Patel Aakash Aanegola Krishna Reddy Konda Sourav Garg K. M. Krishna 61 37 0 29 Sep 2022
4D-StOP: Panoptic Segmentation of 4D LiDAR using Spatio-temporal Object Proposal Generation and Aggregation Lars Kreuzberg Idil Esen Zulfikar Sabarinath Mahadevan Francis Engelmann Bastian Leibe 3DPC 71 34 0 29 Sep 2022
DIY Graphics Tab: A Cost-Effective Alternative to Graphics Tablet for Educators Mohammad Imrul Jubair Arafat Ibne Yousuf Tashfiq Ahmed Hasanath Jamy Foisal Reza Mohsena Ashraf 3DV 25 0 0 29 Sep 2022
View-Invariant Localization using Semantic Objects in Changing Environments Jacqueline Ankenbauer Kaveh Fathian Jonathan P. How 20 2 0 28 Sep 2022
A Survey on Physical Adversarial Attack in Computer Vision Donghua Wang Wen Yao Tingsong Jiang Guijian Tang Xiaoqian Chen AAML 75 38 0 28 Sep 2022
Road Rutting Detection using Deep Learning on Images Poonam Kumari Saha Deeksha M. Arya Ashutosh Kumar Hiroya Maeda Y. Sekimoto 31 9 0 28 Sep 2022
Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding Fengyuan Shi Ruopeng Gao Weilin Huang Limin Wang 37 25 0 28 Sep 2022
EgoSpeed-Net: Forecasting Speed-Control in Driver Behavior from Egocentric Video Data Yichen Ding Ziming Zhang Jun Luo Xun Zhou 44 3 0 27 Sep 2022
A Pathologist-Informed Workflow for Classification of Prostate Glands in Histopathology Alessandro Ferrero Beatrice Knudsen Deepika Sirohi Ross T. Whitaker 38 0 0 27 Sep 2022
A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective Chaoqi Chen Yushuang Wu Qiyuan Dai Hong-Yu Zhou Mutian Xu Sibei Yang Xiaoguang Han Yizhou Yu ViT MedIm AI4CE 34 74 0 27 Sep 2022
Diversified Dynamic Routing for Vision Tasks Botos Csaba Adel Bibi Yanwei Li Philip Torr Ser-Nam Lim MoE 73 0 0 26 Sep 2022
DeepFusion: A Robust and Modular 3D Object Detector for Lidars, Cameras and Radars F. Drews Di Feng F. Faion Lars Rosenbaum Michael Ulrich Claudius Gläser 3DPC 38 21 0 26 Sep 2022
TAD: A Large-Scale Benchmark for Traffic Accidents Detection from Video Surveillance Yajun Xu Chuwen Huang Yibing Nan Kai Wang 77 8 0 26 Sep 2022
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video A. Athar Jonathon Luiten P. Voigtlaender Tarasha Khurana Achal Dave Bastian Leibe Deva Ramanan VOS VLM 29 58 0 25 Sep 2022
CryptoGCN: Fast and Scalable Homomorphically Encrypted Graph Convolutional Network Inference Ran Ran Nuo Xu Wei Wang Quan Gang Jieming Yin Wujie Wen GNN 36 20 0 24 Sep 2022
Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes J. Kim M. Urschler Patricia J. Riddle Jörg Simon Wicker 45 6 0 24 Sep 2022
Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs Youya Xia Josephine Monica Wei-Lun Chao Bharath Hariharan Kilian Q. Weinberger Mark E. Campbell 42 12 0 23 Sep 2022
Query-based Hard-Image Retrieval for Object Detection at Test Time Edward W. Ayers Jonathan Sadeghi John Redford Romain Mueller P. Dokania 30 2 0 23 Sep 2022
Google Coral-based edge computing person reidentification using human parsing combined with analytical method N. Gabdullin A. Raskovalov 40 4 0 22 Sep 2022
Detecting Rotated Objects as Gaussian Distributions and Its 3-D Generalization Xue Yang Gefan Zhang Xiaojiang Yang Yue Zhou Wentao Wang Jin Tang Tao He Junchi Yan 34 89 0 22 Sep 2022
SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation Xiaoye Han L. Yang 64 10 0 22 Sep 2022