Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.06870
Cited By
Mask R-CNN
20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mask R-CNN"
50 / 3,806 papers shown
Title
Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection
Hwanjun Song
Jihwan Bang
VLM
ObjD
34
14
0
25 Mar 2023
DoNet: Deep De-overlapping Network for Cytology Instance Segmentation
Hao Jiang
Rushan Zhang
Yanning Zhou
Yumeng Wang
Hao Chen
ISeg
35
21
0
25 Mar 2023
Lay Text Summarisation Using Natural Language Processing: A Narrative Literature Review
Oliver Vinzelberg
M. Jenkins
Gordon Morison
David McMinn
Z. Tieges
39
6
0
24 Mar 2023
FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization
Pavan Kumar Anasosalu Vasu
J. Gabriel
Jeff J. Zhu
Oncel Tuzel
Anurag Ranjan
ViT
42
155
0
24 Mar 2023
OPDMulti: Openable Part Detection for Multiple Objects
Xiaohao Sun
Hanxiao Jiang
Manolis Savva
Angel X. Chang
AI4CE
35
16
0
24 Mar 2023
HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures
Jiefeng Ma
Jun Du
Pengfei Hu
Zhenrong Zhang
Jianshu Zhang
Huihui Zhu
Cong Liu
29
15
0
24 Mar 2023
Three ways to improve feature alignment for open vocabulary detection
Relja Arandjelović
A. Andonian
A. Mensch
Olivier J. Hénaff
Jean-Baptiste Alayrac
Andrew Zisserman
VLM
ObjD
55
19
0
23 Mar 2023
RGB-D-Inertial SLAM in Indoor Dynamic Environments with Long-term Large Occlusion
Ran Long
C. Rauch
V. Ivan
Tin Lun Lam
S. Vijayakumar
45
5
0
23 Mar 2023
From Knowledge Distillation to Self-Knowledge Distillation: A Unified Approach with Normalized Loss and Customized Soft Labels
Zhendong Yang
Ailing Zeng
Zhe Li
Tianke Zhang
Chun Yuan
Yu Li
40
74
0
23 Mar 2023
Rigidity-Aware Detection for 6D Object Pose Estimation
Yang Hai
Rui Song
Jiaojiao Li
Mathieu Salzmann
Yinlin Hu
3DPC
39
15
0
22 Mar 2023
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Weijia Wu
Yuzhong Zhao
Mike Zheng Shou
Hong Zhou
Chunhua Shen
50
140
0
21 Mar 2023
Human Pose as Compositional Tokens
Zigang Geng
Chunyu Wang
Yixuan Wei
Ze Liu
Houqiang Li
Han Hu
43
47
0
21 Mar 2023
Equiangular Basis Vectors
Yang Shen
Xuhao Sun
Xiuying Wei
41
7
0
21 Mar 2023
Detecting the open-world objects with the help of the Brain
Shuailei Ma
Yuefeng Wang
Ying-yu Wei
Peihao Chen
Zhixiang Ye
Jiaqi Fan
Enming Zhang
Thomas H. Li
VLM
ObjD
29
2
0
21 Mar 2023
Agave crop segmentation and maturity classification with deep learning data-centric strategies using very high-resolution satellite imagery
Abraham Sánchez
Raul Nanclares
A. Quevedo
Ulises Pelagio
Alejandra Aguilar
Gabriela Calvario
E. U. Moya-Sánchez
21
2
0
21 Mar 2023
Texture Learning Domain Randomization for Domain Generalized Segmentation
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
23
18
0
21 Mar 2023
Learning to Explore Informative Trajectories and Samples for Embodied Perception
Ya Jing
Tao Kong
29
5
0
20 Mar 2023
A Region-Prompted Adapter Tuning for Visual Abductive Reasoning
Hao Zhang
Yeo Keat Ee
Basura Fernando
VLM
34
3
0
18 Mar 2023
Multi-Semantic Interactive Learning for Object Detection
Shuxin Wang
Zhichao Zheng
Yanhui Gu
Junsheng Zhou
Yi Chen
ObjD
21
0
0
18 Mar 2023
Efficient Computation Sharing for Multi-Task Visual Scene Understanding
Sara Shoouri
Mingyu Yang
Zichen Fan
Hun-Seok Kim
MoE
31
3
0
16 Mar 2023
FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation
Junjie He
Pengyu Li
Yifeng Geng
Xuansong Xie
ISeg
VLM
41
44
0
15 Mar 2023
Active Teacher for Semi-Supervised Object Detection
Peng Mi
Jianghang Lin
Yiyi Zhou
Yunhang Shen
Gen Luo
Xiaoshuai Sun
Liujuan Cao
Rongrong Fu
Qiang Xu
Rongrong Ji
57
61
0
15 Mar 2023
DynaMask: Dynamic Mask Selection for Instance Segmentation
Ruihuang Li
Chenhang He
Shuai Li
Yabin Zhang
Lei Zhang
ISeg
32
16
0
14 Mar 2023
AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+
Tianlin Li
Ying Wang
Ziwei Xuan
Guo-Jun Qi
ViT
48
3
0
14 Mar 2023
Synthesizing Realistic Image Restoration Training Pairs: A Diffusion Approach
Tao Yang
Peiran Ren
Xuansong Xie
Lei Zhang
DiffM
37
15
0
13 Mar 2023
Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling
Yongshuai Huang
Ning Lu
Dapeng Chen
Yibo Li
Zecheng Xie
Shenggao Zhu
Liangcai Gao
Wei Peng
37
27
0
13 Mar 2023
ViM: Vision Middleware for Unified Downstream Transferring
Yutong Feng
Biao Gong
Jianwen Jiang
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
37
1
0
13 Mar 2023
Object-Centric Multi-Task Learning for Human Instances
Hyeongseok Son
Sang-Il Jung
Solae Lee
Seong-heum Kim
Seungsang Park
ByungIn Yoo
3DH
36
0
0
13 Mar 2023
Amodal Intra-class Instance Segmentation: Synthetic Datasets and Benchmark
Jiayang Ao
Qiuhong Ke
Krista A. Ehinger
42
1
0
12 Mar 2023
DECOMPL: Decompositional Learning with Attention Pooling for Group Activity Recognition from a Single Volleyball Image
Berker Demirel
Huseyin Ozkan
33
2
0
11 Mar 2023
Uncovering the Handwritten Text in the Margins: End-to-end Handwritten Text Detection and Recognition
Liang Cheng
Jonas Frankemölle
Adam Axelsson
Ekta Vats
26
3
0
10 Mar 2023
HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining
Shixiang Tang
Cheng Chen
Qingsong Xie
Meilin Chen
Yizhou Wang
...
Feng Zhu
Haiyang Yang
Li Yi
Rui Zhao
Wanli Ouyang
VLM
37
36
0
10 Mar 2023
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
...
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
124
1,831
0
09 Mar 2023
Rethinking Self-Supervised Visual Representation Learning in Pre-training for 3D Human Pose and Shape Estimation
Hongsuk Choi
Hyeongjin Nam
T. Lee
Gyeongsik Moon
Kyoung Mu Lee
44
7
0
09 Mar 2023
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset
Md. Istiak Hossain Shihab
Md. Rakibul Hasan
Mahfuzur Rahman Emon
Syed Mobassir Hossen
Md. Nazmuddoha Ansary
...
Sayma Sultana Chowdhury
Farig Sadeque
Tahsin Reasat
Ahmed Imtiaz Humayun
Asif Sushmit
29
13
0
09 Mar 2023
MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
Minh-Quan Le
Tam V. Nguyen
Trung-Nghia Le
Thanh-Toan Do
Minh N. Do
M. Tran
DiffM
48
13
0
09 Mar 2023
Imbalanced Open Set Domain Adaptation via Moving-threshold Estimation and Gradual Alignment
Jinghan Ru
Jun Tian
Zhekai Du
Chengwei Xiao
Jingjing Li
H. Shen
40
12
0
08 Mar 2023
A Method for Animating Children's Drawings of the Human Figure
H. Smith
Qingyuan Zheng
Yifei Li
Somya Jain
Jessica Hodgins
40
25
0
07 Mar 2023
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Yanxin Long
Youpeng Wen
Jianhua Han
Hang Xu
Pengzhen Ren
Wei Zhang
Sheng Zhao
Xiaodan Liang
ObjD
VLM
20
31
0
04 Mar 2023
PixMIM: Rethinking Pixel Reconstruction in Masked Image Modeling
Yuan Liu
Songyang Zhang
Jiacheng Chen
Kai-xiang Chen
Dahua Lin
75
28
0
04 Mar 2023
X
3
^3
3
KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection
Marvin Klingner
Shubhankar Borse
V. Kumar
B. Rezaei
V. Narayanan
S. Yogamani
Fatih Porikli
47
21
0
03 Mar 2023
Depth-based 6DoF Object Pose Estimation using Swin Transformer
Zhujun Li
I. Stamos
ViT
43
11
0
03 Mar 2023
BSH-Det3D: Improving 3D Object Detection with BEV Shape Heatmap
You Shen
Yunzhou Zhang
Yanmin Wu
Zhenyu Wang
Linghao Yang
Sonya A. Coleman
D. Kerr
3DPC
35
2
0
03 Mar 2023
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving
Xiwen Liang
Minzhe Niu
Jianhua Han
Hang Xu
Chunjing Xu
Xiaodan Liang
VLM
36
14
0
03 Mar 2023
Feature Completion Transformer for Occluded Person Re-identification
Tao Wang
Mengyuan Liu
Hong Liu
Wenhao Li
Miaoju Ban
Tuanyu Guo
Yidi Li
ViT
39
13
0
03 Mar 2023
Alexa Arena: A User-Centric Interactive Platform for Embodied AI
Qiaozi Gao
Govind Thattai
Suhaila Shakiah
Xiaofeng Gao
Shreyas Pansare
...
Michael Johnston
R. Ghanadan
Arindam Mandal
Dilek Z. Hakkani-Tür
Premkumar Natarajan
6
27
0
02 Mar 2023
Dropout Reduces Underfitting
Zhuang Liu
Zhi-Qin John Xu
Joseph Jin
Zhiqiang Shen
Trevor Darrell
52
36
0
02 Mar 2023
Predicting Motion Plans for Articulating Everyday Objects
Arjun Gupta
Max E. Shepherd
Saurabh Gupta
35
8
0
02 Mar 2023
3D generation on ImageNet
Ivan Skorokhodov
Aliaksandr Siarohin
Yinghao Xu
Jian Ren
Hsin-Ying Lee
Peter Wonka
Sergey Tulyakov
69
55
0
02 Mar 2023
Open-World Object Manipulation using Pre-trained Vision-Language Models
Austin Stone
Ted Xiao
Yao Lu
K. Gopalakrishnan
Kuang-Huei Lee
...
Sean Kirmani
Brianna Zitkovich
F. Xia
Chelsea Finn
Karol Hausman
LM&Ro
156
145
0
02 Mar 2023
Previous
1
2
3
...
14
15
16
...
75
76
77
Next