ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.06870
  4. Cited By
Mask R-CNN

Mask R-CNN

20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
    ObjD
ArXivPDFHTML

Papers citing "Mask R-CNN"

50 / 3,898 papers shown
Title
Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised
  Framework with Spatio-Temporal Collaboration
Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework with Spatio-Temporal Collaboration
Liqi Yan
Qifan Wang
Siqi Ma
Jingang Wang
Changbin (Brad) Yu
VOS
37
38
0
15 Dec 2022
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with
  Visual Queries
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries
Jinjie Mai
Abdullah Hamdi
Silvio Giancola
Chen Zhao
Guohao Li
EgoV
45
14
0
14 Dec 2022
Look Before You Match: Instance Understanding Matters in Video Object
  Segmentation
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Junke Wang
Dongdong Chen
Zuxuan Wu
Chong Luo
Chuanxin Tang
Xiyang Dai
Yucheng Zhao
Yujia Xie
Lu Yuan
Yu-Gang Jiang
VOS
41
39
0
13 Dec 2022
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group
  Propagation
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
Chenhongyi Yang
Jiarui Xu
Shalini De Mello
Elliot J. Crowley
Xinyu Wang
ViT
48
22
0
13 Dec 2022
Learning 3D Representations from 2D Pre-trained Models via
  Image-to-Point Masked Autoencoders
Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders
Renrui Zhang
Liuhui Wang
Yu Qiao
Peng Gao
Hongsheng Li
3DPC
46
126
0
13 Dec 2022
FastMIM: Expediting Masked Image Modeling Pre-training for Vision
FastMIM: Expediting Masked Image Modeling Pre-training for Vision
Jianyuan Guo
Kai Han
Han Wu
Yehui Tang
Yunhe Wang
Chang Xu
38
9
0
13 Dec 2022
3rd Continual Learning Workshop Challenge on Egocentric Category and
  Instance Level Object Understanding
3rd Continual Learning Workshop Challenge on Egocentric Category and Instance Level Object Understanding
Lorenzo Pellegrini
Chenchen Zhu
Fanyi Xiao
Zhicheng Yan
Antonio Carta
Matthias De Lange
Vincenzo Lomonaco
Roshan Sumbaly
Pau Rodríguez López
David Vazquez
CLL
32
6
0
13 Dec 2022
CAT: Learning to Collaborate Channel and Spatial Attention from
  Multi-Information Fusion
CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion
Zizhang Wu
Man Wang
Weiwei Sun
Yuchen Li
Tianhao Xu
Fan Wang
Keke Huang
19
3
0
13 Dec 2022
Adversarially Robust Video Perception by Seeing Motion
Adversarially Robust Video Perception by Seeing Motion
Lingyu Zhang
Chengzhi Mao
Junfeng Yang
Carl Vondrick
VGen
AAML
49
2
0
13 Dec 2022
Accidental Turntables: Learning 3D Pose by Watching Objects Turn
Accidental Turntables: Learning 3D Pose by Watching Objects Turn
Zezhou Cheng
Matheus Gadelha
Subhransu Maji
3DPC
32
1
0
13 Dec 2022
Test-time Adaptation vs. Training-time Generalization: A Case Study in
  Human Instance Segmentation using Keypoints Estimation
Test-time Adaptation vs. Training-time Generalization: A Case Study in Human Instance Segmentation using Keypoints Estimation
K. Azarian
Debasmit Das
Hyojin Park
Fatih Porikli
3DH
OOD
26
3
0
12 Dec 2022
NMS Strikes Back
NMS Strikes Back
Jeffrey Ouyang-Zhang
Jang Hyun Cho
Xingyi Zhou
Philipp Krahenbuhl
35
38
0
12 Dec 2022
Robust Perception through Equivariance
Robust Perception through Equivariance
Chengzhi Mao
Lingyu Zhang
Abhishek Joshi
Junfeng Yang
Hongya Wang
Carl Vondrick
BDL
AAML
36
7
0
12 Dec 2022
Using Multiple Instance Learning to Build Multimodal Representations
Using Multiple Instance Learning to Build Multimodal Representations
Peiqi Wang
W. Wells
Seth Berkowitz
Steven Horng
Polina Golland
SSL
29
6
0
11 Dec 2022
Multi-Sem Fusion: Multimodal Semantic Fusion for 3D Object Detection
Multi-Sem Fusion: Multimodal Semantic Fusion for 3D Object Detection
Shaoqing Xu
Fang Li
Dingfu Zhou
Jin Fang
Sifen Wang
Liangjun Zhang
3DPC
40
9
0
10 Dec 2022
PIVOT: Prompting for Video Continual Learning
PIVOT: Prompting for Video Continual Learning
Andrés Villa
Juan Carlos León Alcázar
Motasem Alfarra
Kumail Alhamoud
J. Hurtado
Fabian Caba Heilbron
Alvaro Soto
Guohao Li
VLM
CLL
45
45
0
09 Dec 2022
Category-Level 6D Object Pose Estimation with Flexible Vector-Based
  Rotation Representation
Category-Level 6D Object Pose Estimation with Flexible Vector-Based Rotation Representation
Wei Chen
Xi Jia
Zhongqun Zhang
H. Chang
Lin Shen
Jinming Duan
A. Leonardis
32
1
0
09 Dec 2022
Contrastive View Design Strategies to Enhance Robustness to Domain
  Shifts in Downstream Object Detection
Contrastive View Design Strategies to Enhance Robustness to Domain Shifts in Downstream Object Detection
Kyle Buettner
Adriana Kovashka
24
2
0
09 Dec 2022
VideoDex: Learning Dexterity from Internet Videos
VideoDex: Learning Dexterity from Internet Videos
Kenneth Shaw
Shikhar Bahl
Deepak Pathak
40
89
0
08 Dec 2022
Latent Graph Representations for Critical View of Safety Assessment
Latent Graph Representations for Critical View of Safety Assessment
Aditya Murali
Deepak Alapatt
Pietro Mascagni
Armine Vardazaryan
Alain Garcia
Nariaki Okamoto
Didier Mutter
N. Padoy
MedIm
31
19
0
08 Dec 2022
Relationship Quantification of Image Degradations
Relationship Quantification of Image Degradations
Wenxin Wang
Boyun Li
Yuanbiao Gou
Peng Hu
Wangmeng Zuo
Xiaocui Peng
37
5
0
08 Dec 2022
Deep Incubation: Training Large Models by Divide-and-Conquering
Deep Incubation: Training Large Models by Divide-and-Conquering
Zanlin Ni
Yulin Wang
Jiangwei Yu
Haojun Jiang
Yu Cao
Gao Huang
VLM
30
11
0
08 Dec 2022
Surround-view Fisheye BEV-Perception for Valet Parking: Dataset,
  Baseline and Distortion-insensitive Multi-task Framework
Surround-view Fisheye BEV-Perception for Valet Parking: Dataset, Baseline and Distortion-insensitive Multi-task Framework
Zizhang Wu
Yuanzhu Gan
Xianzhi Li
Yunzhe Wu
Xiaoquan Wang
Tianhao Xu
Fan Wang
31
9
0
08 Dec 2022
Generating and Weighting Semantically Consistent Sample Pairs for
  Ultrasound Contrastive Learning
Generating and Weighting Semantically Consistent Sample Pairs for Ultrasound Contrastive Learning
Yixiong Chen
Chunhui Zhang
C. Ding
Li Liu
39
14
0
08 Dec 2022
GAMMA: Generative Augmentation for Attentive Marine Debris Detection
GAMMA: Generative Augmentation for Attentive Marine Debris Detection
Vaishnavi Khindkar
Janhavi Khindkar
ViT
30
1
0
07 Dec 2022
Towards Automatic Cetacean Photo-Identification: A Framework for
  Fine-Grain, Few-Shot Learning in Marine Ecology
Towards Automatic Cetacean Photo-Identification: A Framework for Fine-Grain, Few-Shot Learning in Marine Ecology
Cameron Trotter
Nick Wright
A. Mcgough
Matt Sharpe
Barbara J. Cheney
Mònica Arso Civil
R. T. Moore
Jason B. Allen
Per Berggren
17
2
0
07 Dec 2022
AsyInst: Asymmetric Affinity with DepthGrad and Color for Box-Supervised
  Instance Segmentation
AsyInst: Asymmetric Affinity with DepthGrad and Color for Box-Supervised Instance Segmentation
Si-Jia Yang
Longlong Jing
Junfei Xiao
Hang Zhao
Alan Yuille
Yingwei Li
ISeg
27
2
0
07 Dec 2022
LWSIS: LiDAR-guided Weakly Supervised Instance Segmentation for
  Autonomous Driving
LWSIS: LiDAR-guided Weakly Supervised Instance Segmentation for Autonomous Driving
Xiang Li
Junbo Yin
Botian Shi
Yikang Li
Ruigang Yang
Jianbing Shen
ISeg
29
12
0
07 Dec 2022
MEDIAR: Harmony of Data-Centric and Model-Centric for Multi-Modality
  Microscopy
MEDIAR: Harmony of Data-Centric and Model-Centric for Multi-Modality Microscopy
Gihun Lee
Sangmook Kim
Joonkee Kim
Se-Young Yun
MedIm
19
18
0
07 Dec 2022
UI Layers Group Detector: Grouping UI Layers via Text Fusion and Box
  Attention
UI Layers Group Detector: Grouping UI Layers via Text Fusion and Box Attention
Shuhong Xiao
Tingting Zhou
Yunnong Chen
Dengming Zhang
Liuqing Chen
Lingyun Sun
Shiyu Yue
26
4
0
07 Dec 2022
Learning Action-Effect Dynamics from Pairs of Scene-graphs
Learning Action-Effect Dynamics from Pairs of Scene-graphs
Shailaja Keyur Sampat
Pratyay Banerjee
Yezhou Yang
Chitta Baral
GNN
23
0
0
07 Dec 2022
SSDNeRF: Semantic Soft Decomposition of Neural Radiance Fields
SSDNeRF: Semantic Soft Decomposition of Neural Radiance Fields
Siddhant Ranade
Christoph Lassner
Keqin Li
Christian Haene
Shen-Chi Chen
Jean-Charles Bazin
Sofien Bouaziz
28
9
0
07 Dec 2022
InternVideo: General Video Foundation Models via Generative and
  Discriminative Learning
InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Yi Wang
Kunchang Li
Yizhuo Li
Yinan He
Bingkun Huang
...
Junting Pan
Jiashuo Yu
Yali Wang
Limin Wang
Yu Qiao
VLM
VGen
62
313
0
06 Dec 2022
Iterative Next Boundary Detection for Instance Segmentation of Tree
  Rings in Microscopy Images of Shrub Cross Sections
Iterative Next Boundary Detection for Instance Segmentation of Tree Rings in Microscopy Images of Shrub Cross Sections
Alexander Gillert
Giulia Resente
A. Anadon‐Rosell
M. Wilmking
U. V. Lukas
42
8
0
06 Dec 2022
Quantification of geogrid lateral restraint using transparent sand and
  deep learning-based image segmentation
Quantification of geogrid lateral restraint using transparent sand and deep learning-based image segmentation
D. Marx
Krishna Kumar
J. Zornberg
21
1
0
06 Dec 2022
Multimodal Tree Decoder for Table of Contents Extraction in Document
  Images
Multimodal Tree Decoder for Table of Contents Extraction in Document Images
Pengfei Hu
Zhenrong Zhang
Jianshu Zhang
Jun Du
Jiajia Wu
25
12
0
06 Dec 2022
Multi-Task Edge Prediction in Temporally-Dynamic Video Graphs
Multi-Task Edge Prediction in Temporally-Dynamic Video Graphs
Osman Ulger
Julian Wiederer
Mohsen Ghafoorian
Vasileios Belagiannis
Pascal Mettes
48
0
0
06 Dec 2022
Video Object of Interest Segmentation
Video Object of Interest Segmentation
Siyuan Zhou
Chunru Zhan
Biao Wang
T. Ge
Yuning Jiang
Li Niu
VOS
28
0
0
06 Dec 2022
DiffusionInst: Diffusion Model for Instance Segmentation
DiffusionInst: Diffusion Model for Instance Segmentation
Zhangxuan Gu
Haoxing Chen
Zhuoer Xu
Jun Lan
Changhua Meng
Weiqiang Wang
DiffM
26
66
0
06 Dec 2022
Vision Transformer Computation and Resilience for Dynamic Inference
Vision Transformer Computation and Resilience for Dynamic Inference
Kavya Sreedhar
Jason Clemons
Rangharajan Venkatesan
S. Keckler
M. Horowitz
32
2
0
06 Dec 2022
PEANUT: Predicting and Navigating to Unseen Targets
PEANUT: Predicting and Navigating to Unseen Targets
Albert J. Zhai
Shenlong Wang
35
20
0
05 Dec 2022
Framework for 2D Ad placements in LinearTV
Framework for 2D Ad placements in LinearTV
D. Bhargavi
Karan Sindwani
Sia Gholami
DiffM
VGen
38
0
0
05 Dec 2022
CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text
  Detection
CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection
Xi Zhao
Wei Feng
Zheng Zhang
Jing Lv
Xin Zhu
Zhangang Lin
Jin Hu
Jingping Shao
52
5
0
05 Dec 2022
2D Human Pose Estimation with Explicit Anatomical Keypoints Structure
  Constraints
2D Human Pose Estimation with Explicit Anatomical Keypoints Structure Constraints
Zhangjian Ji
Zilong Wang
Ming Zhang
Yapeng Chen
Yuhua Qian
3DH
47
1
0
05 Dec 2022
Med-Query: Steerable Parsing of 9-DoF Medical Anatomies with Query
  Embedding
Med-Query: Steerable Parsing of 9-DoF Medical Anatomies with Query Embedding
Heng Guo
Jianfeng Zhang
K. Yan
Le Lu
Minfeng Xu
MedIm
24
2
0
05 Dec 2022
ObjectMatch: Robust Registration using Canonical Object Correspondences
ObjectMatch: Robust Registration using Canonical Object Correspondences
Can Gümeli
Angela Dai
Matthias Nießner
3DPC
40
12
0
05 Dec 2022
A PM2.5 concentration prediction framework with vehicle tracking system:
  From cause to effect
A PM2.5 concentration prediction framework with vehicle tracking system: From cause to effect
Chuong Dinh Le
H. V. Pham
Duy A. Pham
A. D. Le
H. Vo
14
2
0
04 Dec 2022
Visual Question Answering From Another Perspective: CLEVR Mental
  Rotation Tests
Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests
Christopher Beckham
Martin Weiss
Florian Golemo
S. Honari
Derek Nowrouzezahrai
C. Pal
28
7
0
03 Dec 2022
Exploring Stochastic Autoregressive Image Modeling for Visual
  Representation
Exploring Stochastic Autoregressive Image Modeling for Visual Representation
Yu-Hang Qi
Fan Yang
Yousong Zhu
Yufei Liu
Liwei Wu
Rui Zhao
Wei Li
DiffM
27
13
0
03 Dec 2022
Make RepVGG Greater Again: A Quantization-aware Approach
Make RepVGG Greater Again: A Quantization-aware Approach
Xiangxiang Chu
Liang Li
Bo Zhang
MQ
53
48
0
03 Dec 2022
Previous
123...181920...767778
Next