ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.06870
  4. Cited By
Mask R-CNN

Mask R-CNN

20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
    ObjD
ArXivPDFHTML

Papers citing "Mask R-CNN"

50 / 3,770 papers shown
Title
Processing and Segmentation of Human Teeth from 2D Images using Weakly
  Supervised Learning
Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning
Tomáš Kunzo
Viktor Kocur
Lukás Gajdosech
Martin Madaras
29
0
0
13 Nov 2023
Florence-2: Advancing a Unified Representation for a Variety of Vision
  Tasks
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Bin Xiao
Haiping Wu
Weijian Xu
Xiyang Dai
Houdong Hu
Yumao Lu
Michael Zeng
Ce Liu
Lu Yuan
VLM
50
144
0
10 Nov 2023
Efficient Segmentation with Texture in Ore Images Based on
  Box-supervised Approach
Efficient Segmentation with Texture in Ore Images Based on Box-supervised Approach
Guodong Sun
Delong Huang
Yuting Peng
Lei Cheng
Bo Wu
Yang Zhang
32
2
0
10 Nov 2023
Embedding Space Interpolation Beyond Mini-Batch, Beyond Pairs and Beyond
  Examples
Embedding Space Interpolation Beyond Mini-Batch, Beyond Pairs and Beyond Examples
Shashanka Venkataramanan
Ewa Kijak
Laurent Amsaleg
Yannis Avrithis
33
4
0
09 Nov 2023
Multi-Modal Gaze Following in Conversational Scenarios
Multi-Modal Gaze Following in Conversational Scenarios
Yuqi Hou
Zhongqun Zhang
Nora Horanyi
Jaewon Moon
Yihua Cheng
Hyung Jin Chang
27
5
0
09 Nov 2023
Visually Guided Model Predictive Robot Control via 6D Object Pose
  Localization and Tracking
Visually Guided Model Predictive Robot Control via 6D Object Pose Localization and Tracking
Médéric Fourmy
Vojtech Priban
Jan Kristof Behrens
Nicolas Mansard
Josef Sivic
Vladimir Petrik
38
1
0
09 Nov 2023
Towards a Unified Transformer-based Framework for Scene Graph Generation
  and Human-object Interaction Detection
Towards a Unified Transformer-based Framework for Scene Graph Generation and Human-object Interaction Detection
Tao He
Lianli Gao
Jingkuan Song
Yuan-Fang Li
ViT
39
11
0
03 Nov 2023
CADSim: Robust and Scalable in-the-wild 3D Reconstruction for
  Controllable Sensor Simulation
CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation
Jingkang Wang
S. Manivasagam
Yun Chen
Ze Yang
Ioan Andrei Bârsan
A. Yang
Wei-Chiu Ma
R. Urtasun
3DV
50
24
0
02 Nov 2023
OpenForest: A data catalogue for machine learning in forest monitoring
OpenForest: A data catalogue for machine learning in forest monitoring
Arthur Ouaknine
T. Kattenborn
Etienne Laliberté
David Rolnick
62
6
0
01 Nov 2023
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
Meng Lou
Hong-Yu Zhou
Sibei Yang
Yizhou Yu
Chuan Wu
Yizhou Yu
ViT
49
36
0
30 Oct 2023
DynPoint: Dynamic Neural Point For View Synthesis
DynPoint: Dynamic Neural Point For View Synthesis
Kaichen Zhou
Jia-Xing Zhong
Sangyun Shin
Kai Lu
Yiyuan Yang
Andrew Markham
A. Trigoni
21
18
0
29 Oct 2023
Audio-Visual Instance Segmentation
Audio-Visual Instance Segmentation
Ruohao Guo
Yaru Chen
Yanyu Qi
Wenzhen Yue
Dantong Niu
...
Wenzhen Yue
Ji Shi
Qixun Wang
Peiliang Zhang
Buwen Liang
VLM
VOS
36
2
0
28 Oct 2023
3D-Aware Visual Question Answering about Parts, Poses and Occlusions
3D-Aware Visual Question Answering about Parts, Poses and Occlusions
Xingrui Wang
Wufei Ma
Zhuowan Li
Adam Kortylewski
Alan Yuille
CoGe
27
12
0
27 Oct 2023
Grid Jigsaw Representation with CLIP: A New Perspective on Image Clustering
Grid Jigsaw Representation with CLIP: A New Perspective on Image Clustering
Zijie Song
Zhenzhen Hu
Richang Hong
SSL
51
0
0
27 Oct 2023
Feature Extraction and Classification from Planetary Science Datasets
  enabled by Machine Learning
Feature Extraction and Classification from Planetary Science Datasets enabled by Machine Learning
Conor A. Nixon
Zachary Yahn
Ethan Duncan
Ian Neidel
Alyssa Mills
...
K. Gansler
Charles Liles
C. Walker
Douglas Trent
John Santerre
11
1
0
26 Oct 2023
Automating lichen monitoring in ecological studies using instance
  segmentation of time-lapse images
Automating lichen monitoring in ecological studies using instance segmentation of time-lapse images
Safwen Naimi
Olfa Koubaa
W. Bouachir
Guillaume-Alexandre Bilodeau
Gregory Jeddore
Patricia Baines
David Correia
Andre Arsenault
11
0
0
26 Oct 2023
S$^3$-TTA: Scale-Style Selection for Test-Time Augmentation in
  Biomedical Image Segmentation
S3^33-TTA: Scale-Style Selection for Test-Time Augmentation in Biomedical Image Segmentation
Kangxian Xie
Siyu Huang
Sebastian Cajas Ordone
Hanspeter Pfister
D. Wei
MedIm
21
0
0
25 Oct 2023
Gramian Attention Heads are Strong yet Efficient Vision Learners
Gramian Attention Heads are Strong yet Efficient Vision Learners
Jongbin Ryu
Dongyoon Han
J. Lim
38
1
0
25 Oct 2023
Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model
Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model
Kai Li
Yupeng Deng
Yun-long Kong
Diyou Liu
Jingbo Chen
Yu Meng
Junxian Ma
Chenhao Wang
53
1
0
25 Oct 2023
FloCoDe: Unbiased Dynamic Scene Graph Generation with Temporal
  Consistency and Correlation Debiasing
FloCoDe: Unbiased Dynamic Scene Graph Generation with Temporal Consistency and Correlation Debiasing
Anant Khandelwal
40
2
0
24 Oct 2023
M2DF: Multi-grained Multi-curriculum Denoising Framework for Multimodal
  Aspect-based Sentiment Analysis
M2DF: Multi-grained Multi-curriculum Denoising Framework for Multimodal Aspect-based Sentiment Analysis
Fei Zhao
Chunhui Li
Zhen Wu
Yawen Ouyang
Jianbing Zhang
Xinyu Dai
57
16
0
23 Oct 2023
Convolutional Bidirectional Variational Autoencoder for Image Domain
  Translation of Dotted Arabic Expiration
Convolutional Bidirectional Variational Autoencoder for Image Domain Translation of Dotted Arabic Expiration
Ahmed Zidane
Ghada Soliman
21
0
0
21 Oct 2023
Semi-supervised multimodal coreference resolution in image narrations
Semi-supervised multimodal coreference resolution in image narrations
A. Goel
Basura Fernando
Frank Keller
Hakan Bilen
52
4
0
20 Oct 2023
Zone Evaluation: Revealing Spatial Bias in Object Detection
Zone Evaluation: Revealing Spatial Bias in Object Detection
Zhaohui Zheng
Yuming Chen
Qibin Hou
Xiang Li
Ping Wang
Ming-Ming Cheng
ObjD
27
3
0
20 Oct 2023
Learning Object Permanence from Videos via Latent Imaginations
Learning Object Permanence from Videos via Latent Imaginations
Manuel Traub
Frederic Becker
S. Otte
Martin Volker Butz
38
1
0
16 Oct 2023
MAC: ModAlity Calibration for Object Detection
MAC: ModAlity Calibration for Object Detection
Yutian Lei
Jun Liu
Dong Huang
ObjD
20
0
0
14 Oct 2023
UniParser: Multi-Human Parsing with Unified Correlation Representation
  Learning
UniParser: Multi-Human Parsing with Unified Correlation Representation Learning
Jiaming Chu
Lei Jin
Junliang Xing
Jian-jun Zhao
39
0
0
13 Oct 2023
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
Haoyi Zhu
Honghui Yang
Xiaoyang Wu
Di Huang
Sha Zhang
...
Hengshuang Zhao
Chunhua Shen
Yu Qiao
Tong He
Wanli Ouyang
SSL
77
43
0
12 Oct 2023
CrIBo: Self-Supervised Learning via Cross-Image Object-Level
  Bootstrapping
CrIBo: Self-Supervised Learning via Cross-Image Object-Level Bootstrapping
Tim Lebailly
Thomas Stegmüller
Behzad Bozorgtabar
Jean-Philippe Thiran
Tinne Tuytelaars
SSL
54
6
0
11 Oct 2023
Causal Unsupervised Semantic Segmentation
Causal Unsupervised Semantic Segmentation
Junho Kim
Byung-Kwan Lee
Yonghyun Ro
41
18
0
11 Oct 2023
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
Yulong Shi
Mingwei Sun
Yongshuai Wang
Hui Sun
Zengqiang Chen
39
4
0
10 Oct 2023
3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic
  Indoor Environments
3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments
G. S. Krishna
Kundrapu Supriya
S. Baidya
3DPC
37
3
0
10 Oct 2023
Cross-modal Cognitive Consensus guided Audio-Visual Segmentation
Cross-modal Cognitive Consensus guided Audio-Visual Segmentation
Zhaofeng Shi
Qingbo Wu
Fanman Meng
Linfeng Xu
Hongliang Li
VOS
33
3
0
10 Oct 2023
Joint object detection and re-identification for 3D obstacle
  multi-camera systems
Joint object detection and re-identification for 3D obstacle multi-camera systems
Irene Cortés
Jorge Beltrán
A. D. L. Escalera
Fernando García
3DPC
27
0
0
09 Oct 2023
Uni3DETR: Unified 3D Detection Transformer
Uni3DETR: Unified 3D Detection Transformer
Zhenyu Wang
Yali Li
Xi Chen
Hengshuang Zhao
Shengjin Wang
3DPC
54
18
0
09 Oct 2023
Anchor-Intermediate Detector: Decoupling and Coupling Bounding Boxes for
  Accurate Object Detection
Anchor-Intermediate Detector: Decoupling and Coupling Bounding Boxes for Accurate Object Detection
Yilong Lv
Min Li
Yujie He
Shaopeng Li
Zhuzhen He
Aitao Yang
29
1
0
09 Oct 2023
Cross-Task Data Augmentation by Pseudo-label Generation for Region Based
  Coronary Artery Instance Segmentation
Cross-Task Data Augmentation by Pseudo-label Generation for Region Based Coronary Artery Instance Segmentation
Sandesh Pokhrel
Sanjay Bhandari
Eduard Vazquez
Yash Raj Shrestha
Binod Bhattarai
25
0
0
08 Oct 2023
MVC: A Multi-Task Vision Transformer Network for COVID-19 Diagnosis from
  Chest X-ray Images
MVC: A Multi-Task Vision Transformer Network for COVID-19 Diagnosis from Chest X-ray Images
Huyen Tran
D. Nguyen
John Yearwood
MedIm
ViT
40
0
0
30 Sep 2023
Two-Step Active Learning for Instance Segmentation with Uncertainty and
  Diversity Sampling
Two-Step Active Learning for Instance Segmentation with Uncertainty and Diversity Sampling
Ke Yu
Yuanmin Tang
Giulia DeSalvo
Suraj Kothawade
Abdullah Rashwan
S. Tavakkol
Kayhan Batmanghelich
Xiaoqi Yin
ISeg
32
0
0
28 Sep 2023
InfraParis: A multi-modal and multi-task autonomous driving dataset
InfraParis: A multi-modal and multi-task autonomous driving dataset
Gianni Franchi
Marwane Hariat
Xuanlong Yu
Nacim Belkhir
Antoine Manzanera
David Filliat
19
9
0
27 Sep 2023
Domain generalization across tumor types, laboratories, and species --
  insights from the 2022 edition of the Mitosis Domain Generalization Challenge
Domain generalization across tumor types, laboratories, and species -- insights from the 2022 edition of the Mitosis Domain Generalization Challenge
Marc Aubreville
N. Stathonikos
T. Donovan
R. Klopfleisch
J. Ganz
...
Yongbing Zhang
Sen Yang
Xiyue Wang
Katharina Breininger
C. Bertram
39
16
0
27 Sep 2023
Hashing Neural Video Decomposition with Multiplicative Residuals in
  Space-Time
Hashing Neural Video Decomposition with Multiplicative Residuals in Space-Time
Cheng-Hung Chan
Chengtao Yuan
Cheng Sun
Hwann-Tzong Chen
37
5
0
25 Sep 2023
Autonomous Apple Fruitlet Sizing with Next Best View Planning
Autonomous Apple Fruitlet Sizing with Next Best View Planning
Harry Freeman
George Kantor
27
3
0
24 Sep 2023
Being Aware of Localization Accuracy By Generating Predicted-IoU-Guided
  Quality Scores
Being Aware of Localization Accuracy By Generating Predicted-IoU-Guided Quality Scores
Peng Liu
Weibo Wang
Yuhan Guo
Jiubin Tan
35
1
0
23 Sep 2023
UniHead: Unifying Multi-Perception for Detection Heads
UniHead: Unifying Multi-Perception for Detection Heads
Hantao Zhou
Rui Yang
Yachao Zhang
Haoran Duan
Yawen Huang
R. Hu
Xiu Li
Yefeng Zheng
33
12
0
23 Sep 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary
  Instance Segmentation
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffM
VLM
72
35
0
22 Sep 2023
Detect Everything with Few Examples
Detect Everything with Few Examples
Xinyu Zhang
Yuting Wang
Abdeslam Boularias
ObjD
VLM
37
13
0
22 Sep 2023
SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on
  Scene Graphs
SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs
Guangyao Zhai
Xiaoni Cai
Dianye Huang
Yan Di
Fabian Manhardt
Federico Tombari
Nassir Navab
Benjamin Busam
LM&Ro
32
27
0
21 Sep 2023
ORTexME: Occlusion-Robust Human Shape and Pose via Temporal Average
  Texture and Mesh Encoding
ORTexME: Occlusion-Robust Human Shape and Pose via Temporal Average Texture and Mesh Encoding
Yu Cheng
Bo Wang
R. Tan
3DH
42
0
0
21 Sep 2023
TCOVIS: Temporally Consistent Online Video Instance Segmentation
TCOVIS: Temporally Consistent Online Video Instance Segmentation
Junlong Li
Ting Yu
Yongming Rao
Jie Zhou
Jiwen Lu
43
12
0
21 Sep 2023
Previous
123...8910...747576
Next