ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.06870
  4. Cited By
Mask R-CNN

Mask R-CNN

20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
    ObjD
ArXivPDFHTML

Papers citing "Mask R-CNN"

50 / 3,770 papers shown
Title
Sketch and Refine: Towards Fast and Accurate Lane Detection
Sketch and Refine: Towards Fast and Accurate Lane Detection
Chao Chen
Jie Liu
Chang Zhou
Jie Tang
Gangshan Wu
36
3
0
26 Jan 2024
Learning to Manipulate Artistic Images
Learning to Manipulate Artistic Images
Wei Guo
Yuqi Zhang
De Ma
Qian Zheng
41
0
0
25 Jan 2024
Collaborative Position Reasoning Network for Referring Image
  Segmentation
Collaborative Position Reasoning Network for Referring Image Segmentation
Jianjian Cao
Beiya Dai
Yulin Li
Xiameng Qin
Jingdong Wang
33
0
0
22 Jan 2024
Bridging the gap between image coding for machines and humans
Bridging the gap between image coding for machines and humans
Nam Le
Honglei Zhang
Francesco Cricri
Ramin Ghaznavi-Youvalari
Hamed R. Tavakoli
Emre B. Aksu
M. Hannuksela
Esa Rahtu
29
5
0
19 Jan 2024
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask
  Inpainting
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting
Wouter Van Gansbeke
Bert De Brabandere
DiffM
46
11
0
18 Jan 2024
P2Seg: Pointly-supervised Segmentation via Mutual Distillation
P2Seg: Pointly-supervised Segmentation via Mutual Distillation
Zipeng Wang
Xuehui Yu
Xumeng Han
Wenwen Yu
Zhixun Huang
Jianbin Jiao
Zhenjun Han
36
0
0
18 Jan 2024
Exploring the Role of Convolutional Neural Networks (CNN) in Dental
  Radiography Segmentation: A Comprehensive Systematic Literature Review
Exploring the Role of Convolutional Neural Networks (CNN) in Dental Radiography Segmentation: A Comprehensive Systematic Literature Review
Walid Brahmi
Imen Jdey
Fadoua Drira
25
16
0
17 Jan 2024
Trapped in texture bias? A large scale comparison of deep instance
  segmentation
Trapped in texture bias? A large scale comparison of deep instance segmentation
J. Theodoridis
Jessica Hofmann
J. Maucher
A. Schilling
SSeg
37
5
0
17 Jan 2024
Object-Oriented Semantic Mapping for Reliable UAVs Navigation
Object-Oriented Semantic Mapping for Reliable UAVs Navigation
Thanh Nguyen Canh
A. Elibol
N. Chong
Xiem HoangVan
44
7
0
16 Jan 2024
Semantic Scene Segmentation for Robotics
Semantic Scene Segmentation for Robotics
Juana Valeria Hurtado
Abhinav Valada
VLM
SSeg
44
27
0
15 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
77
1
0
15 Jan 2024
Evaluating Data Augmentation Techniques for Coffee Leaf Disease
  Classification
Evaluating Data Augmentation Techniques for Coffee Leaf Disease Classification
Adrian Gheorghiu
Iulian-Marius Taiatu
Dumitru-Clementin Cercel
Iuliana Marin
Florin-Catalin Pop
57
2
0
11 Jan 2024
GloTSFormer: Global Video Text Spotting Transformer
GloTSFormer: Global Video Text Spotting Transformer
Hang Wang
Yanjie Wang
Yang Li
Can Huang
42
0
0
08 Jan 2024
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports
Haopeng Li
Andong Deng
Qiuhong Ke
Jun Liu
Hossein Rahmani
Yulan Guo
Mohammed Bennamoun
Chen Chen
61
17
0
03 Jan 2024
PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields
PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields
Zheng Chen
Qingan Yan
Huangying Zhan
Changjiang Cai
Xiangyu Xu
Yuzhong Huang
Weihan Wang
Ziyue Feng
Lantao Liu
Yi Tian Xu
3DV
64
3
0
30 Dec 2023
Bin-picking of novel objects through category-agnostic-segmentation: RGB
  matters
Bin-picking of novel objects through category-agnostic-segmentation: RGB matters
Prem Raj
S. Bhadang
Gaurav Chaudhary
Laxmidhar Behera
Tushar Sandhan
41
1
0
27 Dec 2023
Adaptive Depth Networks with Skippable Sub-Paths
Adaptive Depth Networks with Skippable Sub-Paths
Woochul Kang
48
1
0
27 Dec 2023
Semantic-aware SAM for Point-Prompted Instance Segmentation
Semantic-aware SAM for Point-Prompted Instance Segmentation
Zhaoyang Wei
Pengfei Chen
Xuehui Yu
Guorong Li
Jianbin Jiao
Zhenjun Han
VLM
40
6
0
26 Dec 2023
A graph-based multimodal framework to predict gentrification
A graph-based multimodal framework to predict gentrification
Javad Eshtiyagh
Baotong Zhang
Yujing Sun
Linhui Wu
Zhao Wang
22
0
0
25 Dec 2023
Domain Similarity-Perceived Label Assignment for Domain Generalized
  Underwater Object Detection
Domain Similarity-Perceived Label Assignment for Domain Generalized Underwater Object Detection
Xisheng Li
Wei Li
Pinhao Song
Mingjun Zhang
Jie-Gui Zhou
29
0
0
20 Dec 2023
Cached Transformers: Improving Transformers with Differentiable Memory
  Cache
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Zhaoyang Zhang
Wenqi Shao
Yixiao Ge
Xiaogang Wang
Liang Feng
Ping Luo
19
2
0
20 Dec 2023
Zero-shot Building Attribute Extraction from Large-Scale Vision and
  Language Models
Zero-shot Building Attribute Extraction from Large-Scale Vision and Language Models
Fei Pan
Sangryul Jeon
Brian Wang
Frank Mckenna
Stella X. Yu
44
2
0
19 Dec 2023
Mask Grounding for Referring Image Segmentation
Mask Grounding for Referring Image Segmentation
Yong Xien Chng
Henry Zheng
Yizeng Han
Xuchong Qiu
Gao Huang
ISeg
ObjD
50
15
0
19 Dec 2023
Learning Subject-Aware Cropping by Outpainting Professional Photos
Learning Subject-Aware Cropping by Outpainting Professional Photos
James Hong
Lu Yuan
Michael Gharbi
Matthew Fisher
Kayvon Fatahalian
35
2
0
19 Dec 2023
Diffusion-Based Particle-DETR for BEV Perception
Diffusion-Based Particle-DETR for BEV Perception
Asen Nachkov
Martin Danelljan
D. Paudel
Luc Van Gool
DiffM
40
3
0
18 Dec 2023
Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition
Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition
Peng Shen
Xugang Lu
Hisashi Kawai
35
1
0
18 Dec 2023
See, Say, and Segment: Teaching LMMs to Overcome False Premises
See, Say, and Segment: Teaching LMMs to Overcome False Premises
Tsung-Han Wu
Giscard Biamby
David M. Chan
Lisa Dunlap
Ritwik Gupta
Xudong Wang
Joseph E. Gonzalez
Trevor Darrell
VLM
MLLM
44
18
0
13 Dec 2023
Artificial Intelligence for Digital and Computational Pathology
Artificial Intelligence for Digital and Computational Pathology
Andrew H. Song
Guillaume Jaume
Drew F. K. Williamson
Ming Y. Lu
Anurag J. Vaidya
Tiffany R. Miller
Faisal Mahmood
AI4CE
30
130
0
13 Dec 2023
MaxQ: Multi-Axis Query for N:M Sparsity Network
MaxQ: Multi-Axis Query for N:M Sparsity Network
Jingyang Xiang
Siqi Li
Junhao Chen
Zhuangzhi Chen
Tianxin Huang
Linpeng Peng
Yong-Jin Liu
18
0
0
12 Dec 2023
4M: Massively Multimodal Masked Modeling
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
55
64
0
11 Dec 2023
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance
  Segmentation
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
31
2
0
11 Dec 2023
Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops
Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops
Aditya Prakash
Arjun Gupta
Saurabh Gupta
28
3
0
11 Dec 2023
ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for
  Cell Instance Segmentation
ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance Segmentation
Ming Kang
Chee-Ming Ting
F. F. Ting
Raphaël C.-W. Phan
29
137
0
11 Dec 2023
Investigating YOLO Models Towards Outdoor Obstacle Detection For
  Visually Impaired People
Investigating YOLO Models Towards Outdoor Obstacle Detection For Visually Impaired People
Chenhao He
Pramit Saha
41
3
0
10 Dec 2023
You Only Learn One Query: Learning Unified Human Query for Single-Stage
  Multi-Person Multi-Task Human-Centric Perception
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception
Sheng Jin
Shuhuai Li
Tong Li
Wentao Liu
Chao Qian
Ping Luo
42
5
0
09 Dec 2023
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Zeyi Sun
Ye Fang
Tong Wu
Pan Zhang
Yuhang Zang
Shu Kong
Yuanjun Xiong
Dahua Lin
Jiaqi Wang
VLM
CLIP
51
83
0
06 Dec 2023
Automated Multimodal Data Annotation via Calibration With Indoor
  Positioning System
Automated Multimodal Data Annotation via Calibration With Indoor Positioning System
Ryan Rubel
Andrew Dudash
Mohammad Goli
James O'Hara
Karl Wunderlich
18
0
0
06 Dec 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment
  Anything
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Yunyang Xiong
Bala Varadarajan
Lemeng Wu
Xiaoyu Xiang
Fanyi Xiao
...
Dilin Wang
Fei Sun
Forrest N. Iandola
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
42
141
0
01 Dec 2023
JPPF: Multi-task Fusion for Consistent Panoptic-Part Segmentation
JPPF: Multi-task Fusion for Consistent Panoptic-Part Segmentation
Shishir Muralidhara
Sravan Kumar Jagadeesh
René Schuster
Didier Stricker
29
1
0
30 Nov 2023
ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs
ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs
Violeta Menéndez González
Andrew Gilbert
Graeme Phillipson
Stephen Jolly
Simon Hadfield
30
0
0
30 Nov 2023
Marine$\mathcal{X}$: Design and Implementation of Unmanned Surface
  Vessel for Vision Guided Navigation
MarineX\mathcal{X}X: Design and Implementation of Unmanned Surface Vessel for Vision Guided Navigation
Muhayy ud Din
Ahmed Humais
Waseem Akram
Mohamed Alblooshi
L. Saad Saoud
Abdelrahman Alblooshi
Lakmal Seneviratne
Irfan Hussain
18
0
0
28 Nov 2023
Continual Referring Expression Comprehension via Dual Modular
  Memorization
Continual Referring Expression Comprehension via Dual Modular Memorization
Hengtao Shen
Cheng Chen
Peng Wang
Lianli Gao
Ming Wang
Jingkuan Song
ObjD
43
3
0
25 Nov 2023
Maximizing Discrimination Capability of Knowledge Distillation with Energy Function
Maximizing Discrimination Capability of Knowledge Distillation with Energy Function
Seonghak Kim
Gyeongdo Ham
Suin Lee
Donggon Jang
Daeshik Kim
36
4
0
24 Nov 2023
GigaPose: Fast and Robust Novel Object Pose Estimation via One
  Correspondence
GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence
Van Nguyen Nguyen
Thibault Groueix
Mathieu Salzmann
Vincent Lepetit
3DPC
3DH
43
50
0
23 Nov 2023
Labeling Neural Representations with Inverse Recognition
Labeling Neural Representations with Inverse Recognition
Kirill Bykov
Laura Kopf
Shinichi Nakajima
Marius Kloft
Marina M.-C. Höhne
BDL
41
15
0
22 Nov 2023
ADriver-I: A General World Model for Autonomous Driving
ADriver-I: A General World Model for Autonomous Driving
Fan Jia
Weixin Mao
Yingfei Liu
Yucheng Zhao
Yuqing Wen
Chi Zhang
Xiangyu Zhang
Tiancai Wang
48
63
0
22 Nov 2023
DoubleAUG: Single-domain Generalized Object Detector in Urban via Color
  Perturbation and Dual-style Memory
DoubleAUG: Single-domain Generalized Object Detector in Urban via Color Perturbation and Dual-style Memory
Lei Qi
Peng Dong
Tan Xiong
Hui Xue
Xin Geng
44
4
0
22 Nov 2023
MaskFlow: Object-Aware Motion Estimation
MaskFlow: Object-Aware Motion Estimation
Aria Ahmadi
D. R. M. Walton
Tim Atherton
Çağatay Dikici
3DPC
18
0
0
21 Nov 2023
AR Visualization System for Ship Detection and Recognition Based on AI
AR Visualization System for Ship Detection and Recognition Based on AI
Ziqi Ye
Limin Huang
Yongji Wu
Min Hu
11
0
0
21 Nov 2023
On the Importance of Large Objects in CNN Based Object Detection
  Algorithms
On the Importance of Large Objects in CNN Based Object Detection Algorithms
Ahmed Ben Saad
Gabriele Facciolo
Axel Davy
ObjD
27
2
0
20 Nov 2023
Previous
123...789...747576
Next