ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.06870
  4. Cited By
Mask R-CNN

Mask R-CNN

20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
    ObjD
ArXivPDFHTML

Papers citing "Mask R-CNN"

50 / 3,768 papers shown
Title
Towards Open-World Object-based Anomaly Detection via Self-Supervised
  Outlier Synthesis
Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis
Brian K. S. Isaac-Medina
Yona Falinie A. Gaus
Neelanjan Bhowmik
T. Breckon
28
2
0
22 Jul 2024
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
Weiqin Jiao
Claudio Persello
G. Vosselman
3DV
31
3
0
20 Jul 2024
Learning Visual Grounding from Generative Vision and Language Model
Learning Visual Grounding from Generative Vision and Language Model
Shijie Wang
Dahun Kim
A. Taalimi
Chen Sun
Weicheng Kuo
ObjD
38
6
0
18 Jul 2024
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
Junseo Park
Hyeryung Jang
81
0
0
17 Jul 2024
PartImageNet++ Dataset: Scaling up Part-based Models for Robust
  Recognition
PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition
Xiao-Li Li
Yining Liu
Na Dong
Sitian Qin
Xiaolin Hu
41
3
0
15 Jul 2024
A Fair Ranking and New Model for Panoptic Scene Graph Generation
A Fair Ranking and New Model for Panoptic Scene Graph Generation
Julian Lorenz
Alexander Pest
Daniel Kienzle
K. Ludwig
Rainer Lienhart
51
1
0
12 Jul 2024
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Ali Hatamizadeh
Jan Kautz
Mamba
47
59
0
10 Jul 2024
AI-based Automatic Segmentation of Prostate on Multi-modality Images: A
  Review
AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review
Rui Jin
Derun Li
Dehui Xiang
Lei Zhang
Hailing Zhou
Fei Shi
Weifang Zhu
Jing Cai
Tao Peng
Xinjian Chen
44
0
0
09 Jul 2024
Dynamic Neural Radiance Field From Defocused Monocular Video
Dynamic Neural Radiance Field From Defocused Monocular Video
Xianrui Luo
Huiqiang Sun
Juewen Peng
Zhiguo Cao
VGen
54
2
0
08 Jul 2024
Towards Reflected Object Detection: A Benchmark
Towards Reflected Object Detection: A Benchmark
Zhongtian Wang
You Wu
Hui Zhou
Shuiwang Li
ObjD
34
2
0
08 Jul 2024
CountGD: Multi-Modal Open-World Counting
CountGD: Multi-Modal Open-World Counting
Niki Amini-Naieni
Tengda Han
Andrew Zisserman
ObjD
64
10
0
05 Jul 2024
Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling
  Capacities for Efficient 3D Human Pose Estimation
Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling Capacities for Efficient 3D Human Pose Estimation
Mengmeng Cui
Kunbo Zhang
Zhenan Sun
ViT
36
0
0
03 Jul 2024
A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling
A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling
Minghao Zhou
Hong Wang
Yefeng Zheng
Deyu Meng
33
1
0
02 Jul 2024
Multi-View Black-Box Physical Attacks on Infrared Pedestrian Detectors
  Using Adversarial Infrared Grid
Multi-View Black-Box Physical Attacks on Infrared Pedestrian Detectors Using Adversarial Infrared Grid
Kalibinuer Tiliwalidi
Chengyin Hu
Weiwen Shi
AAML
36
1
0
01 Jul 2024
GMT: A Robust Global Association Model for Multi-Target Multi-Camera
  Tracking
GMT: A Robust Global Association Model for Multi-Target Multi-Camera Tracking
Huijie Fan
Tinghui Zhao
Qiang Wang
Baojie Fan
Yandong Tang
Lianqing Liu
51
2
0
01 Jul 2024
Robot Instance Segmentation with Few Annotations for Grasping
Robot Instance Segmentation with Few Annotations for Grasping
Moshe Kimhi
David Vainshtein
Chaim Baskin
Dotan Di Castro
66
2
0
01 Jul 2024
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Xiang Li
Cristina Mata
J. Park
Kumara Kahatapitiya
Yoo Sung Jang
...
Kanchana Ranasinghe
R. Burgert
Mu Cai
Yong Jae Lee
Michael S. Ryoo
LM&Ro
72
26
0
28 Jun 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLM
DiffM
59
3
0
28 Jun 2024
High-resolution open-vocabulary object 6D pose estimation
High-resolution open-vocabulary object 6D pose estimation
Jaime Corsetti
Davide Boscaini
Francesco Giuliari
Changjae Oh
Andrea Cavallaro
Fabio Poiesi
34
1
0
24 Jun 2024
CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery
CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery
Oluwatosin O. Alabi
K. Toe
Zijian Zhou
C. Budd
Nicholas Raison
Miaojing Shi
Tom Kamiel Magda Vercauteren
ISeg
64
1
0
23 Jun 2024
Towards Timely Video Analytics Services at the Network Edge
Towards Timely Video Analytics Services at the Network Edge
Xishuo Li
Shan Zhang
Yuejiao Huang
Xiao Ma
Zhiyuan Wang
Hongbin Luo
46
1
0
21 Jun 2024
TraceNet: Segment one thing efficiently
TraceNet: Segment one thing efficiently
Mingyuan Wu
Zichuan Liu
Haozhen Zheng
Hongpeng Guo
Bo Chen
Xin Lu
Klara Nahrstedt
39
0
0
21 Jun 2024
Autonomous Robotic Drilling System for Mice Cranial Window Creation
Autonomous Robotic Drilling System for Mice Cranial Window Creation
Enduo Zhao
M. M. Marinho
K. Harada
21
5
0
20 Jun 2024
Graspness Discovery in Clutters for Fast and Accurate Grasp Detection
Graspness Discovery in Clutters for Fast and Accurate Grasp Detection
Chenxi Wang
Hao-Shu Fang
Minghao Gou
Hongjie Fang
Jin Gao
Cewu Lu
55
113
0
17 Jun 2024
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications
Jordy Van Landeghem
Subhajit Maity
Ayan Banerjee
Matthew Blaschko
Marie-Francine Moens
Josep Lladós
Sanket Biswas
52
2
0
12 Jun 2024
ROADWork Dataset: Learning to Recognize, Observe, Analyze and Drive
  Through Work Zones
ROADWork Dataset: Learning to Recognize, Observe, Analyze and Drive Through Work Zones
Anurag Ghosh
R. Tamburo
Shen Zheng
Juan R. Alvarez-Padilla
Hailiang Zhu
Michael Cardei
Nicholas Dunn
Christoph Mertz
Srinivasa G. Narasimhan
54
1
0
11 Jun 2024
PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution
  Monocular Metric Depth Estimation
PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation
Zhenyu Li
Shariq Farooq Bhat
Peter Wonka
3DV
MDE
37
7
0
10 Jun 2024
Diving into Underwater: Segment Anything Model Guided Underwater Salient
  Instance Segmentation and A Large-scale Dataset
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset
Shijie Lian
Ziyi Zhang
Hua Li
Wenjie Li
Laurence Tianruo Yang
Sam Kwong
Runmin Cong
VLM
31
14
0
10 Jun 2024
A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
Hou-I Liu
Yu-Wen Tseng
Kai-Cheng Chang
Pin-Jyun Wang
Hong-Han Shuai
Wen-Huang Cheng
ViT
ObjD
48
24
0
09 Jun 2024
F-LMM: Grounding Frozen Large Multimodal Models
F-LMM: Grounding Frozen Large Multimodal Models
Size Wu
Sheng Jin
Wenwei Zhang
Lumin Xu
Wentao Liu
Wei Li
Chen Change Loy
MLLM
80
12
0
09 Jun 2024
Matching Anything by Segmenting Anything
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
45
22
0
06 Jun 2024
ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
Fang Chen
Gourav Datta
Mujahid Al Rafi
Hyeran Jeon
Meng Tang
93
1
0
06 Jun 2024
Reconstructing training data from document understanding models
Reconstructing training data from document understanding models
Jérémie Dentan
Arnaud Paran
A. Shabou
AAML
SyDa
54
1
0
05 Jun 2024
GrootVL: Tree Topology is All You Need in State Space Model
GrootVL: Tree Topology is All You Need in State Space Model
Yicheng Xiao
Lin Song
Shaoli Huang
Jiangshan Wang
Siyu Song
Yixiao Ge
Xiu Li
Ying Shan
Mamba
47
11
0
04 Jun 2024
DenseSeg: Joint Learning for Semantic Segmentation and Landmark
  Detection Using Dense Image-to-Shape Representation
DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation
Ron Keuth
Lasse Hansen
Maren Balks
Ronja Jäger
Anne-Nele Schröder
Ludger Tüshaus
Mattias P. Heinrich
35
0
0
30 May 2024
YotoR-You Only Transform One Representation
YotoR-You Only Transform One Representation
José Ignacio Díaz Villa
P. Loncomilla
Javier Ruiz-del-Solar
ViT
46
0
0
30 May 2024
SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for
  Autonomous Driving
SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving
Yiming Cui
Cheng Han
Dongfang Liu
40
0
0
29 May 2024
A Review and Implementation of Object Detection Models and Optimizations
  for Real-time Medical Mask Detection during the COVID-19 Pandemic
A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic
Ioanna C. Gogou
Dimitrios A. Koutsomitropoulos
3DH
33
2
0
28 May 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLM
ISeg
62
4
0
28 May 2024
A Two-Level Stochastic Model for the Lateral Movement of Vehicles Within
  Their Lane Under Homogeneous Traffic Conditions
A Two-Level Stochastic Model for the Lateral Movement of Vehicles Within Their Lane Under Homogeneous Traffic Conditions
N. Neis
Juergen Beyerer
13
6
0
27 May 2024
ModelLock: Locking Your Model With a Spell
ModelLock: Locking Your Model With a Spell
Yifeng Gao
Yuhua Sun
Xingjun Ma
Zuxuan Wu
Yu-Gang Jiang
VLM
50
1
0
25 May 2024
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge
Brendan Park
Madeline Janecek
Naser Ezzati-Jivan
Yifeng Li
Ali Emami
40
0
0
25 May 2024
Retro: Reusing teacher projection head for efficient embedding
  distillation on Lightweight Models via Self-supervised Learning
Retro: Reusing teacher projection head for efficient embedding distillation on Lightweight Models via Self-supervised Learning
Khanh-Binh Nguyen
Chae Jung Park
39
0
0
24 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
82
45
0
23 May 2024
Vision Transformer with Sparse Scan Prior
Vision Transformer with Sparse Scan Prior
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
ViT
48
5
0
22 May 2024
BiomedParse: a biomedical foundation model for image parsing of
  everything everywhere all at once
BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once
Theodore Zhao
Yu Gu
Jianwei Yang
Naoto Usuyama
Ho Hin Lee
...
B. Piening
Carlo Bifulco
Mu-Hsin Wei
Hoifung Poon
Sheng Wang
MedIm
41
23
0
21 May 2024
Learning Spatial Similarity Distribution for Few-shot Object Counting
Learning Spatial Similarity Distribution for Few-shot Object Counting
Yuanwu Xu
Feifan Song
Haofeng Zhang
48
3
0
20 May 2024
SLAB: Efficient Transformers with Simplified Linear Attention and
  Progressive Re-parameterized Batch Normalization
SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization
Jialong Guo
Xinghao Chen
Yehui Tang
Yunhe Wang
ViT
49
9
0
19 May 2024
Beyond Traditional Single Object Tracking: A Survey
Beyond Traditional Single Object Tracking: A Survey
Omar Abdelaziz
Mohamed Shehata
Mohamed Mohamed
39
0
0
16 May 2024
HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase Recognition
HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase Recognition
Kun Yuan
V. Srivastav
Nassir Navab
N. Padoy
47
12
0
16 May 2024
Previous
123456...747576
Next