ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.06870
  4. Cited By
Mask R-CNN

Mask R-CNN

20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
    ObjD
ArXivPDFHTML

Papers citing "Mask R-CNN"

50 / 4,126 papers shown
Title
Prior-enhanced Temporal Action Localization using Subject-aware Spatial
  Attention
Prior-enhanced Temporal Action Localization using Subject-aware Spatial Attention
Yifan Liu
Youbao Tang
Ning Zhang
Ruei-Sung Lin
Haoqian Wang
59
0
0
10 Nov 2022
Deep Learning based Computer Vision Methods for Complex Traffic
  Environments Perception: A Review
Deep Learning based Computer Vision Methods for Complex Traffic Environments Perception: A Review
Talha Azfar
Jinlong Li
Hongkai Yu
R. Cheu
Yisheng Lv
Ruimin Ke
43
21
0
09 Nov 2022
Reinforcement Learning with Stepwise Fairness Constraints
Reinforcement Learning with Stepwise Fairness Constraints
Zhun Deng
He Sun
Zhiwei Steven Wu
Linjun Zhang
David C. Parkes
FaML
OffRL
43
11
0
08 Nov 2022
ReLoc: A Restoration-Assisted Framework for Robust Image Tampering
  Localization
ReLoc: A Restoration-Assisted Framework for Robust Image Tampering Localization
Peiyu Zhuang
Haodong Li
Rui Yang
Jiwu Huang
30
6
0
08 Nov 2022
Facial Tic Detection in Untrimmed Videos of Tourette Syndrome Patients
Facial Tic Detection in Untrimmed Videos of Tourette Syndrome Patients
Yu-Ching Tang
Benjamín Béjar
J. K. Essoe
J. McGuire
René Vidal
32
4
0
07 Nov 2022
Exploration of Convolutional Neural Network Architectures for Large
  Region Map Automation
Exploration of Convolutional Neural Network Architectures for Large Region Map Automation
Rostyslav-Mykola Tsenov
Christopher J. Henry
J. Storie
C. Storie
Brent Murray
Mikhail Sokolov
32
2
0
07 Nov 2022
AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking
  in Real-Time
AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time
Haoshu Fang
Jiefeng Li
Hongyang Tang
Chaoshun Xu
Haoyi Zhu
Yuliang Xiu
Yong-Lu Li
Cewu Lu
3DH
54
410
0
07 Nov 2022
Prompter: Utilizing Large Language Model Prompting for a Data Efficient
  Embodied Instruction Following
Prompter: Utilizing Large Language Model Prompting for a Data Efficient Embodied Instruction Following
Y. Inoue
Hiroki Ohashi
LM&Ro
35
44
0
07 Nov 2022
EdgeVision: Towards Collaborative Video Analytics on Distributed Edges
  for Performance Maximization
EdgeVision: Towards Collaborative Video Analytics on Distributed Edges for Performance Maximization
Guanyu Gao
Yuqian Dong
Ran A. Wang
Xin Zhou
45
8
0
06 Nov 2022
Evaluating Novel Mask-RCNN Architectures for Ear Mask Segmentation
Evaluating Novel Mask-RCNN Architectures for Ear Mask Segmentation
Saurav K. Aryal
Teanna Barrett
Gloria J. Washington
30
2
0
05 Nov 2022
Contrastive Learning for Diverse Disentangled Foreground Generation
Contrastive Learning for Diverse Disentangled Foreground Generation
Yuheng Li
Yijun Li
Jingwan Lu
Eli Shechtman
Yong Jae Lee
Krishna Kumar Singh
43
7
0
04 Nov 2022
UV R-CNN: Stable and Efficient Dense Human Pose Estimation
UV R-CNN: Stable and Efficient Dense Human Pose Estimation
Wenhe Jia
Yilin Zhou
Xuhan Zhu
Mengjie Hu
Chun Liu
Qing-Huang Song
3DH
41
3
0
04 Nov 2022
Understanding and Mitigating Overfitting in Prompt Tuning for
  Vision-Language Models
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models
Cheng Ma
Yang Liu
Jiankang Deng
Lingxi Xie
Weiming Dong
Changsheng Xu
VLM
VPVLM
54
44
0
04 Nov 2022
SSDA-YOLO: Semi-supervised Domain Adaptive YOLO for Cross-Domain Object
  Detection
SSDA-YOLO: Semi-supervised Domain Adaptive YOLO for Cross-Domain Object Detection
Huayi Zhou
Fei Jiang
Hongtao Lu
ObjD
48
73
0
04 Nov 2022
Rethinking Hierarchies in Pre-trained Plain Vision Transformer
Rethinking Hierarchies in Pre-trained Plain Vision Transformer
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
39
1
0
03 Nov 2022
PolyBuilding: Polygon Transformer for End-to-End Building Extraction
PolyBuilding: Polygon Transformer for End-to-End Building Extraction
Yuan Hu
Zhibin Wang
Zhou Huang
Yu Liu
3DV
ViT
37
9
0
03 Nov 2022
CircleSnake: Instance Segmentation with Circle Representation
CircleSnake: Instance Segmentation with Circle Representation
Ethan H. Nguyen
Haichun Yang
Zuhayr Asad
Ruining Deng
Agnes B. Fogo
Yuankai Huo
22
5
0
02 Nov 2022
Spatial Reasoning for Few-Shot Object Detection
Spatial Reasoning for Few-Shot Object Detection
Geonuk Kim
Ho-Choul Jung
Seong-Whan Lee
ObjD
35
34
0
02 Nov 2022
Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary
  Object Detection
Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection
Yanxin Long
Jianhua Han
Runhu Huang
Xu Hang
Yi Zhu
Chunjing Xu
Xiaodan Liang
VLM
ObjD
48
18
0
02 Nov 2022
Beyond Instance Discrimination: Relation-aware Contrastive
  Self-supervised Learning
Beyond Instance Discrimination: Relation-aware Contrastive Self-supervised Learning
Yifei Zhang
Chang-rui Liu
Yu Zhou
Weiping Wang
QiXiang Ye
Xiangyang Ji
SSL
ISeg
BDL
32
7
0
02 Nov 2022
Pixel-Wise Contrastive Distillation
Pixel-Wise Contrastive Distillation
Junqiang Huang
Zichao Guo
60
4
0
01 Nov 2022
Tree Detection and Diameter Estimation Based on Deep Learning
Tree Detection and Diameter Estimation Based on Deep Learning
Vincent Grondin
Jean-Michel Fortin
F. Pomerleau
Philippe Giguère
3DPC
43
29
0
31 Oct 2022
Real-time Mapping of Physical Scene Properties with an Autonomous Robot
  Experimenter
Real-time Mapping of Physical Scene Properties with an Autonomous Robot Experimenter
I. Haughton
Edgar Sucar
A. Mouton
Edward Johns
Andrew J. Davison
22
4
0
31 Oct 2022
Scoliosis Detection using Deep Neural Network
Scoliosis Detection using Deep Neural Network
Y. Nguyen
36
1
0
31 Oct 2022
Two-Level Temporal Relation Model for Online Video Instance Segmentation
Two-Level Temporal Relation Model for Online Video Instance Segmentation
Ç. S. Çoban
Oguzhan Keskin
Jordi Pont-Tuset
Fatma Guney
VOS
40
0
0
30 Oct 2022
SearchTrack: Multiple Object Tracking with Object-Customized Search and
  Motion-Aware Features
SearchTrack: Multiple Object Tracking with Object-Customized Search and Motion-Aware Features
Zhong-Min Tsai
Yu-Ju Tsai
Chien-Yao Wang
H. Liao
Y. Lin
Yung-Yu Chuang
VOT
53
0
0
29 Oct 2022
Joint Sub-component Level Segmentation and Classification for Anomaly
  Detection within Dual-Energy X-Ray Security Imagery
Joint Sub-component Level Segmentation and Classification for Anomaly Detection within Dual-Energy X-Ray Security Imagery
Neelanjan Bhowmik
T. Breckon
24
3
0
29 Oct 2022
U-Net-based Models for Skin Lesion Segmentation: More Attention and
  Augmentation
U-Net-based Models for Skin Lesion Segmentation: More Attention and Augmentation
P. M. Kazaj
MohammadHossein Koosheshi
Alireza Shahedi
A. V. Sadr
SSeg
26
7
0
28 Oct 2022
Benchmarking performance of object detection under image distortions in
  an uncontrolled environment
Benchmarking performance of object detection under image distortions in an uncontrolled environment
Ayman Beghdadi
Malik Mallem
Lotfi Beji
65
5
0
28 Oct 2022
Grafting Vision Transformers
Grafting Vision Transformers
Jong Sung Park
Kumara Kahatapitiya
Donghyun Kim
Shivchander Sudalairaj
Quanfu Fan
Michael S. Ryoo
ViT
36
3
0
28 Oct 2022
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via
  Differentiable Physics-Based Simulation and Rendering
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering
Jun Lv
Yunhai Feng
Cheng Zhang
Shu Zhao
Lin Shao
Cewu Lu
25
24
0
27 Oct 2022
Predicting Visual Attention and Distraction During Visual Search Using
  Convolutional Neural Networks
Predicting Visual Attention and Distraction During Visual Search Using Convolutional Neural Networks
Manoosh Samiei
James J. Clark
27
0
0
27 Oct 2022
End-to-end Tracking with a Multi-query Transformer
End-to-end Tracking with a Multi-query Transformer
Bruno Korbar
Andrew Zisserman
VOT
34
6
0
26 Oct 2022
Search for Concepts: Discovering Visual Concepts Using Direct
  Optimization
Search for Concepts: Discovering Visual Concepts Using Direct Optimization
P. Reddy
Paul Guerrero
Niloy J. Mitra
OCL
26
4
0
25 Oct 2022
THOR-Net: End-to-end Graformer-based Realistic Two Hands and Object
  Reconstruction with Self-supervision
THOR-Net: End-to-end Graformer-based Realistic Two Hands and Object Reconstruction with Self-supervision
Ahmed Tawfik Aboukhadra
J. Malik
Ahmed Elhayek
Nadia Robertini
D. Stricker
3DH
26
12
0
25 Oct 2022
MetaFormer Baselines for Vision
MetaFormer Baselines for Vision
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
52
163
0
24 Oct 2022
Gallery Filter Network for Person Search
Gallery Filter Network for Person Search
Lucas Jaffe
A. Zakhor
29
12
0
24 Oct 2022
Holistic Interaction Transformer Network for Action Detection
Holistic Interaction Transformer Network for Action Detection
Gueter Josmy Faure
Min-Hung Chen
S. Lai
42
37
0
23 Oct 2022
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing
  Data
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data
Yangfan Zhan
Zhitong Xiong
Yuan. Yuan
93
112
0
23 Oct 2022
Deep Learning in Single-Cell Analysis
Deep Learning in Single-Cell Analysis
Dylan Molho
Jiayuan Ding
Zhaoheng Li
Haifang Wen
Wenzhuo Tang
...
P. Danaher
Robert Yang
Y. Lei
Yuying Xie
Jiliang Tang
43
23
0
22 Oct 2022
Stochastic Adaptive Activation Function
Stochastic Adaptive Activation Function
Kyungsu Lee
Jaeseung Yang
Haeyun Lee
J. Y. Hwang
35
3
0
21 Oct 2022
Monotonic Risk Relationships under Distribution Shifts for Regularized
  Risk Minimization
Monotonic Risk Relationships under Distribution Shifts for Regularized Risk Minimization
Daniel LeJeune
Jiayu Liu
Reinhard Heckel
33
0
0
20 Oct 2022
Object Goal Navigation Based on Semantics and RGB Ego View
Object Goal Navigation Based on Semantics and RGB Ego View
Snehasis Banerjee
Brojeshwar Bhowmick
R. Roychoudhury
20
3
0
20 Oct 2022
Self-Supervised Learning via Maximum Entropy Coding
Self-Supervised Learning via Maximum Entropy Coding
Xin Liu
Zhongdao Wang
Yali Li
Shengjin Wang
SSL
55
41
0
20 Oct 2022
Large-batch Optimization for Dense Visual Predictions
Large-batch Optimization for Dense Visual Predictions
Zeyue Xue
Jianming Liang
Guanglu Song
Zhuofan Zong
Liang Chen
Yu Liu
Ping Luo
VLM
57
9
0
20 Oct 2022
MMRNet: Improving Reliability for Multimodal Object Detection and
  Segmentation for Bin Picking via Multimodal Redundancy
MMRNet: Improving Reliability for Multimodal Object Detection and Segmentation for Bin Picking via Multimodal Redundancy
Yuhao Chen
Hayden Gunraj
E. Z. Zeng
Robbie Meyer
Maximilian Gilles
Alexander Wong
48
1
0
19 Oct 2022
Learning to Discover and Detect Objects
Learning to Discover and Detect Objects
V. Fomenko
Ismail Elezi
Deva Ramanan
Laura Leal-Taixé
Aljosa Osep
ObjD
44
10
0
19 Oct 2022
Visual SLAM: What are the Current Trends and What to Expect?
Visual SLAM: What are the Current Trends and What to Expect?
Ali Tourani
Hriday Bavle
Jose Luis Sanchez-Lopez
Holger Voos
32
81
0
19 Oct 2022
Emerging Threats in Deep Learning-Based Autonomous Driving: A
  Comprehensive Survey
Emerging Threats in Deep Learning-Based Autonomous Driving: A Comprehensive Survey
Huiyun Cao
Wenlong Zou
Yinkun Wang
Ting Song
Mengjun Liu
AAML
64
5
0
19 Oct 2022
A Real-Time Wrong-Way Vehicle Detection Based on YOLO and Centroid
  Tracking
A Real-Time Wrong-Way Vehicle Detection Based on YOLO and Centroid Tracking
Zillur Rahman
Amit Mazumder Ami
M. A. Ullah
13
43
0
19 Oct 2022
Previous
123...222324...818283
Next