ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.06870
  4. Cited By
Mask R-CNN

Mask R-CNN

20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
    ObjD
ArXivPDFHTML

Papers citing "Mask R-CNN"

50 / 3,768 papers shown
Title
Scalable Image Coding for Humans and Machines Using Feature Fusion
  Network
Scalable Image Coding for Humans and Machines Using Feature Fusion Network
Takahiro Shindo
Taiju Watanabe
Yui Tatsumi
Hiroshi Watanabe
34
5
0
15 May 2024
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
Yachan Guo
Yi Xiao
Danna Xue
Jose Luis Gomez Zurita
Antonio M. López
71
0
0
15 May 2024
oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving
oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving
Abdul Hannan Khan
Syed Tahseen Raza Rizvi
Dheeraj Varma Chittari Macharavtu
Andreas Dengel
41
0
0
13 May 2024
Replication Study and Benchmarking of Real-Time Object Detection Models
Replication Study and Benchmarking of Real-Time Object Detection Models
Pierre-Luc Asselin
Vincent Coulombe
William Guimont-Martin
William Larrivée-Hardy
57
0
0
11 May 2024
Learning to Solve Geometry Problems via Simulating Human Dual-Reasoning
  Process
Learning to Solve Geometry Problems via Simulating Human Dual-Reasoning Process
Tong Xiao
Jia-Yin Liu
Zhenya Huang
Jinze Wu
Jing Sha
Shijin Wang
Enhong Chen
AI4CE
42
3
0
10 May 2024
DynaSeg: A Deep Dynamic Fusion Method for Unsupervised Image
  Segmentation Incorporating Feature Similarity and Spatial Continuity
DynaSeg: A Deep Dynamic Fusion Method for Unsupervised Image Segmentation Incorporating Feature Similarity and Spatial Continuity
Boujemaa Guermazi
Naimul Khan
30
2
0
09 May 2024
Unsupervised Skin Feature Tracking with Deep Neural Networks
Unsupervised Skin Feature Tracking with Deep Neural Networks
J. Chang
Torbjörn E. M. Nordling
34
0
0
08 May 2024
Exploring Text-based Realistic Building Facades Editing Applicaiton
Exploring Text-based Realistic Building Facades Editing Applicaiton
Jing Wang
Xin Zhang
AI4CE
42
1
0
05 May 2024
Panoptic-SLAM: Visual SLAM in Dynamic Environments using Panoptic
  Segmentation
Panoptic-SLAM: Visual SLAM in Dynamic Environments using Panoptic Segmentation
G. Abati
J. C. V. Soares
V. S. Medeiros
M. Meggiolaro
Claudio Semini
34
3
0
03 May 2024
Sports Analysis and VR Viewing System Based on Player Tracking and Pose
  Estimation with Multimodal and Multiview Sensors
Sports Analysis and VR Viewing System Based on Player Tracking and Pose Estimation with Multimodal and Multiview Sensors
Wenxuan Guo
Zhiyu Pan
Ziheng Xi
Alapati Tuerxun
Jianjiang Feng
Jie Zhou
29
2
0
02 May 2024
CrossMPT: Cross-attention Message-Passing Transformer for Error
  Correcting Codes
CrossMPT: Cross-attention Message-Passing Transformer for Error Correcting Codes
Seong-Joon Park
Heeyoul Kwak
Sang-Hyo Kim
Yongjune Kim
Jong-Seon No
31
4
0
02 May 2024
UniFS: Universal Few-shot Instance Perception with Point Representations
UniFS: Universal Few-shot Instance Perception with Point Representations
Sheng Jin
Ruijie Yao
Lumin Xu
Wentao Liu
Chao Qian
Ji Wu
Ping Luo
50
2
0
30 Apr 2024
Audio-Visual Traffic Light State Detection for Urban Robots
Audio-Visual Traffic Light State Detection for Urban Robots
Sagar Gupta
Akansel Cosgun
21
0
0
30 Apr 2024
Quater-GCN: Enhancing 3D Human Pose Estimation with Orientation and
  Semi-supervised Training
Quater-GCN: Enhancing 3D Human Pose Estimation with Orientation and Semi-supervised Training
Xingyu Song
Zhan Li
Shi Chen
K. Demachi
3DH
33
3
0
30 Apr 2024
C2FDrone: Coarse-to-Fine Drone-to-Drone Detection using Vision
  Transformer Networks
C2FDrone: Coarse-to-Fine Drone-to-Drone Detection using Vision Transformer Networks
Sairam VC Rebbapragada
Pranoy Panda
Vineeth N. Balasubramanian
ViT
46
5
0
30 Apr 2024
VIEW: Visual Imitation Learning with Waypoints
VIEW: Visual Imitation Learning with Waypoints
Ananth Jonnavittula
Sagar Parekh
Dylan P. Losey
SSL
91
10
0
27 Apr 2024
A Hybrid Approach for Document Layout Analysis in Document images
A Hybrid Approach for Document Layout Analysis in Document images
Tahira Shehzadi
Didier Stricker
Muhammad Zeshan Afzal
37
5
0
27 Apr 2024
Two in One Go: Single-stage Emotion Recognition with Decoupled
  Subject-context Transformer
Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer
Xinpeng Li
Teng Wang
Jian Zhao
Shuyi Mao
Jinbao Wang
Feng Zheng
Xiaojiang Peng
Xuelong Li
30
1
0
26 Apr 2024
Meta-Transfer Derm-Diagnosis: Exploring Few-Shot Learning and Transfer
  Learning for Skin Disease Classification in Long-Tail Distribution
Meta-Transfer Derm-Diagnosis: Exploring Few-Shot Learning and Transfer Learning for Skin Disease Classification in Long-Tail Distribution
Zeynep Özdemir
H. Keles
Ö. Ö. Tanriöver
53
0
0
25 Apr 2024
Self-Balanced R-CNN for Instance Segmentation
Self-Balanced R-CNN for Instance Segmentation
L. Rossi
Akbar Karimi
Andrea Prati
ISeg
SSeg
50
9
0
25 Apr 2024
Boosting Architectural Generation via Prompts: Report
Boosting Architectural Generation via Prompts: Report
Xin Zhang
Wenwen Liu
AI4CE
40
1
0
24 Apr 2024
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster
  Pre-training on Web-scale Image-Text Data
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data
Sachin Mehta
Maxwell Horton
Fartash Faghri
Mohammad Hossein Sekhavat
Mahyar Najibi
Mehrdad Farajtabar
Oncel Tuzel
Mohammad Rastegari
VLM
CLIP
49
6
0
24 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
56
0
0
23 Apr 2024
DAWN: Domain-Adaptive Weakly Supervised Nuclei Segmentation via Cross-Task Interactions
DAWN: Domain-Adaptive Weakly Supervised Nuclei Segmentation via Cross-Task Interactions
Ye Zhang
Yifeng Wang
Zijie Fang
Hao Bian
Linghan Cai
Ziyue Wang
Yongbing Zhang
43
4
0
23 Apr 2024
Closed Loop Interactive Embodied Reasoning for Robot Manipulation
Closed Loop Interactive Embodied Reasoning for Robot Manipulation
Michal Nazarczuk
Jan Kristof Behrens
Karla Stepanova
Matej Hoffmann
K. Mikolajczyk
LM&Ro
LRM
60
1
0
23 Apr 2024
FisheyeDetNet: 360° Surround view Fisheye Camera based Object
  Detection System for Autonomous Driving
FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving
Ganesh Sistu
S. Yogamani
49
0
0
20 Apr 2024
Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and
  Drawer Manipulation in Point Clouds
Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds
Oliver Lemke
Z. Bauer
René Zurbrugg
Marc Pollefeys
Francis Engelmann
Hermann Blum
3DPC
29
11
0
18 Apr 2024
Unifying Global and Local Scene Entities Modelling for Precise Action
  Spotting
Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
Kim Hoang Tran
Phuc Vuong Do
Ngoc Quoc Ly
Ngan Le
38
4
0
15 Apr 2024
Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation
  in Operating Rooms
Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation in Operating Rooms
Diandian Guo
Manxi Lin
Jialun Pei
He Tang
Yueming Jin
Pheng-Ann Heng
42
2
0
14 Apr 2024
Self-Supervised Multi-Object Tracking with Path Consistency
Self-Supervised Multi-Object Tracking with Path Consistency
Zijia Lu
Bing Shuai
Yanbei Chen
Zhenlin Xu
Davide Modolo
VOT
49
10
0
08 Apr 2024
Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation
Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation
Elham Amin Mansour
Ozan Unal
Suman Saha
Benjamin Bejar
Luc Van Gool
51
1
0
04 Apr 2024
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Qinji Yu
Yirui Wang
K. Yan
Haoshen Li
Dazhou Guo
...
Na Shen
Qifeng Wang
Xiaowei Ding
X. Ye
Dakai Jin
MedIm
71
2
0
04 Apr 2024
Unsegment Anything by Simulating Deformation
Unsegment Anything by Simulating Deformation
Jiahao Lu
Xingyi Yang
Xinchao Wang
38
4
0
03 Apr 2024
Resource-Aware Collaborative Monte Carlo Localization with Distribution
  Compression
Resource-Aware Collaborative Monte Carlo Localization with Distribution Compression
Nicky Zimmerman
Alessandro Giusti
Jérôme Guzzi
36
1
0
02 Apr 2024
Task Integration Distillation for Object Detectors
Task Integration Distillation for Object Detectors
Hai Su
ZhenWen Jian
Songsen Yu
46
1
0
02 Apr 2024
Learning to Control Camera Exposure via Reinforcement Learning
Learning to Control Camera Exposure via Reinforcement Learning
Kyunghyun Lee
Ukcheol Shin
Byeong-uk Lee
28
2
0
02 Apr 2024
Instance-Aware Group Quantization for Vision Transformers
Instance-Aware Group Quantization for Vision Transformers
Jaehyeon Moon
Dohyung Kim
Junyong Cheon
Bumsub Ham
MQ
ViT
34
7
0
01 Apr 2024
Deep Instruction Tuning for Segment Anything Model
Deep Instruction Tuning for Segment Anything Model
Xiaorui Huang
Gen Luo
Chaoyang Zhu
Bo Tong
Yiyi Zhou
Xiaoshuai Sun
Rongrong Ji
VLM
57
1
0
31 Mar 2024
VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly
  Supervised 3D Object Detection
VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection
Zihua Liu
Hiroki Sakuma
Masatoshi Okutomi
51
3
0
29 Mar 2024
FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint
  Textual and Visual Clues
FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues
Shuang Li
Jiahua Wang
Lijie Wen
LRM
33
0
0
29 Mar 2024
Efficient Modulation for Vision Networks
Efficient Modulation for Vision Networks
Xu Ma
Xiyang Dai
Jianwei Yang
Bin Xiao
Yinpeng Chen
Yun Fu
Lu Yuan
45
17
0
29 Mar 2024
Illicit object detection in X-ray images using Vision Transformers
Illicit object detection in X-ray images using Vision Transformers
Jorgen Cani
Ioannis Mademlis
Adamantia Anna Rebolledo Chrysochoou
Georgios Th. Papadopoulos
ViT
38
2
0
27 Mar 2024
Tiny Models are the Computational Saver for Large Models
Tiny Models are the Computational Saver for Large Models
Qingyuan Wang
B. Cardiff
Antoine Frappé
Benoît Larras
Deepu John
49
2
0
26 Mar 2024
PECI-Net: Bolus segmentation from video fluoroscopic swallowing study
  images using preprocessing ensemble and cascaded inference
PECI-Net: Bolus segmentation from video fluoroscopic swallowing study images using preprocessing ensemble and cascaded inference
Dougho Park
Younghun Kim
Harim Kang
Junmyeoung Lee
Jinyoung Choi
Taeyeon Kim
Sangeok Lee
Seokil Son
Minsol Kim
Injung Kim
26
2
0
21 Mar 2024
Mask-based Invisible Backdoor Attacks on Object Detection
Mask-based Invisible Backdoor Attacks on Object Detection
Jeongjin Shin
AAML
33
0
0
20 Mar 2024
Adaptive Visual Imitation Learning for Robotic Assisted Feeding Across
  Varied Bowl Configurations and Food Types
Adaptive Visual Imitation Learning for Robotic Assisted Feeding Across Varied Bowl Configurations and Food Types
Rui Liu
Amisha Bhaskar
Pratap Tokekar
40
3
0
19 Mar 2024
Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile
  Industrial Anomaly Detection
Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection
Chengjie Wang
Wenbing Zhu
Bin-Bin Gao
Zhenye Gan
Jianning Zhang
Zhihao Gu
Shuguang Qian
Mingang Chen
Lizhuang Ma
55
46
0
19 Mar 2024
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
Ting Yao
Yehao Li
Yingwei Pan
Tao Mei
ViT
36
15
0
18 Mar 2024
Object Segmentation-Assisted Inter Prediction for Versatile Video Coding
Object Segmentation-Assisted Inter Prediction for Versatile Video Coding
Zhuoyuan Li
Zikun Yuan
Li Li
Dong Liu
Xiaohu Tang
Feng Wu
VOS
39
8
0
18 Mar 2024
ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal
  Instance Segmentation
ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation
Minh-Triet Tran
Winston Bounsavy
Khoa T. Vo
Anh Nguyen
Tri Minh Nguyen
Ngan Le
ViT
32
2
0
18 Mar 2024
Previous
123...567...747576
Next