ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09886
  4. Cited By
SimMIM: A Simple Framework for Masked Image Modeling

SimMIM: A Simple Framework for Masked Image Modeling

18 November 2021
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
ArXivPDFHTML

Papers citing "SimMIM: A Simple Framework for Masked Image Modeling"

50 / 849 papers shown
Title
Information Flow in Self-Supervised Learning
Information Flow in Self-Supervised Learning
Zhiyuan Tan
Jingqin Yang
Weiran Huang
Yang Yuan
Yifan Zhang
SSL
36
14
0
29 Sep 2023
CtxMIM: Context-Enhanced Masked Image Modeling for Remote Sensing Image
  Understanding
CtxMIM: Context-Enhanced Masked Image Modeling for Remote Sensing Image Understanding
Mingming Zhang
Qingjie Liu
Yunhong Wang
32
5
0
28 Sep 2023
Visual In-Context Learning for Few-Shot Eczema Segmentation
Visual In-Context Learning for Few-Shot Eczema Segmentation
Monitirtha Dey
S. K. Bhandari
Venugopal Vasudevan
22
1
0
28 Sep 2023
Towards Foundation Models Learned from Anatomy in Medical Imaging via
  Self-Supervision
Towards Foundation Models Learned from Anatomy in Medical Imaging via Self-Supervision
M. Taher
Michael B. Gotway
Jianming Liang
MedIm
19
11
0
27 Sep 2023
M$^{3}$3D: Learning 3D priors using Multi-Modal Masked Autoencoders for
  2D image and video understanding
M3^{3}33D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding
Muhammad Abdullah Jamal
Omid Mohareri
3DPC
24
1
0
26 Sep 2023
MUTEX: Learning Unified Policies from Multimodal Task Specifications
MUTEX: Learning Unified Policies from Multimodal Task Specifications
Rutav Shah
Roberto Martín-Martín
Yuke Zhu
OffRL
44
54
0
25 Sep 2023
Regress Before Construct: Regress Autoencoder for Point Cloud
  Self-supervised Learning
Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning
Yang Liu
Cheng Chen
Can Wang
Xulin King
Mengyuan Liu
3DPC
40
7
0
25 Sep 2023
Masked Image Residual Learning for Scaling Deeper Vision Transformers
Masked Image Residual Learning for Scaling Deeper Vision Transformers
Guoxi Huang
Hongtao Fu
A. Bors
34
7
0
25 Sep 2023
LMC: Large Model Collaboration with Cross-assessment for Training-Free
  Open-Set Object Recognition
LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition
Haoxuan Qu
Xiaofei Hui
Yujun Cai
Jun Liu
49
10
0
22 Sep 2023
Masking Improves Contrastive Self-Supervised Learning for ConvNets, and
  Saliency Tells You Where
Masking Improves Contrastive Self-Supervised Learning for ConvNets, and Saliency Tells You Where
Zhi-Yi Chin
Chieh-Ming Jiang
Ching-Chun Huang
Pin-Yu Chen
Wei-Chen Chiu
SSL
29
0
0
22 Sep 2023
Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism
Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism
Chengcheng Wang
Wei He
Ying Nie
Jianyuan Guo
Chuanjian Liu
Kai Han
Yunhe Wang
ObjD
29
207
0
20 Sep 2023
Self-supervised TransUNet for Ultrasound regional segmentation of the
  distal radius in children
Self-supervised TransUNet for Ultrasound regional segmentation of the distal radius in children
Yuyue Zhou
Jessica Knight
B. Felfeliyan
Christopher Keen
A. Hareendranathan
Jacob L. Jaremko
22
0
0
18 Sep 2023
FactoFormer: Factorized Hyperspectral Transformers with Self-Supervised
  Pretraining
FactoFormer: Factorized Hyperspectral Transformers with Self-Supervised Pretraining
Shaheera Mohamed
Maryam Haghighat
Tharindu Fernando
Sridha Sridharan
Clinton Fookes
Peyman Moghadam
ViT
30
12
0
18 Sep 2023
RingMo-lite: A Remote Sensing Multi-task Lightweight Network with
  CNN-Transformer Hybrid Framework
RingMo-lite: A Remote Sensing Multi-task Lightweight Network with CNN-Transformer Hybrid Framework
Yuelei Wang
Ting Zhang
Liangjin Zhao
Lin Hu
Zhechao Wang
...
Kaiqiang Chen
Xuan Zeng
Zhirui Wang
Hongqi Wang
Xian Sun
24
4
0
16 Sep 2023
Viewpoint Integration and Registration with Vision Language Foundation
  Model for Image Change Understanding
Viewpoint Integration and Registration with Vision Language Foundation Model for Image Change Understanding
Xiaonan Lu
Jianlong Yuan
Ruigang Niu
Yuan Hu
Fan Wang
21
1
0
15 Sep 2023
BROW: Better featuRes fOr Whole slide image based on self-distillation
BROW: Better featuRes fOr Whole slide image based on self-distillation
Yuan Wu
Shaojie Li
Zhiqiang Du
Wentao Zhu
28
4
0
15 Sep 2023
Virchow: A Million-Slide Digital Pathology Foundation Model
Virchow: A Million-Slide Digital Pathology Foundation Model
Eugene Vorontsov
Alican Bozkurt
Adam Casson
George Shaikovski
Michal Zelechowski
...
Razik Yousfi
Christopher Kanan
David Klimstra
B. Rothrock
Thomas J. Fuchs
MedIm
13
82
0
14 Sep 2023
Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image
  Translation for Histopathology Images
Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology Images
Zhiyun Song
Penghui Du
Junpeng Yan
Keqin Li
Jianzhong Shou
Maode Lai
Yubo Fan
Yan Xu
34
7
0
14 Sep 2023
SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
Yiran Qin
Chaoqun Wang
Zijian Kang
Ningning Ma
Zhen Li
Ruimao Zhang
3DPC
40
10
0
13 Sep 2023
Temporal Action Localization with Enhanced Instant Discriminability
Temporal Action Localization with Enhanced Instant Discriminability
Ding Shi
Qiong Cao
Yujie Zhong
Shan An
Jian Cheng
Haogang Zhu
Dacheng Tao
39
9
0
11 Sep 2023
Self-Supervised Transformer with Domain Adaptive Reconstruction for
  General Face Forgery Video Detection
Self-Supervised Transformer with Domain Adaptive Reconstruction for General Face Forgery Video Detection
Daichi Zhang
Zihao Xiao
Jianmin Li
Shiming Ge
CVBM
ViT
30
2
0
09 Sep 2023
BiLMa: Bidirectional Local-Matching for Text-based Person
  Re-identification
BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification
T. Fujii
Shuhei Tarashima
49
8
0
09 Sep 2023
Video and Synthetic MRI Pre-training of 3D Vision Architectures for
  Neuroimage Analysis
Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis
Nikhil J. Dhinagar
Amit Singh
Saket Ozarkar
Ketaki Buwa
Sophia I Thomopoulos
...
Corey McMillan
Chih-Chien Tsai
Jiun-Jie Wang
Yih-Ru Wu
Paul M. Thompson
MedIm
24
2
0
09 Sep 2023
AMLP:Adaptive Masking Lesion Patches for Self-supervised Medical Image
  Segmentation
AMLP:Adaptive Masking Lesion Patches for Self-supervised Medical Image Segmentation
Xiang-Fei Wang
Ruizhi Wang
Jie Zhou
Thomas Lukasiewicz
Zhenghua Xu
39
0
0
08 Sep 2023
DropPos: Pre-Training Vision Transformers by Reconstructing Dropped
  Positions
DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tong Wang
Zhaoxiang Zhang
29
19
0
07 Sep 2023
Toward High Quality Facial Representation Learning
Toward High Quality Facial Representation Learning
Yue Wang
Jinlong Peng
Jiangning Zhang
Ran Yi
L. Liu
Yabiao Wang
Chengjie Wang
CVBM
SSL
52
7
0
07 Sep 2023
Towards Efficient Training with Negative Samples in Visual Tracking
Towards Efficient Training with Negative Samples in Visual Tracking
Qingmao Wei
Bi Zeng
Guotian Zeng
AAML
37
1
0
06 Sep 2023
Gene-induced Multimodal Pre-training for Image-omic Classification
Gene-induced Multimodal Pre-training for Image-omic Classification
Ting Jin
Xingran Xie
Renjie Wan
Qingli Li
Yan Wang
AI4CE
47
11
0
06 Sep 2023
Efficient Training for Visual Tracking with Deformable Transformer
Efficient Training for Visual Tracking with Deformable Transformer
Qingmao Wei
Guotian Zeng
Bi Zeng
ViT
30
4
0
06 Sep 2023
A Survey of the Impact of Self-Supervised Pretraining for Diagnostic
  Tasks with Radiological Images
A Survey of the Impact of Self-Supervised Pretraining for Diagnostic Tasks with Radiological Images
Blake Vanberlo
Jesse Hoey
Alexander Wong
SSL
LM&MA
18
2
0
05 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
29
24
0
04 Sep 2023
Leveraging Self-Supervised Vision Transformers for Segmentation-based
  Transfer Function Design
Leveraging Self-Supervised Vision Transformers for Segmentation-based Transfer Function Design
Dominik Engel
Leon Sick
Timo Ropinski
ViT
19
0
0
04 Sep 2023
RevColV2: Exploring Disentangled Representations in Masked Image
  Modeling
RevColV2: Exploring Disentangled Representations in Masked Image Modeling
Qi Han
Yuxuan Cai
Xiangyu Zhang
41
7
0
02 Sep 2023
Masked Transformer for Electrocardiogram Classification
Masked Transformer for Electrocardiogram Classification
Ya Zhou
Xiaolin Diao
Yanni Huo
Yang Liu
Xiaohan Fan
Wei-Ye Zhao
MedIm
30
2
0
31 Aug 2023
CL-MAE: Curriculum-Learned Masked Autoencoders
CL-MAE: Curriculum-Learned Masked Autoencoders
Neelu Madan
Nicolae-Cătălin Ristea
Kamal Nasrollahi
T. Moeslund
Radu Tudor Ionescu
19
10
0
31 Aug 2023
Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object
  Detection
Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection
Yifan Xu
Mengdan Zhang
Xiaoshan Yang
Changsheng Xu
ObjD
32
5
0
30 Aug 2023
MetaWeather: Few-Shot Weather-Degraded Image Restoration
MetaWeather: Few-Shot Weather-Degraded Image Restoration
Youngrae Kim
Younggeol Cho
Thanh-Tung Nguyen
Seunghoon Hong
Dongman Lee
27
0
0
28 Aug 2023
Masked Feature Modelling: Feature Masking for the Unsupervised
  Pre-training of a Graph Attention Network Block for Bottom-up Video Event
  Recognition
Masked Feature Modelling: Feature Masking for the Unsupervised Pre-training of a Graph Attention Network Block for Bottom-up Video Event Recognition
Dimitrios Daskalakis
Nikolaos Gkalelis
Vasileios Mezaris
38
0
0
24 Aug 2023
Masked Momentum Contrastive Learning for Zero-shot Semantic
  Understanding
Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding
Jiantao Wu
Shentong Mo
Muhammad Awais
Sara Atito
Zhenhua Feng
J. Kittler
VLM
36
4
0
22 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding
MGMAE: Motion Guided Masking for Video Masked Autoencoding
Bingkun Huang
Zhiyu Zhao
Guozhen Zhang
Yu Qiao
Limin Wang
39
30
0
21 Aug 2023
Point Contrastive Prediction with Semantic Clustering for
  Self-Supervised Learning on Point Cloud Videos
Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos
Xiaoxiao Sheng
Zhiqiang Shen
Gang Xiao
Longguang Wang
Y. Guo
Hehe Fan
3DPC
SSL
33
7
0
18 Aug 2023
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning
  on Point Cloud Videos
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
Zhiqiang Shen
Xiaoxiao Sheng
Hehe Fan
Longguang Wang
Y. Guo
Qiong Liu
Hao-Kai Wen
Xiaoping Zhou
3DPC
20
14
0
18 Aug 2023
Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction
Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction
Chenxin Xu
R. Tan
Yuhong Tan
Siheng Chen
Xinchao Wang
Yanfeng Wang
3DH
51
20
0
17 Aug 2023
SRMAE: Masked Image Modeling for Scale-Invariant Deep Representations
SRMAE: Masked Image Modeling for Scale-Invariant Deep Representations
Zhiming Wang
Lin Gu
Feng Lu
30
0
0
17 Aug 2023
Learning to In-paint: Domain Adaptive Shape Completion for 3D Organ
  Segmentation
Learning to In-paint: Domain Adaptive Shape Completion for 3D Organ Segmentation
Mingjin Chen
Yongkang He
Yongyi Lu
Zhi-Yi Yang
MedIm
19
1
0
17 Aug 2023
pNNCLR: Stochastic Pseudo Neighborhoods for Contrastive Learning based
  Unsupervised Representation Learning Problems
pNNCLR: Stochastic Pseudo Neighborhoods for Contrastive Learning based Unsupervised Representation Learning Problems
Momojit Biswas
Himanshu Buckchash
Dilip K. Prasad
SSL
26
7
0
14 Aug 2023
Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images
  with Free Attention Masks
Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
David Junhao Zhang
Mutian Xu
Chuhui Xue
Wenqing Zhang
Xiaoguang Han
Song Bai
Mike Zheng Shou
DiffM
56
6
0
13 Aug 2023
Self-Supervised Pre-Training with Contrastive and Masked Autoencoder
  Methods for Dealing with Small Datasets in Deep Learning for Medical Imaging
Self-Supervised Pre-Training with Contrastive and Masked Autoencoder Methods for Dealing with Small Datasets in Deep Learning for Medical Imaging
Daniel Wolf
Tristan Payer
C. Lisson
C. Lisson
Meinrad Beer
Michael Götz
Timo Ropinski
35
15
0
12 Aug 2023
PETformer: Long-term Time Series Forecasting via Placeholder-enhanced
  Transformer
PETformer: Long-term Time Series Forecasting via Placeholder-enhanced Transformer
Shengsheng Lin
Weiwei Lin
Wentai Wu
Song Wang
Yongxiang Wang
AI4TS
40
19
0
09 Aug 2023
Unsupervised Camouflaged Object Segmentation as Domain Adaptation
Unsupervised Camouflaged Object Segmentation as Domain Adaptation
Yi Zhang
Chengyi Wu
28
3
0
08 Aug 2023
Previous
123...8910...151617
Next