SimMIM: A Simple Framework for Masked Image Modeling

18 November 2021

Jianmin Bao

Papers citing "SimMIM: A Simple Framework for Masked Image Modeling"

50 / 849 papers shown

Title
Information Flow in Self-Supervised Learning Zhiyuan Tan Jingqin Yang Weiran Huang Yang Yuan Yifan Zhang SSL 36 14 0 29 Sep 2023
CtxMIM: Context-Enhanced Masked Image Modeling for Remote Sensing Image Understanding Mingming Zhang Qingjie Liu Yunhong Wang 32 5 0 28 Sep 2023
Visual In-Context Learning for Few-Shot Eczema Segmentation Monitirtha Dey S. K. Bhandari Venugopal Vasudevan 22 1 0 28 Sep 2023
Towards Foundation Models Learned from Anatomy in Medical Imaging via Self-Supervision M. Taher Michael B. Gotway Jianming Liang MedIm 19 11 0 27 Sep 2023
$M$^{3}$3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding$ M $^{3}$ 3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding Muhammad Abdullah Jamal Omid Mohareri 3DPC 24 1 0 26 Sep 2023
MUTEX: Learning Unified Policies from Multimodal Task Specifications Rutav Shah Roberto Martín-Martín Yuke Zhu OffRL 44 54 0 25 Sep 2023
Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning Yang Liu Cheng Chen Can Wang Xulin King Mengyuan Liu 3DPC 40 7 0 25 Sep 2023
Masked Image Residual Learning for Scaling Deeper Vision Transformers Guoxi Huang Hongtao Fu A. Bors 34 7 0 25 Sep 2023
LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition Haoxuan Qu Xiaofei Hui Yujun Cai Jun Liu 49 10 0 22 Sep 2023
Masking Improves Contrastive Self-Supervised Learning for ConvNets, and Saliency Tells You Where Zhi-Yi Chin Chieh-Ming Jiang Ching-Chun Huang Pin-Yu Chen Wei-Chen Chiu SSL 29 0 0 22 Sep 2023
Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism Chengcheng Wang Wei He Ying Nie Jianyuan Guo Chuanjian Liu Kai Han Yunhe Wang ObjD 29 207 0 20 Sep 2023
Self-supervised TransUNet for Ultrasound regional segmentation of the distal radius in children Yuyue Zhou Jessica Knight B. Felfeliyan Christopher Keen A. Hareendranathan Jacob L. Jaremko 22 0 0 18 Sep 2023
FactoFormer: Factorized Hyperspectral Transformers with Self-Supervised Pretraining Shaheera Mohamed Maryam Haghighat Tharindu Fernando Sridha Sridharan Clinton Fookes Peyman Moghadam ViT 30 12 0 18 Sep 2023
RingMo-lite: A Remote Sensing Multi-task Lightweight Network with CNN-Transformer Hybrid Framework Yuelei Wang Ting Zhang Liangjin Zhao Lin Hu Zhechao Wang ... Kaiqiang Chen Xuan Zeng Zhirui Wang Hongqi Wang Xian Sun 24 4 0 16 Sep 2023
Viewpoint Integration and Registration with Vision Language Foundation Model for Image Change Understanding Xiaonan Lu Jianlong Yuan Ruigang Niu Yuan Hu Fan Wang 21 1 0 15 Sep 2023
BROW: Better featuRes fOr Whole slide image based on self-distillation Yuan Wu Shaojie Li Zhiqiang Du Wentao Zhu 28 4 0 15 Sep 2023
Virchow: A Million-Slide Digital Pathology Foundation Model Eugene Vorontsov Alican Bozkurt Adam Casson George Shaikovski Michal Zelechowski ... Razik Yousfi Christopher Kanan David Klimstra B. Rothrock Thomas J. Fuchs MedIm 13 82 0 14 Sep 2023
Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology Images Zhiyun Song Penghui Du Junpeng Yan Keqin Li Jianzhong Shou Maode Lai Yubo Fan Yan Xu 34 7 0 14 Sep 2023
SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection Yiran Qin Chaoqun Wang Zijian Kang Ningning Ma Zhen Li Ruimao Zhang 3DPC 40 10 0 13 Sep 2023
Temporal Action Localization with Enhanced Instant Discriminability Ding Shi Qiong Cao Yujie Zhong Shan An Jian Cheng Haogang Zhu Dacheng Tao 39 9 0 11 Sep 2023
Self-Supervised Transformer with Domain Adaptive Reconstruction for General Face Forgery Video Detection Daichi Zhang Zihao Xiao Jianmin Li Shiming Ge CVBM ViT 30 2 0 09 Sep 2023
BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification T. Fujii Shuhei Tarashima 49 8 0 09 Sep 2023
Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis Nikhil J. Dhinagar Amit Singh Saket Ozarkar Ketaki Buwa Sophia I Thomopoulos ... Corey McMillan Chih-Chien Tsai Jiun-Jie Wang Yih-Ru Wu Paul M. Thompson MedIm 24 2 0 09 Sep 2023
AMLP:Adaptive Masking Lesion Patches for Self-supervised Medical Image Segmentation Xiang-Fei Wang Ruizhi Wang Jie Zhou Thomas Lukasiewicz Zhenghua Xu 39 0 0 08 Sep 2023
DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions Haochen Wang Junsong Fan Yuxi Wang Kaiyou Song Tong Wang Zhaoxiang Zhang 29 19 0 07 Sep 2023
Toward High Quality Facial Representation Learning Yue Wang Jinlong Peng Jiangning Zhang Ran Yi L. Liu Yabiao Wang Chengjie Wang CVBM SSL 52 7 0 07 Sep 2023
Towards Efficient Training with Negative Samples in Visual Tracking Qingmao Wei Bi Zeng Guotian Zeng AAML 37 1 0 06 Sep 2023
Gene-induced Multimodal Pre-training for Image-omic Classification Ting Jin Xingran Xie Renjie Wan Qingli Li Yan Wang AI4CE 47 11 0 06 Sep 2023
Efficient Training for Visual Tracking with Deformable Transformer Qingmao Wei Guotian Zeng Bi Zeng ViT 30 4 0 06 Sep 2023
A Survey of the Impact of Self-Supervised Pretraining for Diagnostic Tasks with Radiological Images Blake Vanberlo Jesse Hoey Alexander Wong SSL LM&MA 18 2 0 05 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention Zhuofan Xia Xuran Pan Shiji Song Li Erran Li Gao Huang ViT 29 24 0 04 Sep 2023
Leveraging Self-Supervised Vision Transformers for Segmentation-based Transfer Function Design Dominik Engel Leon Sick Timo Ropinski ViT 19 0 0 04 Sep 2023
RevColV2: Exploring Disentangled Representations in Masked Image Modeling Qi Han Yuxuan Cai Xiangyu Zhang 41 7 0 02 Sep 2023
Masked Transformer for Electrocardiogram Classification Ya Zhou Xiaolin Diao Yanni Huo Yang Liu Xiaohan Fan Wei-Ye Zhao MedIm 30 2 0 31 Aug 2023
CL-MAE: Curriculum-Learned Masked Autoencoders Neelu Madan Nicolae-Cătălin Ristea Kamal Nasrollahi T. Moeslund Radu Tudor Ionescu 19 10 0 31 Aug 2023
Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection Yifan Xu Mengdan Zhang Xiaoshan Yang Changsheng Xu ObjD 32 5 0 30 Aug 2023
MetaWeather: Few-Shot Weather-Degraded Image Restoration Youngrae Kim Younggeol Cho Thanh-Tung Nguyen Seunghoon Hong Dongman Lee 27 0 0 28 Aug 2023
Masked Feature Modelling: Feature Masking for the Unsupervised Pre-training of a Graph Attention Network Block for Bottom-up Video Event Recognition Dimitrios Daskalakis Nikolaos Gkalelis Vasileios Mezaris 38 0 0 24 Aug 2023
Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding Jiantao Wu Shentong Mo Muhammad Awais Sara Atito Zhenhua Feng J. Kittler VLM 36 4 0 22 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding Bingkun Huang Zhiyu Zhao Guozhen Zhang Yu Qiao Limin Wang 39 30 0 21 Aug 2023
Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos Xiaoxiao Sheng Zhiqiang Shen Gang Xiao Longguang Wang Y. Guo Hehe Fan 3DPC SSL 33 7 0 18 Aug 2023
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos Zhiqiang Shen Xiaoxiao Sheng Hehe Fan Longguang Wang Y. Guo Qiong Liu Hao-Kai Wen Xiaoping Zhou 3DPC 20 14 0 18 Aug 2023
Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction Chenxin Xu R. Tan Yuhong Tan Siheng Chen Xinchao Wang Yanfeng Wang 3DH 51 20 0 17 Aug 2023
SRMAE: Masked Image Modeling for Scale-Invariant Deep Representations Zhiming Wang Lin Gu Feng Lu 30 0 0 17 Aug 2023
Learning to In-paint: Domain Adaptive Shape Completion for 3D Organ Segmentation Mingjin Chen Yongkang He Yongyi Lu Zhi-Yi Yang MedIm 19 1 0 17 Aug 2023
pNNCLR: Stochastic Pseudo Neighborhoods for Contrastive Learning based Unsupervised Representation Learning Problems Momojit Biswas Himanshu Buckchash Dilip K. Prasad SSL 26 7 0 14 Aug 2023
Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks David Junhao Zhang Mutian Xu Chuhui Xue Wenqing Zhang Xiaoguang Han Song Bai Mike Zheng Shou DiffM 56 6 0 13 Aug 2023
Self-Supervised Pre-Training with Contrastive and Masked Autoencoder Methods for Dealing with Small Datasets in Deep Learning for Medical Imaging Daniel Wolf Tristan Payer C. Lisson C. Lisson Meinrad Beer Michael Götz Timo Ropinski 35 15 0 12 Aug 2023
PETformer: Long-term Time Series Forecasting via Placeholder-enhanced Transformer Shengsheng Lin Weiwei Lin Wentai Wu Song Wang Yongxiang Wang AI4TS 40 19 0 09 Aug 2023
Unsupervised Camouflaged Object Segmentation as Domain Adaptation Yi Zhang Chengyi Wu 28 3 0 08 Aug 2023