SimMIM: A Simple Framework for Masked Image Modeling

18 November 2021

Jianmin Bao

Papers citing "SimMIM: A Simple Framework for Masked Image Modeling"

50 / 849 papers shown

Title
Rescuing referral failures during automated diagnosis of domain-shifted medical images Anuj Srivastava Karm Patel Pradeep Shenoy D. Sridharan OOD 26 0 0 28 Nov 2023
Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding Hoang-Quan Nguyen Thanh-Dat Truong Xuan-Bac Nguyen Ashley Dowling Xin Li Khoa Luu VLM 24 19 0 26 Nov 2023
Predicting Gradient is Better: Exploring Self-Supervised Learning for SAR ATR with a Joint-Embedding Predictive Architecture Wei-Jang Li Yang Wei Tianpeng Liu Yuenan Hou Yuxuan Li Zhen Liu Yongxiang Liu Li Liu 31 18 0 26 Nov 2023
Understanding Self-Supervised Features for Learning Unsupervised Instance Segmentation Paul Engstler Luke Melas-Kyriazi Christian Rupprecht Iro Laina SSL 25 3 0 24 Nov 2023
Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder Xiaohao Xu 38 0 0 23 Nov 2023
SegVol: Universal and Interactive Volumetric Medical Image Segmentation Yuxin Du Fan Bai Tiejun Huang Bo Zhao VLM 42 38 0 22 Nov 2023
Masked Autoencoders Are Robust Neural Architecture Search Learners Yiming Hu Xiangxiang Chu Bo-Wen Zhang OOD 40 0 0 20 Nov 2023
Event Camera Data Dense Pre-training Yan Yang Liyuan Pan Liu Liu 30 4 0 20 Nov 2023
Deep Tensor Network Yifan Zhang 32 0 0 18 Nov 2023
Shifting to Machine Supervision: Annotation-Efficient Semi and Self-Supervised Learning for Automatic Medical Image Segmentation and Classification P. Singh Raviteja Chukkapalli Shravan Chaudhari Luoyao Chen Mei Chen Jinqian Pan Craig Smuda Jacopo Cirrone 4 7 0 17 Nov 2023
From Pretext to Purpose: Batch-Adaptive Self-Supervised Learning Jiansong Zhang Linlin Shen Peizhong Liu SSL 32 0 0 16 Nov 2023
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Bin Xiao Haiping Wu Weijian Xu Xiyang Dai Houdong Hu Yumao Lu Michael Zeng Ce Liu Lu Yuan VLM 45 143 0 10 Nov 2023
Window Attention is Bugged: How not to Interpolate Position Embeddings Daniel Bolya Chaitanya K. Ryali Judy Hoffman Christoph Feichtenhofer 43 10 0 09 Nov 2023
Embedding Space Interpolation Beyond Mini-Batch, Beyond Pairs and Beyond Examples Shashanka Venkataramanan Ewa Kijak Laurent Amsaleg Yannis Avrithis 20 4 0 09 Nov 2023
PersonMAE: Person Re-Identification Pre-Training with Masked AutoEncoders Hezhen Hu Xiaoyi Dong Jianmin Bao Dongdong Chen Lu Yuan Dong Chen Houqiang Li 31 3 0 08 Nov 2023
Asymmetric Masked Distillation for Pre-Training Small Foundation Models Zhiyu Zhao Bingkun Huang Sen Xing Gangshan Wu Yu Qiao Limin Wang 42 5 0 06 Nov 2023
ProS: Facial Omni-Representation Learning via Prototype-based Self-Distillation Xing Di Yiyu Zheng Xiaoming Liu Yu Cheng 18 3 0 03 Nov 2023
Concatenated Masked Autoencoders as Spatial-Temporal Learner Zhouqiang Jiang Bowen Wang Tong Xiang Zhaofeng Niu Hong Tang Guangshun Li Liangzhi Li 22 2 0 02 Nov 2023
CROMA: Remote Sensing Representations with Contrastive Radar-Optical Masked Autoencoders A. Fuller K. Millard James R. Green 29 60 0 01 Nov 2023
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders Srijan Das Tanmay Jain Dominick Reilly P. Balaji Soumyajit Karmakar Shyam Marjit Xiang Li Abhijit Das Michael S. Ryoo 39 16 0 31 Oct 2023
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception Junkun Yuan Xinyu Zhang Hao Zhou Jian Wang Zhongwei Qiu ... Junyu Han Errui Ding Lanfen Lin Fei Wu Jingdong Wang 38 18 0 31 Oct 2023
Pre-training with Random Orthogonal Projection Image Modeling Maryam Haghighat Peyman Moghadam Shaheer Mohamed Piotr Koniusz VLM 28 8 0 28 Oct 2023
Feature Guided Masked Autoencoder for Self-supervised Learning in Remote Sensing Yi Wang Hugo Hernández Hernández C. Albrecht Xiao Xiang Zhu 45 30 0 28 Oct 2023
FaultSeg Swin-UNETR: Transformer-Based Self-Supervised Pretraining Model for Fault Recognition Zeren Zhang Ran Chen Jinwen Ma ViT 13 0 0 27 Oct 2023
SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation Mengcheng Lan Xinjiang Wang Yiping Ke Jiaxing Xu Xue Jiang Wayne Zhang 43 11 0 27 Oct 2023
Bridging The Gaps Between Token Pruning and Full Pre-training via Masked Fine-tuning Fengyuan Shi Limin Wang ViT 38 0 0 26 Oct 2023
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA Asmar Nadeem Adrian Hilton R. Dawes Graham A. Thomas A. Mustafa 33 9 0 25 Oct 2023
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder Huiwon Jang Jihoon Tack Daewon Choi Jongheon Jeong Jinwoo Shin 21 2 0 25 Oct 2023
SAMCLR: Contrastive pre-training on complex scenes using SAM for view sampling Benjamin Missaoui Chongbin Yuan VLM 31 1 0 23 Oct 2023
Learning with Unmasked Tokens Drives Stronger Vision Learners Taekyung Kim Sanghyuk Chun Byeongho Heo Dongyoon Han SSL 42 1 0 20 Oct 2023
WeedCLR: Weed Contrastive Learning through Visual Representations with Class-Optimized Loss in Long-Tailed Datasets Alzayat Saleh A. Olsen Jake Wood B. Philippa M. R. Azghadi 22 0 0 19 Oct 2023
LoMAE: Low-level Vision Masked Autoencoders for Low-dose CT Denoising Dayang Wang Yongshun Xu Shuo Han Zhan Wu Li Zhou Bahareh Morovati Hengyong Yu MedIm 46 2 0 19 Oct 2023
Semi-Supervised Crowd Counting with Contextual Modeling: Facilitating Holistic Understanding of Crowd Scenes Yifei Qian Xiaopeng Hong Zhongliang Guo Ognjen Arandjelović Carl R. Donovan 35 8 0 16 Oct 2023
Foundation Ark: Accruing and Reusing Knowledge for Superior and Robust Performance Dongao Ma Jiaxuan Pang Michael B. Gotway Jianming Liang MedIm OOD 25 8 0 14 Oct 2023
Self-supervised Representation Learning From Random Data Projectors Yi Sui Tongzi Wu Jesse C. Cresswell Ga Wu George Stein Xiao Shi Huang Xiaochen Zhang M. Volkovs 32 10 0 11 Oct 2023
Survey on Imbalanced Data, Representation Learning and SEP Forecasting Josias Moukpe AI4TS 27 0 0 11 Oct 2023
Heuristic Vision Pre-Training with Self-Supervised and Supervised Multi-Task Learning Zhiming Qian VLM SSL 22 0 0 11 Oct 2023
Perceptual MAE for Image Manipulation Localization: A High-level Vision Learner Focusing on Low-level Features Xiaochen Ma Jizhe Zhou Xiong Xu Zhuohang Jiang Chi-Man Pun 31 0 0 10 Oct 2023
Antenna Response Consistency Driven Self-supervised Learning for WIFI-based Human Activity Recognition Ke Xu Jiangtao Wang Erik Cambria Dingchang Zheng 8 0 0 10 Oct 2023
Adversarial Masked Image Inpainting for Robust Detection of Mpox and Non-Mpox Yubiao Yue Zhenzhang Li MedIm 21 0 0 10 Oct 2023
DiPS: Discriminative Pseudo-Label Sampling with Self-Supervised Transformers for Weakly Supervised Object Localization Shakeeb Murtaza Soufiane Belharbi M. Pedersoli Aydin Sarraf Eric Granger WSOL 45 9 0 09 Oct 2023
Adaptive Multi-head Contrastive Learning Lei Wang Piotr Koniusz Tom Gedeon Liang Zheng 41 4 0 09 Oct 2023
Enhancing Representations through Heterogeneous Self-Supervised Learning Zhongyu Li Bo-Wen Yin Yongxiang Liu Li Liu Ming-Ming Cheng SSL 28 2 0 08 Oct 2023
MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation Muhammad Osama Khan Junbang Liang Chun-Kai Wang Shan Yang Yu Lou MDE 49 4 0 06 Oct 2023
TiC: Exploring Vision Transformer in Convolution Song Zhang Qingzhong Wang Jiang Bian Haoyi Xiong ViT 34 1 0 06 Oct 2023
3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation Chen Zhao Tong Zhang Mathieu Salzmann 3DH 28 9 0 05 Oct 2023
Understanding Masked Autoencoders From a Local Contrastive Perspective Xiaoyu Yue Lei Bai Meng Wei Jiangmiao Pang Xihui Liu Luping Zhou Wanli Ouyang SSL 67 4 0 03 Oct 2023
Self-distilled Masked Attention guided masked image modeling with noise Regularized Teacher (SMART) for medical image analysis Jue Jiang Aneesh Rangnekar Chloe Choi Harini Veeraraghavan MedIm 24 0 0 02 Oct 2023
Can Pre-trained Networks Detect Familiar Out-of-Distribution Data? Atsuyuki Miyai Qing Yu Go Irie Kiyoharu Aizawa OODD 148 6 0 02 Oct 2023
Win-Win: Training High-Resolution Vision Transformers from Two Windows Vincent Leroy Jérôme Revaud Thomas Lucas Philippe Weinzaepfel ViT 42 2 0 01 Oct 2023