ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09886
  4. Cited By
SimMIM: A Simple Framework for Masked Image Modeling

SimMIM: A Simple Framework for Masked Image Modeling

18 November 2021
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
ArXivPDFHTML

Papers citing "SimMIM: A Simple Framework for Masked Image Modeling"

50 / 849 papers shown
Title
Rescuing referral failures during automated diagnosis of domain-shifted
  medical images
Rescuing referral failures during automated diagnosis of domain-shifted medical images
Anuj Srivastava
Karm Patel
Pradeep Shenoy
D. Sridharan
OOD
26
0
0
28 Nov 2023
Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for
  Visual Insect Understanding
Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding
Hoang-Quan Nguyen
Thanh-Dat Truong
Xuan-Bac Nguyen
Ashley Dowling
Xin Li
Khoa Luu
VLM
24
19
0
26 Nov 2023
Predicting Gradient is Better: Exploring Self-Supervised Learning for
  SAR ATR with a Joint-Embedding Predictive Architecture
Predicting Gradient is Better: Exploring Self-Supervised Learning for SAR ATR with a Joint-Embedding Predictive Architecture
Wei-Jang Li
Yang Wei
Tianpeng Liu
Yuenan Hou
Yuxuan Li
Zhen Liu
Yongxiang Liu
Li Liu
31
18
0
26 Nov 2023
Understanding Self-Supervised Features for Learning Unsupervised
  Instance Segmentation
Understanding Self-Supervised Features for Learning Unsupervised Instance Segmentation
Paul Engstler
Luke Melas-Kyriazi
Christian Rupprecht
Iro Laina
SSL
25
3
0
24 Nov 2023
Towards Transferable Multi-modal Perception Representation Learning for
  Autonomy: NeRF-Supervised Masked AutoEncoder
Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder
Xiaohao Xu
38
0
0
23 Nov 2023
SegVol: Universal and Interactive Volumetric Medical Image Segmentation
SegVol: Universal and Interactive Volumetric Medical Image Segmentation
Yuxin Du
Fan Bai
Tiejun Huang
Bo Zhao
VLM
42
38
0
22 Nov 2023
Masked Autoencoders Are Robust Neural Architecture Search Learners
Masked Autoencoders Are Robust Neural Architecture Search Learners
Yiming Hu
Xiangxiang Chu
Bo-Wen Zhang
OOD
40
0
0
20 Nov 2023
Event Camera Data Dense Pre-training
Event Camera Data Dense Pre-training
Yan Yang
Liyuan Pan
Liu Liu
30
4
0
20 Nov 2023
Deep Tensor Network
Deep Tensor Network
Yifan Zhang
32
0
0
18 Nov 2023
Shifting to Machine Supervision: Annotation-Efficient Semi and
  Self-Supervised Learning for Automatic Medical Image Segmentation and
  Classification
Shifting to Machine Supervision: Annotation-Efficient Semi and Self-Supervised Learning for Automatic Medical Image Segmentation and Classification
P. Singh
Raviteja Chukkapalli
Shravan Chaudhari
Luoyao Chen
Mei Chen
Jinqian Pan
Craig Smuda
Jacopo Cirrone
4
7
0
17 Nov 2023
From Pretext to Purpose: Batch-Adaptive Self-Supervised Learning
From Pretext to Purpose: Batch-Adaptive Self-Supervised Learning
Jiansong Zhang
Linlin Shen
Peizhong Liu
SSL
32
0
0
16 Nov 2023
Florence-2: Advancing a Unified Representation for a Variety of Vision
  Tasks
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Bin Xiao
Haiping Wu
Weijian Xu
Xiyang Dai
Houdong Hu
Yumao Lu
Michael Zeng
Ce Liu
Lu Yuan
VLM
45
143
0
10 Nov 2023
Window Attention is Bugged: How not to Interpolate Position Embeddings
Window Attention is Bugged: How not to Interpolate Position Embeddings
Daniel Bolya
Chaitanya K. Ryali
Judy Hoffman
Christoph Feichtenhofer
43
10
0
09 Nov 2023
Embedding Space Interpolation Beyond Mini-Batch, Beyond Pairs and Beyond
  Examples
Embedding Space Interpolation Beyond Mini-Batch, Beyond Pairs and Beyond Examples
Shashanka Venkataramanan
Ewa Kijak
Laurent Amsaleg
Yannis Avrithis
20
4
0
09 Nov 2023
PersonMAE: Person Re-Identification Pre-Training with Masked
  AutoEncoders
PersonMAE: Person Re-Identification Pre-Training with Masked AutoEncoders
Hezhen Hu
Xiaoyi Dong
Jianmin Bao
Dongdong Chen
Lu Yuan
Dong Chen
Houqiang Li
31
3
0
08 Nov 2023
Asymmetric Masked Distillation for Pre-Training Small Foundation Models
Asymmetric Masked Distillation for Pre-Training Small Foundation Models
Zhiyu Zhao
Bingkun Huang
Sen Xing
Gangshan Wu
Yu Qiao
Limin Wang
42
5
0
06 Nov 2023
ProS: Facial Omni-Representation Learning via Prototype-based
  Self-Distillation
ProS: Facial Omni-Representation Learning via Prototype-based Self-Distillation
Xing Di
Yiyu Zheng
Xiaoming Liu
Yu Cheng
18
3
0
03 Nov 2023
Concatenated Masked Autoencoders as Spatial-Temporal Learner
Concatenated Masked Autoencoders as Spatial-Temporal Learner
Zhouqiang Jiang
Bowen Wang
Tong Xiang
Zhaofeng Niu
Hong Tang
Guangshun Li
Liangzhi Li
22
2
0
02 Nov 2023
CROMA: Remote Sensing Representations with Contrastive Radar-Optical
  Masked Autoencoders
CROMA: Remote Sensing Representations with Contrastive Radar-Optical Masked Autoencoders
A. Fuller
K. Millard
James R. Green
29
60
0
01 Nov 2023
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked
  Autoencoders
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Srijan Das
Tanmay Jain
Dominick Reilly
P. Balaji
Soumyajit Karmakar
Shyam Marjit
Xiang Li
Abhijit Das
Michael S. Ryoo
39
16
0
31 Oct 2023
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
Junkun Yuan
Xinyu Zhang
Hao Zhou
Jian Wang
Zhongwei Qiu
...
Junyu Han
Errui Ding
Lanfen Lin
Fei Wu
Jingdong Wang
38
18
0
31 Oct 2023
Pre-training with Random Orthogonal Projection Image Modeling
Pre-training with Random Orthogonal Projection Image Modeling
Maryam Haghighat
Peyman Moghadam
Shaheer Mohamed
Piotr Koniusz
VLM
28
8
0
28 Oct 2023
Feature Guided Masked Autoencoder for Self-supervised Learning in Remote
  Sensing
Feature Guided Masked Autoencoder for Self-supervised Learning in Remote Sensing
Yi Wang
Hugo Hernández Hernández
C. Albrecht
Xiao Xiang Zhu
45
30
0
28 Oct 2023
FaultSeg Swin-UNETR: Transformer-Based Self-Supervised Pretraining Model for Fault Recognition
Zeren Zhang
Ran Chen
Jinwen Ma
ViT
13
0
0
27 Oct 2023
SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation
SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation
Mengcheng Lan
Xinjiang Wang
Yiping Ke
Jiaxing Xu
Xue Jiang
Wayne Zhang
43
11
0
27 Oct 2023
Bridging The Gaps Between Token Pruning and Full Pre-training via Masked
  Fine-tuning
Bridging The Gaps Between Token Pruning and Full Pre-training via Masked Fine-tuning
Fengyuan Shi
Limin Wang
ViT
38
0
0
26 Oct 2023
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
Asmar Nadeem
Adrian Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
33
9
0
25 Oct 2023
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked
  Auto-Encoder
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder
Huiwon Jang
Jihoon Tack
Daewon Choi
Jongheon Jeong
Jinwoo Shin
21
2
0
25 Oct 2023
SAMCLR: Contrastive pre-training on complex scenes using SAM for view
  sampling
SAMCLR: Contrastive pre-training on complex scenes using SAM for view sampling
Benjamin Missaoui
Chongbin Yuan
VLM
31
1
0
23 Oct 2023
Learning with Unmasked Tokens Drives Stronger Vision Learners
Learning with Unmasked Tokens Drives Stronger Vision Learners
Taekyung Kim
Sanghyuk Chun
Byeongho Heo
Dongyoon Han
SSL
42
1
0
20 Oct 2023
WeedCLR: Weed Contrastive Learning through Visual Representations with
  Class-Optimized Loss in Long-Tailed Datasets
WeedCLR: Weed Contrastive Learning through Visual Representations with Class-Optimized Loss in Long-Tailed Datasets
Alzayat Saleh
A. Olsen
Jake Wood
B. Philippa
M. R. Azghadi
22
0
0
19 Oct 2023
LoMAE: Low-level Vision Masked Autoencoders for Low-dose CT Denoising
LoMAE: Low-level Vision Masked Autoencoders for Low-dose CT Denoising
Dayang Wang
Yongshun Xu
Shuo Han
Zhan Wu
Li Zhou
Bahareh Morovati
Hengyong Yu
MedIm
46
2
0
19 Oct 2023
Semi-Supervised Crowd Counting with Contextual Modeling: Facilitating
  Holistic Understanding of Crowd Scenes
Semi-Supervised Crowd Counting with Contextual Modeling: Facilitating Holistic Understanding of Crowd Scenes
Yifei Qian
Xiaopeng Hong
Zhongliang Guo
Ognjen Arandjelović
Carl R. Donovan
35
8
0
16 Oct 2023
Foundation Ark: Accruing and Reusing Knowledge for Superior and Robust
  Performance
Foundation Ark: Accruing and Reusing Knowledge for Superior and Robust Performance
Dongao Ma
Jiaxuan Pang
Michael B. Gotway
Jianming Liang
MedIm
OOD
25
8
0
14 Oct 2023
Self-supervised Representation Learning From Random Data Projectors
Self-supervised Representation Learning From Random Data Projectors
Yi Sui
Tongzi Wu
Jesse C. Cresswell
Ga Wu
George Stein
Xiao Shi Huang
Xiaochen Zhang
M. Volkovs
32
10
0
11 Oct 2023
Survey on Imbalanced Data, Representation Learning and SEP Forecasting
Survey on Imbalanced Data, Representation Learning and SEP Forecasting
Josias Moukpe
AI4TS
27
0
0
11 Oct 2023
Heuristic Vision Pre-Training with Self-Supervised and Supervised
  Multi-Task Learning
Heuristic Vision Pre-Training with Self-Supervised and Supervised Multi-Task Learning
Zhiming Qian
VLM
SSL
22
0
0
11 Oct 2023
Perceptual MAE for Image Manipulation Localization: A High-level Vision
  Learner Focusing on Low-level Features
Perceptual MAE for Image Manipulation Localization: A High-level Vision Learner Focusing on Low-level Features
Xiaochen Ma
Jizhe Zhou
Xiong Xu
Zhuohang Jiang
Chi-Man Pun
31
0
0
10 Oct 2023
Antenna Response Consistency Driven Self-supervised Learning for
  WIFI-based Human Activity Recognition
Antenna Response Consistency Driven Self-supervised Learning for WIFI-based Human Activity Recognition
Ke Xu
Jiangtao Wang
Erik Cambria
Dingchang Zheng
8
0
0
10 Oct 2023
Adversarial Masked Image Inpainting for Robust Detection of Mpox and
  Non-Mpox
Adversarial Masked Image Inpainting for Robust Detection of Mpox and Non-Mpox
Yubiao Yue
Zhenzhang Li
MedIm
21
0
0
10 Oct 2023
DiPS: Discriminative Pseudo-Label Sampling with Self-Supervised
  Transformers for Weakly Supervised Object Localization
DiPS: Discriminative Pseudo-Label Sampling with Self-Supervised Transformers for Weakly Supervised Object Localization
Shakeeb Murtaza
Soufiane Belharbi
M. Pedersoli
Aydin Sarraf
Eric Granger
WSOL
45
9
0
09 Oct 2023
Adaptive Multi-head Contrastive Learning
Adaptive Multi-head Contrastive Learning
Lei Wang
Piotr Koniusz
Tom Gedeon
Liang Zheng
41
4
0
09 Oct 2023
Enhancing Representations through Heterogeneous Self-Supervised Learning
Enhancing Representations through Heterogeneous Self-Supervised Learning
Zhongyu Li
Bo-Wen Yin
Yongxiang Liu
Li Liu
Ming-Ming Cheng
SSL
28
2
0
08 Oct 2023
MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth
  Estimation
MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Muhammad Osama Khan
Junbang Liang
Chun-Kai Wang
Shan Yang
Yu Lou
MDE
49
4
0
06 Oct 2023
TiC: Exploring Vision Transformer in Convolution
TiC: Exploring Vision Transformer in Convolution
Song Zhang
Qingzhong Wang
Jiang Bian
Haoyi Xiong
ViT
34
1
0
06 Oct 2023
3D-Aware Hypothesis & Verification for Generalizable Relative Object
  Pose Estimation
3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation
Chen Zhao
Tong Zhang
Mathieu Salzmann
3DH
28
9
0
05 Oct 2023
Understanding Masked Autoencoders From a Local Contrastive Perspective
Understanding Masked Autoencoders From a Local Contrastive Perspective
Xiaoyu Yue
Lei Bai
Meng Wei
Jiangmiao Pang
Xihui Liu
Luping Zhou
Wanli Ouyang
SSL
67
4
0
03 Oct 2023
Self-distilled Masked Attention guided masked image modeling with noise
  Regularized Teacher (SMART) for medical image analysis
Self-distilled Masked Attention guided masked image modeling with noise Regularized Teacher (SMART) for medical image analysis
Jue Jiang
Aneesh Rangnekar
Chloe Choi
Harini Veeraraghavan
MedIm
24
0
0
02 Oct 2023
Can Pre-trained Networks Detect Familiar Out-of-Distribution Data?
Can Pre-trained Networks Detect Familiar Out-of-Distribution Data?
Atsuyuki Miyai
Qing Yu
Go Irie
Kiyoharu Aizawa
OODD
148
6
0
02 Oct 2023
Win-Win: Training High-Resolution Vision Transformers from Two Windows
Win-Win: Training High-Resolution Vision Transformers from Two Windows
Vincent Leroy
Jérôme Revaud
Thomas Lucas
Philippe Weinzaepfel
ViT
42
2
0
01 Oct 2023
Previous
123...789...151617
Next