ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09886
  4. Cited By
SimMIM: A Simple Framework for Masked Image Modeling

SimMIM: A Simple Framework for Masked Image Modeling

18 November 2021
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
ArXivPDFHTML

Papers citing "SimMIM: A Simple Framework for Masked Image Modeling"

50 / 849 papers shown
Title
Not All Prompts Are Secure: A Switchable Backdoor Attack Against
  Pre-trained Vision Transformers
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers
Shengyuan Yang
Jiawang Bai
Kuofeng Gao
Yong-Liang Yang
Yiming Li
Shu-Tao Xia
AAML
SILM
35
5
0
17 May 2024
Harmonizing Generalization and Personalization in Federated Prompt
  Learning
Harmonizing Generalization and Personalization in Federated Prompt Learning
Tianyu Cui
Hongxia Li
Jingya Wang
Ye-ling Shi
FedML
VLM
34
8
0
16 May 2024
Self-supervised vision-langage alignment of deep learning
  representations for bone X-rays analysis
Self-supervised vision-langage alignment of deep learning representations for bone X-rays analysis
A. Englebert
Anne-Sophie Collin
O. Cornu
Christophe De Vleeschouwer
34
1
0
14 May 2024
Efficient Vision-Language Pre-training by Cluster Masking
Efficient Vision-Language Pre-training by Cluster Masking
Zihao Wei
Zixuan Pan
Andrew Owens
VLM
29
8
0
14 May 2024
Open Challenges and Opportunities in Federated Foundation Models Towards
  Biomedical Healthcare
Open Challenges and Opportunities in Federated Foundation Models Towards Biomedical Healthcare
Xingyu Li
Lu Peng
Yuping Wang
Weihua Zhang
AI4CE
MedIm
LM&MA
71
5
0
10 May 2024
MaskMatch: Boosting Semi-Supervised Learning Through Mask
  Autoencoder-Driven Feature Learning
MaskMatch: Boosting Semi-Supervised Learning Through Mask Autoencoder-Driven Feature Learning
Wenjin Zhang
Keyi Li
Sen Yang
Chenyang Gao
Wanzhao Yang
Sifan Yuan
I. Marsic
36
1
0
10 May 2024
Efficient Pretraining Model based on Multi-Scale Local Visual Field
  Feature Reconstruction for PCB CT Image Element Segmentation
Efficient Pretraining Model based on Multi-Scale Local Visual Field Feature Reconstruction for PCB CT Image Element Segmentation
Chen Chen
Kai Qiao
Jie Yang
Jian Chen
Bin Yan
30
1
0
09 May 2024
Class-relevant Patch Embedding Selection for Few-Shot Image
  Classification
Class-relevant Patch Embedding Selection for Few-Shot Image Classification
Weihao Jiang
Haoyang Cui
Kun He
VLM
44
0
0
06 May 2024
Intra-task Mutual Attention based Vision Transformer for Few-Shot
  Learning
Intra-task Mutual Attention based Vision Transformer for Few-Shot Learning
Weihao Jiang
Chang-Shu Liu
Kun He
ViT
64
0
0
06 May 2024
MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial
  Representation Learning
MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning
Vishal Nedungadi
A. Kariryaa
Stefan Oehmcke
Serge J. Belongie
Christian Igel
Nico Lang
42
25
0
04 May 2024
Self-Supervised Learning for Interventional Image Analytics: Towards
  Robust Device Trackers
Self-Supervised Learning for Interventional Image Analytics: Towards Robust Device Trackers
Saahil Islam
Venkatesh N. Murthy
Dominik Neumann
Badhan Kumar Das
Puneet Sharma
Andreas Maier
Dorin Comaniciu
Florin-Cristian Ghesu
34
1
0
02 May 2024
Spider: A Unified Framework for Context-dependent Concept Segmentation
Spider: A Unified Framework for Context-dependent Concept Segmentation
Xiaoqi Zhao
Youwei Pang
Wei Ji
Baicheng Sheng
Jiaming Zuo
Lihe Zhang
Huchuan Lu
39
6
0
02 May 2024
Masked Multi-Query Slot Attention for Unsupervised Object Discovery
Masked Multi-Query Slot Attention for Unsupervised Object Discovery
Rishav Pramanik
José-Fabian Villa-Vásquez
M. Pedersoli
OCL
40
0
0
30 Apr 2024
ConPro: Learning Severity Representation for Medical Images using
  Contrastive Learning and Preference Optimization
ConPro: Learning Severity Representation for Medical Images using Contrastive Learning and Preference Optimization
Hong Nguyen
H. Nguyen
Melinda Y. Chang
Hieu H. Pham
Shrikanth Narayanan
Michael Pazzani
27
0
0
29 Apr 2024
Representing Part-Whole Hierarchies in Foundation Models by Learning
  Localizability, Composability, and Decomposability from Anatomy via
  Self-Supervision
Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability, Composability, and Decomposability from Anatomy via Self-Supervision
M. Taher
Michael B. Gotway
Jianming Liang
MedIm
31
5
0
24 Apr 2024
MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis
MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis
Jiaxin Zhuang
Linshan Wu
Qiong Wang
V. Vardhanabhuti
Lin Luo
Hao Chen
Hao Chen
57
4
0
24 Apr 2024
HybridFlow: Infusing Continuity into Masked Codebook for Extreme
  Low-Bitrate Image Compression
HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression
Lei Lu
Yanyue Xie
Wei Jiang
Wei Wang
Xue Lin
Yanzhi Wang
45
4
0
20 Apr 2024
An Experimental Study on Exploring Strong Lightweight Vision
  Transformers via Masked Image Modeling Pre-Training
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Jin Gao
Shubo Lin
Shaoru Wang
Yutong Kou
Zeming Li
Liang Li
Congxuan Zhang
Xiaoqin Zhang
Yizheng Wang
Weiming Hu
47
1
0
18 Apr 2024
MaeFuse: Transferring Omni Features with Pretrained Masked Autoencoders for Infrared and Visible Image Fusion via Guided Training
MaeFuse: Transferring Omni Features with Pretrained Masked Autoencoders for Infrared and Visible Image Fusion via Guided Training
Jiayang Li
Junjun Jiang
Pengwei Liang
Jiayi Ma
Liqiang Nie
42
1
0
17 Apr 2024
Cross-Modal Self-Training: Aligning Images and Pointclouds to Learn
  Classification without Labels
Cross-Modal Self-Training: Aligning Images and Pointclouds to Learn Classification without Labels
Amaya Dharmasiri
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
VLM
3DPC
49
1
0
15 Apr 2024
XoFTR: Cross-modal Feature Matching Transformer
XoFTR: Cross-modal Feature Matching Transformer
Önder Tuzcuoglu
Aybora Köksal
Bugra Sofu
Sinan Kalkan
Aydin Alatan
ViT
50
10
0
15 Apr 2024
How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model
How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model
Han Gu
Haoyu Dong
Jichen Yang
Maciej Mazurowski
MedIm
VLM
80
14
0
15 Apr 2024
Masked Image Modeling as a Framework for Self-Supervised Learning across
  Eye Movements
Masked Image Modeling as a Framework for Self-Supervised Learning across Eye Movements
Robin Weiler
Matthias Brucklacher
C. Pennartz
Sander M. Bohté
38
0
0
12 Apr 2024
OmniSat: Self-Supervised Modality Fusion for Earth Observation
OmniSat: Self-Supervised Modality Fusion for Earth Observation
Guillaume Astruc
Nicolas Gonthier
Clement Mallet
Loic Landrieu
38
25
0
12 Apr 2024
Emerging Property of Masked Token for Effective Pre-training
Emerging Property of Masked Token for Effective Pre-training
Hyesong Choi
Hunsang Lee
Seyoung Joung
Hyejin Park
Jiyeong Kim
Dongbo Min
36
9
0
12 Apr 2024
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced
  Pre-training
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training
Hyesong Choi
Hyejin Park
Kwang Moo Yi
Sungmin Cha
Dongbo Min
39
9
0
12 Apr 2024
Guided Masked Self-Distillation Modeling for Distributed Multimedia
  Sensor Event Analysis
Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis
Masahiro Yasuda
Noboru Harada
Yasunori Ohishi
Shoichiro Saito
Akira Nakayama
Nobutaka Ono
36
3
0
12 Apr 2024
Any2Point: Empowering Any-modality Large Models for Efficient 3D
  Understanding
Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding
Yiwen Tang
Ray Zhang
Jiaming Liu
Zoey Guo
Dong Wang
...
Bin Zhao
Shanghang Zhang
Peng Gao
Hongsheng Li
Xuelong Li
40
12
0
11 Apr 2024
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
Jihao Liu
Jinliang Zheng
Yu Liu
Hongsheng Li
VLM
29
3
0
11 Apr 2024
NeuroNet: A Novel Hybrid Self-Supervised Learning Framework for Sleep
  Stage Classification Using Single-Channel EEG
NeuroNet: A Novel Hybrid Self-Supervised Learning Framework for Sleep Stage Classification Using Single-Channel EEG
Cheol-Hui Lee
Hakseung Kim
Hyun-jee Han
Min-Kyung Jung
Byung C. Yoon
Dong-Joo Kim
37
5
0
10 Apr 2024
Unified Physical-Digital Attack Detection Challenge
Unified Physical-Digital Attack Detection Challenge
Haocheng Yuan
Ajian Liu
Junze Zheng
Jun Wan
Jiankang Deng
Sergio Escalera
Hugo Jair Escalante
Isabelle M Guyon
Zhen Lei
AAML
CVBM
35
2
0
09 Apr 2024
Social-MAE: Social Masked Autoencoder for Multi-person Motion
  Representation Learning
Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning
Mahsa Ehsanpour
Ian Reid
Hamid Rezatofighi
ViT
34
0
0
08 Apr 2024
D2SL: Decouple Defogging and Semantic Learning for Foggy Domain-Adaptive
  Segmentation
D2SL: Decouple Defogging and Semantic Learning for Foggy Domain-Adaptive Segmentation
Xuan Sun
Zhanfu An
Yuyu Liu
38
0
0
07 Apr 2024
Dissecting Query-Key Interaction in Vision Transformers
Dissecting Query-Key Interaction in Vision Transformers
Xu Pan
Aaron Philip
Ziqian Xie
Odelia Schwartz
39
1
0
04 Apr 2024
Foundation Model for Advancing Healthcare: Challenges, Opportunities,
  and Future Directions
Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future Directions
Yuting He
Fuxiang Huang
Xinrui Jiang
Yuxiang Nie
Minghao Wang
Jiguang Wang
Hao Chen
LM&MA
AI4CE
71
27
0
04 Apr 2024
Cross-Modal Conditioned Reconstruction for Language-guided Medical Image
  Segmentation
Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation
Xiaoshuang Huang
Hongxiang Li
Meng Cao
Long Chen
Chenyu You
Dong An
VLM
41
5
0
03 Apr 2024
A Unified Membership Inference Method for Visual Self-supervised Encoder
  via Part-aware Capability
A Unified Membership Inference Method for Visual Self-supervised Encoder via Part-aware Capability
Jie Zhu
Jirong Zha
Ding Li
Leye Wang
37
6
0
03 Apr 2024
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation
  Learning for Neural Radiance Fields
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Muhammad Zubair Irshad
Sergey Zakahrov
Vitor Campagnolo Guizilini
Adrien Gaidon
Z. Kira
Rares Ambrus
ViT
42
12
0
01 Apr 2024
Bridging Remote Sensors with Multisensor Geospatial Foundation Models
Bridging Remote Sensors with Multisensor Geospatial Foundation Models
Boran Han
Shuai Zhang
Xingjian Shi
Markus Reichstein
31
22
0
01 Apr 2024
SyncMask: Synchronized Attentional Masking for Fashion-centric
  Vision-Language Pretraining
SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining
Chull Hwan Song
Taebaek Hwang
Jooyoung Yoon
Shunghyun Choi
Yeong Hyeon Gu
23
4
0
01 Apr 2024
Learning to Rank Patches for Unbiased Image Redundancy Reduction
Learning to Rank Patches for Unbiased Image Redundancy Reduction
Yang Luo
Zhineng Chen
Peng Zhou
Zuxuan Wu
Xieping Gao
Yu-Gang Jiang
SSL
27
1
0
31 Mar 2024
Transformer based Pluralistic Image Completion with Reduced Information
  Loss
Transformer based Pluralistic Image Completion with Reduced Information Loss
Qiankun Liu
Yuqi Jiang
Zhentao Tan
Dongdong Chen
Ying Fu
Qi Chu
Gang Hua
Nenghai Yu
ViT
68
11
0
31 Mar 2024
DailyMAE: Towards Pretraining Masked Autoencoders in One Day
DailyMAE: Towards Pretraining Masked Autoencoders in One Day
Jiantao Wu
Shentong Mo
Sara Atito
Zhenhua Feng
Josef Kittler
Muhammad Awais
35
3
0
31 Mar 2024
MVEB: Self-Supervised Learning with Multi-View Entropy Bottleneck
MVEB: Self-Supervised Learning with Multi-View Entropy Bottleneck
Liangjiang Wen
Xiasi Wang
Jianzhuang Liu
Zenglin Xu
28
2
0
28 Mar 2024
Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders
Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders
Alexandre Eymaël
Renaud Vandeghen
A. Cioppa
Silvio Giancola
Guohao Li
Marc Van Droogenbroeck
ViT
43
6
0
26 Mar 2024
Masked Autoencoders are PDE Learners
Masked Autoencoders are PDE Learners
Anthony Y. Zhou
A. Farimani
AI4CE
38
6
0
26 Mar 2024
Adversarially Masked Video Consistency for Unsupervised Domain
  Adaptation
Adversarially Masked Video Consistency for Unsupervised Domain Adaptation
Xiaoyu Zhu
Junwei Liang
Po-Yao Huang
Alex Hauptmann
32
1
0
24 Mar 2024
SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal
  Visual Object Tracking
SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
Xiaojun Hou
Jiazheng Xing
Yijie Qian
Yaowei Guo
Shuo Xin
...
Kai Tang
Mengmeng Wang
Zhengkai Jiang
Liang Liu
Yong-Jin Liu
30
23
0
24 Mar 2024
Once for Both: Single Stage of Importance and Sparsity Search for Vision
  Transformer Compression
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression
Hancheng Ye
Chong Yu
Peng Ye
Renqiu Xia
Yansong Tang
Jiwen Lu
Tao Chen
Bo-Wen Zhang
53
3
0
23 Mar 2024
Regularized Adaptive Momentum Dual Averaging with an Efficient Inexact
  Subproblem Solver for Training Structured Neural Network
Regularized Adaptive Momentum Dual Averaging with an Efficient Inexact Subproblem Solver for Training Structured Neural Network
Zih-Syuan Huang
Ching-pei Lee
25
0
0
21 Mar 2024
Previous
123456...151617
Next