ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09886
  4. Cited By
SimMIM: A Simple Framework for Masked Image Modeling

SimMIM: A Simple Framework for Masked Image Modeling

18 November 2021
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
ArXivPDFHTML

Papers citing "SimMIM: A Simple Framework for Masked Image Modeling"

50 / 849 papers shown
Title
CMID: A Unified Self-Supervised Learning Framework for Remote Sensing
  Image Understanding
CMID: A Unified Self-Supervised Learning Framework for Remote Sensing Image Understanding
Dilxat Muhtar
Xue-liang Zhang
Pengfeng Xiao
Zhenshi Li
Feng-Xue Gu
SSL
45
50
0
19 Apr 2023
DCELANM-Net:Medical Image Segmentation based on Dual Channel Efficient
  Layer Aggregation Network with Learner
DCELANM-Net:Medical Image Segmentation based on Dual Channel Efficient Layer Aggregation Network with Learner
Cheng Lu
Z. Xia
Krzysztof Przystupa
Orest Kochan
J. Su
MedIm
23
10
0
19 Apr 2023
Zoom-VQA: Patches, Frames and Clips Integration for Video Quality
  Assessment
Zoom-VQA: Patches, Frames and Clips Integration for Video Quality Assessment
Kai Zhao
Kun Yuan
Ming-Ting Sun
Xingsen Wen
21
20
0
13 Apr 2023
Hard Patches Mining for Masked Image Modeling
Hard Patches Mining for Masked Image Modeling
Haochen Wang
Kaiyou Song
Junsong Fan
Yuxi Wang
Jin Xie
Zhaoxiang Zhang
37
59
0
12 Apr 2023
Learning Transferable Pedestrian Representation from Multimodal
  Information Supervision
Learning Transferable Pedestrian Representation from Multimodal Information Supervision
Li-Na Bao
Longhui Wei
Xiaoyu Qiu
Wen-gang Zhou
Houqiang Li
Qi Tian
SSL
42
5
0
12 Apr 2023
MoMo: A shared encoder Model for text, image and multi-Modal
  representations
MoMo: A shared encoder Model for text, image and multi-Modal representations
Rakesh Chada
Zhao-Heng Zheng
P. Natarajan
ViT
21
4
0
11 Apr 2023
A Billion-scale Foundation Model for Remote Sensing Images
A Billion-scale Foundation Model for Remote Sensing Images
Keumgang Cha
Junghoon Seo
Taekyung Lee
38
64
0
11 Apr 2023
Mask-Based Modeling for Neural Radiance Fields
Mask-Based Modeling for Neural Radiance Fields
Ganlin Yang
Guoqiang Wei
Zhizheng Zhang
Yan Lu
Dong Liu
AI4CE
21
1
0
11 Apr 2023
Diffusion Models as Masked Autoencoders
Diffusion Models as Masked Autoencoders
Chen Wei
K. Mangalam
Po-Yao (Bernie) Huang
Yanghao Li
Haoqi Fan
Hu Xu
Huiyu Wang
Cihang Xie
Alan Yuille
Christoph Feichtenhofer
DiffM
SyDa
36
48
0
06 Apr 2023
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot
  Keypoint Detection
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection
Changsheng Lu
Hao Zhu
Piotr Koniusz
48
11
0
06 Apr 2023
Disentangled Pre-training for Image Matting
Disentangled Pre-training for Image Matting
Yan-Da Li
Zilong Huang
Gang Yu
Ling-Hao Chen
Yunchao Wei
Jianbo Jiao
28
0
0
03 Apr 2023
Multi-Modal Representation Learning with Text-Driven Soft Masks
Multi-Modal Representation Learning with Text-Driven Soft Masks
Jaeyoo Park
Bohyung Han
SSL
30
4
0
03 Apr 2023
Mask Hierarchical Features For Self-Supervised Learning
Mask Hierarchical Features For Self-Supervised Learning
Fenggang Liu
Yangguang Li
Feng Liang
Jilan Xu
Bin Huang
Jing Shao
21
0
0
01 Apr 2023
DIME-FM: DIstilling Multimodal and Efficient Foundation Models
DIME-FM: DIstilling Multimodal and Efficient Foundation Models
Ximeng Sun
Pengchuan Zhang
Peizhao Zhang
Hardik Shah
Kate Saenko
Xide Xia
VLM
30
20
0
31 Mar 2023
LaCViT: A Label-aware Contrastive Fine-tuning Framework for Vision
  Transformers
LaCViT: A Label-aware Contrastive Fine-tuning Framework for Vision Transformers
Zijun Long
Zaiqiao Meng
Gerardo Aragon Camarasa
R. McCreadie
VLM
42
5
0
31 Mar 2023
Whether and When does Endoscopy Domain Pretraining Make Sense?
Whether and When does Endoscopy Domain Pretraining Make Sense?
Dominik Batić
Felix Holm
Ege Özsoy
Tobias Czempiel
Nassir Navab
20
7
0
30 Mar 2023
Beyond Appearance: a Semantic Controllable Self-Supervised Learning
  Framework for Human-Centric Visual Tasks
Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks
Weihua Chen
Xianzhe Xu
Jian Jia
Haowen Luo
Yaohua Wang
F. Wang
Rong Jin
Xiuyu Sun
SSL
39
94
0
30 Mar 2023
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D
  Human Pose Estimation
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation
Qi-jun Zhao
Ce Zheng
Mengyuan Liu
Pichao Wang
Chong Chen
ViT
25
87
0
30 Mar 2023
Complementary Random Masking for RGB-Thermal Semantic Segmentation
Complementary Random Masking for RGB-Thermal Semantic Segmentation
Ukcheol Shin
Kyunghyun Lee
In So Kweon
Jean Oh
32
20
0
30 Mar 2023
PMatch: Paired Masked Image Modeling for Dense Geometric Matching
PMatch: Paired Masked Image Modeling for Dense Geometric Matching
Shengjie Zhu
Xiaoming Liu
40
24
0
30 Mar 2023
Masked Autoencoders as Image Processors
Masked Autoencoders as Image Processors
Huiyu Duan
Wei Shen
Xiongkuo Min
Danyang Tu
Long Teng
Jia Wang
Guangtao Zhai
ViT
38
11
0
30 Mar 2023
Mixed Autoencoder for Self-supervised Visual Representation Learning
Mixed Autoencoder for Self-supervised Visual Representation Learning
Kai Chen
Zhili Liu
Lanqing Hong
Hang Xu
Zhenguo Li
Dit-Yan Yeung
SSL
30
43
0
30 Mar 2023
ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing
ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing
Xiaodan Li
YueFeng Chen
Yao Zhu
Shuhui Wang
Rong Zhang
Hui Xue
34
24
0
30 Mar 2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Limin Wang
Bingkun Huang
Zhiyu Zhao
Zhan Tong
Yinan He
Yi Wang
Yali Wang
Yu Qiao
VGen
73
329
0
29 Mar 2023
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Kunchang Li
Yali Wang
Yizhuo Li
Yi Wang
Yinan He
Limin Wang
Yu Qiao
VGen
57
156
0
28 Mar 2023
Mask and Restore: Blind Backdoor Defense at Test Time with Masked
  Autoencoder
Mask and Restore: Blind Backdoor Defense at Test Time with Masked Autoencoder
Tao Sun
Lu Pang
Chao Chen
Haibin Ling
AAML
45
9
0
27 Mar 2023
SEM-POS: Grammatically and Semantically Correct Video Captioning
SEM-POS: Grammatically and Semantically Correct Video Captioning
Asmar Nadeem
A. Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
27
8
0
26 Mar 2023
BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning
BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning
Changdae Oh
Hyeji Hwang
Hee-young Lee
Yongtaek Lim
Geunyoung Jung
Jiyoung Jung
Hosik Choi
Kyungwoo Song
VLM
VPVLM
85
57
0
26 Mar 2023
Masked Scene Contrast: A Scalable Framework for Unsupervised 3D
  Representation Learning
Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning
Xiaoyang Wu
Xin Wen
Xihui Liu
Hengshuang Zhao
3DPC
122
40
0
24 Mar 2023
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based
  Self-Supervised Pre-Training
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
Runsen Xu
Tai Wang
Wenwei Zhang
Runjian Chen
Jinkun Cao
Jiangmiao Pang
Dahua Lin
3DPC
39
30
0
23 Mar 2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining
The effectiveness of MAE pre-pretraining for billion-scale pretraining
Mannat Singh
Quentin Duval
Kalyan Vasudev Alwala
Haoqi Fan
Vaibhav Aggarwal
...
Piotr Dollár
Christoph Feichtenhofer
Ross B. Girshick
Rohit Girdhar
Ishan Misra
LRM
126
63
0
23 Mar 2023
Masked Image Training for Generalizable Deep Image Denoising
Masked Image Training for Generalizable Deep Image Denoising
Haoyu Chen
Jinjin Gu
Yihao Liu
Salma Abdel Magid
Chao Dong
Qiong Wang
Hanspeter Pfister
Lei Zhu
27
63
0
23 Mar 2023
Test-time Detection and Repair of Adversarial Samples via Masked
  Autoencoder
Test-time Detection and Repair of Adversarial Samples via Masked Autoencoder
Yun-Yun Tsai
Ju-Chin Chao
Albert Wen
Zhaoyuan Yang
Chengzhi Mao
Tapan Shah
Junfeng Yang
AAML
21
1
0
22 Mar 2023
Correlational Image Modeling for Self-Supervised Visual Pre-Training
Correlational Image Modeling for Self-Supervised Visual Pre-Training
Wei Li
Jiahao Xie
Chen Change Loy
SSL
40
10
0
22 Mar 2023
Human Pose as Compositional Tokens
Human Pose as Compositional Tokens
Zigang Geng
Chunyu Wang
Yixuan Wei
Ze Liu
Houqiang Li
Han Hu
43
47
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
EVA-02: A Visual Representation for Neon Genesis
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
ViT
CLIP
42
261
0
20 Mar 2023
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling
  for Multi-view 3D Understanding
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding
Jihao Liu
Tai Wang
Boxiao Liu
Qihang Zhang
Yu Liu
Hongsheng Li
38
16
0
20 Mar 2023
HybridMIM: A Hybrid Masked Image Modeling Framework for 3D Medical Image
  Segmentation
HybridMIM: A Hybrid Masked Image Modeling Framework for 3D Medical Image Segmentation
Zhaohu Xing
Lei Zhu
Lequan Yu
Zhiheng Xing
Liang Wan
39
8
0
18 Mar 2023
Dual-path Adaptation from Image to Video Transformers
Dual-path Adaptation from Image to Video Transformers
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
ViT
21
37
0
17 Mar 2023
Denoising Diffusion Autoencoders are Unified Self-supervised Learners
Denoising Diffusion Autoencoders are Unified Self-supervised Learners
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
DiffM
30
71
0
17 Mar 2023
MAPSeg: Unified Unsupervised Domain Adaptation for Heterogeneous Medical
  Image Segmentation Based on 3D Masked Autoencoding and Pseudo-Labeling
MAPSeg: Unified Unsupervised Domain Adaptation for Heterogeneous Medical Image Segmentation Based on 3D Masked Autoencoding and Pseudo-Labeling
Xuzhe Zhang
Yu-Hsun Wu
S. Entringer
H. Simhan
Jia Guo
...
A. Jackowski
Haifeng Li
J. Posner
Andrew F. Laine
Yun Wang
OOD
20
11
0
16 Mar 2023
Real Face Foundation Representation Learning for Generalized Deepfake
  Detection
Real Face Foundation Representation Learning for Generalized Deepfake Detection
Liang Shi
Jie Zhang
Shiguang Shan
CVBM
43
7
0
15 Mar 2023
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D
  Object Detection
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection
Anthony Chen
Kevin Zhang
Renrui Zhang
Zihan Wang
Yuheng Lu
Yandong Guo
Shanghang Zhang
3DPC
70
61
0
14 Mar 2023
AdPE: Adversarial Positional Embeddings for Pretraining Vision
  Transformers via MAE+
AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+
Tianlin Li
Ying Wang
Ziwei Xuan
Guo-Jun Qi
ViT
48
3
0
14 Mar 2023
DPPMask: Masked Image Modeling with Determinantal Point Processes
DPPMask: Masked Image Modeling with Determinantal Point Processes
Junde Xu
Zikai Lin
Donghao Zhou
Yao-Cheng Yang
Xiangyun Liao
Bian Wu
Guangyong Chen
Pheng-Ann Heng
28
1
0
13 Mar 2023
ViM: Vision Middleware for Unified Downstream Transferring
ViM: Vision Middleware for Unified Downstream Transferring
Yutong Feng
Biao Gong
Jianwen Jiang
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
37
1
0
13 Mar 2023
Improving Masked Autoencoders by Learning Where to Mask
Improving Masked Autoencoders by Learning Where to Mask
Haijia Chen
Wendong Zhang
Yunbo Wang
Xiaokang Yang
SSL
20
20
0
12 Mar 2023
Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze
  Panoramic Dental X-rays
Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays
Ibrahim Ethem Hamamci
Sezgin Er
Enis Simsar
Anjany Sekuboyina
M. Gundogar
B. Stadlinger
A. Mehl
Bjoern H. Menze
DiffM
MedIm
23
26
0
11 Mar 2023
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature
  Mimicking
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking
Peng Gao
Renrui Zhang
Rongyao Fang
Ziyi Lin
Hongyang Li
Hongsheng Li
Qiao Yu
27
18
0
09 Mar 2023
Masked Image Modeling with Local Multi-Scale Reconstruction
Masked Image Modeling with Local Multi-Scale Reconstruction
Haoqing Wang
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhiwei Deng
Kai Han
64
46
0
09 Mar 2023
Previous
123...111213...151617
Next