Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.09886
Cited By
SimMIM: A Simple Framework for Masked Image Modeling
18 November 2021
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SimMIM: A Simple Framework for Masked Image Modeling"
50 / 849 papers shown
Title
CMID: A Unified Self-Supervised Learning Framework for Remote Sensing Image Understanding
Dilxat Muhtar
Xue-liang Zhang
Pengfeng Xiao
Zhenshi Li
Feng-Xue Gu
SSL
45
50
0
19 Apr 2023
DCELANM-Net:Medical Image Segmentation based on Dual Channel Efficient Layer Aggregation Network with Learner
Cheng Lu
Z. Xia
Krzysztof Przystupa
Orest Kochan
J. Su
MedIm
23
10
0
19 Apr 2023
Zoom-VQA: Patches, Frames and Clips Integration for Video Quality Assessment
Kai Zhao
Kun Yuan
Ming-Ting Sun
Xingsen Wen
21
20
0
13 Apr 2023
Hard Patches Mining for Masked Image Modeling
Haochen Wang
Kaiyou Song
Junsong Fan
Yuxi Wang
Jin Xie
Zhaoxiang Zhang
37
59
0
12 Apr 2023
Learning Transferable Pedestrian Representation from Multimodal Information Supervision
Li-Na Bao
Longhui Wei
Xiaoyu Qiu
Wen-gang Zhou
Houqiang Li
Qi Tian
SSL
42
5
0
12 Apr 2023
MoMo: A shared encoder Model for text, image and multi-Modal representations
Rakesh Chada
Zhao-Heng Zheng
P. Natarajan
ViT
21
4
0
11 Apr 2023
A Billion-scale Foundation Model for Remote Sensing Images
Keumgang Cha
Junghoon Seo
Taekyung Lee
38
64
0
11 Apr 2023
Mask-Based Modeling for Neural Radiance Fields
Ganlin Yang
Guoqiang Wei
Zhizheng Zhang
Yan Lu
Dong Liu
AI4CE
21
1
0
11 Apr 2023
Diffusion Models as Masked Autoencoders
Chen Wei
K. Mangalam
Po-Yao (Bernie) Huang
Yanghao Li
Haoqi Fan
Hu Xu
Huiyu Wang
Cihang Xie
Alan Yuille
Christoph Feichtenhofer
DiffM
SyDa
36
48
0
06 Apr 2023
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection
Changsheng Lu
Hao Zhu
Piotr Koniusz
48
11
0
06 Apr 2023
Disentangled Pre-training for Image Matting
Yan-Da Li
Zilong Huang
Gang Yu
Ling-Hao Chen
Yunchao Wei
Jianbo Jiao
28
0
0
03 Apr 2023
Multi-Modal Representation Learning with Text-Driven Soft Masks
Jaeyoo Park
Bohyung Han
SSL
30
4
0
03 Apr 2023
Mask Hierarchical Features For Self-Supervised Learning
Fenggang Liu
Yangguang Li
Feng Liang
Jilan Xu
Bin Huang
Jing Shao
21
0
0
01 Apr 2023
DIME-FM: DIstilling Multimodal and Efficient Foundation Models
Ximeng Sun
Pengchuan Zhang
Peizhao Zhang
Hardik Shah
Kate Saenko
Xide Xia
VLM
30
20
0
31 Mar 2023
LaCViT: A Label-aware Contrastive Fine-tuning Framework for Vision Transformers
Zijun Long
Zaiqiao Meng
Gerardo Aragon Camarasa
R. McCreadie
VLM
42
5
0
31 Mar 2023
Whether and When does Endoscopy Domain Pretraining Make Sense?
Dominik Batić
Felix Holm
Ege Özsoy
Tobias Czempiel
Nassir Navab
20
7
0
30 Mar 2023
Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks
Weihua Chen
Xianzhe Xu
Jian Jia
Haowen Luo
Yaohua Wang
F. Wang
Rong Jin
Xiuyu Sun
SSL
39
94
0
30 Mar 2023
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation
Qi-jun Zhao
Ce Zheng
Mengyuan Liu
Pichao Wang
Chong Chen
ViT
25
87
0
30 Mar 2023
Complementary Random Masking for RGB-Thermal Semantic Segmentation
Ukcheol Shin
Kyunghyun Lee
In So Kweon
Jean Oh
32
20
0
30 Mar 2023
PMatch: Paired Masked Image Modeling for Dense Geometric Matching
Shengjie Zhu
Xiaoming Liu
40
24
0
30 Mar 2023
Masked Autoencoders as Image Processors
Huiyu Duan
Wei Shen
Xiongkuo Min
Danyang Tu
Long Teng
Jia Wang
Guangtao Zhai
ViT
38
11
0
30 Mar 2023
Mixed Autoencoder for Self-supervised Visual Representation Learning
Kai Chen
Zhili Liu
Lanqing Hong
Hang Xu
Zhenguo Li
Dit-Yan Yeung
SSL
30
43
0
30 Mar 2023
ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing
Xiaodan Li
YueFeng Chen
Yao Zhu
Shuhui Wang
Rong Zhang
Hui Xue
34
24
0
30 Mar 2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Limin Wang
Bingkun Huang
Zhiyu Zhao
Zhan Tong
Yinan He
Yi Wang
Yali Wang
Yu Qiao
VGen
73
329
0
29 Mar 2023
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Kunchang Li
Yali Wang
Yizhuo Li
Yi Wang
Yinan He
Limin Wang
Yu Qiao
VGen
57
156
0
28 Mar 2023
Mask and Restore: Blind Backdoor Defense at Test Time with Masked Autoencoder
Tao Sun
Lu Pang
Chao Chen
Haibin Ling
AAML
45
9
0
27 Mar 2023
SEM-POS: Grammatically and Semantically Correct Video Captioning
Asmar Nadeem
A. Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
27
8
0
26 Mar 2023
BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning
Changdae Oh
Hyeji Hwang
Hee-young Lee
Yongtaek Lim
Geunyoung Jung
Jiyoung Jung
Hosik Choi
Kyungwoo Song
VLM
VPVLM
85
57
0
26 Mar 2023
Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning
Xiaoyang Wu
Xin Wen
Xihui Liu
Hengshuang Zhao
3DPC
122
40
0
24 Mar 2023
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
Runsen Xu
Tai Wang
Wenwei Zhang
Runjian Chen
Jinkun Cao
Jiangmiao Pang
Dahua Lin
3DPC
39
30
0
23 Mar 2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining
Mannat Singh
Quentin Duval
Kalyan Vasudev Alwala
Haoqi Fan
Vaibhav Aggarwal
...
Piotr Dollár
Christoph Feichtenhofer
Ross B. Girshick
Rohit Girdhar
Ishan Misra
LRM
126
63
0
23 Mar 2023
Masked Image Training for Generalizable Deep Image Denoising
Haoyu Chen
Jinjin Gu
Yihao Liu
Salma Abdel Magid
Chao Dong
Qiong Wang
Hanspeter Pfister
Lei Zhu
27
63
0
23 Mar 2023
Test-time Detection and Repair of Adversarial Samples via Masked Autoencoder
Yun-Yun Tsai
Ju-Chin Chao
Albert Wen
Zhaoyuan Yang
Chengzhi Mao
Tapan Shah
Junfeng Yang
AAML
21
1
0
22 Mar 2023
Correlational Image Modeling for Self-Supervised Visual Pre-Training
Wei Li
Jiahao Xie
Chen Change Loy
SSL
40
10
0
22 Mar 2023
Human Pose as Compositional Tokens
Zigang Geng
Chunyu Wang
Yixuan Wei
Ze Liu
Houqiang Li
Han Hu
43
47
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
ViT
CLIP
42
261
0
20 Mar 2023
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding
Jihao Liu
Tai Wang
Boxiao Liu
Qihang Zhang
Yu Liu
Hongsheng Li
38
16
0
20 Mar 2023
HybridMIM: A Hybrid Masked Image Modeling Framework for 3D Medical Image Segmentation
Zhaohu Xing
Lei Zhu
Lequan Yu
Zhiheng Xing
Liang Wan
39
8
0
18 Mar 2023
Dual-path Adaptation from Image to Video Transformers
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
ViT
21
37
0
17 Mar 2023
Denoising Diffusion Autoencoders are Unified Self-supervised Learners
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
DiffM
30
71
0
17 Mar 2023
MAPSeg: Unified Unsupervised Domain Adaptation for Heterogeneous Medical Image Segmentation Based on 3D Masked Autoencoding and Pseudo-Labeling
Xuzhe Zhang
Yu-Hsun Wu
S. Entringer
H. Simhan
Jia Guo
...
A. Jackowski
Haifeng Li
J. Posner
Andrew F. Laine
Yun Wang
OOD
20
11
0
16 Mar 2023
Real Face Foundation Representation Learning for Generalized Deepfake Detection
Liang Shi
Jie Zhang
Shiguang Shan
CVBM
43
7
0
15 Mar 2023
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection
Anthony Chen
Kevin Zhang
Renrui Zhang
Zihan Wang
Yuheng Lu
Yandong Guo
Shanghang Zhang
3DPC
70
61
0
14 Mar 2023
AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+
Tianlin Li
Ying Wang
Ziwei Xuan
Guo-Jun Qi
ViT
48
3
0
14 Mar 2023
DPPMask: Masked Image Modeling with Determinantal Point Processes
Junde Xu
Zikai Lin
Donghao Zhou
Yao-Cheng Yang
Xiangyun Liao
Bian Wu
Guangyong Chen
Pheng-Ann Heng
28
1
0
13 Mar 2023
ViM: Vision Middleware for Unified Downstream Transferring
Yutong Feng
Biao Gong
Jianwen Jiang
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
37
1
0
13 Mar 2023
Improving Masked Autoencoders by Learning Where to Mask
Haijia Chen
Wendong Zhang
Yunbo Wang
Xiaokang Yang
SSL
20
20
0
12 Mar 2023
Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays
Ibrahim Ethem Hamamci
Sezgin Er
Enis Simsar
Anjany Sekuboyina
M. Gundogar
B. Stadlinger
A. Mehl
Bjoern H. Menze
DiffM
MedIm
23
26
0
11 Mar 2023
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking
Peng Gao
Renrui Zhang
Rongyao Fang
Ziyi Lin
Hongyang Li
Hongsheng Li
Qiao Yu
27
18
0
09 Mar 2023
Masked Image Modeling with Local Multi-Scale Reconstruction
Haoqing Wang
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhiwei Deng
Kai Han
64
46
0
09 Mar 2023
Previous
1
2
3
...
11
12
13
...
15
16
17
Next