Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.09886
Cited By
SimMIM: A Simple Framework for Masked Image Modeling
18 November 2021
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SimMIM: A Simple Framework for Masked Image Modeling"
50 / 849 papers shown
Title
Large-scale Dataset Pruning with Dynamic Uncertainty
Muyang He
Shuo Yang
Tiejun Huang
Bo Zhao
36
25
0
08 Jun 2023
FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow
Zhaoyang Huang
Xiaoyu Shi
Chao Zhang
Qiang Wang
Yijin Li
Hongwei Qin
Jifeng Dai
Xiaogang Wang
Hongsheng Li
33
4
0
08 Jun 2023
Improving Visual Prompt Tuning for Self-supervised Vision Transformers
S. Yoo
Eunji Kim
Dahuin Jung
Jungbeom Lee
Sung-Hoon Yoon
VLM
6
38
0
08 Jun 2023
Understanding Masked Autoencoders via Hierarchical Latent Variable Models
Lingjing Kong
Martin Q. Ma
Guan-Hong Chen
Eric Xing
Yuejie Chi
Louis-Philippe Morency
Kun Zhang
19
30
0
08 Jun 2023
UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks
Yanan Sun
Zi-Qi Zhong
Qi Fan
Chi-Keung Tang
Yu-Wing Tai
VLM
35
4
0
07 Jun 2023
Asymmetric Patch Sampling for Contrastive Learning
Chen Shen
Jianzhong Chen
Shu Wang
Hulin Kuang
Jin Liu
Jianxin Wang
SSL
38
4
0
05 Jun 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
43
0
0
02 Jun 2023
Evaluating The Robustness of Self-Supervised Representations to Background/Foreground Removal
Xavier F. Cadet
Ranya Aloufi
A. Miranville
S. Ahmadi-Abhari
Hamed Haddadi
26
0
0
02 Jun 2023
Masked Autoencoder for Unsupervised Video Summarization
Minho Shim
Taeoh Kim
Jinhyung Kim
Dongyoon Wee
33
1
0
02 Jun 2023
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Chaitanya K. Ryali
Yuan-Ting Hu
Daniel Bolya
Chen Wei
Haoqi Fan
...
Omid Poursaeed
Judy Hoffman
Jitendra Malik
Yanghao Li
Christoph Feichtenhofer
3DH
45
160
0
01 Jun 2023
MOSAIC: Masked Optimisation with Selective Attention for Image Reconstruction
Pamuditha Somarathne
Tharindu Wickremasinghe
Amashi Niwarthana
A. Thieshanthan
Chamira U. S. Edussooriya
D. Wadduwage
23
0
0
01 Jun 2023
Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners
Sarthak Yadav
Sergios Theodoridis
Lars Kai Hansen
Zheng-Hua Tan
28
7
0
01 Jun 2023
A Novel Driver Distraction Behavior Detection Method Based on Self-supervised Learning with Masked Image Modeling
Yingzhi Zhang
Taiguo Li
Chong Li
Xinghong Zhou
40
10
0
01 Jun 2023
Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
Baohao Liao
Shaomu Tan
Christof Monz
KELM
23
29
0
01 Jun 2023
Augmentation-aware Self-supervised Learning with Conditioned Projector
Marcin Przewike'zlikowski
Mateusz Pyla
Bartosz Zieliñski
Bartlomiej Twardowski
Jacek Tabor
Marek Śmieja
SSL
43
2
0
31 May 2023
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
Jan van Gemert
25
9
0
31 May 2023
Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast
Guo Fan
Zekun Qi
Wenkai Shi
Kaisheng Ma
3DPC
SSL
35
9
0
31 May 2023
Robust Lane Detection through Self Pre-training with Masked Sequential Autoencoders and Fine-tuning with Customized PolyLoss
Ruohan Li
Yongqi Dong
34
4
0
26 May 2023
Contrast, Attend and Diffuse to Decode High-Resolution Images from Brain Activities
Jingyuan Sun
Mingxiao Li
Zijiao Chen
Yunhao Zhang
Shaonan Wang
Marie-Francine Moens
DiffM
47
30
0
26 May 2023
LANISTR: Multimodal Learning from Structured and Unstructured Data
Sayna Ebrahimi
Sercan Ö. Arik
Yihe Dong
Tomas Pfister
20
4
0
26 May 2023
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical Invariance
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Lu Yuan
Zicheng Liu
Youzuo Lin
28
2
0
25 May 2023
VanillaKD: Revisit the Power of Vanilla Knowledge Distillation from Small Scale to Large Scale
Zhiwei Hao
Jianyuan Guo
Kai Han
Han Hu
Chang Xu
Yunhe Wang
38
16
0
25 May 2023
Delving Deeper into Data Scaling in Masked Image Modeling
Cheng Lu
Xiaojie Jin
Qibin Hou
Jun Hao Liew
Mingg-Ming Cheng
Jiashi Feng
38
4
0
24 May 2023
Siamese Masked Autoencoders
Agrim Gupta
Jiajun Wu
Jia Deng
Li Fei-Fei
46
49
0
23 May 2023
A multimodal method based on cross-attention and convolution for postoperative infection diagnosis
Xianjie Liu
Hon-Yi Shi
23
0
0
23 May 2023
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training
Utku Ozbulak
Hyun Jung Lee
Beril Boga
Esla Timothy Anzaku
Ho-min Park
Arnout Van Messem
W. D. Neve
J. Vankerschaver
DiffM
28
36
0
23 May 2023
A Dive into SAM Prior in Image Restoration
Zeyu Xiao
Jiawang Bai
Zhihe Lu
Zhiwei Xiong
29
16
0
23 May 2023
Contrastive Predictive Autoencoders for Dynamic Point Cloud Self-Supervised Learning
Xiaoxiao Sheng
Zhiqiang Shen
Gang Xiao
3DPC
SSL
28
6
0
22 May 2023
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation
Wenxuan Wang
Jing Liu
Xingjian He
Yisi Zhang
Cheng Chen
Jiachen Shen
Yan Zhang
Jiangyun Li
25
11
0
19 May 2023
HMSN: Hyperbolic Self-Supervised Learning by Clustering with Ideal Prototypes
A. Durrant
Georgios Leontidis
SSL
38
4
0
18 May 2023
A Survey on Time-Series Pre-Trained Models
Qianli Ma
Ziqiang Liu
Zhenjing Zheng
Ziyang Huang
Siying Zhu
Zhongzhong Yu
James T. Kwok
AI4TS
31
50
0
18 May 2023
Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document Understanding
ShuWei Feng
Tianyang Zhan
Zhanming Jie
Trung Quoc Luong
Xiaoran Jin
27
1
0
16 May 2023
GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training
Xiaoyu Tian
Haoxi Ran
Yue Wang
Hang Zhao
3DPC
ViT
29
38
0
15 May 2023
Masked Collaborative Contrast for Weakly Supervised Semantic Segmentation
Fangwen Wu
Jingxuan He
Yufei Yin
Y. Hao
Gang Huang
Lechao Cheng
ISeg
26
5
0
15 May 2023
PLIP: Language-Image Pre-training for Person Representation Learning
Jia-li Zuo
Jiahao Hong
Feng Zhang
Changqian Yu
Hanyu Zhou
Changxin Gao
Nong Sang
Jingdong Wang
VLM
MLLM
39
32
0
15 May 2023
Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval
Han Fang
Zhifei Yang
Xianghao Zang
Chao Ban
Hao Sun
VGen
34
2
0
13 May 2023
A Memory Model for Question Answering from Streaming Data Supported by Rehearsal and Anticipation of Coreference Information
Vladimir Araujo
Alvaro Soto
Marie-Francine Moens
KELM
22
2
0
12 May 2023
Exploring the Rate-Distortion-Complexity Optimization in Neural Image Compression
Yixin Gao
Runsen Feng
Zongyu Guo
Zhibo Chen
37
4
0
12 May 2023
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
Jia-ju Mao
Shu-Hua Guo
Yuan Chang
Xuesong Yin
Binling Nie
28
2
0
10 May 2023
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
24
72
0
09 May 2023
Self-supervised Pre-training with Masked Shape Prediction for 3D Scene Understanding
Li Jiang
Zetong Yang
Shaoshuai Shi
Vladislav Golyanik
Dengxin Dai
Bernt Schiele
3DPC
37
13
0
08 May 2023
PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud Videos
Zhiqiang Shen
Xiaoxiao Sheng
Longguang Wang
Y. Guo
Qiong Liu
Xiaoping Zhou
3DPC
SSL
35
14
0
06 May 2023
A vector quantized masked autoencoder for audiovisual speech emotion recognition
Samir Sadok
Simon Leglaive
Renaud Séguier
SSL
81
6
0
05 May 2023
What Do Self-Supervised Vision Transformers Learn?
Namuk Park
Wonjae Kim
Byeongho Heo
Taekyung Kim
Sangdoo Yun
SSL
88
76
1
01 May 2023
Objectives Matter: Understanding the Impact of Self-Supervised Objectives on Vision Transformer Representations
Shashank Shekhar
Florian Bordes
Pascal Vincent
Ari S. Morcos
29
10
0
25 Apr 2023
Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders
Heng Pan
Chenyang Liu
Wenxiao Wang
Liejie Yuan
Hongfa Wang
Zhifeng Li
Wei Liu
VLM
35
3
0
25 Apr 2023
A Cookbook of Self-Supervised Learning
Randall Balestriero
Mark Ibrahim
Vlad Sobal
Ari S. Morcos
Shashank Shekhar
...
Pierre Fernandez
Amir Bar
Hamed Pirsiavash
Yann LeCun
Micah Goldblum
SyDa
FedML
SSL
50
275
0
24 Apr 2023
Self-supervised Learning by View Synthesis
Shaoteng Liu
Xiangyu Zhang
T. Hu
Jiaya Jia
3DV
ViT
40
1
0
22 Apr 2023
FreMIM: Fourier Transform Meets Masked Image Modeling for Medical Image Segmentation
Wenxuan Wang
Jing Wang
Chia-Ju Chen
Jianbo Jiao
Yuanxiu Cai
Shanshan Song
Jiangyun Li
MedIm
31
18
0
21 Apr 2023
Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget
Johannes Lehner
Benedikt Alkin
Andreas Fürst
Elisabeth Rumetshofer
Lukas Miklautz
Sepp Hochreiter
29
18
0
20 Apr 2023
Previous
1
2
3
...
10
11
12
...
15
16
17
Next