Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.09886
Cited By
SimMIM: A Simple Framework for Masked Image Modeling
18 November 2021
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SimMIM: A Simple Framework for Masked Image Modeling"
50 / 849 papers shown
Title
Prompted Contrast with Masked Motion Modeling: Towards Versatile 3D Action Representation Learning
Jiahang Zhang
Lilang Lin
Jiaying Liu
SSL
23
15
0
08 Aug 2023
Feature-Suppressed Contrast for Self-Supervised Food Pre-training
Xinda Liu
Yaohui Zhu
Linhu Liu
Jiang Tian
Lili Wang
SSL
33
2
0
07 Aug 2023
DETR Doesn't Need Multi-Scale or Locality Design
Yutong Lin
Yuhui Yuan
Zheng-Wei Zhang
Chen Li
Nanning Zheng
Han Hu
37
5
0
03 Aug 2023
Relational Contrastive Learning for Scene Text Recognition
Jinglei Zhang
Tiancheng Lin
Yi Xu
Kaibo Chen
Rui Zhang
16
8
0
01 Aug 2023
Improving Pixel-based MIM by Reducing Wasted Modeling Capability
Yuan Liu
Songyang Zhang
Jiacheng Chen
Zhaohui Yu
Kai-xiang Chen
Dahua Lin
27
29
0
01 Aug 2023
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training
Jeya Maria Jose Valanarasu
Yucheng Tang
Dong Yang
Ziyue Xu
Can Zhao
...
Vishal M. Patel
Bennett Landman
Daguang Xu
Yufan He
V. Nath
MedIm
23
13
0
31 Jul 2023
Stochastic positional embeddings improve masked image modeling
Amir Bar
Florian Bordes
Assaf Shocher
Mahmoud Assran
Pascal Vincent
Nicolas Ballas
Trevor Darrell
Amir Globerson
Yann LeCun
36
3
0
31 Jul 2023
HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation
Zuyan Liu
Gaojie Lin
Congyi Wang
Min Zheng
Feida Zhu
3DH
19
0
0
29 Jul 2023
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation
Lingdong Kong
Yaru Niu
Shaoyuan Xie
Hanjiang Hu
Lai Xing Ng
...
Zhenyu Li
Runze Chen
Haiyong Luo
Fang Zhao
Jing Yu
34
13
0
27 Jul 2023
Pre-Training with Diffusion models for Dental Radiography segmentation
Jérémy Rousseau
C. Alaka
E. Covili
H. Mayard
L. Misrachi
Willy Au
DiffM
MedIm
AI4CE
41
4
0
26 Jul 2023
Controllable Guide-Space for Generalizable Face Forgery Detection
Yingjie Guo
Cheng Zhen
Pengfei Yan
CVBM
AAML
38
21
0
26 Jul 2023
CLIP-KD: An Empirical Study of CLIP Model Distillation
Chuanguang Yang
Zhulin An
Libo Huang
Junyu Bi
Xinqiang Yu
Hansheng Yang
Boyu Diao
Yongjun Xu
VLM
29
27
0
24 Jul 2023
Global k-Space Interpolation for Dynamic MRI Reconstruction using Masked Image Modeling
Jia-Yu Pan
Suprosanna Shit
Özgün Turgut
Wenqi Huang
Hongwei Bran Li
Nil Stolt Ansó
Thomas Kustner
Kerstin Hammernik
Daniel Rueckert
33
9
0
24 Jul 2023
AlignDet: Aligning Pre-training and Fine-tuning in Object Detection
Ming Li
Jie Wu
Xionghui Wang
Chen Chen
Jie Qin
Xu Xiao
Rui Wang
Min Zheng
Xin Pan
ObjD
VLM
32
18
0
20 Jul 2023
A Holistic Assessment of the Reliability of Machine Learning Systems
Anthony Corso
David Karamadian
Romeo Valentin
Mary Cooper
Mykel J. Kochenderfer
30
6
0
20 Jul 2023
Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection
Guangzhi Wang
Yangyang Guo
Mohan S. Kankanhalli
28
0
0
19 Jul 2023
CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation
Lizhao Liu
Zhuangwei Zhuang
Shan Huang
Xu Xiao
Tian-Zhu Xiang
Cen Chen
Jingdong Wang
Mingkui Tan
3DPC
50
17
0
19 Jul 2023
Domain Adaptation based Object Detection for Autonomous Driving in Foggy and Rainy Weather
Jinlong Li
Runsheng Xu
Xinyu Liu
Jin Ma
Baolu Li
Q. Zou
Jiaqi Ma
Hongkai Yu
32
7
0
18 Jul 2023
MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments
Spyros Gidaris
Andrei Bursuc
Oriane Siméoni
Antonín Vobecký
N. Komodakis
Matthieu Cord
Patrick Pérez
SSL
ViT
24
3
0
18 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
28
39
0
17 Jul 2023
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Daiqing Li
Huan Ling
Amlan Kar
David Acuna
Seung Wook Kim
Karsten Kreis
Antonio Torralba
Sanja Fidler
VLM
DiffM
22
27
0
14 Jul 2023
DenseMP: Unsupervised Dense Pre-training for Few-shot Medical Image Segmentation
Zhaoxin Fan
Puquan Pan
Zeren Zhang
C. Chen
Tianyang Wang
Si Zheng
Min Xu
VLM
44
0
0
13 Jul 2023
Test-Time Training on Video Streams
Renhao Wang
Yu Sun
Yossi Gandelsman
Xinlei Chen
Alexei A. Efros
Alexei A. Efros
Xiaolong Wang
TTA
ViT
3DGS
47
16
0
11 Jul 2023
Towards Cross-Table Masked Pretraining for Web Data Mining
Chaonan Ye
Guoshan Lu
Haobo Wang
Liyao Li
Sai Wu
Gang Chen
Jun Zhao
LMTD
39
13
0
10 Jul 2023
Mx2M: Masked Cross-Modality Modeling in Domain Adaptation for 3D Semantic Segmentation
Boxiang Zhang
Zunran Wang
Yonggen Ling
Yuanyuan Guan
Shenghao Zhang
Wenhui Li
37
6
0
09 Jul 2023
Cross-modal Orthogonal High-rank Augmentation for RGB-Event Transformer-trackers
Zhiyu Zhu
Junhui Hou
Dapeng Wu
ViT
24
28
0
09 Jul 2023
AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images
Ao Cheng
Guoqiang Zhao
Lirong Wang
Ruobing Zhang
24
3
0
05 Jul 2023
Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization
Yumeng Li
Dan Zhang
M. Keuper
Anna Khoreva
46
10
0
02 Jul 2023
Stitched ViTs are Flexible Vision Backbones
Zizheng Pan
Jing Liu
Haoyu He
Jianfei Cai
Bohan Zhuang
20
2
0
30 Jun 2023
DreamDiffusion: Generating High-Quality Images from Brain EEG Signals
Yun-Hao Bai
Xintao Wang
Yanpei Cao
Yixiao Ge
Chun Yuan
Ying Shan
DiffM
30
51
0
29 Jun 2023
Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners
Bowen Shi
Xiaopeng Zhang
Yaoming Wang
Jin Li
Wenrui Dai
Junni Zou
H. Xiong
Qi Tian
51
4
0
28 Jun 2023
You Can Mask More For Extremely Low-Bitrate Image Compression
Anqi Li
Feng Li
Jiaxin Han
H. Bai
Runmin Cong
Chunjie Zhang
Ming Wang
Weisi Lin
Yao-Min Zhao
36
2
0
27 Jun 2023
ParameterNet: Parameters Are All You Need
Kai Han
Yunhe Wang
Jianyuan Guo
Enhua Wu
VLM
AI4CE
35
25
0
26 Jun 2023
Patch-Level Contrasting without Patch Correspondence for Accurate and Dense Contrastive Representation Learning
Shaofeng Zhang
Feng Zhu
Rui Zhao
Junchi Yan
27
17
0
23 Jun 2023
Inter-Instance Similarity Modeling for Contrastive Learning
Chen Shen
Dawei Liu
Hao Tang
Zhe Qu
Jianxin Wang
SSL
31
4
0
21 Jun 2023
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining
Dezhi Peng
Chongyu Liu
Yuliang Liu
Lianwen Jin
DiffM
24
9
0
21 Jun 2023
Task-Robust Pre-Training for Worst-Case Downstream Adaptation
Jianghui Wang
Cheng Yang
Xingyu Xie
Cong Fang
Zhouchen Lin
OOD
32
0
0
21 Jun 2023
Continual Learners are Incremental Model Generalizers
Jaehong Yoon
Sung Ju Hwang
Yu Cao
CLL
33
5
0
21 Jun 2023
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing
F. Liu
Delong Chen
Zhan-Rong Guan
Xiaocong Zhou
Jiale Zhu
Qiaolin Ye
Liyong Fu
Jun Zhou
VLM
71
193
0
19 Jun 2023
Enhanced Masked Image Modeling for Analysis of Dental Panoramic Radiographs
A. Almalki
Longin Jan Latecki
MedIm
23
4
0
18 Jun 2023
MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification
Dequan Wang
Xiaosong Wang
Lilong Wang
Mengzhang Li
Q. Da
...
Qi Duan
Jie Zhao
Kang Li
Yu Qiao
Shaoting Zhang
VLM
MedIm
33
33
0
16 Jun 2023
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models
You-Chen Liu
Lingdong Kong
Jun Cen
Runnan Chen
Wenwei Zhang
Liang Pan
Kai-xiang Chen
Ziwei Liu
37
83
0
15 Jun 2023
Advancing Volumetric Medical Image Segmentation via Global-Local Masked Autoencoder
Jiafan Zhuang
Luyang Luo
Hao Chen
28
11
0
15 Jun 2023
ViP: A Differentially Private Foundation Model for Computer Vision
Yaodong Yu
Maziar Sanjabi
Yi Ma
Kamalika Chaudhuri
Chuan Guo
18
12
0
15 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
38
7
0
14 Jun 2023
MOFI: Learning Image Representations from Noisy Entity Annotated Images
Wentao Wu
Aleksei Timofeev
Chen Chen
Bowen Zhang
Kun Duan
...
Yantao Zheng
Jonathon Shlens
Xianzhi Du
Zhe Gan
Yinfei Yang
VLM
26
7
0
13 Jun 2023
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
Lorenzo Baraldi
Roberto Amoroso
Marcella Cornia
Lorenzo Baraldi
Andrea Pilzer
Rita Cucchiara
38
2
0
12 Jun 2023
FasterViT: Fast Vision Transformers with Hierarchical Attention
Ali Hatamizadeh
Greg Heinrich
Hongxu Yin
Andrew Tao
J. Álvarez
Jan Kautz
Pavlo Molchanov
ViT
28
67
0
09 Jun 2023
Exploring Effective Mask Sampling Modeling for Neural Image Compression
Lin Liu
Mingming Zhao
Shanxin Yuan
Wenlong Lyu
Wen-gang Zhou
Houqiang Li
Yanfeng Wang
Qi Tian
16
3
0
09 Jun 2023
R-MAE: Regions Meet Masked Autoencoders
Duy-Kien Nguyen
Vaibhav Aggarwal
Yanghao Li
Martin R. Oswald
Alexander Kirillov
Cees G. M. Snoek
Xinlei Chen
TPM
34
11
0
08 Jun 2023
Previous
1
2
3
...
9
10
11
...
15
16
17
Next