Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.09886
Cited By
SimMIM: A Simple Framework for Masked Image Modeling
18 November 2021
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SimMIM: A Simple Framework for Masked Image Modeling"
50 / 849 papers shown
Title
Exploring scalable medical image encoders beyond text supervision
Fernando Pérez-García
Harshita Sharma
Sam Bond-Taylor
Kenza Bouzid
Valentina Salvatelli
...
Maria T. A. Wetscherek
Noel C. F. Codella
Stephanie L. Hyland
Javier Alvarez-Valle
Ozan Oktay
LM&MA
MedIm
50
26
0
19 Jan 2024
Reconstructing the Invisible: Video Frame Restoration through Siamese Masked Conditional Variational Autoencoder
Yongchen Zhou
Richard Jiang
24
0
0
18 Jan 2024
Adversarial Masking Contrastive Learning for vein recognition
Huafeng Qin
Yiquan Wu
M. El-Yacoubi
Jun Wang
Guangxiang Yang
MedIm
34
0
0
16 Jan 2024
Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities
Xu Yan
Haiming Zhang
Yingjie Cai
Jingming Guo
Weichao Qiu
...
Lihui Jiang
Wei Zhang
Hongbo Zhang
Dengxin Dai
Bingbing Liu
56
17
0
16 Jan 2024
Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing
Jakob Hackstein
Gencer Sumbul
Kai Norman Clasen
Begüm Demir
38
7
0
15 Jan 2024
Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation
Hao Tang
Ling Shao
N. Sebe
Luc Van Gool
34
5
0
15 Jan 2024
MIMIC: Mask Image Pre-training with Mix Contrastive Fine-tuning for Facial Expression Recognition
Fan Zhang
Xiaobao Guo
Xiaojiang Peng
Alex C. Kot
27
0
0
14 Jan 2024
Transformer for Object Re-Identification: A Survey
Mang Ye
Shuo Chen
Chenyue Li
Wei-Shi Zheng
David J. Crandall
Bo Du
ViT
98
13
0
13 Jan 2024
A Study on Self-Supervised Pretraining for Vision Problems in Gastrointestinal Endoscopy
Edward Sanderson
B. Matuszewski
23
2
0
11 Jan 2024
Motion Guided Token Compression for Efficient Masked Video Modeling
Yukun Feng
Yangming Shi
Fengze Liu
Tan Yan
43
0
0
10 Jan 2024
CrisisViT: A Robust Vision Transformer for Crisis Image Classification
Zijun Long
R. McCreadie
Muhammad Imran
96
9
0
05 Jan 2024
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
Yiran Song
Qianyu Zhou
Hefei Ling
Deng-Ping Fan
Xuequan Lu
Lizhuang Ma
VLM
38
14
0
04 Jan 2024
SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment
Ziping Ma
Furong Xu
Jian Liu
Ming Yang
Qingpei Guo
VLM
42
3
0
04 Jan 2024
Data-Centric Foundation Models in Computational Healthcare: A Survey
Yunkun Zhang
Jin Gao
Zheling Tan
Lingfeng Zhou
Kexin Ding
Mu Zhou
Shaoting Zhang
Dequan Wang
AI4CE
39
22
0
04 Jan 2024
Skeleton2vec: A Self-supervised Learning Framework with Contextualized Target Representations for Skeleton Sequence
Ruizhuo Xu
Linzhi Huang
Mei Wang
Jiani Hu
Weihong Deng
ViT
MedIm
35
1
0
01 Jan 2024
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
42
14
0
31 Dec 2023
Analyzing Local Representations of Self-supervised Vision Transformers
Ani Vanyan
Alvard Barseghyan
Hakob Tamazyan
Vahan Huroyan
Hrant Khachatrian
Martin Danelljan
42
3
0
31 Dec 2023
Morphing Tokens Draw Strong Masked Image Models
Taekyung Kim
Byeongho Heo
Dongyoon Han
54
3
0
30 Dec 2023
Visual Point Cloud Forecasting enables Scalable Autonomous Driving
Zetong Yang
Li Chen
Yanan Sun
Hongyang Li
3DPC
27
40
0
29 Dec 2023
Learning Vision from Models Rivals Learning Vision from Data
Yonglong Tian
Lijie Fan
Kaifeng Chen
Dina Katabi
Dilip Krishnan
Phillip Isola
27
45
0
28 Dec 2023
Learning to Embed Time Series Patches Independently
Seunghan Lee
Taeyoung Park
Kibok Lee
SSL
AI4TS
28
27
0
27 Dec 2023
Soft Contrastive Learning for Time Series
Seunghan Lee
Taeyoung Park
Kibok Lee
AI4TS
27
19
0
27 Dec 2023
Bootstrap Masked Visual Modeling via Hard Patches Mining
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tiancai Wang
Xiangyu Zhang
Zhaoxiang Zhang
42
5
0
21 Dec 2023
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
Jiaming Liu
Ran Xu
Senqiao Yang
Renrui Zhang
Qizhe Zhang
Zehui Chen
Yandong Guo
Shanghang Zhang
TTA
35
10
0
19 Dec 2023
How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model
Yixin Zhang
Shen Zhao
Han Gu
Maciej Mazurowski
VLM
43
4
0
17 Dec 2023
Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning
Kaiyou Song
Shan Zhang
Tong Wang
VLM
31
2
0
16 Dec 2023
SeiT++: Masked Token Modeling Improves Storage-efficient Training
Min-Seob Lee
Song Park
Byeongho Heo
Dongyoon Han
Hyunjung Shim
MQ
VLM
26
1
0
15 Dec 2023
ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining
Ruoxi Shi
Xinyue Wei
Cheng Wang
Hao Su
41
16
0
14 Dec 2023
VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding
Yi Xin
Junlong Du
Qiang Wang
Zhiwen Lin
Ke Yan
VPVLM
92
49
0
14 Dec 2023
Semi-supervised Semantic Segmentation Meets Masked Modeling:Fine-grained Locality Learning Matters in Consistency Regularization
W. Pan
Zhe Xu
Jiangpeng Yan
Zihan Wu
R. Tong
Xiu Li
Jianhua Yao
ISeg
28
1
0
14 Dec 2023
ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance
Ling-Hao Chen
Yuanshuo Zhang
Taohua Huang
Liangcai Su
Zeyi Lin
Xi Xiao
Xiaobo Xia
Tongliang Liu
NoLa
33
9
0
13 Dec 2023
PAD: Self-Supervised Pre-Training with Patchwise-Scale Adapter for Infrared Images
Tao Zhang
Kun Ding
Jinyong Wen
Yu Xiong
Zeyu Zhang
Shiming Xiang
Chunhong Pan
30
3
0
13 Dec 2023
LMD: Faster Image Reconstruction with Latent Masking Diffusion
Zhiyuan Ma
Zhihuan Yu
Jianjun Li
Bowen Zhou
DiffM
24
8
0
13 Dec 2023
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
50
64
0
11 Dec 2023
The Counterattack of CNNs in Self-Supervised Learning: Larger Kernel Size might be All You Need
Tianjin Huang
Tianlong Chen
Zhangyang Wang
Shiwei Liu
32
1
0
09 Dec 2023
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
Xiaoyun Xu
Shujian Yu
Jingzheng Wu
S. Picek
AAML
35
0
0
08 Dec 2023
Towards Enhanced Image Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency
Yikai Wang
Chenjie Cao
Yanwei Fu
Ke Fan
Xiangyang Xue
Yanwei Fu
DiffM
53
2
0
08 Dec 2023
Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic Segmentation
Jiawei Fan
Chao Li
Xiaolong Liu
Meina Song
Anbang Yao
28
5
0
07 Dec 2023
A brief introduction to a framework named Multilevel Guidance-Exploration Network
Guoqing Yang
Zhiming Luo
Jianzhe Gao
Yingxin Lai
Kun Yang
Yifan He
Shaozi Li
3DH
29
0
0
07 Dec 2023
Low-shot Object Learning with Mutual Exclusivity Bias
Anh Thai
Ahmad Humayun
Stefan Stojanov
Zixuan Huang
Bikram Boote
James M. Rehg
40
2
0
06 Dec 2023
Learning Cortical Anomaly through Masked Encoding for Unsupervised Heterogeneity Mapping
Hao-Chun Yang
Ole A. Andreassen
Lars T. Westlye
A. Marquand
Christian F. Beckmann
T. Wolfers
18
1
0
05 Dec 2023
Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training
Arun V. Reddy
William Paul
Corban Rivera
Ketul Shah
Celso M. de Melo
Rama Chellappa
37
4
0
05 Dec 2023
Rejuvenating image-GPT as Strong Visual Representation Learners
Sucheng Ren
Zeyu Wang
Hongru Zhu
Junfei Xiao
Alan L. Yuille
Cihang Xie
VLM
57
7
0
04 Dec 2023
Multi-task Image Restoration Guided By Robust DINO Features
Xin Lin
Chao Ren
Kelvin C. K. Chan
Lu Qi
Jinshan Pan
Ming-Hsuan Yang
37
2
0
04 Dec 2023
Local Masking Meets Progressive Freezing: Crafting Efficient Vision Transformers for Self-Supervised Learning
Utku Mert Topcuoglu
Erdem Akagündüz
43
1
0
02 Dec 2023
Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling
Shentong Mo
Pedro Morgado
27
13
0
02 Dec 2023
Sequential Modeling Enables Scalable Learning for Large Vision Models
Yutong Bai
Xinyang Geng
K. Mangalam
Amir Bar
Alan Yuille
Trevor Darrell
Jitendra Malik
Alexei A. Efros
MLLM
VLM
22
156
0
01 Dec 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Yunyang Xiong
Bala Varadarajan
Lemeng Wu
Xiaoyu Xiang
Fanyi Xiao
...
Dilin Wang
Fei Sun
Forrest N. Iandola
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
42
140
0
01 Dec 2023
Learning Anatomically Consistent Embedding for Chest Radiography
Ziyu Zhou
Haozhe Luo
Jiaxuan Pang
Xiaowei Ding
Michael B. Gotway
Jianming Liang
SSL
17
5
0
01 Dec 2023
BIM: Block-Wise Self-Supervised Learning with Masked Image Modeling
Yixuan Luo
Mengye Ren
Sai Qian Zhang
28
0
0
28 Nov 2023
Previous
1
2
3
...
6
7
8
...
15
16
17
Next