Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.06377
Cited By
v1
v2
v3 (latest)
Masked Autoencoders Are Scalable Vision Learners
11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked Autoencoders Are Scalable Vision Learners"
50 / 4,779 papers shown
Title
Self-Supervised Pre-Training for Precipitation Post-Processor
Sojung An
Junha Lee
Jiyeon Jang
Inchae Na
Wooyeon Park
Sujeong You
AI4Cl
76
1
0
31 Oct 2023
Contrastive Difference Predictive Coding
Chongyi Zheng
Ruslan Salakhutdinov
Benjamin Eysenbach
AI4TS
OffRL
81
17
0
31 Oct 2023
FOCAL: Contrastive Learning for Multimodal Time-Series Sensing Signals in Factorized Orthogonal Latent Space
Shengzhong Liu
Tomoyoshi Kimura
Dongxin Liu
Ruijie Wang
Jinyang Li
Suhas Diggavi
Mani B. Srivastava
Tarek Abdelzaher
AI4TS
88
27
0
30 Oct 2023
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks
Micah Goldblum
Hossein Souri
Renkun Ni
Manli Shu
Viraj Prabhu
...
Adrien Bardes
Judy Hoffman
Ramalingam Chellappa
Andrew Gordon Wilson
Tom Goldstein
VLM
194
68
0
30 Oct 2023
Herd: Using multiple, smaller LLMs to match the performances of proprietary, large LLMs via an intelligent composer
S. N. Hari
Matt Thomson
55
0
0
30 Oct 2023
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone
Zeyinzi Jiang
Chaojie Mao
Ziyuan Huang
Ao Ma
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
88
16
0
30 Oct 2023
Harvest Video Foundation Models via Efficient Post-Pretraining
Yizhuo Li
Kunchang Li
Yinan He
Yi Wang
Yali Wang
Limin Wang
Yu Qiao
Ping Luo
CLIP
VLM
VGen
113
2
0
30 Oct 2023
AViTMP: A Tracking-Specific Transformer for Single-Branch Visual Tracking
Chuanming Tang
Kai Wang
Joost van de Weijer
Jianlin Zhang
Yongmei Huang
116
0
0
30 Oct 2023
Fast Trainable Projection for Robust Fine-Tuning
Junjiao Tian
Yen-Cheng Liu
James Seale Smith
Z. Kira
OOD
101
14
0
29 Oct 2023
BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping
Srikumar Sastry
Subash Khanal
Aayush Dhakal
Di Huang
Nathan Jacobs
79
10
0
29 Oct 2023
TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
Shuhuai Ren
Sishuo Chen
Shicheng Li
Xu Sun
Lu Hou
ViT
104
34
0
29 Oct 2023
Adversarial Examples Are Not Real Features
Ang Li
Yifei Wang
Yiwen Guo
Yisen Wang
93
13
0
29 Oct 2023
Identifiable Contrastive Learning with Automatic Feature Importance Discovery
Qi Zhang
Yifei Wang
Yisen Wang
83
13
0
29 Oct 2023
Improving Compositional Generalization Using Iterated Learning and Simplicial Embeddings
Yi Ren
Samuel Lavoie
Mikhail Galkin
Danica J. Sutherland
Aaron Courville
86
16
0
28 Oct 2023
Pre-training with Random Orthogonal Projection Image Modeling
Maryam Haghighat
Peyman Moghadam
Shaheer Mohamed
Piotr Koniusz
VLM
87
9
0
28 Oct 2023
Triplet Attention Transformer for Spatiotemporal Predictive Learning
Xuesong Nie
Xi Chen
Haoyuan Jin
Zhihang Zhu
Yunfeng Yan
Donglian Qi
ViT
53
11
0
28 Oct 2023
Foundation Models for Generalist Geospatial Artificial Intelligence
Johannes Jakubik
Sujit Roy
C. Phillips
P. Fraccaro
Denys Godwin
...
Hamed Alemohammad
M. Maskey
R. Ganti
Kommy Weldemariam
Rahul Ramachandran
AI4CE
VLM
98
105
0
28 Oct 2023
Feature Guided Masked Autoencoder for Self-supervised Learning in Remote Sensing
Yi Wang
Hugo Hernández Hernández
C. Albrecht
Xiao Xiang Zhu
107
33
0
28 Oct 2023
Visual Explanations via Iterated Integrated Attributions
Oren Barkan
Yehonatan Elisha
Yuval Asher
Amit Eshel
Noam Koenigstein
FAtt
XAI
49
18
0
28 Oct 2023
Self-Supervised Multi-Modality Learning for Multi-Label Skin Lesion Classification
Hao Wang
Euijoon Ahn
Lei Bi
Jinman Kim
70
1
0
28 Oct 2023
ReConTab: Regularized Contrastive Representation Learning for Tabular Data
Suiyao Chen
Jing Wu
N. Hovakimyan
Handong Yao
87
36
0
28 Oct 2023
Learning to recognize occluded and small objects with partial inputs
H. Zunair
A. Ben Hamza
85
1
0
27 Oct 2023
Unlocking the Potential of Prompt-Tuning in Bridging Generalized and Personalized Federated Learning
Wenlong Deng
Christos Thrampoulidis
Xiaoxiao Li
134
12
0
27 Oct 2023
FaultSeg Swin-UNETR: Transformer-Based Self-Supervised Pretraining Model for Fault Recognition
Zeren Zhang
Ran Chen
Jinwen Ma
ViT
30
0
0
27 Oct 2023
SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation
Mengcheng Lan
Xinjiang Wang
Yiping Ke
Jiaxing Xu
Xue Jiang
Wayne Zhang
87
13
0
27 Oct 2023
Grid Jigsaw Representation with CLIP: A New Perspective on Image Clustering
Zijie Song
Zhenzhen Hu
Richang Hong
SSL
117
0
0
27 Oct 2023
Three Pillars improving Vision Foundation Model Distillation for Lidar
Gilles Puy
Spyros Gidaris
Alexandre Boulch
Oriane Siméoni
Corentin Sautier
Patrick Pérez
Andrei Bursuc
Renaud Marlet
189
21
0
26 Oct 2023
Semantic Generative Augmentations for Few-Shot Counting
Perla Doubinsky
Nicolas Audebert
M. Crucianu
Hervé Le Borgne
VLM
DiffM
90
4
0
26 Oct 2023
BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point Clouds
Corentin Sautier
Gilles Puy
Alexandre Boulch
Renaud Marlet
Vincent Lepetit
3DPC
75
16
0
26 Oct 2023
Bridging The Gaps Between Token Pruning and Full Pre-training via Masked Fine-tuning
Fengyuan Shi
Limin Wang
ViT
77
0
0
26 Oct 2023
netFound: Foundation Model for Network Security
Satyandra Guthula
Navya Battula
Roman Beltiukov
Wenbo Guo
Arpit Gupta
Inder Monga
154
19
0
25 Oct 2023
TD-MPC2: Scalable, Robust World Models for Continuous Control
Nicklas Hansen
Hao Su
Xiaolong Wang
MU
131
159
0
25 Oct 2023
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
Asmar Nadeem
Adrian Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
84
10
0
25 Oct 2023
Towards Control-Centric Representations in Reinforcement Learning from Images
Chen Liu
Hongyu Zang
Xin Li
Yong Heng
Yifei Wang
Zhen Fang
Yisen Wang
Mingzhong Wang
57
0
0
25 Oct 2023
Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models
Oren Barkan
Yuval Asher
Amit Eshel
Yehonatan Elisha
Noam Koenigstein
77
5
0
25 Oct 2023
Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation
Chengpeng Li
Zhengyi Yang
Jizhi Zhang
Jiancan Wu
Dingxian Wang
Xiangnan He
Xiang Wang
OffRL
96
1
0
25 Oct 2023
General Point Model with Autoencoding and Autoregressive
Zhe Li
Zhangyang Gao
Cheng Tan
Stan Z. Li
Laurence T. Yang
AI4CE
3DPC
60
4
0
25 Oct 2023
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder
Huiwon Jang
Jihoon Tack
Daewon Choi
Jongheon Jeong
Jinwoo Shin
76
3
0
25 Oct 2023
Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model
Kai Li
Yupeng Deng
Yun-long Kong
Diyou Liu
Jingbo Chen
Yu Meng
Junxian Ma
Chenhao Wang
258
1
0
25 Oct 2023
Fine tuning Pre trained Models for Robustness Under Noisy Labels
Sumyeong Ahn
Sihyeon Kim
Jongwoo Ko
SeYoung Yun
AAML
NoLa
121
8
0
24 Oct 2023
Compressed representation of brain genetic transcription
James K. Ruffle
Henry Watkins
Robert J. Gray
H. Hyare
Michel Thiebaut de Schotten
P. Nachev
76
0
0
24 Oct 2023
Debiasing, calibrating, and improving Semi-supervised Learning performance via simple Ensemble Projector
Khanh-Binh Nguyen
69
3
0
24 Oct 2023
I
2
^2
2
MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation
Yunyao Mao
Jiajun Deng
Wen-gang Zhou
Zhenbo Lu
Wanli Ouyang
Houqiang Li
VLM
85
1
0
24 Oct 2023
Remote Heart Rate Monitoring in Smart Environments from Videos with Self-supervised Pre-training
Divij Gupta
Ali Etemad
93
2
0
23 Oct 2023
Deep Integrated Explanations
Oren Barkan
Yehonatan Elisha
Jonathan Weill
Yuval Asher
Amit Eshel
Noam Koenigstein
FAtt
109
7
0
23 Oct 2023
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding
Haoxiang Wang
Pavan Kumar Anasosalu Vasu
Fartash Faghri
Raviteja Vemulapalli
Mehrdad Farajtabar
Sachin Mehta
Mohammad Rastegari
Oncel Tuzel
Hadi Pouransari
VLM
128
73
0
23 Oct 2023
FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models
Lihe Yang
Xiaogang Xu
Bingyi Kang
Yinghuan Shi
Hengshuang Zhao
93
46
0
23 Oct 2023
Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning
Jingyun Yang
Max Sobol Mark
Brandon Vu
Archit Sharma
Jeannette Bohg
Chelsea Finn
OffRL
OnRL
95
26
0
23 Oct 2023
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules
Zhiyuan Liu
Yaorui Shi
An Zhang
Enzhi Zhang
Kenji Kawaguchi
Xiang Wang
Tat-Seng Chua
AI4CE
97
40
0
23 Oct 2023
SAMCLR: Contrastive pre-training on complex scenes using SAM for view sampling
Benjamin Missaoui
Chongbin Yuan
VLM
59
1
0
23 Oct 2023
Previous
1
2
3
...
51
52
53
...
94
95
96
Next