ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.06377
  4. Cited By
Masked Autoencoders Are Scalable Vision Learners
v1v2v3 (latest)

Masked Autoencoders Are Scalable Vision Learners

11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
    ViTTPM
ArXiv (abs)PDFHTML

Papers citing "Masked Autoencoders Are Scalable Vision Learners"

50 / 4,777 papers shown
Title
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented
  Visual Models
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models
Chunyuan Li
Haotian Liu
Liunian Harold Li
Pengchuan Zhang
J. Aneja
...
Ping Jin
Houdong Hu
Zicheng Liu
Yong Jae Lee
Jianfeng Gao
101
152
0
19 Apr 2022
The Devil is in the Frequency: Geminated Gestalt Autoencoder for
  Self-Supervised Visual Pre-Training
The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training
Hao Liu
Xinghua Jiang
Xin Li
Antai Guo
Deqiang Jiang
Bo Ren
88
39
0
18 Apr 2022
Empirical Evaluation and Theoretical Analysis for Representation
  Learning: A Survey
Empirical Evaluation and Theoretical Analysis for Representation Learning: A Survey
Kento Nozawa
Issei Sato
AI4TS
139
5
0
18 Apr 2022
ResT V2: Simpler, Faster and Stronger
ResT V2: Simpler, Faster and Stronger
Qing-Long Zhang
Yubin Yang
ViT
68
26
0
15 Apr 2022
Masked Siamese Networks for Label-Efficient Learning
Masked Siamese Networks for Label-Efficient Learning
Mahmoud Assran
Mathilde Caron
Ishan Misra
Piotr Bojanowski
Florian Bordes
Pascal Vincent
Armand Joulin
Michael G. Rabbat
Nicolas Ballas
SSL
131
325
0
14 Apr 2022
DeiT III: Revenge of the ViT
DeiT III: Revenge of the ViT
Hugo Touvron
Matthieu Cord
Hervé Jégou
ViT
129
418
0
14 Apr 2022
Residual Swin Transformer Channel Attention Network for Image
  Demosaicing
Residual Swin Transformer Channel Attention Network for Image Demosaicing
W. Xing
K. Egiazarian
ViT
41
14
0
14 Apr 2022
WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic
  Segmentation for Lung Adenocarcinoma
WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic Segmentation for Lung Adenocarcinoma
Chu Han
Xipeng Pan
Lixu Yan
Huan Lin
Bingbing Li
...
Chengda Lu
Xin Chen
C. Liang
Qingling Zhang
Zaiyi Liu
157
30
0
13 Apr 2022
Self-supervised Vision Transformers for Joint SAR-optical Representation
  Learning
Self-supervised Vision Transformers for Joint SAR-optical Representation Learning
Yi Wang
C. Albrecht
Xiaoxiang Zhu
ViT
114
52
0
11 Apr 2022
Evaluating Vision Transformer Methods for Deep Reinforcement Learning
  from Pixels
Evaluating Vision Transformer Methods for Deep Reinforcement Learning from Pixels
Tianxin Tao
Daniele Reda
M. van de Panne
ViT
78
19
0
11 Apr 2022
Representation Learning by Detecting Incorrect Location Embeddings
Representation Learning by Detecting Incorrect Location Embeddings
Sepehr Sameni
Simon Jenni
Paolo Favaro
ViT
69
5
0
10 Apr 2022
Unleashing Vanilla Vision Transformer with Masked Image Modeling for
  Object Detection
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
Yuxin Fang
Shusheng Yang
Shijie Wang
Yixiao Ge
Ying Shan
Xinggang Wang
91
57
0
06 Apr 2022
Simple and Effective Synthesis of Indoor 3D Scenes
Simple and Effective Synthesis of Indoor 3D Scenes
Jing Yu Koh
Harsh Agrawal
Dhruv Batra
Richard Tucker
Austin Waters
Honglak Lee
Yinfei Yang
Jason Baldridge
Peter Anderson
VGen3DV
138
30
0
06 Apr 2022
Last Layer Re-Training is Sufficient for Robustness to Spurious
  Correlations
Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations
Polina Kirichenko
Pavel Izmailov
A. Wilson
OOD
119
339
0
06 Apr 2022
An Empirical Study of Remote Sensing Pretraining
An Empirical Study of Remote Sensing Pretraining
Di Wang
Jing Zhang
Bo Du
Guisong Xia
Dacheng Tao
EDL
134
198
0
06 Apr 2022
Mixing Signals: Data Augmentation Approach for Deep Learning Based
  Modulation Recognition
Mixing Signals: Data Augmentation Approach for Deep Learning Based Modulation Recognition
Xin-Shun Xu
Zhuangzhi Chen
Dongwei Xu
Huaji Zhou
Shanqing Yu
Shilian Zheng
Qi Xuan
Xiaoniu Yang
76
13
0
05 Apr 2022
A Survey on Dropout Methods and Experimental Verification in
  Recommendation
A Survey on Dropout Methods and Experimental Verification in Recommendation
Yongqian Li
Weizhi Ma
C. L. Philip Chen
Hao Fei
Yiqun Liu
Shaoping Ma
Yue Yang
88
11
0
05 Apr 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
144
279
0
04 Apr 2022
BatchFormerV2: Exploring Sample Relationships for Dense Representation
  Learning
BatchFormerV2: Exploring Sample Relationships for Dense Representation Learning
Zhi Hou
Baosheng Yu
Chaoyue Wang
Yibing Zhan
Dacheng Tao
ViT
112
13
0
04 Apr 2022
Revisiting a kNN-based Image Classification System with High-capacity
  Storage
Revisiting a kNN-based Image Classification System with High-capacity Storage
K. Nakata
Youyang Ng
Daisuke Miyashita
A. Maki
Yu Lin
J. Deguchi
96
26
0
03 Apr 2022
Improving Vision Transformers by Revisiting High-frequency Components
Improving Vision Transformers by Revisiting High-frequency Components
Jiawang Bai
Liuliang Yuan
Shutao Xia
Shuicheng Yan
Zhifeng Li
Wen Liu
ViT
108
94
0
03 Apr 2022
POS-BERT: Point Cloud One-Stage BERT Pre-Training
POS-BERT: Point Cloud One-Stage BERT Pre-Training
Kexue Fu
Peng Gao
Shaolei Liu
Renrui Zhang
Yu Qiao
Manning Wang
3DPC
88
19
0
03 Apr 2022
UNetFormer: A Unified Vision Transformer Model and Pre-Training
  Framework for 3D Medical Image Segmentation
UNetFormer: A Unified Vision Transformer Model and Pre-Training Framework for 3D Medical Image Segmentation
Ali Hatamizadeh
Ziyue Xu
Dong Yang
Wenqi Li
H. Roth
Daguang Xu
ViTMedIm
91
29
0
01 Apr 2022
Self-distillation Augmented Masked Autoencoders for Histopathological
  Image Classification
Self-distillation Augmented Masked Autoencoders for Histopathological Image Classification
Yang Luo
Zhineng Chen
Shengtian Zhou
Xieping Gao
77
1
0
31 Mar 2022
MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
Alan Baade
Puyuan Peng
David Harwath
78
102
0
30 Mar 2022
Exploring Plain Vision Transformer Backbones for Object Detection
Exploring Plain Vision Transformer Backbones for Object Detection
Yanghao Li
Hanzi Mao
Ross B. Girshick
Kaiming He
ViT
108
819
0
30 Mar 2022
mc-BEiT: Multi-choice Discretization for Image BERT Pre-training
mc-BEiT: Multi-choice Discretization for Image BERT Pre-training
Xiaotong Li
Yixiao Ge
Kun Yi
Zixuan Hu
Ying Shan
Ling-yu Duan
92
39
0
29 Mar 2022
In-N-Out Generative Learning for Dense Unsupervised Video Segmentation
In-N-Out Generative Learning for Dense Unsupervised Video Segmentation
Xiaomiao Pan
Peike Li
Zongxin Yang
Huiling Zhou
Chang Zhou
Hongxia Yang
Jingren Zhou
Yi Yang
VOS
80
12
0
29 Mar 2022
Large-scale Bilingual Language-Image Contrastive Learning
Large-scale Bilingual Language-Image Contrastive Learning
ByungSoo Ko
Geonmo Gu
VLM
112
14
0
28 Mar 2022
Mugs: A Multi-Granular Self-Supervised Learning Framework
Mugs: A Multi-Granular Self-Supervised Learning Framework
Pan Zhou
Yichen Zhou
Chenyang Si
Weihao Yu
Teck Khim Ng
Shuicheng Yan
VLM
81
60
0
27 Mar 2022
Beyond Masking: Demystifying Token-Based Pre-Training for Vision
  Transformers
Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers
Yunjie Tian
Lingxi Xie
Jiemin Fang
Mengnan Shi
Junran Peng
Xiaopeng Zhang
Jianbin Jiao
Qi Tian
QiXiang Ye
73
20
0
27 Mar 2022
3D-OAE: Occlusion Auto-Encoders for Self-Supervised Learning on Point
  Clouds
3D-OAE: Occlusion Auto-Encoders for Self-Supervised Learning on Point Clouds
Junsheng Zhou
Xin Wen
Baorui Ma
Yu-Shen Liu
Yue Gao
Yi Fang
Zhizhong Han
3DPC
82
19
0
26 Mar 2022
On the Viability of Monocular Depth Pre-training for Semantic
  Segmentation
On the Viability of Monocular Depth Pre-training for Semantic Segmentation
Dong Lao
Fengyu Yang
Daniel Wang
Hyoungseob Park
Samuel Lu
Alex Wong
Stefano Soatto
MDE
81
0
0
26 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSLOnRL
107
123
0
25 Mar 2022
VideoMAE: Masked Autoencoders are Data-Efficient Learners for
  Self-Supervised Video Pre-Training
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Zhan Tong
Yibing Song
Jue Wang
Limin Wang
ViT
252
1,220
0
23 Mar 2022
Visual Prompt Tuning
Visual Prompt Tuning
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLMVPVLM
208
1,654
0
23 Mar 2022
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream
  Framework
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
Botao Ye
Hong Chang
Bingpeng Ma
Shiguang Shan
Xilin Chen
ViT
119
485
0
22 Mar 2022
Unsupervised Anomaly Detection in Medical Images with a Memory-augmented
  Multi-level Cross-attentional Masked Autoencoder
Unsupervised Anomaly Detection in Medical Images with a Memory-augmented Multi-level Cross-attentional Masked Autoencoder
Yu Tian
Guansong Pang
Yuyuan Liu
Chong Wang
Yuanhong Chen
Fengbei Liu
Rajvinder Singh
Johan Verjans
Mengyu Wang
G. Carneiro
ViT
111
24
0
22 Mar 2022
Root-aligned SMILES: A Tight Representation for Chemical Reaction
  Prediction
Root-aligned SMILES: A Tight Representation for Chemical Reaction Prediction
Zipeng Zhong
Mingli Song
Zunlei Feng
Tiantao Liu
Lingxiang Jia
Shaolun Yao
Min-Ying Wu
Tingjun Hou
Mingli Song
91
57
0
22 Mar 2022
Representation Uncertainty in Self-Supervised Learning as Variational
  Inference
Representation Uncertainty in Self-Supervised Learning as Variational Inference
Hiroki Nakamura
Masashi Okada
T. Taniguchi
82
19
0
22 Mar 2022
Test-time Adaptation with Slot-Centric Models
Test-time Adaptation with Slot-Centric Models
Mihir Prabhudesai
Anirudh Goyal
S. Paul
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gaurav Aggarwal
Thomas Kipf
Deepak Pathak
Katerina Fragkiadaki
TTA
89
10
0
21 Mar 2022
Masked Discrimination for Self-Supervised Learning on Point Clouds
Masked Discrimination for Self-Supervised Learning on Point Clouds
Haotian Liu
Mu Cai
Yong Jae Lee
3DPC
117
171
0
21 Mar 2022
MixFormer: End-to-End Tracking with Iterative Mixed Attention
MixFormer: End-to-End Tracking with Iterative Mixed Attention
Yutao Cui
Jiang Cheng
Limin Wang
Gangshan Wu
VOT
123
477
0
21 Mar 2022
Upsampling Autoencoder for Self-Supervised Point Cloud Learning
Upsampling Autoencoder for Self-Supervised Point Cloud Learning
Cheng Zhang
Jian Shi
X. Deng
Zizhao Wu
3DPC
100
8
0
21 Mar 2022
simCrossTrans: A Simple Cross-Modality Transfer Learning for Object
  Detection with ConvNets or Vision Transformers
simCrossTrans: A Simple Cross-Modality Transfer Learning for Object Detection with ConvNets or Vision Transformers
Xiaoke Shen
I. Stamos
ViT
36
5
0
20 Mar 2022
Three things everyone should know about Vision Transformers
Three things everyone should know about Vision Transformers
Hugo Touvron
Matthieu Cord
Alaaeldin El-Nouby
Jakob Verbeek
Hervé Jégou
ViT
114
123
0
18 Mar 2022
Emerging Artificial Intelligence Applications in Spatial Transcriptomics
  Analysis
Emerging Artificial Intelligence Applications in Spatial Transcriptomics Analysis
Yijun Li
Stefan Stanojevic
L. Garmire
48
26
0
18 Mar 2022
GATE: Graph CCA for Temporal SElf-supervised Learning for
  Label-efficient fMRI Analysis
GATE: Graph CCA for Temporal SElf-supervised Learning for Label-efficient fMRI Analysis
Liang Peng
Nan Wang
Jie Xu
Xiao-lan Zhu
Xiaoxiao Li
75
36
0
17 Mar 2022
Object discovery and representation networks
Object discovery and representation networks
Olivier J. Hénaff
Skanda Koppula
Evan Shelhamer
Daniel Zoran
Andrew Jaegle
Andrew Zisserman
João Carreira
Relja Arandjelović
108
89
0
16 Mar 2022
Weak Augmentation Guided Relational Self-Supervised Learning
Weak Augmentation Guided Relational Self-Supervised Learning
Mingkai Zheng
Shan You
Fei Wang
Chao Qian
Changshui Zhang
Xiaogang Wang
Chang Xu
83
5
0
16 Mar 2022
Previous
123...9293949596
Next