ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.06377
  4. Cited By
Masked Autoencoders Are Scalable Vision Learners
v1v2v3 (latest)

Masked Autoencoders Are Scalable Vision Learners

11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
    ViTTPM
ArXiv (abs)PDFHTML

Papers citing "Masked Autoencoders Are Scalable Vision Learners"

50 / 4,777 papers shown
Title
Metadata-enhanced contrastive learning from retinal optical coherence
  tomography images
Metadata-enhanced contrastive learning from retinal optical coherence tomography images
R. Holland
Oliver Leingang
Hrvoje Bogunović
Sophie Riedl
L. Fritsche
...
U. Schmidt-Erfurth
S. Sivaprasad
A. Lotery
Daniel Rueckert
Martin J. Menten
78
9
0
04 Aug 2022
GPPF: A General Perception Pre-training Framework via Sparsely Activated
  Multi-Task Learning
GPPF: A General Perception Pre-training Framework via Sparsely Activated Multi-Task Learning
Benyuan Sun
Jinqiao Dai
Zihao Liang
Cong Liu
Yi Yang
Bo Bai
MoE
75
4
0
03 Aug 2022
Masked Vision and Language Modeling for Multi-modal Representation
  Learning
Masked Vision and Language Modeling for Multi-modal Representation Learning
Gukyeong Kwon
Zhaowei Cai
Avinash Ravichandran
Erhan Bas
Rahul Bhotika
Stefano Soatto
92
68
0
03 Aug 2022
Learning Prior Feature and Attention Enhanced Image Inpainting
Learning Prior Feature and Attention Enhanced Image Inpainting
Chenjie Cao
Qiaole Dong
Yanwei Fu
DiffM
81
26
0
03 Aug 2022
Augmenting Vision Language Pretraining by Learning Codebook with Visual
  Semantics
Augmenting Vision Language Pretraining by Learning Codebook with Visual Semantics
Xiaoyuan Guo
Jiali Duan
C.-C. Jay Kuo
J. Gichoya
Imon Banerjee
VLM
46
1
0
31 Jul 2022
SdAE: Self-distillated Masked Autoencoder
SdAE: Self-distillated Masked Autoencoder
Yabo Chen
Yuchen Liu
Dongsheng Jiang
Xiaopeng Zhang
Wenrui Dai
H. Xiong
Qi Tian
ViT
99
73
0
31 Jul 2022
Out-of-Distribution Detection with Semantic Mismatch under Masking
Out-of-Distribution Detection with Semantic Mismatch under Masking
Yijun Yang
Ruiyuan Gao
Qiang Xu
OODD
82
28
0
31 Jul 2022
Less is More: Consistent Video Depth Estimation with Masked Frames
  Modeling
Less is More: Consistent Video Depth Estimation with Masked Frames Modeling
Yiran Wang
Zhiyu Pan
Xingyi Li
Zhiguo Cao
Ke Xian
Jianming Zhang
66
29
0
31 Jul 2022
Improving Fine-tuning of Self-supervised Models with Contrastive
  Initialization
Improving Fine-tuning of Self-supervised Models with Contrastive Initialization
Haolin Pan
Yong Guo
Qinyi Deng
Hao-Fan Yang
Yiqun Chen
Jian Chen
SSL
71
21
0
30 Jul 2022
Masked Autoencoders As The Unified Learners For Pre-Trained Sentence
  Representation
Masked Autoencoders As The Unified Learners For Pre-Trained Sentence Representation
Alexander H. Liu
Samuel J. Yang
92
6
0
30 Jul 2022
A Survey on Masked Autoencoder for Self-supervised Learning in Vision
  and Beyond
A Survey on Masked Autoencoder for Self-supervised Learning in Vision and Beyond
Chaoning Zhang
Chenshuang Zhang
Junha Song
John Seon Keun Yi
Kang Zhang
In So Kweon
SSL
96
77
0
30 Jul 2022
Global-Local Self-Distillation for Visual Representation Learning
Global-Local Self-Distillation for Visual Representation Learning
Tim Lebailly
Tinne Tuytelaars
SSL
53
6
0
29 Jul 2022
Transfer Learning for Segmentation Problems: Choose the Right Encoder
  and Skip the Decoder
Transfer Learning for Segmentation Problems: Choose the Right Encoder and Skip the Decoder
Jonas Dippel
Matthias Lenga
Thomas Goerttler
Klaus Obermayer
Johannes Höhne
SSL
87
2
0
29 Jul 2022
Pro-tuning: Unified Prompt Tuning for Vision Tasks
Pro-tuning: Unified Prompt Tuning for Vision Tasks
Xing Nie
Bolin Ni
Jianlong Chang
Gaomeng Meng
Chunlei Huo
Zhaoxiang Zhang
Shiming Xiang
Qi Tian
Chunhong Pan
AAMLVPVLMVLM
122
76
0
28 Jul 2022
Progressive Voronoi Diagram Subdivision: Towards A Holistic Geometric
  Framework for Exemplar-free Class-Incremental Learning
Progressive Voronoi Diagram Subdivision: Towards A Holistic Geometric Framework for Exemplar-free Class-Incremental Learning
Chunwei Ma
Zhanghexuan Ji
Ziyun Huang
Yan Shen
Mingchen Gao
Jinhui Xu
99
1
0
28 Jul 2022
Knowing Where and What: Unified Word Block Pretraining for Document Understanding
Song Tao
Zijian Wang
Tiantian Fan
Canjie Luo
Can Huang
SSL
80
2
0
28 Jul 2022
Break and Make: Interactive Structural Understanding Using LEGO Bricks
Break and Make: Interactive Structural Understanding Using LEGO Bricks
Aaron Walsman
Muru Zhang
Klemen Kotar
Karthik Desingh
Ali Farhadi
Dieter Fox
71
10
0
27 Jul 2022
Contrastive Masked Autoencoders are Stronger Vision Learners
Contrastive Masked Autoencoders are Stronger Vision Learners
Zhicheng Huang
Xiaojie Jin
Cheng Lu
Qibin Hou
Mingg-Ming Cheng
Dongmei Fu
Xiaohui Shen
Jiashi Feng
154
154
0
27 Jul 2022
Leveraging GAN Priors for Few-Shot Part Segmentation
Leveraging GAN Priors for Few-Shot Part Segmentation
M. Han
Heliang Zheng
Chaoyue Wang
Yong Luo
Han Hu
Bo Du
89
6
0
27 Jul 2022
Boosting Point-BERT by Multi-choice Tokens
Boosting Point-BERT by Multi-choice Tokens
Kexue Fu
Ming-Dong Yuan
Manning Wang
3DPC
77
7
0
27 Jul 2022
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Qiang Chen
Xiaokang Chen
Jian Wang
Shan Zhang
Kun Yao
Haocheng Feng
Junyu Han
Errui Ding
Gang Zeng
Jingdong Wang
ViT
143
135
0
26 Jul 2022
Seeing Far in the Dark with Patterned Flash
Seeing Far in the Dark with Patterned Flash
Zhanghao Sun
Jian Wang
Yicheng Wu
S. Nayar
76
2
0
25 Jul 2022
Equivariance and Invariance Inductive Bias for Learning from
  Insufficient Data
Equivariance and Invariance Inductive Bias for Learning from Insufficient Data
Tan Wang
Qianru Sun
Sugiri Pranata
J. Karlekar
Hanwang Zhang
SSL
100
21
0
25 Jul 2022
Jigsaw-ViT: Learning Jigsaw Puzzles in Vision Transformer
Jigsaw-ViT: Learning Jigsaw Puzzles in Vision Transformer
Yingyi Chen
Xiaoke Shen
Yahui Liu
Qinghua Tao
Johan A. K. Suykens
AAMLViT
85
24
0
25 Jul 2022
Dive into Big Model Training
Dive into Big Model Training
Qinghua Liu
Yuxiang Jiang
MoMeAI4CELRM
41
3
0
25 Jul 2022
Affective Behaviour Analysis Using Pretrained Model with Facial Priori
Affective Behaviour Analysis Using Pretrained Model with Facial Priori
Yifan Li
Haomiao Sun
Zhao Liu
Hu Han
CVBMViT
69
11
0
24 Jul 2022
MAR: Masked Autoencoders for Efficient Action Recognition
MAR: Masked Autoencoders for Efficient Action Recognition
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Xiang Wang
Yuehuang Wang
Yiliang Lv
Changxin Gao
Nong Sang
101
47
0
24 Jul 2022
Self-supervised contrastive learning of echocardiogram videos enables
  label-efficient cardiac disease diagnosis
Self-supervised contrastive learning of echocardiogram videos enables label-efficient cardiac disease diagnosis
G. Holste
Evangelos K. Oikonomou
Bobak J. Mortazavi
Zhangyang Wang
Rohan Khera
58
10
0
23 Jul 2022
High-Resolution Swin Transformer for Automatic Medical Image
  Segmentation
High-Resolution Swin Transformer for Automatic Medical Image Segmentation
Chen Wei
Shenghan Ren
Kaitai Guo
Haihong Hu
Jimin Liang
ViTOODMedIm
59
43
0
23 Jul 2022
EgoEnv: Human-centric environment representations from egocentric video
EgoEnv: Human-centric environment representations from egocentric video
Tushar Nagarajan
Santhosh Kumar Ramakrishnan
Ruta Desai
James M. Hillis
Kristen Grauman
EgoV
115
20
0
22 Jul 2022
Emotion Separation and Recognition from a Facial Expression by
  Generating the Poker Face with Vision Transformers
Emotion Separation and Recognition from a Facial Expression by Generating the Poker Face with Vision Transformers
Jia Li
Jian‐Hui Nie
Dan Guo
Richang Hong
Meng Wang
ViT
84
15
0
22 Jul 2022
Scale dependant layer for self-supervised nuclei encoding
Scale dependant layer for self-supervised nuclei encoding
Peter Naylor
Yao-Hung Hubert Tsai
Marick Laé
Makoto Yamada
SSL
87
0
0
22 Jul 2022
MABe22: A Multi-Species Multi-Task Benchmark for Learned Representations
  of Behavior
MABe22: A Multi-Species Multi-Task Benchmark for Learned Representations of Behavior
Jennifer J. Sun
Markus Marks
Andrew Ulmer
Dipam Chakraborty
Brian Geuther
...
Joseph Parker
Pietro Perona
Yisong Yue
K. Branson
Ann Kennedy
40
9
0
21 Jul 2022
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
Yaqian Liang
Shanshan Zhao
Baosheng Yu
Jing Zhang
Fazhi He
ViT
93
39
0
20 Jul 2022
Deep Preconditioners and their application to seismic wavefield
  processing
Deep Preconditioners and their application to seismic wavefield processing
M. Ravasi
73
2
0
20 Jul 2022
Unsupervised Industrial Anomaly Detection via Pattern Generative and
  Contrastive Networks
Unsupervised Industrial Anomaly Detection via Pattern Generative and Contrastive Networks
Jianfeng Huang
Chenyang Li
Yimin Lin
Kai Wang
ViT
60
1
0
20 Jul 2022
Invariant Feature Learning for Generalized Long-Tailed Classification
Invariant Feature Learning for Generalized Long-Tailed Classification
Kaihua Tang
Mingyuan Tao
Jiaxin Qi
Zhenguang Liu
Hanwang Zhang
VLM
96
56
0
19 Jul 2022
Multi-Task Learning Framework for Emotion Recognition in-the-wild
Multi-Task Learning Framework for Emotion Recognition in-the-wild
Tenggan Zhang
Chuanhe Liu
Xiaolong Liu
Yuchen Liu
Liyu Meng
Lei Sun
Wenqiang Jiang
Fengyuan Zhang
Jinming Zhao
Qin Jin
CVBM
86
19
0
19 Jul 2022
GAFX: A General Audio Feature eXtractor
GAFX: A General Audio Feature eXtractor
Zhaoyang Bu
Han Zhang
Xiaohu Zhu
54
0
0
19 Jul 2022
Label2Label: A Language Modeling Framework for Multi-Attribute Learning
Label2Label: A Language Modeling Framework for Multi-Attribute Learning
Wanhua Li
Zhexuan Cao
Jianjiang Feng
Jie Zhou
Jiwen Lu
VLM
99
27
0
18 Jul 2022
Fast-MoCo: Boost Momentum-based Contrastive Learning with Combinatorial
  Patches
Fast-MoCo: Boost Momentum-based Contrastive Learning with Combinatorial Patches
Yuanzheng Ci
Chen Lin
Lei Bai
Wanli Ouyang
SSL
74
26
0
17 Jul 2022
Stroke-Based Autoencoders: Self-Supervised Learners for Efficient
  Zero-Shot Chinese Character Recognition
Stroke-Based Autoencoders: Self-Supervised Learners for Efficient Zero-Shot Chinese Character Recognition
Zong Chen
Wen-Chi Yang
Xin Li
89
8
0
17 Jul 2022
E-NeRV: Expedite Neural Video Representation with Disentangled
  Spatial-Temporal Context
E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context
Zizhang Li
Mengmeng Wang
Huaijin Pi
Kechun Xu
Jianbiao Mei
Yong Liu
93
75
0
17 Jul 2022
SatMAE: Pre-training Transformers for Temporal and Multi-Spectral
  Satellite Imagery
SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery
Yezhen Cong
Samarth Khanna
Chenlin Meng
Patrick Liu
Erik Rozi
Yutong He
Marshall Burke
David B. Lobell
Stefano Ermon
ViT
102
276
0
17 Jul 2022
SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video
  Anomaly Detection
SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection
Antonio Bărbălău
Radu Tudor Ionescu
Mariana-Iuliana Georgescu
J. Dueholm
B. Ramachandra
Kamal Nasrollahi
Fahad Shahbaz Khan
T. Moeslund
M. Shah
ViT
105
72
0
16 Jul 2022
Multi-Modal Unsupervised Pre-Training for Surgical Operating Room
  Workflow Analysis
Multi-Modal Unsupervised Pre-Training for Surgical Operating Room Workflow Analysis
Muhammad Abdullah Jamal
Omid Mohareri
20
7
0
16 Jul 2022
Model-Aware Contrastive Learning: Towards Escaping the Dilemmas
Model-Aware Contrastive Learning: Towards Escaping the Dilemmas
Zizheng Huang
Haoxing Chen
Ziqi Wen
Chao Zhang
Huaxiong Li
Bojuan Wang
Chunlin Chen
62
10
0
16 Jul 2022
Masked Spatial-Spectral Autoencoders Are Excellent Hyperspectral
  Defenders
Masked Spatial-Spectral Autoencoders Are Excellent Hyperspectral Defenders
Jiahao Qi
Z. Gong
Xingyue Liu
Kangcheng Bin
Chen Chen
Yongqiang Li
Wei Xue
Yu Zhang
P. Zhong
AAML
81
6
0
16 Jul 2022
HOME: High-Order Mixed-Moment-based Embedding for Representation
  Learning
HOME: High-Order Mixed-Moment-based Embedding for Representation Learning
Chuang Niu
Ge Wang
SSL
71
4
0
15 Jul 2022
Position Prediction as an Effective Pretraining Strategy
Position Prediction as an Effective Pretraining Strategy
Shuangfei Zhai
Navdeep Jaitly
Jason Ramapuram
Dan Busbridge
Tatiana Likhomanenko
Joseph Y. Cheng
Walter A. Talbott
Chen Huang
Hanlin Goh
J. Susskind
ViT
88
25
0
15 Jul 2022
Previous
123...878889...949596
Next