ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.06377
  4. Cited By
Masked Autoencoders Are Scalable Vision Learners

Masked Autoencoders Are Scalable Vision Learners

11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
    ViT
    TPM
ArXivPDFHTML

Papers citing "Masked Autoencoders Are Scalable Vision Learners"

50 / 4,611 papers shown
Title
Masked Feature Prediction for Self-Supervised Visual Pre-Training
Masked Feature Prediction for Self-Supervised Visual Pre-Training
Chen Wei
Haoqi Fan
Saining Xie
Chaoxia Wu
Alan Yuille
Christoph Feichtenhofer
ViT
54
655
0
16 Dec 2021
Rethinking Nearest Neighbors for Visual Classification
Rethinking Nearest Neighbors for Visual Classification
Menglin Jia
Bor-Chun Chen
Zuxuan Wu
Claire Cardie
Serge J. Belongie
Ser-Nam Lim
SSL
30
10
0
15 Dec 2021
Self-Supervised Modality-Aware Multiple Granularity Pre-Training for
  RGB-Infrared Person Re-Identification
Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification
Lin Wan
Qianyan Jing
Zongyuan Sun
Chuan Zhang
Zhihang Li
Yehansen Chen
SSL
12
5
0
12 Dec 2021
PE-former: Pose Estimation Transformer
PE-former: Pose Estimation Transformer
Paschalis Panteleris
Antonis Argyros
ViT
19
12
0
09 Dec 2021
Semi-Supervised Medical Image Segmentation via Cross Teaching between
  CNN and Transformer
Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer
Xiangde Luo
Minhao Hu
Tao Song
Guotai Wang
Shaoting Zhang
ViT
MedIm
22
202
0
09 Dec 2021
ViewCLR: Learning Self-supervised Video Representation for Unseen
  Viewpoints
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints
Srijan Das
Michael S. Ryoo
SSL
24
17
0
07 Dec 2021
E$^2$(GO)MOTION: Motion Augmented Event Stream for Egocentric Action
  Recognition
E2^22(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition
Chiara Plizzari
M. Planamente
Gabriele Goletto
Marco Cannici
Emanuele Gusso
Matteo Matteucci
Barbara Caputo
EgoV
15
56
0
07 Dec 2021
Label-Efficient Semantic Segmentation with Diffusion Models
Label-Efficient Semantic Segmentation with Diffusion Models
Dmitry Baranchuk
Ivan Rubachev
A. Voynov
Valentin Khrulkov
Artem Babenko
DiffM
VLM
195
513
0
06 Dec 2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence
  Model Tackles All SMAC Tasks
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Linghui Meng
Muning Wen
Yaodong Yang
Chenyang Le
Xiyun Li
Weinan Zhang
Ying Wen
Haifeng Zhang
Jun Wang
Bo Xu
OffRL
24
38
0
06 Dec 2021
A Survey of Deep Learning for Low-Shot Object Detection
A Survey of Deep Learning for Low-Shot Object Detection
Qihan Huang
Haofei Zhang
Mengqi Xue
Jie Song
Mingli Song
ObjD
33
18
0
06 Dec 2021
BEVT: BERT Pretraining of Video Transformers
BEVT: BERT Pretraining of Video Transformers
Rui Wang
Dongdong Chen
Zuxuan Wu
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Yu-Gang Jiang
Luowei Zhou
Lu Yuan
ViT
25
203
0
02 Dec 2021
DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
Yongming Rao
Wenliang Zhao
Guangyi Chen
Yansong Tang
Zheng Zhu
Guan Huang
Jie Zhou
Jiwen Lu
VLM
CLIP
61
551
0
02 Dec 2021
PTCT: Patches with 3D-Temporal Convolutional Transformer Network for
  Precipitation Nowcasting
PTCT: Patches with 3D-Temporal Convolutional Transformer Network for Precipitation Nowcasting
Ziao Yang
Xiangru Yang
Qifeng Lin
ViT
AI4TS
13
4
0
02 Dec 2021
SwinTrack: A Simple and Strong Baseline for Transformer Tracking
SwinTrack: A Simple and Strong Baseline for Transformer Tracking
Liting Lin
Heng Fan
Zhipeng Zhang
Yong-mei Xu
Haibin Ling
ViT
23
301
0
02 Dec 2021
MC-SSL0.0: Towards Multi-Concept Self-Supervised Learning
MC-SSL0.0: Towards Multi-Concept Self-Supervised Learning
Sara Atito
Muhammad Awais
Ammarah Farooq
Zhenhua Feng
J. Kittler
15
17
0
30 Nov 2021
EdiBERT, a generative model for image editing
EdiBERT, a generative model for image editing
Thibaut Issenhuth
Ugo Tanielian
Jérémie Mary
David Picard
DiffM
24
12
0
30 Nov 2021
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point
  Modeling
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
Xumin Yu
Lulu Tang
Yongming Rao
Tiejun Huang
Jie Zhou
Jiwen Lu
3DPC
20
652
0
29 Nov 2021
Natural Scene Text Editing Based on AI
Natural Scene Text Editing Based on AI
Yujie Zhang
15
0
0
26 Nov 2021
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
Xiaoyi Dong
Jianmin Bao
Ting Zhang
Dongdong Chen
Weiming Zhang
Lu Yuan
Dong Chen
Fang Wen
Nenghai Yu
Baining Guo
ViT
37
238
0
24 Nov 2021
ViCE: Improving Dense Representation Learning by Superpixelization and
  Contrasting Cluster Assignment
ViCE: Improving Dense Representation Learning by Superpixelization and Contrasting Cluster Assignment
Robin Karlsson
Tomoki Hayashi
Keisuke Fujii
Alexander Carballo
Kento Ohtani
K. Takeda
SSL
34
4
0
24 Nov 2021
RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?
RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?
Yufei Xu
Qiming Zhang
Jing Zhang
Dacheng Tao
SSL
19
18
0
24 Nov 2021
Learning Representation for Clustering via Prototype Scattering and
  Positive Sampling
Learning Representation for Clustering via Prototype Scattering and Positive Sampling
Zhizhong Huang
Jie Chen
Junping Zhang
Hongming Shan
16
87
0
23 Nov 2021
RIO: Rotation-equivariance supervised learning of robust inertial
  odometry
RIO: Rotation-equivariance supervised learning of robust inertial odometry
Caifa Zhou
Xiya Cao
Dandan Zeng
Yongliang Wang
OOD
SSL
17
21
0
23 Nov 2021
Benchmarking Detection Transfer Learning with Vision Transformers
Benchmarking Detection Transfer Learning with Vision Transformers
Yanghao Li
Saining Xie
Xinlei Chen
Piotr Dollar
Kaiming He
Ross B. Girshick
12
164
0
22 Nov 2021
Attention Mechanisms in Computer Vision: A Survey
Attention Mechanisms in Computer Vision: A Survey
Meng-Hao Guo
Tianhan Xu
Jiangjiang Liu
Zheng-Ning Liu
Peng-Tao Jiang
Tai-Jiang Mu
Song-Hai Zhang
Ralph Robert Martin
Ming-Ming Cheng
Shimin Hu
19
1,633
0
15 Nov 2021
A Survey of Visual Transformers
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
69
330
0
11 Nov 2021
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Ruiyang Liu
Yinghui Li
Li Tao
Dun Liang
Haitao Zheng
77
96
0
07 Nov 2021
Towards the Generalization of Contrastive Self-Supervised Learning
Towards the Generalization of Contrastive Self-Supervised Learning
Weiran Huang
Mingyang Yi
Xuyang Zhao
Zihao Jiang
SSL
21
105
0
01 Nov 2021
GenURL: A General Framework for Unsupervised Representation Learning
GenURL: A General Framework for Unsupervised Representation Learning
Siyuan Li
Zicheng Liu
Z. Zang
Di Wu
Zhiyuan Chen
Stan Z. Li
OOD
3DGS
OffRL
26
9
0
27 Oct 2021
Towards Language-guided Visual Recognition via Dynamic Convolutions
Towards Language-guided Visual Recognition via Dynamic Convolutions
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Yongjian Wu
Yue Gao
Rongrong Ji
ObjD
25
19
0
17 Oct 2021
Self-Supervised Learning by Estimating Twin Class Distributions
Self-Supervised Learning by Estimating Twin Class Distributions
Feng Wang
Tao Kong
Rufeng Zhang
Huaping Liu
Hang Li
SSL
43
16
0
14 Oct 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Benyou Wang
Qianqian Xie
Jiahuan Pei
Zhihong Chen
Prayag Tiwari
Zhao Li
Jie Fu
LM&MA
AI4CE
31
163
0
11 Oct 2021
Attention is All You Need? Good Embeddings with Statistics are
  enough:Large Scale Audio Understanding without Transformers/ Convolutions/
  BERTs/ Mixers/ Attention/ RNNs or ....
Attention is All You Need? Good Embeddings with Statistics are enough:Large Scale Audio Understanding without Transformers/ Convolutions/ BERTs/ Mixers/ Attention/ RNNs or ....
Prateek Verma
AI4TS
24
2
0
07 Oct 2021
SORNet: Spatial Object-Centric Representations for Sequential
  Manipulation
SORNet: Spatial Object-Centric Representations for Sequential Manipulation
Wentao Yuan
Chris Paxton
Karthik Desingh
D. Fox
3DPC
139
72
0
08 Sep 2021
nnFormer: Interleaved Transformer for Volumetric Segmentation
nnFormer: Interleaved Transformer for Volumetric Segmentation
Hong-Yu Zhou
J. Guo
Yinghao Zhang
Lequan Yu
Liansheng Wang
Yizhou Yu
ViT
MedIm
24
306
0
07 Sep 2021
Towards Out-Of-Distribution Generalization: A Survey
Towards Out-Of-Distribution Generalization: A Survey
Jiashuo Liu
Zheyan Shen
Yue He
Xingxuan Zhang
Renzhe Xu
Han Yu
Peng Cui
CML
OOD
29
515
0
31 Aug 2021
When Do Contrastive Learning Signals Help Spatio-Temporal Graph
  Forecasting?
When Do Contrastive Learning Signals Help Spatio-Temporal Graph Forecasting?
Xu Liu
Yuxuan Liang
Chao Huang
Yu Zheng
Bryan Hooi
Roger Zimmermann
AI4TS
13
60
0
26 Aug 2021
How Self-Supervised Learning Can be Used for Fine-Grained Head Pose
  Estimation?
How Self-Supervised Learning Can be Used for Fine-Grained Head Pose Estimation?
Mahdi Pourmirzaei
Farzaneh Esmaili
G. Montazer
Sasan Karamizadeh
Seyedehsamaneh Shojaeilangari
19
0
0
10 Aug 2021
A Low Rank Promoting Prior for Unsupervised Contrastive Learning
A Low Rank Promoting Prior for Unsupervised Contrastive Learning
Yu Wang
Jingyang Lin
Qi Cai
Yingwei Pan
Ting Yao
Hongyang Chao
Tao Mei
SSL
17
16
0
05 Aug 2021
Few Shots Are All You Need: A Progressive Few Shot Learning Approach for
  Low Resource Handwritten Text Recognition
Few Shots Are All You Need: A Progressive Few Shot Learning Approach for Low Resource Handwritten Text Recognition
Mohamed Ali Souibgui
A. Forns
Yousri Kessentini
Beta Megyesi
17
20
0
21 Jul 2021
Continual Contrastive Learning for Image Classification
Continual Contrastive Learning for Image Classification
Zhiwei Lin
Yongtao Wang
Hongxiang Lin
SSL
CLL
17
13
0
05 Jul 2021
VOLO: Vision Outlooker for Visual Recognition
VOLO: Vision Outlooker for Visual Recognition
Li-xin Yuan
Qibin Hou
Zihang Jiang
Jiashi Feng
Shuicheng Yan
ViT
41
313
0
24 Jun 2021
Bootstrap Representation Learning for Segmentation on Medical Volumes
  and Sequences
Bootstrap Representation Learning for Segmentation on Medical Volumes and Sequences
Zejian Chen
Wei Zhuo
Tianfu Wang
Wufeng Xue
Dong Ni
19
5
0
23 Jun 2021
Neural Optimization Kernel: Towards Robust Deep Learning
Neural Optimization Kernel: Towards Robust Deep Learning
Yueming Lyu
Ivor Tsang
14
1
0
11 Jun 2021
Rethinking Architecture Design for Tackling Data Heterogeneity in
  Federated Learning
Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning
Liangqiong Qu
Yuyin Zhou
Paul Pu Liang
Yingda Xia
Feifei Wang
Ehsan Adeli
L. Fei-Fei
D. Rubin
FedML
AI4CE
19
173
0
10 Jun 2021
Semantic-Aware Contrastive Learning for Multi-object Medical Image
  Segmentation
Semantic-Aware Contrastive Learning for Multi-object Medical Image Segmentation
Ho Hin Lee
Yucheng Tang
Qi Yang
Xin Yu
Shunxing Bao
L. Cai
Lucas W. Remedios
Bennett A. Landman
Yuankai Huo
21
8
0
03 Jun 2021
Exploring the Diversity and Invariance in Yourself for Visual
  Pre-Training Task
Exploring the Diversity and Invariance in Yourself for Visual Pre-Training Task
Longhui Wei
Lingxi Xie
Wen-gang Zhou
Houqiang Li
Qi Tian
SSL
19
3
0
01 Jun 2021
Backdoor Attacks on Self-Supervised Learning
Backdoor Attacks on Self-Supervised Learning
Aniruddha Saha
Ajinkya Tejankar
Soroush Abbasi Koohpayegani
Hamed Pirsiavash
SSL
AAML
22
100
0
21 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
303
5,773
0
29 Apr 2021
SiT: Self-supervised vIsion Transformer
SiT: Self-supervised vIsion Transformer
Sara Atito Ali Ahmed
Muhammad Awais
J. Kittler
ViT
31
139
0
08 Apr 2021
Previous
123...919293
Next