ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.06377
  4. Cited By
Masked Autoencoders Are Scalable Vision Learners
v1v2v3 (latest)

Masked Autoencoders Are Scalable Vision Learners

11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
    ViTTPM
ArXiv (abs)PDFHTML

Papers citing "Masked Autoencoders Are Scalable Vision Learners"

50 / 4,778 papers shown
Title
Anatomical Invariance Modeling and Semantic Alignment for
  Self-supervised Learning in 3D Medical Image Analysis
Anatomical Invariance Modeling and Semantic Alignment for Self-supervised Learning in 3D Medical Image Analysis
Yankai Jiang
Ming Sun
Heng Guo
Xiaoyu Bai
K. Yan
Le Lu
Minfeng Xu
MedIm
130
22
0
11 Feb 2023
Leveraging Inpainting for Single-Image Shadow Removal
Leveraging Inpainting for Single-Image Shadow Removal
Xiaoguang Li
Qing Guo
R. Abdelfattah
Di Lin
Wei Feng
Ivor Tsang
Song Wang
112
26
0
10 Feb 2023
Generalized Video Anomaly Event Detection: Systematic Taxonomy and
  Comparison of Deep Models
Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models
Yang Liu
Dingkang Yang
Yan Wang
Jing Liu
Jun Liu
Azzedine Boukerche
Peng Sun
Liang Song
151
97
0
10 Feb 2023
BEST: BERT Pre-Training for Sign Language Recognition with Coupling
  Tokenization
BEST: BERT Pre-Training for Sign Language Recognition with Coupling Tokenization
Weichao Zhao
Hezhen Hu
Wen-gang Zhou
Jiaxin Shi
Houqiang Li
SLR
74
33
0
10 Feb 2023
Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation
  with Conditional Alignment and Reweighting
Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting
Viraj Prabhu
David Acuna
Andy Liao
Rafid Mahmood
M. Law
Judy Hoffman
Sanja Fidler
James Lucas
87
12
0
09 Feb 2023
Towards Geospatial Foundation Models via Continual Pretraining
Towards Geospatial Foundation Models via Continual Pretraining
Matías Mendieta
Boran Han
Xingjian Shi
Yi Zhu
Chen Chen
VLMAI4CE
145
73
0
09 Feb 2023
An Investigation into Pre-Training Object-Centric Representations for
  Reinforcement Learning
An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning
Jaesik Yoon
Yi-Fu Wu
Heechul Bae
Sungjin Ahn
OCL
111
44
0
09 Feb 2023
DeepVATS: Deep Visual Analytics for Time Series
DeepVATS: Deep Visual Analytics for Time Series
V. Rodríguez-Fernández
David Montalvo
F. Piccialli
Grzegorz J. Nalepa
David Camacho
AI4TS
55
7
0
08 Feb 2023
Evaluating Self-Supervised Learning via Risk Decomposition
Evaluating Self-Supervised Learning via Risk Decomposition
Yann Dubois
Tatsunori Hashimoto
Percy Liang
77
9
0
06 Feb 2023
AIM: Adapting Image Models for Efficient Video Action Recognition
AIM: Adapting Image Models for Efficient Video Action Recognition
Taojiannan Yang
Yi Zhu
Yusheng Xie
Aston Zhang
Chong Chen
Mu Li
ViT
148
157
0
06 Feb 2023
SurgT challenge: Benchmark of Soft-Tissue Trackers for Robotic Surgery
SurgT challenge: Benchmark of Soft-Tissue Trackers for Robotic Surgery
João Cartucho
Alistair Weld
Samyakh Tukra
Haozheng Xu
Hiroki Matsuzaki
...
B. Silva
Estevão Lima
João L. Vilaça
Sandro Queiros
Stamatia Giannarou
96
11
0
06 Feb 2023
RLSbench: Domain Adaptation Under Relaxed Label Shift
RLSbench: Domain Adaptation Under Relaxed Label Shift
Saurabh Garg
Nick Erickson
James Sharpnack
Alexander J. Smola
Sivaraman Balakrishnan
Zachary Chase Lipton
VLM
108
33
0
06 Feb 2023
Rethinking Out-of-distribution (OOD) Detection: Masked Image Modeling is
  All You Need
Rethinking Out-of-distribution (OOD) Detection: Masked Image Modeling is All You Need
Jingyao Li
Pengguang Chen
Shaozuo Yu
Zexin He
Shu Liu
Jiaya Jia
OODD
102
46
0
06 Feb 2023
Single Cells Are Spatial Tokens: Transformers for Spatial Transcriptomic
  Data Imputation
Single Cells Are Spatial Tokens: Transformers for Spatial Transcriptomic Data Imputation
Haifang Wen
Wenzhuo Tang
Wei Jin
Jiayuan Ding
Renming Liu
Xinnan Dai
Feng Shi
Lulu Shang
Jiliang Tang
Yuying Xie
66
10
0
06 Feb 2023
Multi-View Masked World Models for Visual Robotic Manipulation
Multi-View Masked World Models for Visual Robotic Manipulation
Younggyo Seo
Junsup Kim
Stephen James
Kimin Lee
Jinwoo Shin
Pieter Abbeel
VGen
107
60
0
05 Feb 2023
Revisiting Discriminative vs. Generative Classifiers: Theory and
  Implications
Revisiting Discriminative vs. Generative Classifiers: Theory and Implications
Chenyu Zheng
Guoqiang Wu
Fan Bao
Yue Cao
Chongxuan Li
Jun Zhu
BDL
88
30
0
05 Feb 2023
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided
  by Generative Pretraining
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
Zekun Qi
Runpei Dong
Guo Fan
Zheng Ge
Xiangyu Zhang
Kaisheng Ma
Li Yi
154
131
0
05 Feb 2023
MOMA:Distill from Self-Supervised Teachers
MOMA:Distill from Self-Supervised Teachers
Yuan Yao
Nandakishor Desai
M. Palaniswami
103
2
0
04 Feb 2023
Representation Deficiency in Masked Language Modeling
Representation Deficiency in Masked Language Modeling
Yu Meng
Jitin Krishnan
Sinong Wang
Qifan Wang
Yuning Mao
Han Fang
Marjan Ghazvininejad
Jiawei Han
Luke Zettlemoyer
149
7
0
04 Feb 2023
DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition
DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition
Jiayu Jiao
Yuyao Tang
Kun-Li Channing Lin
Yipeng Gao
Jinhua Ma
Yaowei Wang
Wei-Shi Zheng
MedImViT
98
155
0
03 Feb 2023
AIROGS: Artificial Intelligence for RObust Glaucoma Screening Challenge
AIROGS: Artificial Intelligence for RObust Glaucoma Screening Challenge
Coen de Vente
Koen A. Vermeer
Nicolas Jaccard
He Wang
Hongyi Sun
...
Abdul Qayyum
Imran Razzak
Bram van Ginneken
H. Lemij
Clara I. Sánchez
129
57
0
03 Feb 2023
Rethinking Semi-Supervised Medical Image Segmentation: A
  Variance-Reduction Perspective
Rethinking Semi-Supervised Medical Image Segmentation: A Variance-Reduction Perspective
Chenyu You
Weicheng Dai
Yifei Min
Fenglin Liu
David Clifton
S. Kevin Zhou
Lawrence H. Staib
James S Duncan
104
71
0
03 Feb 2023
Blockwise Self-Supervised Learning at Scale
Blockwise Self-Supervised Learning at Scale
Shoaib Ahmed Siddiqui
David M. Krueger
Yann LeCun
Stéphane Deny
SSL
90
16
0
03 Feb 2023
Self-Supervised Relation Alignment for Scene Graph Generation
Self-Supervised Relation Alignment for Scene Graph Generation
Bicheng Xu
Renjie Liao
Leonid Sigal
69
0
0
02 Feb 2023
Energy-Inspired Self-Supervised Pretraining for Vision Models
Energy-Inspired Self-Supervised Pretraining for Vision Models
Ze Wang
Jiang Wang
Zicheng Liu
Qiang Qiu
61
8
0
02 Feb 2023
A Survey on Efficient Training of Transformers
A Survey on Efficient Training of Transformers
Bohan Zhuang
Jing Liu
Zizheng Pan
Haoyu He
Yuetian Weng
Chunhua Shen
130
49
0
02 Feb 2023
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial
  Defense
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial Defense
Zunzhi You
Daochang Liu
Bohyung Han
Chang Xu
AAMLVLM
110
4
0
02 Feb 2023
SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling
SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling
Jiaxiang Dong
Haixu Wu
Haoran Zhang
Li Zhang
Jianmin Wang
Mingsheng Long
AI4TS
142
94
0
02 Feb 2023
ADAPT: Action-aware Driving Caption Transformer
ADAPT: Action-aware Driving Caption Transformer
Bu Jin
Xinyi Liu
Yupeng Zheng
Pengfei Li
Hao Zhao
Tong Zhang
Yuhang Zheng
Guyue Zhou
Jingjing Liu
136
74
0
01 Feb 2023
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image
  and Video
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Haiyang Xu
Qinghao Ye
Mingshi Yan
Yaya Shi
Jiabo Ye
...
Guohai Xu
Ji Zhang
Songfang Huang
Feiran Huang
Jingren Zhou
MLLMVLMMoE
116
171
0
01 Feb 2023
Towards Label-Efficient Incremental Learning: A Survey
Towards Label-Efficient Incremental Learning: A Survey
Mert Kilickaya
Joost van de Weijer
Yuki M. Asano
CLL
94
4
0
01 Feb 2023
What Makes Good Examples for Visual In-Context Learning?
What Makes Good Examples for Visual In-Context Learning?
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
MLLMVPVLMVLMLRM
106
117
0
31 Jan 2023
ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View
  Semantic Consistency
ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency
Pengzhen Ren
Changlin Li
Hang Xu
Yi Zhu
Guangrun Wang
Jian-zhuo Liu
Xiaojun Chang
Xiaodan Liang
106
45
0
31 Jan 2023
Continuous Spatiotemporal Transformers
Continuous Spatiotemporal Transformers
Antonio H. O. Fonseca
E. Zappala
J. O. Caro
David van Dijk
85
8
0
31 Jan 2023
Advancing Radiograph Representation Learning with Masked Record Modeling
Advancing Radiograph Representation Learning with Masked Record Modeling
Hong-Yu Zhou
Chenyu Lian
Lian-cheng Wang
Yizhou Yu
MedIm
112
59
0
30 Jan 2023
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion
  Models
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models
Rongjie Huang
Jia-Bin Huang
Dongchao Yang
Yi Ren
Luping Liu
Mingze Li
Zhenhui Ye
Jinglin Liu
Xiaoyue Yin
Zhou Zhao
DiffM
238
344
0
30 Jan 2023
Improving the Accuracy-Robustness Trade-Off of Classifiers via Adaptive
  Smoothing
Improving the Accuracy-Robustness Trade-Off of Classifiers via Adaptive Smoothing
Yatong Bai
Brendon G. Anderson
Aerin Kim
Somayeh Sojoudi
AAML
129
19
0
29 Jan 2023
Towards Vision Transformer Unrolling Fixed-Point Algorithm: a Case Study
  on Image Restoration
Towards Vision Transformer Unrolling Fixed-Point Algorithm: a Case Study on Image Restoration
Peng Qiao
Sidun Liu
Tao Sun
Ke Yang
Y. Dou
ViT
81
1
0
29 Jan 2023
Neural Relation Graph: A Unified Framework for Identifying Label Noise
  and Outlier Data
Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data
Jang-Hyun Kim
Sangdoo Yun
Hyun Oh Song
85
19
0
29 Jan 2023
A Closer Look at Few-shot Classification Again
A Closer Look at Few-shot Classification Again
Xu Luo
Hao Wu
Ji Zhang
Lianli Gao
Jing Xu
Jingkuan Song
94
52
0
28 Jan 2023
Deciphering the Projection Head: Representation Evaluation
  Self-supervised Learning
Deciphering the Projection Head: Representation Evaluation Self-supervised Learning
Jiajun Ma
Tianyang Hu
Wei Cao
81
8
0
28 Jan 2023
Aerial Image Object Detection With Vision Transformer Detector (ViTDet)
Aerial Image Object Detection With Vision Transformer Detector (ViTDet)
Liya Wang
A. Tien
141
9
0
28 Jan 2023
Cross-Architectural Positive Pairs improve the effectiveness of
  Self-Supervised Learning
Cross-Architectural Positive Pairs improve the effectiveness of Self-Supervised Learning
P. Singh
Jacopo Cirrone
SSL
119
0
0
27 Jan 2023
Understanding Self-Supervised Pretraining with Part-Aware Representation
  Learning
Understanding Self-Supervised Pretraining with Part-Aware Representation Learning
Jie Zhu
Jiyang Qi
Mingyu Ding
Xiaokang Chen
Ping Luo
Xinggang Wang
Wenyu Liu
Leye Wang
Jingdong Wang
SSL
106
8
0
27 Jan 2023
Deep Industrial Image Anomaly Detection: A Survey
Deep Industrial Image Anomaly Detection: A Survey
Jiaqi Liu
Guoyang Xie
Jingbao Wang
Shangwen Li
Chengjie Wang
Feng Zheng
Yaochu Jin
136
195
0
27 Jan 2023
Cut and Learn for Unsupervised Object Detection and Instance
  Segmentation
Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Xudong Wang
Rohit Girdhar
Stella X. Yu
Ishan Misra
VLM
131
173
0
26 Jan 2023
Discovering and Mitigating Visual Biases through Keyword Explanation
Discovering and Mitigating Visual Biases through Keyword Explanation
Younghyun Kim
Sangwoo Mo
Minkyu Kim
Kyungmin Lee
Jaeho Lee
Jinwoo Shin
157
34
0
26 Jan 2023
Compact Transformer Tracker with Correlative Masked Modeling
Compact Transformer Tracker with Correlative Masked Modeling
Zikai Song
Run Luo
Junqing Yu
Yi-Ping Phoebe Chen
Wei Yang
ViT
68
61
0
26 Jan 2023
PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation
  Invariant Transformation
PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation
Ningxin Zheng
Huiqiang Jiang
Quan Zhang
Zhenhua Han
Yuqing Yang
...
Fan Yang
Chengruidong Zhang
Lili Qiu
Mao Yang
Lidong Zhou
102
29
0
26 Jan 2023
A Method For Eliminating Contour Errors In Self-Encoder Reconstructed
  Images
A Method For Eliminating Contour Errors In Self-Encoder Reconstructed Images
Yonggang Li
Hao Zhang
125
0
0
25 Jan 2023
Previous
123...767778...949596
Next