ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.06377
  4. Cited By
Masked Autoencoders Are Scalable Vision Learners

Masked Autoencoders Are Scalable Vision Learners

11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
    ViT
    TPM
ArXivPDFHTML

Papers citing "Masked Autoencoders Are Scalable Vision Learners"

50 / 4,611 papers shown
Title
Automatic segmentation of meniscus based on MAE self-supervision and
  point-line weak supervision paradigm
Automatic segmentation of meniscus based on MAE self-supervision and point-line weak supervision paradigm
Yuhan Xie
Kexin Jiang
Zhiyong Zhang
Shaolong Chen
Xiaodong Zhang
Changzhen Qiu
16
1
0
07 May 2022
EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision
  Transformers
EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers
Junting Pan
Adrian Bulat
Fuwen Tan
Xiatian Zhu
L. Dudziak
Hongsheng Li
Georgios Tzimiropoulos
Brais Martínez
ViT
23
180
0
06 May 2022
MINI: Mining Implicit Novel Instances for Few-Shot Object Detection
MINI: Mining Implicit Novel Instances for Few-Shot Object Detection
Yuhang Cao
Jiaqi Wang
Yiqi Lin
Dahua Lin
ObjD
14
5
0
06 May 2022
BlobGAN: Spatially Disentangled Scene Representations
BlobGAN: Spatially Disentangled Scene Representations
Dave Epstein
Taesung Park
Richard Y. Zhang
Eli Shechtman
Alexei A. Efros
GAN
SSL
OCL
27
42
0
05 May 2022
CoCa: Contrastive Captioners are Image-Text Foundation Models
CoCa: Contrastive Captioners are Image-Text Foundation Models
Jiahui Yu
Zirui Wang
Vijay Vasudevan
Legg Yeung
Mojtaba Seyedhosseini
Yonghui Wu
VLM
CLIP
OffRL
57
1,255
0
04 May 2022
Better plain ViT baselines for ImageNet-1k
Better plain ViT baselines for ImageNet-1k
Lucas Beyer
Xiaohua Zhai
Alexander Kolesnikov
ViT
VLM
22
111
0
03 May 2022
Data Determines Distributional Robustness in Contrastive Language Image
  Pre-training (CLIP)
Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP)
Alex Fang
Gabriel Ilharco
Mitchell Wortsman
Yu Wan
Vaishaal Shankar
Achal Dave
Ludwig Schmidt
VLM
OOD
20
138
0
03 May 2022
Engineering flexible machine learning systems by traversing
  functionally-invariant paths
Engineering flexible machine learning systems by traversing functionally-invariant paths
G. Raghavan
Bahey Tharwat
S. N. Hari
Dhruvil Satani
Matt Thomson
OOD
AI4CE
11
4
0
30 Apr 2022
StorSeismic: A new paradigm in deep learning for seismic processing
StorSeismic: A new paradigm in deep learning for seismic processing
R. Harsuko
T. Alkhalifah
16
37
0
30 Apr 2022
Unsupervised Contrastive Learning based Transformer for Lung Nodule
  Detection
Unsupervised Contrastive Learning based Transformer for Lung Nodule Detection
Chuang Niu
Ge Wang
ViT
MedIm
14
36
0
30 Apr 2022
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model
  Pretraining
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining
Yuting Gao
Jinfeng Liu
Zihan Xu
Jinchao Zhang
Ke Li
Rongrong Ji
Chunhua Shen
VLM
CLIP
25
100
0
29 Apr 2022
CogView2: Faster and Better Text-to-Image Generation via Hierarchical
  Transformers
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
Ming Ding
Wendi Zheng
Wenyi Hong
Jie Tang
VLM
18
321
0
28 Apr 2022
AE-NeRF: Auto-Encoding Neural Radiance Fields for 3D-Aware Object
  Manipulation
AE-NeRF: Auto-Encoding Neural Radiance Fields for 3D-Aware Object Manipulation
Mira Kim
Jaehoon Ko
Kyusun Cho
J. Choi
Daewon Choi
Seung Wook Kim
20
4
0
28 Apr 2022
Towards Flexible Inference in Sequential Decision Problems via
  Bidirectional Transformers
Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Micah Carroll
Jessy Lin
Orr Paradise
Raluca Georgescu
Mingfei Sun
...
Stephanie Milani
Katja Hofmann
Matthew J. Hausknecht
Anca Dragan
Sam Devlin
OffRL
21
10
0
28 Apr 2022
Self-Supervised Learning of Object Parts for Semantic Segmentation
Self-Supervised Learning of Object Parts for Semantic Segmentation
A. Ziegler
Yuki M. Asano
SSL
OCL
21
101
0
27 Apr 2022
Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
Dading Chong
Helin Wang
Peilin Zhou
Qingcheng Zeng
31
65
0
27 Apr 2022
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
ViT
22
512
0
26 Apr 2022
Understanding The Robustness in Vision Transformers
Understanding The Robustness in Vision Transformers
Daquan Zhou
Zhiding Yu
Enze Xie
Chaowei Xiao
Anima Anandkumar
Jiashi Feng
J. Álvarez
ViT
14
185
0
26 Apr 2022
MILES: Visual BERT Pre-training with Injected Language Semantics for
  Video-text Retrieval
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
Yuying Ge
Yixiao Ge
Xihui Liu
Alex Jinpeng Wang
Jianping Wu
Ying Shan
Xiaohu Qie
Ping Luo
VLM
9
43
0
26 Apr 2022
Masked Spectrogram Modeling using Masked Autoencoders for Learning
  General-purpose Audio Representation
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
N. Harada
K. Kashino
24
65
0
26 Apr 2022
Deeper Insights into the Robustness of ViTs towards Common Corruptions
Deeper Insights into the Robustness of ViTs towards Common Corruptions
Rui Tian
Zuxuan Wu
Qi Dai
Han Hu
Yu-Gang Jiang
ViT
AAML
16
4
0
26 Apr 2022
Masked Image Modeling Advances 3D Medical Image Analysis
Masked Image Modeling Advances 3D Medical Image Analysis
Zekai Chen
Devansh Agarwal
Kshitij Aggarwal
Wiem Safta
Samit Hirawat
V. Sethuraman
Mariann Micsinai Balan
Kevin Brown
14
69
0
25 Apr 2022
A Survey on Unsupervised Anomaly Detection Algorithms for Industrial
  Images
A Survey on Unsupervised Anomaly Detection Algorithms for Industrial Images
Yajie Cui
Zhaoxiang Liu
Shiguo Lian
OOD
DRL
30
42
0
24 Apr 2022
A Mask-Based Adversarial Defense Scheme
A Mask-Based Adversarial Defense Scheme
Weizhen Xu
Chenyi Zhang
Fangzhen Zhao
Liangda Fang
AAML
20
3
0
21 Apr 2022
Progressive Training of A Two-Stage Framework for Video Restoration
Progressive Training of A Two-Stage Framework for Video Restoration
Mei Zheng
Qunliang Xing
Minglang Qiao
Mai Xu
Lai Jiang
Huaida Liu
Ying Chen
30
9
0
21 Apr 2022
A Masked Image Reconstruction Network for Document-level Relation
  Extraction
A Masked Image Reconstruction Network for Document-level Relation Extraction
L. Zhang
Yidong Cheng
14
2
0
21 Apr 2022
Neuro-BERT: Rethinking Masked Autoencoding for Self-supervised
  Neurological Pretraining
Neuro-BERT: Rethinking Masked Autoencoding for Self-supervised Neurological Pretraining
Di Wu
Siyuan Li
Jie Yang
Mohamad Sawan
SSL
28
14
0
20 Apr 2022
Disentangling Spatial-Temporal Functional Brain Networks via
  Twin-Transformers
Disentangling Spatial-Temporal Functional Brain Networks via Twin-Transformers
Xiao-Wen Yu
Lu Zhang
Lin Zhao
Yanjun Lyu
Tianming Liu
Dajiang Zhu
23
10
0
20 Apr 2022
Diverse Imagenet Models Transfer Better
Diverse Imagenet Models Transfer Better
Niv Nayman
A. Golbert
Asaf Noy
Tan Ping
Lihi Zelnik-Manor
22
0
0
19 Apr 2022
Missingness Bias in Model Debugging
Missingness Bias in Model Debugging
Saachi Jain
Hadi Salman
E. Wong
Pengchuan Zhang
Vibhav Vineet
Sai H. Vemprala
A. Madry
22
37
0
19 Apr 2022
SePiCo: Semantic-Guided Pixel Contrast for Domain Adaptive Semantic
  Segmentation
SePiCo: Semantic-Guided Pixel Contrast for Domain Adaptive Semantic Segmentation
Binhui Xie
Shuang Li
Mingjiang Li
Chi Harold Liu
Gao Huang
Guoren Wang
19
146
0
19 Apr 2022
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented
  Visual Models
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models
Chunyuan Li
Haotian Liu
Liunian Harold Li
Pengchuan Zhang
J. Aneja
...
Ping Jin
Houdong Hu
Zicheng Liu
Yong Jae Lee
Jianfeng Gao
24
144
0
19 Apr 2022
The Devil is in the Frequency: Geminated Gestalt Autoencoder for
  Self-Supervised Visual Pre-Training
The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training
Hao Liu
Xinghua Jiang
Xin Li
Antai Guo
Deqiang Jiang
Bo Ren
24
36
0
18 Apr 2022
Empirical Evaluation and Theoretical Analysis for Representation
  Learning: A Survey
Empirical Evaluation and Theoretical Analysis for Representation Learning: A Survey
Kento Nozawa
Issei Sato
AI4TS
14
4
0
18 Apr 2022
ResT V2: Simpler, Faster and Stronger
ResT V2: Simpler, Faster and Stronger
Qing-Long Zhang
Yubin Yang
ViT
25
25
0
15 Apr 2022
Masked Siamese Networks for Label-Efficient Learning
Masked Siamese Networks for Label-Efficient Learning
Mahmoud Assran
Mathilde Caron
Ishan Misra
Piotr Bojanowski
Florian Bordes
Pascal Vincent
Armand Joulin
Michael G. Rabbat
Nicolas Ballas
SSL
11
311
0
14 Apr 2022
DeiT III: Revenge of the ViT
DeiT III: Revenge of the ViT
Hugo Touvron
Matthieu Cord
Hervé Jégou
ViT
37
388
0
14 Apr 2022
Residual Swin Transformer Channel Attention Network for Image
  Demosaicing
Residual Swin Transformer Channel Attention Network for Image Demosaicing
W. Xing
K. Egiazarian
ViT
19
14
0
14 Apr 2022
WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic
  Segmentation for Lung Adenocarcinoma
WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic Segmentation for Lung Adenocarcinoma
Chu Han
Xipeng Pan
Lixu Yan
Huan Lin
Bingbing Li
...
Chengda Lu
Xin Chen
C. Liang
Qingling Zhang
Zaiyi Liu
25
26
0
13 Apr 2022
Self-supervised Vision Transformers for Joint SAR-optical Representation
  Learning
Self-supervised Vision Transformers for Joint SAR-optical Representation Learning
Yi Wang
C. Albrecht
Xiaoxiang Zhu
ViT
19
49
0
11 Apr 2022
Evaluating Vision Transformer Methods for Deep Reinforcement Learning
  from Pixels
Evaluating Vision Transformer Methods for Deep Reinforcement Learning from Pixels
Tianxin Tao
Daniele Reda
M. van de Panne
ViT
11
19
0
11 Apr 2022
Representation Learning by Detecting Incorrect Location Embeddings
Representation Learning by Detecting Incorrect Location Embeddings
Sepehr Sameni
Simon Jenni
Paolo Favaro
ViT
21
4
0
10 Apr 2022
Unleashing Vanilla Vision Transformer with Masked Image Modeling for
  Object Detection
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
Yuxin Fang
Shusheng Yang
Shijie Wang
Yixiao Ge
Ying Shan
Xinggang Wang
16
55
0
06 Apr 2022
Simple and Effective Synthesis of Indoor 3D Scenes
Simple and Effective Synthesis of Indoor 3D Scenes
Jing Yu Koh
Harsh Agrawal
Dhruv Batra
Richard Tucker
Austin Waters
Honglak Lee
Yinfei Yang
Jason Baldridge
Peter Anderson
VGen
3DV
13
29
0
06 Apr 2022
Last Layer Re-Training is Sufficient for Robustness to Spurious
  Correlations
Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations
Polina Kirichenko
Pavel Izmailov
A. Wilson
OOD
29
314
0
06 Apr 2022
An Empirical Study of Remote Sensing Pretraining
An Empirical Study of Remote Sensing Pretraining
Di Wang
Jing Zhang
Bo Du
Guisong Xia
Dacheng Tao
EDL
23
190
0
06 Apr 2022
Mixing Signals: Data Augmentation Approach for Deep Learning Based
  Modulation Recognition
Mixing Signals: Data Augmentation Approach for Deep Learning Based Modulation Recognition
Xin-Shun Xu
Zhuangzhi Chen
Dongwei Xu
Huaji Zhou
Shanqing Yu
Shilian Zheng
Qi Xuan
Xiaoniu Yang
20
13
0
05 Apr 2022
A Survey on Dropout Methods and Experimental Verification in
  Recommendation
A Survey on Dropout Methods and Experimental Verification in Recommendation
Y. Li
Weizhi Ma
C. L. Philip Chen
M. Zhang
Yiqun Liu
Shaoping Ma
Yue Yang
25
9
0
05 Apr 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
22
265
0
04 Apr 2022
BatchFormerV2: Exploring Sample Relationships for Dense Representation
  Learning
BatchFormerV2: Exploring Sample Relationships for Dense Representation Learning
Zhi Hou
Baosheng Yu
Chaoyue Wang
Yibing Zhan
Dacheng Tao
ViT
13
11
0
04 Apr 2022
Previous
123...888990919293
Next