ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.06377
  4. Cited By
Masked Autoencoders Are Scalable Vision Learners
v1v2v3 (latest)

Masked Autoencoders Are Scalable Vision Learners

11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
    ViTTPM
ArXiv (abs)PDFHTML

Papers citing "Masked Autoencoders Are Scalable Vision Learners"

50 / 4,779 papers shown
Title
ELVIS: Empowering Locality of Vision Language Pre-training with
  Intra-modal Similarity
ELVIS: Empowering Locality of Vision Language Pre-training with Intra-modal Similarity
Sumin Seo
Jaewoong Shin
Jaewoo Kang
Tae Soo Kim
Thijs Kooi
47
1
0
11 Apr 2023
Towards Efficient Fine-tuning of Pre-trained Code Models: An
  Experimental Study and Beyond
Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond
Ensheng Shi
Yanlin Wang
Hongyu Zhang
Lun Du
Shi Han
Dongmei Zhang
Hongbin Sun
89
45
0
11 Apr 2023
A Billion-scale Foundation Model for Remote Sensing Images
A Billion-scale Foundation Model for Remote Sensing Images
Keumgang Cha
Junghoon Seo
Taekyung Lee
121
71
0
11 Apr 2023
Mask-Based Modeling for Neural Radiance Fields
Mask-Based Modeling for Neural Radiance Fields
Ganlin Yang
Guoqiang Wei
Zhizheng Zhang
Yan Lu
Dong Liu
AI4CE
47
1
0
11 Apr 2023
Exploring Effective Factors for Improving Visual In-Context Learning
Exploring Effective Factors for Improving Visual In-Context Learning
Yanpeng Sun
Qiang Chen
Jian Wang
Jingdong Wang
Zechao Li
LRMVLM
106
26
0
10 Apr 2023
SAM vs BET: A Comparative Study for Brain Extraction and Segmentation of
  Magnetic Resonance Images using Deep Learning
SAM vs BET: A Comparative Study for Brain Extraction and Segmentation of Magnetic Resonance Images using Deep Learning
Sovesh Mohapatra
Advait Gosai
G. Schlaug
38
34
0
10 Apr 2023
GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner
GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner
Zhenyu Hou
Yufei He
Yukuo Cen
Xiao Liu
Yuxiao Dong
Evgeny Kharlamov
Jie Tang
SSL
68
118
0
10 Apr 2023
On Robustness in Multimodal Learning
On Robustness in Multimodal Learning
Brandon McKinzie
Joseph Cheng
Vaishaal Shankar
Yinfei Yang
Jonathon Shlens
Alexander Toshev
61
2
0
10 Apr 2023
Token Boosting for Robust Self-Supervised Visual Transformer
  Pre-training
Token Boosting for Robust Self-Supervised Visual Transformer Pre-training
Tianjiao Li
Lin Geng Foo
Ping Hu
Xindi Shang
Hossein Rahmani
Zehuan Yuan
Jing Liu
119
7
0
09 Apr 2023
EMP-SSL: Towards Self-Supervised Learning in One Training Epoch
EMP-SSL: Towards Self-Supervised Learning in One Training Epoch
Shengbang Tong
Yubei Chen
Yi Ma
Yann LeCun
72
26
0
08 Apr 2023
InstructBio: A Large-scale Semi-supervised Learning Paradigm for
  Biochemical Problems
InstructBio: A Large-scale Semi-supervised Learning Paradigm for Biochemical Problems
Fang Wu
Huiling Qin
Siyuan Li
Stan Z. Li
Xianyuan Zhan
Jinbo Xu
80
5
0
08 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature
  Review
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
105
43
0
07 Apr 2023
Language-aware Multiple Datasets Detection Pretraining for DETRs
Language-aware Multiple Datasets Detection Pretraining for DETRs
Jing Hao
Song Chen
Xiaodi Wang
Shumin Han
ObjD
82
3
0
07 Apr 2023
Rethinking Evaluation Protocols of Visual Representations Learned via
  Self-supervised Learning
Rethinking Evaluation Protocols of Visual Representations Learned via Self-supervised Learning
Jaehoon Lee
Doyoung Yoon
Byeongmoon Ji
Kyungyul Kim
Sangheum Hwang
SSL
77
3
0
07 Apr 2023
Localized Region Contrast for Enhancing Self-Supervised Learning in
  Medical Image Segmentation
Localized Region Contrast for Enhancing Self-Supervised Learning in Medical Image Segmentation
Xiangyi Yan
Junayed Naushad
Chenyu You
Hao Tang
Shanlin Sun
Kun Han
Haoyu Ma
James Duncan
Xiaohui Xie
SSL
73
4
0
06 Apr 2023
Self-Supervised Video Similarity Learning
Self-Supervised Video Similarity Learning
Giorgos Kordopatis-Zilos
Giorgos Tolias
Christos Tzelepis
I. Kompatsiaris
Ioannis Patras
Symeon Papadopoulos
SSL
65
8
0
06 Apr 2023
Diffusion Models as Masked Autoencoders
Diffusion Models as Masked Autoencoders
Chen Wei
K. Mangalam
Po-Yao (Bernie) Huang
Yanghao Li
Haoqi Fan
Hu Xu
Huiyu Wang
Cihang Xie
Alan Yuille
Christoph Feichtenhofer
DiffMSyDa
100
53
0
06 Apr 2023
Visual Dependency Transformers: Dependency Tree Emerges from Reversed
  Attention
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
Mingyu Ding
Songlin Yang
Lijie Fan
Zhenfang Chen
Z. Chen
Ping Luo
J. Tenenbaum
Chuang Gan
ViT
157
15
0
06 Apr 2023
Micron-BERT: BERT-based Facial Micro-Expression Recognition
Micron-BERT: BERT-based Facial Micro-Expression Recognition
Xuan-Bac Nguyen
C. Duong
Xin Li
Susan Gauch
Han-Seok Seo
Khoa Luu
87
59
0
06 Apr 2023
Synthesizing Anyone, Anywhere, in Any Pose
Synthesizing Anyone, Anywhere, in Any Pose
Håkon Hukkelås
Frank Lindseth
94
4
0
06 Apr 2023
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot
  Keypoint Detection
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection
Changsheng Lu
Hao Zhu
Piotr Koniusz
82
11
0
06 Apr 2023
Inductive biases in deep learning models for weather prediction
Inductive biases in deep learning models for weather prediction
Jannik Thümmel
Matthias Karlbauer
S. Otte
C. Zarfl
Georg Martius
...
Thomas Scholten
Ulrich Friedrich
V. Wulfmeyer
B. Goswami
Martin Volker Butz
AI4CE
113
6
0
06 Apr 2023
PointCAT: Cross-Attention Transformer for point cloud
PointCAT: Cross-Attention Transformer for point cloud
Xincheng Yang
Mingze Jin
Weiji He
Qian Chen
3DPCViT
77
3
0
06 Apr 2023
InterFormer: Real-time Interactive Image Segmentation
InterFormer: Real-time Interactive Image Segmentation
YouFu Huang
Hao Yang
Ke Sun
Shengchuan Zhang
Liujuan Cao
Guannan Jiang
Rongrong Ji
93
23
0
06 Apr 2023
Segment Anything
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLMVLM
497
7,485
0
05 Apr 2023
ENTL: Embodied Navigation Trajectory Learner
ENTL: Embodied Navigation Trajectory Learner
Klemen Kotar
Aaron Walsman
Roozbeh Mottaghi
113
7
0
05 Apr 2023
Towards Efficient Task-Driven Model Reprogramming with Foundation Models
Towards Efficient Task-Driven Model Reprogramming with Foundation Models
Shoukai Xu
Jiangchao Yao
Ran Luo
Shuhai Zhang
Zihao Lian
Mingkui Tan
Bo Han
Yaowei Wang
95
6
0
05 Apr 2023
Industrial Anomaly Detection with Domain Shift: A Real-world Dataset and
  Masked Multi-scale Reconstruction
Industrial Anomaly Detection with Domain Shift: A Real-world Dataset and Masked Multi-scale Reconstruction
Zilong Zhang
Zhibin Zhao
Xingwu Zhang
Chuang Sun
Xuefeng Chen
123
59
0
05 Apr 2023
Exploration of Lightweight Single Image Denoising with Transformers and
  Truly Fair Training
Exploration of Lightweight Single Image Denoising with Transformers and Truly Fair Training
Haram Choi
Cheolwoong Na
Jinseop S. Kim
Jihoon Yang
ViT
53
3
0
04 Apr 2023
One Small Step for Generative AI, One Giant Leap for AGI: A Complete
  Survey on ChatGPT in AIGC Era
One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era
Chaoning Zhang
Chenshuang Zhang
Chenghao Li
Yu Qiao
Sheng Zheng
...
Sung-Ho Bae
Lik-Hang Lee
Pan Hui
In So Kweon
Choong Seon Hong
LM&MAAI4MHLRMELM
106
138
0
04 Apr 2023
RARE: Robust Masked Graph Autoencoder
RARE: Robust Masked Graph Autoencoder
Wenxuan Tu
Qing Liao
Sihang Zhou
Xin Peng
Chuan Ma
Yanfeng Guo
Xinwang Liu
Zhiping Cai
124
16
0
04 Apr 2023
Improved Visual Fine-tuning with Natural Language Supervision
Improved Visual Fine-tuning with Natural Language Supervision
Junyan Wang
Yuanhong Xu
Juhua Hu
Ming Yan
Jitao Sang
Qi Qian
64
8
0
04 Apr 2023
Defending Against Patch-based Backdoor Attacks on Self-Supervised
  Learning
Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning
Ajinkya Tejankar
Maziar Sanjabi
Qifan Wang
Sinong Wang
Hamed Firooz
Hamed Pirsiavash
L Tan
AAML
86
21
0
04 Apr 2023
Exploring Vision-Language Models for Imbalanced Learning
Exploring Vision-Language Models for Imbalanced Learning
Yidong Wang
Zhuohao Yu
Jindong Wang
Qiang Heng
Haoxing Chen
Wei Ye
Rui Xie
Xingxu Xie
Shi-Bo Zhang
VLM
105
33
0
04 Apr 2023
U-Netmer: U-Net meets Transformer for medical image segmentation
U-Netmer: U-Net meets Transformer for medical image segmentation
Sheng He
Rina Bao
P. E. Grant
Yangming Ou
ViTMedIm
101
13
0
03 Apr 2023
Specialty-Oriented Generalist Medical AI for Chest CT Screening
Specialty-Oriented Generalist Medical AI for Chest CT Screening
Chuang Niu
Qing Lyu
Christopher D. Carothers
P. Kaviani
Josh Tan
Pingkun Yan
Mannudeep K. Kalra
C. Whitlow
Ge Wang
66
6
0
03 Apr 2023
On the Benefits of 3D Pose and Tracking for Human Action Recognition
On the Benefits of 3D Pose and Tracking for Human Action Recognition
Jathushan Rajasegaran
Georgios Pavlakos
Angjoo Kanazawa
Christoph Feichtenhofer
Jitendra Malik
111
34
0
03 Apr 2023
Associating Spatially-Consistent Grouping with Text-supervised Semantic
  Segmentation
Associating Spatially-Consistent Grouping with Text-supervised Semantic Segmentation
Yabo Zhang
Zihao Wang
Jun Hao Liew
Jingjia Huang
Manyu Zhu
Jiashi Feng
W. Zuo
VLM
52
4
0
03 Apr 2023
Real-time 6K Image Rescaling with Rate-distortion Optimization
Real-time 6K Image Rescaling with Rate-distortion Optimization
Chenyang Qi
Xin Yang
Ka Leong Cheng
Ying-Cong Chen
Qifeng Chen
79
10
0
03 Apr 2023
Temporal Enhanced Training of Multi-view 3D Object Detector via
  Historical Object Prediction
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Zhuofan Zong
Dong Jiang
Guanglu Song
Zeyue Xue
Jingyong Su
Hongsheng Li
Yu Liu
120
39
0
03 Apr 2023
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot
  Action Recognition
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Changxin Gao
Yingya Zhang
Deli Zhao
Nong Sang
84
45
0
03 Apr 2023
Open-Vocabulary Point-Cloud Object Detection without 3D Annotation
Yuheng Lu
Chenfeng Xu
Xi Wei
Xiaodong Xie
Masayoshi Tomizuka
Kurt Keutzer
Shanghang Zhang
3DPC
118
57
0
03 Apr 2023
Disentangled Pre-training for Image Matting
Disentangled Pre-training for Image Matting
Yan-Da Li
Zilong Huang
Gang Yu
Ling-Hao Chen
Yunchao Wei
Jianbo Jiao
93
0
0
03 Apr 2023
Chain-of-Thought Predictive Control
Chain-of-Thought Predictive Control
Zhiwei Jia
Vineet Thumuluri
Fangchen Liu
Ling-Hao Chen
Zhiao Huang
H. Su
LM&Ro
145
20
0
03 Apr 2023
Multi-Modal Representation Learning with Text-Driven Soft Masks
Multi-Modal Representation Learning with Text-Driven Soft Masks
Jaeyoo Park
Bohyung Han
SSL
51
4
0
03 Apr 2023
Vision-Language Models for Vision Tasks: A Survey
Vision-Language Models for Vision Tasks: A Survey
Jingyi Zhang
Jiaxing Huang
Sheng Jin
Shijian Lu
VLM
167
552
0
03 Apr 2023
NeuroDAVIS: A neural network model for data visualization
NeuroDAVIS: A neural network model for data visualization
Chayan Maitra
D. Seal
R. K. De
DiffM
55
4
0
01 Apr 2023
HaLP: Hallucinating Latent Positives for Skeleton-based Self-Supervised
  Learning of Actions
HaLP: Hallucinating Latent Positives for Skeleton-based Self-Supervised Learning of Actions
Anshul B. Shah
A. Roy
Ketul Shah
Shlok Kumar Mishra
David W. Jacobs
A. Cherian
Ramalingam Chellappa
SSL
62
29
0
01 Apr 2023
JacobiNeRF: NeRF Shaping with Mutual Information Gradients
JacobiNeRF: NeRF Shaping with Mutual Information Gradients
Xiaomeng Xu
Yanchao Yang
Kaichun Mo
Boxiao Pan
L. Yi
Leonidas Guibas
86
10
0
01 Apr 2023
Mask Hierarchical Features For Self-Supervised Learning
Mask Hierarchical Features For Self-Supervised Learning
Fenggang Liu
Yangguang Li
Feng Liang
Jilan Xu
Bin Huang
Jing Shao
30
0
0
01 Apr 2023
Previous
123...707172...949596
Next