ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.06377
  4. Cited By
Masked Autoencoders Are Scalable Vision Learners
v1v2v3 (latest)

Masked Autoencoders Are Scalable Vision Learners

11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
    ViTTPM
ArXiv (abs)PDFHTML

Papers citing "Masked Autoencoders Are Scalable Vision Learners"

50 / 4,779 papers shown
Title
Affordance Grounding from Demonstration Video to Target Image
Affordance Grounding from Demonstration Video to Target Image
Joya Chen
Difei Gao
Kevin Qinghong Lin
Mike Zheng Shou
70
27
0
26 Mar 2023
Selective Structured State-Spaces for Long-Form Video Understanding
Selective Structured State-Spaces for Long-Form Video Understanding
Jue Wang
Wenjie Zhu
Pichao Wang
Xiang Yu
Linda Liu
Mohamed Omar
Raffay Hamid
94
101
0
25 Mar 2023
Spatio-Temporal Graph Neural Networks for Predictive Learning in Urban
  Computing: A Survey
Spatio-Temporal Graph Neural Networks for Predictive Learning in Urban Computing: A Survey
G. Jin
Yuxuan Liang
Yuchen Fang
Zezhi Shao
Jincai Huang
Junbo Zhang
Yu Zheng
AI4TSAI4CE
143
211
0
25 Mar 2023
Federated Learning without Full Labels: A Survey
Federated Learning without Full Labels: A Survey
Yilun Jin
Yang Liu
Kai Chen
Qian Yang
FedML
85
26
0
25 Mar 2023
Vision Models Can Be Efficiently Specialized via Few-Shot Task-Aware
  Compression
Vision Models Can Be Efficiently Specialized via Few-Shot Task-Aware Compression
Denis Kuznedelev
Soroush Tabesh
Kimia Noorbakhsh
Elias Frantar
Sara Beery
Eldar Kurtic
Dan Alistarh
MQVLM
75
2
0
25 Mar 2023
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
Shanghua Gao
Pan Zhou
Mingg-Ming Cheng
Shuicheng Yan
DiffM
229
171
0
25 Mar 2023
Prompt-Guided Transformers for End-to-End Open-Vocabulary Object
  Detection
Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection
Hwanjun Song
Jihwan Bang
VLMObjD
80
15
0
25 Mar 2023
Active Finetuning: Exploiting Annotation Budget in the
  Pretraining-Finetuning Paradigm
Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning Paradigm
Yichen Xie
Han Lu
Junchi Yan
Xiaokang Yang
Masayoshi Tomizuka
Wei Zhan
102
34
0
25 Mar 2023
ViPFormer: Efficient Vision-and-Pointcloud Transformer for Unsupervised
  Pointcloud Understanding
ViPFormer: Efficient Vision-and-Pointcloud Transformer for Unsupervised Pointcloud Understanding
Hongyu Sun
Yongcai Wang
Xudong Cai
Xuewei Bai
Deying Li
ViT3DPC
98
8
0
25 Mar 2023
Supervised Masked Knowledge Distillation for Few-Shot Transformers
Supervised Masked Knowledge Distillation for Few-Shot Transformers
Hanxi Lin
G. Han
Jiawei Ma
Shiyuan Huang
Xudong Lin
Shih-Fu Chang
92
36
0
25 Mar 2023
Masked Scene Contrast: A Scalable Framework for Unsupervised 3D
  Representation Learning
Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning
Xiaoyang Wu
Xin Wen
Xihui Liu
Hengshuang Zhao
3DPC
173
45
0
24 Mar 2023
MindDiffuser: Controlled Image Reconstruction from Human Brain Activity
  with Semantic and Structural Diffusion
MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion
Yizhuo Lu
Changde Du
Dianpeng Wang
Huiguang He
DiffM
199
45
0
24 Mar 2023
Image Deblurring by Exploring In-depth Properties of Transformer
Image Deblurring by Exploring In-depth Properties of Transformer
Pengwei Liang
Junjun Jiang
Xianming Liu
Jiayi Ma
ViT
78
20
0
24 Mar 2023
HandNeRF: Neural Radiance Fields for Animatable Interacting Hands
HandNeRF: Neural Radiance Fields for Animatable Interacting Hands
Zhiyang Guo
Wen-gang Zhou
Min Wang
Li Li
Houqiang Li
3DH
128
16
0
24 Mar 2023
Temperature Schedules for Self-Supervised Contrastive Methods on
  Long-Tail Data
Temperature Schedules for Self-Supervised Contrastive Methods on Long-Tail Data
Anna Kukleva
Moritz Bohle
Bernt Schiele
Hilde Kuehne
Christian Rupprecht
94
45
0
23 Mar 2023
Neural Preset for Color Style Transfer
Neural Preset for Color Style Transfer
Zhanghan Ke
Yuhao Liu
Lei Zhu
Nanxuan Zhao
Rynson W. H. Lau
156
35
0
23 Mar 2023
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based
  Self-Supervised Pre-Training
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
Runsen Xu
Tai Wang
Wenwei Zhang
Runjian Chen
Jinkun Cao
Jiangmiao Pang
Dahua Lin
3DPC
93
30
0
23 Mar 2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining
The effectiveness of MAE pre-pretraining for billion-scale pretraining
Mannat Singh
Quentin Duval
Kalyan Vasudev Alwala
Haoqi Fan
Vaibhav Aggarwal
...
Piotr Dollár
Christoph Feichtenhofer
Ross B. Girshick
Rohit Girdhar
Ishan Misra
LRM
182
71
0
23 Mar 2023
Multi-granularity Interaction Simulation for Unsupervised Interactive
  Segmentation
Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation
Kehan Li
Yian Zhao
Zhennan Wang
Ze-Long Cheng
Peng Jin
Xiang Ji
Li-ming Yuan
Chang-rui Liu
Jie Chen
84
9
0
23 Mar 2023
Visual-Language Prompt Tuning with Knowledge-guided Context Optimization
Visual-Language Prompt Tuning with Knowledge-guided Context Optimization
Hantao Yao
Rui Zhang
Changsheng Xu
VLMVPVLM
206
227
0
23 Mar 2023
CrOC: Cross-View Online Clustering for Dense Visual Representation
  Learning
CrOC: Cross-View Online Clustering for Dense Visual Representation Learning
Thomas Stegmüller
Tim Lebailly
Behzad Bozorgtabar
Tinne Tuytelaars
Jean-Philippe Thiran
98
17
0
23 Mar 2023
Masked Image Training for Generalizable Deep Image Denoising
Masked Image Training for Generalizable Deep Image Denoising
Haoyu Chen
Jinjin Gu
Yihao Liu
Salma Abdel Magid
Chao Dong
Qiong Wang
Hanspeter Pfister
Lei Zhu
79
68
0
23 Mar 2023
PointGame: Geometrically and Adaptively Masked Auto-Encoder on Point
  Clouds
PointGame: Geometrically and Adaptively Masked Auto-Encoder on Point Clouds
Yun-Hai Liu
Xu Yan
Zhilei Chen
Zhiqi Li
Zeyong Wei
Mingqiang Wei
3DPC
90
2
0
23 Mar 2023
Top-Down Visual Attention from Analysis by Synthesis
Top-Down Visual Attention from Analysis by Synthesis
Baifeng Shi
Trevor Darrell
Xin Eric Wang
88
32
0
23 Mar 2023
Test-time Detection and Repair of Adversarial Samples via Masked
  Autoencoder
Test-time Detection and Repair of Adversarial Samples via Masked Autoencoder
Yun-Yun Tsai
Ju-Chin Chao
Albert Wen
Zhaoyuan Yang
Chengzhi Mao
Tapan Shah
Junfeng Yang
AAML
68
1
0
22 Mar 2023
Correlational Image Modeling for Self-Supervised Visual Pre-Training
Correlational Image Modeling for Self-Supervised Visual Pre-Training
Wei Li
Jiahao Xie
Chen Change Loy
SSL
96
12
0
22 Mar 2023
Weakly Supervised Video Representation Learning with Unaligned Text for
  Sequential Videos
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos
Sixun Dong
Huazhang Hu
Dongze Lian
Weixin Luo
Yichen Qian
Shenghua Gao
ViTAI4TS
73
12
0
22 Mar 2023
EasyDGL: Encode, Train and Interpret for Continuous-time Dynamic Graph
  Learning
EasyDGL: Encode, Train and Interpret for Continuous-time Dynamic Graph Learning
Chao Chen
Haoyu Geng
Nianzu Yang
Xiaokang Yang
Junchi Yan
104
8
0
22 Mar 2023
MV-MR: multi-views and multi-representations for self-supervised
  learning and knowledge distillation
MV-MR: multi-views and multi-representations for self-supervised learning and knowledge distillation
Vitaliy Kinakh
M. Drozdova
Svyatoslav Voloshynovskiy
83
2
0
21 Mar 2023
Machine Learning for Brain Disorders: Transformers and Visual
  Transformers
Machine Learning for Brain Disorders: Transformers and Visual Transformers
Robin Courant
Maika Edberg
Nicolas Dufour
Vicky Kalogeiton
MedImViT
63
1
0
21 Mar 2023
ViC-MAE: Self-Supervised Representation Learning from Images and Video
  with Contrastive Masked Autoencoders
ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders
J. Hernandez
Ruben Villegas
Vicente Ordonez
SSL
75
2
0
21 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to
  GPT-5 All You Need?
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
188
170
0
21 Mar 2023
Human Pose as Compositional Tokens
Human Pose as Compositional Tokens
Zigang Geng
Chunyu Wang
Yixuan Wei
Ze Liu
Houqiang Li
Han Hu
89
49
0
21 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the
  Future
Large AI Models in Health Informatics: Applications, Challenges, and the Future
Jianing Qiu
Lin Li
Jiankai Sun
Jiachuan Peng
Peilun Shi
...
Bo Xiao
Wu Yuan
Ningli Wang
Dong Xu
Benny Lo
AI4MHLM&MA
114
141
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
EVA-02: A Visual Representation for Neon Genesis
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLMViTCLIP
130
289
0
20 Mar 2023
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling
  for Multi-view 3D Understanding
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding
Jihao Liu
Tai Wang
Boxiao Liu
Qihang Zhang
Yu Liu
Hongsheng Li
69
16
0
20 Mar 2023
Towards End-to-End Generative Modeling of Long Videos with
  Memory-Efficient Bidirectional Transformers
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers
Jaehoon Yoo
Semin Kim
Doyup Lee
Chiheon Kim
Seunghoon Hong
84
3
0
20 Mar 2023
TWINS: A Fine-Tuning Framework for Improved Transferability of
  Adversarial Robustness and Generalization
TWINS: A Fine-Tuning Framework for Improved Transferability of Adversarial Robustness and Generalization
Ziquan Liu
Yi Tian Xu
Xiangyang Ji
Antoni B. Chan
AAML
58
18
0
20 Mar 2023
Coreset Sampling from Open-Set for Fine-Grained Self-Supervised Learning
Coreset Sampling from Open-Set for Fine-Grained Self-Supervised Learning
Sungnyun Kim
Sangmin Bae
Se-Young Yun
135
11
0
20 Mar 2023
Understanding the Role of the Projector in Knowledge Distillation
Understanding the Role of the Projector in Knowledge Distillation
Roy Miles
K. Mikolajczyk
115
25
0
20 Mar 2023
FedMAE: Federated Self-Supervised Learning with One-Block Masked
  Auto-Encoder
FedMAE: Federated Self-Supervised Learning with One-Block Masked Auto-Encoder
Nan Yang
Xuanyu Chen
Charles Z. Liu
Dong Yuan
Wei Bao
Li-zhen Cui
69
3
0
20 Mar 2023
Multi-modal Facial Affective Analysis based on Masked Autoencoder
Multi-modal Facial Affective Analysis based on Masked Autoencoder
Wei Zhang
Bowen Ma
Feng Qiu
Yu-qiong Ding
CVBM
99
29
0
20 Mar 2023
Diffusion-based Document Layout Generation
Diffusion-based Document Layout Generation
Liu He
Yijuan Lu
John Corring
D. Florêncio
Cha Zhang
DiffM
63
22
0
19 Mar 2023
Trainable Projected Gradient Method for Robust Fine-tuning
Trainable Projected Gradient Method for Robust Fine-tuning
Junjiao Tian
Xiaoliang Dai
Chih-Yao Ma
Zecheng He
Yen-Cheng Liu
Z. Kira
115
41
0
19 Mar 2023
Spatio-Temporal AU Relational Graph Representation Learning For Facial
  Action Units Detection
Spatio-Temporal AU Relational Graph Representation Learning For Facial Action Units Detection
Zihan Wang
Siyang Song
Cheng Luo
Yuzhi Zhou
Shiling Wu
Weicheng Xie
Linlin Shen
CVBM
60
13
0
19 Mar 2023
Exploring Expression-related Self-supervised Learning for Affective
  Behaviour Analysis
Exploring Expression-related Self-supervised Learning for Affective Behaviour Analysis
Fanglei Xue
Yifan Sun
Yi Yang
90
4
0
18 Mar 2023
Machine learning with data assimilation and uncertainty quantification
  for dynamical systems: a review
Machine learning with data assimilation and uncertainty quantification for dynamical systems: a review
Sibo Cheng
César Quilodrán-Casas
Said Ouala
A. Farchi
Che Liu
...
Weiping Ding
Yike Guo
A. Carrassi
Marc Bocquet
Rossella Arcucci
AI4CE
81
138
0
18 Mar 2023
HybridMIM: A Hybrid Masked Image Modeling Framework for 3D Medical Image
  Segmentation
HybridMIM: A Hybrid Masked Image Modeling Framework for 3D Medical Image Segmentation
Zhaohu Xing
Lei Zhu
Lequan Yu
Zhiheng Xing
Liang Wan
68
9
0
18 Mar 2023
Data-Centric Learning from Unlabeled Graphs with Diffusion Model
Data-Centric Learning from Unlabeled Graphs with Diffusion Model
Gang Liu
Eric Inae
Tong Zhao
Jiaxin Xu
Te Luo
Meng Jiang
DiffM
75
26
0
17 Mar 2023
A Unified Continual Learning Framework with General Parameter-Efficient
  Tuning
A Unified Continual Learning Framework with General Parameter-Efficient Tuning
Qiankun Gao
Chen Zhao
Yifan Sun
Teng Xi
Gang Zhang
Guohao Li
Shuai Liu
CLL
129
98
0
17 Mar 2023
Previous
123...727374...949596
Next