ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.06377
  4. Cited By
Masked Autoencoders Are Scalable Vision Learners
v1v2v3 (latest)

Masked Autoencoders Are Scalable Vision Learners

11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
    ViTTPM
ArXiv (abs)PDFHTML

Papers citing "Masked Autoencoders Are Scalable Vision Learners"

50 / 4,778 papers shown
Title
Scene123: One Prompt to 3D Scene Generation via Video-Assisted and
  Consistency-Enhanced MAE
Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE
Yiying Yang
Fukun Yin
Jiayuan Fan
Xin Chen
Wanzhang Li
Gang Yu
VGen
94
1
0
10 Aug 2024
PersonViT: Large-scale Self-supervised Vision Transformer for Person
  Re-Identification
PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identification
Bin Hu
Xinggang Wang
Wenyu Liu
ViT
100
4
0
10 Aug 2024
Enhancing Representation Learning of EEG Data with Masked Autoencoders
Enhancing Representation Learning of EEG Data with Masked Autoencoders
Yifei Zhou
Sitong Liu
83
0
0
09 Aug 2024
Semi-Supervised One-Shot Imitation Learning
Semi-Supervised One-Shot Imitation Learning
Philipp Wu
Kourosh Hakhamaneshi
Yuqing Du
Igor Mordatch
Aravind Rajeswaran
Pieter Abbeel
SSL
108
1
0
09 Aug 2024
UNIC: Universal Classification Models via Multi-teacher Distillation
UNIC: Universal Classification Models via Multi-teacher Distillation
Mert Bulent Sariyildiz
Philippe Weinzaepfel
Thomas Lucas
Diane Larlus
Yannis Kalantidis
133
7
0
09 Aug 2024
Masked adversarial neural network for cell type deconvolution in spatial
  transcriptomics
Masked adversarial neural network for cell type deconvolution in spatial transcriptomics
Lin Huang
Xiaofei Liu
Shunfang Wang
Wenwen Min
28
0
0
09 Aug 2024
Instruction Tuning-free Visual Token Complement for Multimodal LLMs
Instruction Tuning-free Visual Token Complement for Multimodal LLMs
Dongsheng Wang
Jiequan Cui
Miaoge Li
Wang Lin
Bo Chen
Hanwang Zhang
MLLM
50
4
0
09 Aug 2024
Generative AI on SpectrumNet: An Open Benchmark of Multiband 3D Radio
  Maps
Generative AI on SpectrumNet: An Open Benchmark of Multiband 3D Radio Maps
Shuhang Zhang
Shuai Jiang
Wanjie Lin
Zheng Fang
Kangjun Liu
Hongliang Zhang
Ke Chen
MedIm
75
4
0
09 Aug 2024
Synchronous Multi-modal Semantic Communication System with Packet-level
  Coding
Synchronous Multi-modal Semantic Communication System with Packet-level Coding
Yun Tian
Jingkai Ying
Zhijin Qin
Ye Jin
Xiaoming Tao
72
6
0
08 Aug 2024
AggSS: An Aggregated Self-Supervised Approach for Class-Incremental
  Learning
AggSS: An Aggregated Self-Supervised Approach for Class-Incremental Learning
Jayateja Kalla
Soma Biswas
SSL
78
0
0
08 Aug 2024
Dual-branch PolSAR Image Classification Based on GraphMAE and Local
  Feature Extraction
Dual-branch PolSAR Image Classification Based on GraphMAE and Local Feature Extraction
Yuchen Wang
Ziyi Guo
Haixia Bi
Danfeng Hong
Chen Xu
SSL
117
3
0
08 Aug 2024
MU-MAE: Multimodal Masked Autoencoders-Based One-Shot Learning
MU-MAE: Multimodal Masked Autoencoders-Based One-Shot Learning
Rex Liu
Xin Liu
100
2
0
08 Aug 2024
Masked EEG Modeling for Driving Intention Prediction
Masked EEG Modeling for Driving Intention Prediction
Jinzhao Zhou
Justin Sia
Yiqun Duan
Yu-Cheng Chang
Yu-Kai Wang
Chin-Teng Lin
39
3
0
08 Aug 2024
PowerPM: Foundation Model for Power Systems
PowerPM: Foundation Model for Power Systems
Shihao Tu
Yupeng Zhang
Jing Zhang
Yang Yang
AI4TS
55
7
0
07 Aug 2024
How Well Can Vision Language Models See Image Details?
How Well Can Vision Language Models See Image Details?
Chenhui Gou
Abdulwahab Felemban
Faizan Farooq Khan
Deyao Zhu
Jianfei Cai
Hamid Rezatofighi
Mohamed Elhoseiny
VLMMLLM
100
5
0
07 Aug 2024
JARViS: Detecting Actions in Video Using Unified Actor-Scene Context
  Relation Modeling
JARViS: Detecting Actions in Video Using Unified Actor-Scene Context Relation Modeling
Seok Hwan Lee
Taein Son
Soo Won Seo
Jisong Kim
Jun Won Choi
96
0
0
07 Aug 2024
DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving
  with Mamba
DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba
Chengran Yuan
Zhanqi Zhang
Jiawei Sun
Shuo Sun
Zefan Huang
...
Dongen Li
Yuhang Han
Anthony Wong
K. P. Tee
Marcelo H. Ang Jr
Mamba
113
16
0
07 Aug 2024
From Recognition to Prediction: Leveraging Sequence Reasoning for Action
  Anticipation
From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation
Xin Liu
Chao Hao
Zitong Yu
Huanjing Yue
Jingyu Yang
66
1
0
05 Aug 2024
Past Movements-Guided Motion Representation Learning for Human Motion
  Prediction
Past Movements-Guided Motion Representation Learning for Human Motion Prediction
Junyu Shi
Baoxuan Wang
3DH
65
0
0
04 Aug 2024
Improving Neural Surface Reconstruction with Feature Priors from
  Multi-View Image
Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image
Xinlin Ren
Chenjie Cao
Yanwei Fu
Xiangyang Xue
140
2
0
04 Aug 2024
LEGO: Self-Supervised Representation Learning for Scene Text Images
LEGO: Self-Supervised Representation Learning for Scene Text Images
Yujin Ren
Jiaxin Zhang
Lianwen Jin
SSL
78
0
0
04 Aug 2024
Unsupervised Representation Learning by Balanced Self Attention Matching
Unsupervised Representation Learning by Balanced Self Attention Matching
Daniel Shalam
Simon Korman
SSL
111
0
0
04 Aug 2024
Masked Angle-Aware Autoencoder for Remote Sensing Images
Masked Angle-Aware Autoencoder for Remote Sensing Images
Zhihao Li
B. Hou
Siteng Ma
Zitong Wu
Xianpeng Guo
Bo Ren
Licheng Jiao
132
13
0
04 Aug 2024
Image Clustering Algorithm Based on Self-Supervised Pretrained Models
  and Latent Feature Distribution Optimization
Image Clustering Algorithm Based on Self-Supervised Pretrained Models and Latent Feature Distribution Optimization
Qiuyu Zhu
Liheng Hu
Sijin Wang
SSLVLM
44
1
0
04 Aug 2024
Downstream Transfer Attack: Adversarial Attacks on Downstream Models
  with Pre-trained Vision Transformers
Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
Weijie Zheng
Xingjun Ma
Hanxun Huang
Zuxuan Wu
Yu-Gang Jiang
AAML
102
0
0
03 Aug 2024
MedUHIP: Towards Human-In-the-Loop Medical Segmentation
MedUHIP: Towards Human-In-the-Loop Medical Segmentation
Jiayuan Zhu
A. Billard
52
0
0
03 Aug 2024
Actra: Optimized Transformer Architecture for Vision-Language-Action
  Models in Robot Learning
Actra: Optimized Transformer Architecture for Vision-Language-Action Models in Robot Learning
Yueen Ma
Dafeng Chi
Shiguang Wu
Yuecheng Liu
Yuzheng Zhuang
Jianye Hao
Irwin King
66
5
0
02 Aug 2024
Contribution-based Low-Rank Adaptation with Pre-training Model for Real
  Image Restoration
Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration
Donwon Park
Leixian Shen
Se Young Chun
94
2
0
02 Aug 2024
POA: Pre-training Once for Models of All Sizes
POA: Pre-training Once for Models of All Sizes
Yingying Zhang
Xin Guo
Jiangwei Lao
Lei Yu
Lixiang Ru
Jian Wang
Guo Ye
Huimei He
Jingdong Chen
Ming Yang
169
1
0
02 Aug 2024
Text-Guided Video Masked Autoencoder
Text-Guided Video Masked Autoencoder
D. Fan
Jue Wang
Shuai Liao
Zhikang Zhang
Vimal Bhat
Xinyu Li
VGen
57
3
0
01 Aug 2024
Virchow2: Scaling Self-Supervised Mixed Magnification Models in
  Pathology
Virchow2: Scaling Self-Supervised Mixed Magnification Models in Pathology
Eric Zimmermann
Eugene Vorontsov
Julian Viret
Adam Casson
Michal Zelechowski
...
Razik Yousfi
Thomas J. Fuchs
Nicolò Fusi
Siqi Liu
Kristen Severson
MedIm
108
40
0
01 Aug 2024
SAM 2: Segment Anything in Images and Videos
SAM 2: Segment Anything in Images and Videos
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
...
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
VLMMLLM
172
948
0
01 Aug 2024
Scaling Backwards: Minimal Synthetic Pre-training?
Scaling Backwards: Minimal Synthetic Pre-training?
Ryo Nakamura
Ryu Tadokoro
Ryosuke Yamada
Tim Puhlfürß
Iro Laina
Christian Rupprecht
Walid Maalej
Rio Yokota
Hirokatsu Kataoka
DD
90
4
0
01 Aug 2024
AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data
  for 3D-Native Segmentation
AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation
Asbjorn Munk
Jakob Ambsdorf
S. Llambias
Mads Nielsen
83
4
0
01 Aug 2024
A Simple Background Augmentation Method for Object Detection with
  Diffusion Model
A Simple Background Augmentation Method for Object Detection with Diffusion Model
Yuhang Li
Jun Gao
Chen Chen
Yue Zhang
Jielei Zhang
DiffM
82
5
0
01 Aug 2024
Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
Richard Ren
Steven Basart
Adam Khoja
Alice Gatti
Long Phan
...
Alexander Pan
Gabriel Mukobi
Ryan H. Kim
Stephen Fitz
Dan Hendrycks
ELM
77
25
0
31 Jul 2024
EZSR: Event-based Zero-Shot Recognition
EZSR: Event-based Zero-Shot Recognition
Yan Yang
Sehwan Kim
Dongxu Li
Y. Sun
65
0
0
31 Jul 2024
Big Cooperative Learning
Big Cooperative Learning
Yulai Cong
AI4CE
70
0
0
31 Jul 2024
Segment Anything for Videos: A Systematic Survey
Segment Anything for Videos: A Systematic Survey
Chunhui Zhang
Yawen Cui
Weilin Lin
Guanjie Huang
Yan Rong
Li Liu
Shiguang Shan
VLM
86
8
0
31 Jul 2024
Contrasting Deep Learning Models for Direct Respiratory Insufficiency
  Detection Versus Blood Oxygen Saturation Estimation
Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation
M. Gauy
Natalia Hitomi Koza
Ricardo Mikio Morita
Gabriel Rocha Stanzione
Arnaldo Cândido Júnior
L. Berti
A. S. Levin
E. Sabino
F. Svartman
Marcelo Finger
54
0
0
30 Jul 2024
S3PET: Semi-supervised Standard-dose PET Image Reconstruction via
  Dose-aware Token Swap
S3PET: Semi-supervised Standard-dose PET Image Reconstruction via Dose-aware Token Swap
J. Cui
Pinxian Zeng
Yuanyuan Xu
Xi Wu
Jiliu Zhou
Yan Wang
65
1
0
30 Jul 2024
What makes for good morphology representations for spatial omics?
What makes for good morphology representations for spatial omics?
Eduard Chelebian
C. Avenel
C. Wählby
51
0
0
30 Jul 2024
Enhancing Quantitative Image Synthesis through Pretraining and
  Resolution Scaling for Bone Mineral Density Estimation from a Plain X-ray
  Image
Enhancing Quantitative Image Synthesis through Pretraining and Resolution Scaling for Bone Mineral Density Estimation from a Plain X-ray Image
Yi Gu
Yoshito Otake
Keisuke Uemura
Masaki Takao
Mazen Soufi
Seiji Okada
Nobuhiko Sugano
Hugues Talbot
Yoshinobu Sato
58
0
0
30 Jul 2024
Dense Self-Supervised Learning for Medical Image Segmentation
Dense Self-Supervised Learning for Medical Image Segmentation
Maxime Seince
Loic Le Folgoc
Luiz Augusto Facury de Souza
Elsa Angelini
SSL
75
0
0
29 Jul 2024
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Yuanwen Yue
Anurag Das
Francis Engelmann
Siyu Tang
J. E. Lenssen
110
28
0
29 Jul 2024
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Jinghuan Shang
Karl Schmeckpeper
Brandon B. May
M. Minniti
Tarik Kelestemur
David Watkins
Laura Herlant
VLM
101
24
0
29 Jul 2024
Classification, Regression and Segmentation directly from k-Space in
  Cardiac MRI
Classification, Regression and Segmentation directly from k-Space in Cardiac MRI
Ruochen Li
Jiazhen Pan
Youxiang Zhu
Juncheng Ni
Daniel Rueckert
71
2
0
29 Jul 2024
Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets
Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets
Muhammad Abdullah Jamal
Omid Mohareri
92
2
0
29 Jul 2024
Forecast-PEFT: Parameter-Efficient Fine-Tuning for Pre-trained Motion
  Forecasting Models
Forecast-PEFT: Parameter-Efficient Fine-Tuning for Pre-trained Motion Forecasting Models
Jifeng Wang
Kaouther Messaoud
Yuejiang Liu
Juergen Gall
Alexandre Alahi
69
1
0
28 Jul 2024
Large-scale cervical precancerous screening via AI-assisted cytology
  whole slide image analysis
Large-scale cervical precancerous screening via AI-assisted cytology whole slide image analysis
Honglin Li
Yusuan Sun
Chenglu Zhu
Yunlong Zhang
Shichuan Zhang
...
Pingyi Chen
Jingxiong Li
Sunyi Zheng
Can Cui
Lin Yang
85
3
0
28 Jul 2024
Previous
123...242526...949596
Next