ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.06377
  4. Cited By
Masked Autoencoders Are Scalable Vision Learners

Masked Autoencoders Are Scalable Vision Learners

11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
    ViT
    TPM
ArXivPDFHTML

Papers citing "Masked Autoencoders Are Scalable Vision Learners"

50 / 4,611 papers shown
Title
Twofold Debiasing Enhances Fine-Grained Learning with Coarse Labels
Twofold Debiasing Enhances Fine-Grained Learning with Coarse Labels
Xin-yang Zhao
Jian Jin
Yang-yang Li
Yazhou Yao
42
0
0
27 Feb 2025
Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds
Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds
Mohamed Abdelsamad
Michael Ulrich
Claudius Gläser
Abhinav Valada
3DPC
42
0
0
27 Feb 2025
FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction
FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction
Siyu Jiao
Gengwei Zhang
Yinlong Qian
Jiancheng Huang
Yao Zhao
Humphrey Shi
Lin Ma
Y. X. Wei
Zequn Jie
VLM
39
1
0
27 Feb 2025
Shared Stochastic Gaussian Process Latent Variable Models: A Multi-modal Generative Model for Quasar Spectra
Shared Stochastic Gaussian Process Latent Variable Models: A Multi-modal Generative Model for Quasar Spectra
Vidhi Lalchand
Anna-Christina Eilers
59
0
0
27 Feb 2025
GONet: A Generalizable Deep Learning Model for Glaucoma Detection
GONet: A Generalizable Deep Learning Model for Glaucoma Detection
Or Abramovich
Hadas Pizem
Jonathan Fhima
Eran Berkowitz
Ben Gofrit
...
Meital Baskin
Jan Van Eijgen
Ingeborg Stalmans
E. Blumenthal
Joachim A. Behar
59
1
0
26 Feb 2025
Generalist World Model Pre-Training for Efficient Reinforcement Learning
Generalist World Model Pre-Training for Efficient Reinforcement Learning
Yi Zhao
Aidan Scannell
Yuxin Hou
Tianyu Cui
Le Chen
Dieter Buchler
Arno Solin
Juho Kannala
J. Pajarinen
OffRL
OnRL
75
1
0
26 Feb 2025
Dictionary-based Framework for Interpretable and Consistent Object Parsing
Dictionary-based Framework for Interpretable and Consistent Object Parsing
Tiezheng Zhang
Qihang Yu
Alan Yuille
Ju He
74
1
0
26 Feb 2025
MCLRL: A Multi-Domain Contrastive Learning with Reinforcement Learning Framework for Few-Shot Modulation Recognition
MCLRL: A Multi-Domain Contrastive Learning with Reinforcement Learning Framework for Few-Shot Modulation Recognition
Dongwei Xu
Yutao Zhu
Yao Lu
Youpeng Feng
Yun Lin
Qi Xuan
76
0
0
26 Feb 2025
Mixtraining: A Better Trade-Off Between Compute and Performance
Mixtraining: A Better Trade-Off Between Compute and Performance
Zexin Li
Jiancheng Zhang
Yufei Li
Yinglun Zhu
Cong Liu
46
0
0
26 Feb 2025
Multispectral to Hyperspectral using Pretrained Foundational model
Multispectral to Hyperspectral using Pretrained Foundational model
Ruben Gonzalez
C. Albrecht
Nassim Ait Ali Braham
Devyani Lambhate
Joao Lucas de Sousa Almeida
P. Fraccaro
Benedikt Blumenstiel
Thomas Brunschwiler
Ranjini Bangalore
61
0
0
26 Feb 2025
Model-Free Adversarial Purification via Coarse-To-Fine Tensor Network Representation
Model-Free Adversarial Purification via Coarse-To-Fine Tensor Network Representation
Guang Lin
D. Nguyen
Zerui Tao
Konstantinos Slavakis
Toshihisa Tanaka
Qibin Zhao
AAML
59
0
0
25 Feb 2025
Escaping The Big Data Paradigm in Self-Supervised Representation Learning
Escaping The Big Data Paradigm in Self-Supervised Representation Learning
Carlos Vélez García
Miguel Cazorla
Jorge Pomares
49
0
0
25 Feb 2025
DenoMAE2.0: Improving Denoising Masked Autoencoders by Classifying Local Patches
DenoMAE2.0: Improving Denoising Masked Autoencoders by Classifying Local Patches
Atik Faysal
Mohammad Rostami
Taha Boushine
Reihaneh Gh. Roshan
Huaxia Wang
Nikhil Muralidhar
39
1
0
25 Feb 2025
Stealthy Backdoor Attack in Self-Supervised Learning Vision Encoders for Large Vision Language Models
Stealthy Backdoor Attack in Self-Supervised Learning Vision Encoders for Large Vision Language Models
Zhaoyi Liu
Huan Zhang
AAML
72
0
0
25 Feb 2025
Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives
Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives
Dilermando Queiroz
Anderson Carlos
André Anjos
Lilian Berton
43
0
0
24 Feb 2025
Vision-LSTM: xLSTM as Generic Vision Backbone
Vision-LSTM: xLSTM as Generic Vision Backbone
Benedikt Alkin
M. Beck
Korbinian Poppel
Sepp Hochreiter
Johannes Brandstetter
VLM
58
43
0
24 Feb 2025
A Survey of fMRI to Image Reconstruction
A Survey of fMRI to Image Reconstruction
Weiyu Guo
Guoying Sun
JianXiang He
Tong Shao
Shaoguang Wang
Ziyang Chen
Meisheng Hong
Ying Sun
Hui Xiong
40
1
0
24 Feb 2025
TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time Series Representation
TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time Series Representation
Daoyu Wang
Mingyue Cheng
Z. Liu
Q. Liu
Enhong Chen
AI4TS
DiffM
45
1
0
24 Feb 2025
Mitigating Data Scarcity in Time Series Analysis: A Foundation Model with Series-Symbol Data Generation
Mitigating Data Scarcity in Time Series Analysis: A Foundation Model with Series-Symbol Data Generation
Wenxuan Wang
K. Wu
Yujian Betterest Li
Dan Wang
X. Zhang
J. Liu
AI4TS
63
0
0
24 Feb 2025
A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis
A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis
Yuli Wu
Fucheng Liu
Rüveyda Yilmaz
Henning Konermann
Peter Walter
Johannes Stegmaier
EGVM
MedIm
48
1
0
24 Feb 2025
Simpler Fast Vision Transformers with a Jumbo CLS Token
Simpler Fast Vision Transformers with a Jumbo CLS Token
A. Fuller
Yousef Yassin
Daniel G. Kyrollos
Evan Shelhamer
James R. Green
67
0
0
24 Feb 2025
MACPruning: Dynamic Operation Pruning to Mitigate Side-Channel DNN Model Extraction
MACPruning: Dynamic Operation Pruning to Mitigate Side-Channel DNN Model Extraction
Ruyi Ding
Cheng Gongye
Davis Ranney
A. A. Ding
Yunsi Fei
AAML
63
0
0
24 Feb 2025
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
Benedikt Alkin
Lukas Miklautz
Sepp Hochreiter
Johannes Brandstetter
VLM
65
8
0
24 Feb 2025
MolSpectra: Pre-training 3D Molecular Representation with Multi-modal Energy Spectra
MolSpectra: Pre-training 3D Molecular Representation with Multi-modal Energy Spectra
Liang Wang
Shaozhen Liu
Yu Rong
Deli Zhao
Qiang Liu
Shu Wu
Liang Wang
MedIm
63
2
0
22 Feb 2025
Exploring Patient Data Requirements in Training Effective AI Models for MRI-based Breast Cancer Classification
Exploring Patient Data Requirements in Training Effective AI Models for MRI-based Breast Cancer Classification
Solha Kang
W. D. Neve
Francois Rameau
Utku Ozbulak
OOD
45
0
0
22 Feb 2025
Understanding the Emergence of Multimodal Representation Alignment
Understanding the Emergence of Multimodal Representation Alignment
Megan Tjandrasuwita
Chanakya Ekbote
Liu Ziyin
Paul Pu Liang
50
1
0
22 Feb 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
G. Klambauer
Razvan Pascanu
Sepp Hochreiter
73
4
0
21 Feb 2025
Intelligent Anomaly Detection for Lane Rendering Using Transformer with Self-Supervised Pre-Training and Customized Fine-Tuning
Intelligent Anomaly Detection for Lane Rendering Using Transformer with Self-Supervised Pre-Training and Customized Fine-Tuning
Yongqi Dong
Xingmin Lu
Ruohan Li
Wei Song
B. Arem
Haneen Farah
ViT
105
1
0
21 Feb 2025
Controllable Unlearning for Image-to-Image Generative Models via $\varepsilon$-Constrained Optimization
Controllable Unlearning for Image-to-Image Generative Models via ε\varepsilonε-Constrained Optimization
Xiaohua Feng
Chao-Jun Chen
Yuyuan Li
L. Zhang
Longfei Li
Jun Zhou
Xiaolin Zheng
MU
68
0
0
20 Feb 2025
Contrastive Localized Language-Image Pre-Training
Contrastive Localized Language-Image Pre-Training
Hong-You Chen
Zhengfeng Lai
H. Zhang
X. Wang
Marcin Eichner
Keen You
Meng Cao
Bowen Zhang
Y. Yang
Zhe Gan
CLIP
VLM
68
7
0
20 Feb 2025
Myna: Masking-Based Contrastive Learning of Musical Representations
Myna: Masking-Based Contrastive Learning of Musical Representations
Ori Yonay
Tracy Hammond
Tianbao Yang
AAML
53
0
0
20 Feb 2025
Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity
Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity
Yizhuo Lu
Changde Du
Chong Wang
Xuanliu Zhu
Liuyun Jiang
Xujin Li
Huiguang He
VGen
114
4
0
20 Feb 2025
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Zekun Qi
Wenyao Zhang
Yufei Ding
Runpei Dong
Xinqiang Yu
...
Xin Jin
Kaisheng Ma
Zhizheng Zhang
He Wang
Li Yi
LM&Ro
131
3
0
18 Feb 2025
Toward Foundational Model for Sleep Analysis Using a Multimodal Hybrid Self-Supervised Learning Framework
Toward Foundational Model for Sleep Analysis Using a Multimodal Hybrid Self-Supervised Learning Framework
Cheol-Hui Lee
Hakseung Kim
Byung C. Yoon
Dong-Joo Kim
41
0
0
18 Feb 2025
L4P: Low-Level 4D Vision Perception Unified
L4P: Low-Level 4D Vision Perception Unified
Abhishek Badki
Hang Su
Bowen Wen
Orazio Gallo
VLM
78
1
0
18 Feb 2025
Performance of Zero-Shot Time Series Foundation Models on Cloud Data
Performance of Zero-Shot Time Series Foundation Models on Cloud Data
William Toner
Thomas L. Lee
Artjom Joosen
Rajkarn Singh
Martin Asenov
AI4TS
50
0
0
18 Feb 2025
Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment
Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment
Haoyuan Wu
Haisheng Zheng
Yuan Pu
Bei Yu
53
1
0
18 Feb 2025
Lightweight Online Adaption for Time Series Foundation Model Forecasts
Lightweight Online Adaption for Time Series Foundation Model Forecasts
Thomas L. Lee
William Toner
Rajkarn Singh
Artjom Joosem
Martin Asenov
AI4TS
36
0
0
18 Feb 2025
Masking the Gaps: An Imputation-Free Approach to Time Series Modeling with Missing Data
Masking the Gaps: An Imputation-Free Approach to Time Series Modeling with Missing Data
Abhilash Neog
Arka Daw
Sepideh Fatemi Khorasgani
Anuj Karpatne
AI4TS
36
0
0
18 Feb 2025
MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding
MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding
Weikang Qiu
Zheng Huang
Haoyu Hu
Aosong Feng
Yujun Yan
Rex Ying
45
0
0
18 Feb 2025
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
Emmanuel K. Raptis
Athanasios Ch. Kapoutsis
Elias B. Kosmatopoulos
LM&Ro
75
0
0
18 Feb 2025
Hyperspherical Energy Transformer with Recurrent Depth
Yunzhe Hu
Difan Zou
Dong Xu
41
0
0
17 Feb 2025
Artificial Kuramoto Oscillatory Neurons
Artificial Kuramoto Oscillatory Neurons
Takeru Miyato
Sindy Lowe
Andreas Geiger
Max Welling
AI4CE
69
6
0
17 Feb 2025
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Jingcheng Ni
Yuxin Guo
Yichen Liu
Rui Chen
Lewei Lu
Z. Wu
DiffM
VGen
59
3
0
17 Feb 2025
Frequency-Aware Masked Autoencoders for Human Activity Recognition using Accelerometers
Frequency-Aware Masked Autoencoders for Human Activity Recognition using Accelerometers
Niels R. Lorenzen
P. Jennum
Emmanuel Mignot
A. Brink-Kjaer
31
0
0
17 Feb 2025
CR-CTC: Consistency regularization on CTC for improved speech recognition
CR-CTC: Consistency regularization on CTC for improved speech recognition
Zengwei Yao
Wei Kang
Xiaoyu Yang
Fangjun Kuang
Liyong Guo
Han Zhu
Zengrui Jin
Zhaoqing Li
Long Lin
Daniel Povey
51
0
0
17 Feb 2025
Intensity-Spatial Dual Masked Autoencoder for Multi-Scale Feature Learning in Chest CT Segmentation
Intensity-Spatial Dual Masked Autoencoder for Multi-Scale Feature Learning in Chest CT Segmentation
Yuexing Ding
Jun Wang
H. Lyu
86
0
0
17 Feb 2025
Differentially Private Prototypes for Imbalanced Transfer Learning
Differentially Private Prototypes for Imbalanced Transfer Learning
Dariush Wahdany
Matthew Jagielski
Adam Dziedzic
Franziska Boenisch
85
0
0
17 Feb 2025
Simplifying DINO via Coding Rate Regularization
Simplifying DINO via Coding Rate Regularization
Ziyang Wu
Jingyuan Zhang
Druv Pai
X. Wang
Chandan Singh
Jianwei Yang
Jianfeng Gao
Yi-An Ma
156
1
0
17 Feb 2025
Vision-Enhanced Time Series Forecasting via Latent Diffusion Models
Vision-Enhanced Time Series Forecasting via Latent Diffusion Models
Weilin Ruan
Siru Zhong
Haomin Wen
Yuxuan Liang
AI4TS
67
1
0
16 Feb 2025
Previous
123...789...919293
Next