ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.06377
  4. Cited By
Masked Autoencoders Are Scalable Vision Learners
v1v2v3 (latest)

Masked Autoencoders Are Scalable Vision Learners

11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
    ViTTPM
ArXiv (abs)PDFHTML

Papers citing "Masked Autoencoders Are Scalable Vision Learners"

50 / 4,777 papers shown
Title
3D Medical Imaging Segmentation on Non-Contrast CT
Canxuan Gang
Yuhan Peng
104
0
0
11 Mar 2025
SARA: Structural and Adversarial Representation Alignment for Training-efficient Diffusion Models
Hesen Chen
Junyan Wang
Zhiyu Tan
Hao Li
101
1
0
11 Mar 2025
Scale-Aware Pre-Training for Human-Centric Visual Perception: Enabling Lightweight and Generalizable Models
Scale-Aware Pre-Training for Human-Centric Visual Perception: Enabling Lightweight and Generalizable Models
Xuanhan Wang
Huimin Deng
Lianli Gao
Jingkuan Song
VLM
74
0
0
11 Mar 2025
"Principal Components" Enable A New Language of Images
Xin Wen
Bingchen Zhao
Ismail Elezi
Jiankang Deng
Xiaojuan Qi
114
1
0
11 Mar 2025
Seal Your Backdoor with Variational Defense
Seal Your Backdoor with Variational Defense
Ivan Sabolić
Matej Grcić
Sinisa Segvic
AAML
460
0
0
11 Mar 2025
Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis
Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis
Kai Qiu
Xianrui Li
Jason Kuen
Hong Chen
Xiaohao Xu
Jiuxiang Gu
Yinyi Luo
Bhiksha Raj
Zhe Lin
Marios Savvides
160
2
0
11 Mar 2025
Pre-trained Models Succeed in Medical Imaging with Representation Similarity Degradation
Wenqiang Zu
Shenghao Xie
Hao Chen
Lei Ma
MedIm
143
0
0
11 Mar 2025
Universal Incremental Learning: Mitigating Confusion from Inter- and Intra-task Distribution Randomness
Universal Incremental Learning: Mitigating Confusion from Inter- and Intra-task Distribution Randomness
Sheng Luo
Yi Zhou
Tao Zhou
CLL
169
0
0
10 Mar 2025
MIRAM: Masked Image Reconstruction Across Multiple Scales for Breast Lesion Risk Prediction
MIRAM: Masked Image Reconstruction Across Multiple Scales for Breast Lesion Risk Prediction
H. Q. Vo
Pengyu Yuan
Zheng Yin
Kelvin K. Wong
Chika F. Ezeana
S. Ly
Stephen T. C. Wong
H. Nguyen
57
0
0
10 Mar 2025
Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation
Ziliang Miao
Runjian Chen
Yixi Cai
Buwei He
Wenquan Zhao
Wenqi Shao
Bo Zhang
Fu Zhang
3DPC
98
0
0
10 Mar 2025
Semi-Supervised Medical Image Segmentation via Knowledge Mining from Large Models
Yuchen Mao
Hongwei Bran Li
Yinyi Lai
G. Papanastasiou
Peng Qi
Yunjie Yang
Chengjia Wang
VLM
111
1
0
10 Mar 2025
Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation
Pengchen Liang
Haishan Huang
Bin Pu
Jianguo Chen
Xiang Hua
Jing Zhang
Weibo Ma
Z. Chen
Yiwei Li
Qing Chang
78
0
0
10 Mar 2025
On the Generalization of Representation Uncertainty in Earth Observation
Spyros Kondylatos
Nikolaos Ioannis Bountos
Dimitrios Michail
Xiao Xiang Zhu
Gustau Camps-Valls
Ioannis Papoutsis
108
1
0
10 Mar 2025
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation
Ding Zhong
Xu Zheng
Chenfei Liao
Yuanhuiyi Lyu
Jialei Chen
Shengyang Wu
Linfeng Zhang
Xuming Hu
VLM
118
10
0
10 Mar 2025
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
Xin Wen
Bingchen Zhao
Yilun Chen
Jiangmiao Pang
Xiaojuan Qi
LM&Ro
222
0
0
10 Mar 2025
Denoising Hamiltonian Network for Physical Reasoning
Congyue Deng
Brandon Yushan Feng
Cecilia Garraffo
Alan Garbarz
Robin Walters
William T. Freeman
Leonidas Guibas
Kaiming He
AI4CE
95
0
0
10 Mar 2025
V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation
Guiwei Zhang
Tianyu Zhang
Mohan Zhou
Yalong Bai
Biye Li
145
0
0
10 Mar 2025
Alligat0R: Pre-Training Through Co-Visibility Segmentation for Relative Camera Pose Regression
Thibaut Loiseau
Guillaume Bourmaud
Vincent Lepetit
132
1
0
10 Mar 2025
Keeping Representation Similarity in Finetuning for Medical Image Analysis
Wenqiang Zu
Shenghao Xie
Hao Chen
Yiming Liang
Lei Ma
MedImOOD
139
0
0
10 Mar 2025
ADROIT: A Self-Supervised Framework for Learning Robust Representations for Active Learning
S. Banerjee
Vinay Kumar Verma
SSL
103
0
0
10 Mar 2025
Iterative Prompt Relocation for Distribution-Adaptive Visual Prompt Tuning
Chikai Shang
Mengke Li
Yiqun Zhang
Zhen Chen
Jinlin Wu
Fangqing Gu
Yang Lu
Yiu-ming Cheung
VLM
111
0
0
10 Mar 2025
Effective and Efficient Masked Image Generation Models
Effective and Efficient Masked Image Generation Models
Zebin You
Jingyang Ou
Xiaolu Zhang
Jun Hu
Jun Zhou
Chongxuan Li
DiffMVLM
113
3
0
10 Mar 2025
MultiCo3D: Multi-Label Voxel Contrast for One-Shot Incremental Segmentation of 3D Neuroimages
Hao Xu
Tengfei Xue
Dongnan Liu
Yuqian Chen
Fan Zhang
C. Westin
Ron Kikinis
L. O’Donnell
Weidong Cai
78
0
0
09 Mar 2025
What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Xavier Thomas
Deepti Ghadiyaram
DiffM
196
0
0
09 Mar 2025
SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks
SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks
Yimeng Shan
Zhenbang Ren
Haodi Wu
Wenjie Wei
Rui-jie Zhu
...
Jason K. Eshraghian
Haicheng Qu
Jing Zhang
Malu Zhang
Yiran Yang
103
1
0
09 Mar 2025
M3^33amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing Classification
Mingxiang Cao
Weiying Xie
Xin Zhang
Jiaqing Zhang
Kai Jiang
Jie Lei
Yunsong Li
Mamba
150
0
0
09 Mar 2025
CLICv2: Image Complexity Representation via Content Invariance Contrastive Learning
Shipeng Liu
Liang Zhao
Dengfeng Chen
SSL
198
0
0
09 Mar 2025
Segment Anything, Even Occluded
Wei-En Tai
Yu-Lin Shih
Cheng Sun
Y. Wang
Hwann-Tzong Chen
VLM
100
1
0
08 Mar 2025
Exploring Interpretability for Visual Prompt Tuning with Hierarchical Concepts
Yubin Wang
Xinyang Jiang
De Cheng
Xiangqian Zhao
Zilong Wang
Dongsheng Li
Cairong Zhao
VLM
165
0
0
08 Mar 2025
Pathology-Guided AI System for Accurate Segmentation and Diagnosis of Cervical Spondylosis
Qi Zhang
Xiuyuan Chen
Ziyi He
Lianming Wu
Kun Wang
Jianqi Sun
Hongxing Shen
163
0
0
08 Mar 2025
Dynamically evolving segment anything model with continuous learning for medical image segmentation
Zhaori Liu
Mengyang Li
Hu Han
Enli Zhang
Shiguang Shan
Zhiming Zhao
VLM
84
0
0
08 Mar 2025
WeakMedSAM: Weakly-Supervised Medical Image Segmentation via SAM with Sub-Class Exploration and Prompt Affinity Mining
Haoran Wang
Lian Huai
Wenbin Li
Lei Qi
Xingqun Jiang
Yinghuan Shi
MedIm
133
3
0
06 Mar 2025
A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery
Yiheng Zhu
Mingyang Li
Junlong Liu
Kun Fu
Jian Wu
Yue Liu
Mingze Yin
Jieping Ye
Jian Wu
Zehua Wang
145
0
0
06 Mar 2025
A Survey of Foundation Models for Environmental Science
Runlong Yu
Shengyu Chen
Yiqun Xie
X. Jia
AI4CE
141
1
0
05 Mar 2025
Self is the Best Learner: CT-free Ultra-Low-Dose PET Organ Segmentation via Collaborating Denoising and Segmentation Learning
Self is the Best Learner: CT-free Ultra-Low-Dose PET Organ Segmentation via Collaborating Denoising and Segmentation Learning
Zanting Ye
Xiaolong Niu
Xu Han
Xuanbin Wu
Wantong Lu
Yijun Lu
Hao Sun
Yanchao Huang
Hubing Wu
Lijun Lu
86
0
0
05 Mar 2025
Partial Convolution Meets Visual Attention
Haiduo Huang
Fuwei Yang
D. Li
Ji Liu
Lu Tian
Jinzhang Peng
Pengju Ren
E. Barsoum
3DH
443
0
0
05 Mar 2025
COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation
Aurelio Noca
Xianmei Lei
Jonathan Becktor
J. Edlund
Anna Sabel
Patrick Spieler
Curtis Padgett
Alexandre Alahi
Deegan Atha
151
0
0
05 Mar 2025
Developing a PET/CT Foundation Model for Cross-Modal Anatomical and Functional Imaging
Y. Oh
Robert Seifert
Yihan Cao
Christoph Clement
Justin Ferdinandus
...
Xuzhao Li
P. Heidari
Axel Rominger
Kuangyu Shi
Quanzheng Li
ViTMedIm
90
0
0
04 Mar 2025
Boltzmann Attention Sampling for Image Analysis with Small Objects
Boltzmann Attention Sampling for Image Analysis with Small Objects
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
190
0
0
04 Mar 2025
GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning
Zhun Mou
Bin Xia
Zhengchao Huang
Wenming Yang
Jiaya Jia
VGenELMLRM
107
1
0
04 Mar 2025
Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training
Paul Janson
Vaibhav Singh
Paria Mehrbod
Adam Ibrahim
Irina Rish
Eugene Belilovsky
Benjamin Thérien
CLL
130
1
0
04 Mar 2025
TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping
Xinying Hong
Siyu Li
Kang Zeng
Hao-miao Shi
Bomin Peng
Kailun Yang
Zehan Li
118
0
0
04 Mar 2025
Undertrained Image Reconstruction for Realistic Degradation in Blind Image Super-Resolution
Ru Ito
Supatta Viriyavisuthisakul
K. Kawamoto
Hiroshi Kera
107
0
0
04 Mar 2025
SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting
Ali Caglayan
Nevrez Imamoglu
T. Kouyama
152
0
0
03 Mar 2025
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Jonathan Fhima
Jan Van Eijgen
Lennert Beeckmans
Thomas Jacobs
Moti Freiman
Luis Filipe Nakayama
Ingeborg Stalmans
Chaim Baskin
Joachim A. Behar
MedIm
178
0
0
03 Mar 2025
A General Purpose Spectral Foundational Model for Both Proximal and Remote Sensing Spectral Imaging
William Michael Laprade
Jesper Cairo Westergaard
Svend Christensen
Mads Nielsen
Anders Bjorholm Dahl
108
0
0
03 Mar 2025
Primus: Enforcing Attention Usage for 3D Medical Image Segmentation
Tassilo Wald
Saikat Roy
Fabian Isensee
Constantin Ulrich
Sebastian Ziegler
D. Trofimova
Raphael Stock
Michael Baumgartner
Gregor Köhler
Klaus H. Maier-Hein
ViTMedIm
79
1
0
03 Mar 2025
Lossy Neural Compression for Geospatial Analytics: A Review
Carlos Gomes
Isabelle Wittmann
Damien Robert
Johannes Jakubik
Tim Reichelt
...
Romeo Kienzler
Rania Briq
Sabrina Benassou
Michele Lazzarini
C. Albrecht
145
2
0
03 Mar 2025
EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting
Suzhen Wang
Weijie Chen
Wei Zhang
Minda Zhao
Lincheng Li
Rongsheng Zhang
Zhibo Hu
Xin Yu
107
1
0
03 Mar 2025
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection
Boyong He
Yuxiang Ji
Qianwen Ye
Zhuoyue Tan
Liaoni Wu
DiffM
160
0
0
03 Mar 2025
Previous
123...91011...949596
Next