Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.06377
Cited By
Masked Autoencoders Are Scalable Vision Learners
11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Masked Autoencoders Are Scalable Vision Learners"
50 / 4,611 papers shown
Title
Iterative Prompt Relocation for Distribution-Adaptive Visual Prompt Tuning
Chikai Shang
Mengke Li
Yiqun Zhang
Zhen Chen
Jinlin Wu
Fangqing Gu
Yang Lu
Yiu-ming Cheung
VLM
71
0
0
10 Mar 2025
Effective and Efficient Masked Image Generation Models
Zebin You
Jingyang Ou
Xiaolu Zhang
Jun Hu
Jun Zhou
Chongxuan Li
DiffM
VLM
54
1
0
10 Mar 2025
V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation
Guiwei Zhang
Tianyu Zhang
Mohan Zhou
Yalong Bai
Biye Li
59
0
0
10 Mar 2025
MIRAM: Masked Image Reconstruction Across Multiple Scales for Breast Lesion Risk Prediction
H. Q. Vo
Pengyu Yuan
Zheng Yin
Kelvin K. Wong
Chika F. Ezeana
S. Ly
Stephen T. C. Wong
H. Nguyen
39
0
0
10 Mar 2025
Keeping Representation Similarity in Finetuning for Medical Image Analysis
Wenqiang Zu
Shenghao Xie
Hao Chen
Yiming Liang
Lei Ma
MedIm
OOD
43
0
0
10 Mar 2025
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation
Ding Zhong
Xu Zheng
Chenfei Liao
Yuanhuiyi Lyu
Jialei Chen
Shengyang Wu
Linfeng Zhang
Xuming Hu
VLM
53
4
0
10 Mar 2025
What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Xavier Thomas
Deepti Ghadiyaram
DiffM
87
0
0
09 Mar 2025
M
3
^3
3
amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing Classification
Mingxiang Cao
Weiying Xie
Xin Zhang
Jiaqing Zhang
Kai Jiang
Jie Lei
Yunsong Li
Mamba
44
0
0
09 Mar 2025
MultiCo3D: Multi-Label Voxel Contrast for One-Shot Incremental Segmentation of 3D Neuroimages
Hao Xu
Tengfei Xue
Dongnan Liu
Yuqian Chen
Fan Zhang
C. Westin
Ron Kikinis
L. O’Donnell
Weidong Cai
41
0
0
09 Mar 2025
CLICv2: Image Complexity Representation via Content Invariance Contrastive Learning
Shipeng Liu
Liang Zhao
Dengfeng Chen
SSL
96
0
0
09 Mar 2025
SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks
Yimeng Shan
Zhenbang Ren
Haodi Wu
Wenjie Wei
Rui-jie Zhu
...
Jason Eshraghian
Haicheng Qu
J. Zhang
Malu Zhang
Y. Yang
42
0
0
09 Mar 2025
Pathology-Guided AI System for Accurate Segmentation and Diagnosis of Cervical Spondylosis
Qi Zhang
Xiuyuan Chen
Ziyi He
Lianming Wu
Kun Wang
Jianqi Sun
Hongxing Shen
51
0
0
08 Mar 2025
Segment Anything, Even Occluded
Wei-En Tai
Yu-Lin Shih
Cheng Sun
Y. Wang
Hwann-Tzong Chen
VLM
62
0
0
08 Mar 2025
Dynamically evolving segment anything model with continuous learning for medical image segmentation
Zhaori Liu
Mengyang Li
Hu Han
Enli Zhang
Shiguang Shan
Zhiming Zhao
VLM
57
0
0
08 Mar 2025
Exploring Interpretability for Visual Prompt Tuning with Hierarchical Concepts
Yubin Wang
Xinyang Jiang
De Cheng
Xiangqian Zhao
Zilong Wang
Dongsheng Li
Cairong Zhao
VLM
67
0
0
08 Mar 2025
USP: Unified Self-Supervised Pretraining for Image Generation and Understanding
Xiangxiang Chu
Renda Li
Yong Wang
60
0
0
08 Mar 2025
WeakMedSAM: Weakly-Supervised Medical Image Segmentation via SAM with Sub-Class Exploration and Prompt Affinity Mining
Haoran Wang
Lian Huai
Wenbin Li
Lei Qi
Xingqun Jiang
Yinghuan Shi
MedIm
61
2
0
06 Mar 2025
A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery
Yiheng Zhu
Mingyang Li
Junlong Liu
Kun Fu
J. Wu
Q. Li
Mingze Yin
Jieping Ye
Jian Wu
Z. Wang
60
0
0
06 Mar 2025
Partial Convolution Meets Visual Attention
Haiduo Huang
Fuwei Yang
D. Li
Ji Liu
Lu Tian
Jinzhang Peng
Pengju Ren
E. Barsoum
3DH
151
0
0
05 Mar 2025
Self is the Best Learner: CT-free Ultra-Low-Dose PET Organ Segmentation via Collaborating Denoising and Segmentation Learning
Zanting Ye
Xiaolong Niu
Xuanbin Wu
Wantong Lu
Lijun Lu
46
0
0
05 Mar 2025
A Survey of Foundation Models for Environmental Science
Runlong Yu
Shengyu Chen
Yiqun Xie
X. Jia
AI4CE
59
1
0
05 Mar 2025
COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation
Aurelio Noca
Xianmei Lei
Jonathan Becktor
J. Edlund
Anna Sabel
Patrick Spieler
Curtis Padgett
Alexandre Alahi
Deegan Atha
50
0
0
05 Mar 2025
Undertrained Image Reconstruction for Realistic Degradation in Blind Image Super-Resolution
Ru Ito
Supatta Viriyavisuthisakul
K. Kawamoto
Hiroshi Kera
71
0
0
04 Mar 2025
Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training
Paul Janson
Vaibhav Singh
Paria Mehrbod
Adam Ibrahim
Irina Rish
Eugene Belilovsky
Benjamin Thérien
CLL
73
0
0
04 Mar 2025
TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping
Xinying Hong
Siyu Li
Kang Zeng
Hao-miao Shi
Bomin Peng
Kailun Yang
Z. Li
57
0
0
04 Mar 2025
GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning
Zhun Mou
Bin Xia
Zhengchao Huang
Wenming Yang
Jiaya Jia
VGen
ELM
LRM
63
0
0
04 Mar 2025
Developing a PET/CT Foundation Model for Cross-Modal Anatomical and Functional Imaging
Y. Oh
Robert Seifert
Yihan Cao
Christoph Clement
Justin Ferdinandus
...
X. Li
P. Heidari
Axel Rominger
Kuangyu Shi
Quanzheng Li
ViT
MedIm
36
0
0
04 Mar 2025
Boltzmann Attention Sampling for Image Analysis with Small Objects
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
71
0
0
04 Mar 2025
SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting
Ali Caglayan
Nevrez Imamoglu
T. Kouyama
65
0
0
03 Mar 2025
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Jonathan Fhima
Jan Van Eijgen
Lennert Beeckmans
Thomas Jacobs
Moti Freiman
Luis Filipe Nakayama
Ingeborg Stalmans
Chaim Baskin
Joachim A. Behar
MedIm
60
0
0
03 Mar 2025
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection
Boyong He
Yuxiang Ji
Qianwen Ye
Zhuoyue Tan
Liaoni Wu
DiffM
58
0
0
03 Mar 2025
A General Purpose Spectral Foundational Model for Both Proximal and Remote Sensing Spectral Imaging
William Michael Laprade
Jesper Cairo Westergaard
Svend Christensen
Mads Nielsen
Anders Bjorholm Dahl
63
0
0
03 Mar 2025
Lossy Neural Compression for Geospatial Analytics: A Review
Carlos Gomes
Isabelle Wittmann
Damien Robert
Johannes Jakubik
Tim Reichelt
...
Romeo Kienzler
Rania Briq
Sabrina Benassou
Michele Lazzarini
C. Albrecht
88
2
0
03 Mar 2025
EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting
Suzhen Wang
Weijie Chen
Wei Zhang
Minda Zhao
Lincheng Li
Rongsheng Zhang
Z. Hu
Xin Yu
63
1
0
03 Mar 2025
Primus: Enforcing Attention Usage for 3D Medical Image Segmentation
Tassilo Wald
Saikat Roy
Fabian Isensee
Constantin Ulrich
Sebastian Ziegler
D. Trofimova
Raphael Stock
Michael Baumgartner
Gregor Köhler
Klaus H. Maier-Hein
ViT
MedIm
42
1
0
03 Mar 2025
Foundation Models Secretly Understand Neural Network Weights: Enhancing Hypernetwork Architectures with Foundation Models
Jeffrey Gu
Serena Yeung-Levy
AI4CE
29
0
0
02 Mar 2025
Confounder-Aware Medical Data Selection for Fine-Tuning Pretrained Vision Models
Anyang Ji
Qingbo Kang
Wei Xu
Changfan Wang
Kang Li
Qicheng Lao
26
0
0
02 Mar 2025
Random Walks in Self-supervised Learning for Triangular Meshes
Gal Yefet
A. Tal
SSL
55
0
0
02 Mar 2025
Wavelet-Driven Masked Image Modeling: A Path to Efficient Visual Representation
Wenzhao Xiang
Chang Liu
Hongyang Yu
Xilin Chen
29
0
0
02 Mar 2025
MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention
Tianyi Wang
Jianan Fan
Dingxin Zhang
Dongnan Liu
Yong-quan Xia
Heng Huang
Weidong Cai
34
0
0
01 Mar 2025
Split Adaptation for Pre-trained Vision Transformers
Lixu Wang
Bingqi Shang
Y. Li
Payal Mohapatra
Wei Dong
Xiao-Xu Wang
Qi Zhu
ViT
43
0
0
01 Mar 2025
Soften the Mask: Adaptive Temporal Soft Mask for Efficient Dynamic Facial Expression Recognition
Mengzhu Li
Quanxing Zha
Hongjun Wu
CVBM
53
0
0
28 Feb 2025
Unsupervised Parameter Efficient Source-free Post-pretraining
Abhishek Jha
Tinne Tuytelaars
Yuki M. Asano
OOD
43
0
0
28 Feb 2025
SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition
Shanshan Wan
Yingmei Wei
Lai Kang
Tianrui Shen
Haixuan Wang
Yee-Hong Yang
41
0
0
28 Feb 2025
CuPID: Leveraging Masked Single-Lead ECG Modelling for Enhancing the Representations
A. Atienza
G. Manimaran
J. Bardram
S. Puthusserypady
37
0
0
28 Feb 2025
Unified Video Action Model
Shuang Li
Yihuai Gao
Dorsa Sadigh
Shuran Song
VGen
46
1
0
28 Feb 2025
ALVI Interface: Towards Full Hand Motion Decoding for Amputees Using sEMG
A. Kovalev
Anna Makarova
Petr Chizhov
Matvey Antonov
Gleb Duplin
...
Viacheslav Gostevskii
Vladimir Bessonov
Andrey Tsurkan
Mikhail Korobok
Aleksejs Timčenko
36
0
0
28 Feb 2025
Anatomically-guided masked autoencoder pre-training for aneurysm detection
Alberto Mario Ceballos-Arroyo
Jisoo Kim
C. Lin
Lei Qin
Geoffrey S. Young
Huaizu Jiang
ViT
MedIm
33
0
0
28 Feb 2025
TimesBERT: A BERT-Style Foundation Model for Time Series Understanding
Haoran Zhang
Yong Liu
Yunzhong Qiu
Haixuan Liu
Zhongyi Pei
Jianmin Wang
Mingsheng Long
AI4TS
40
0
0
28 Feb 2025
Parallel-Learning of Invariant and Tempo-variant Attributes of Single-Lead Cardiac Signals: PLITA
A. Atienza
J. Bardram
S. Puthusserypady
33
0
0
28 Feb 2025
Previous
1
2
3
...
6
7
8
...
91
92
93
Next