Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.06377
Cited By
v1
v2
v3 (latest)
Masked Autoencoders Are Scalable Vision Learners
11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked Autoencoders Are Scalable Vision Learners"
50 / 4,779 papers shown
Title
Heuristic Vision Pre-Training with Self-Supervised and Supervised Multi-Task Learning
Zhiming Qian
VLM
SSL
59
0
0
11 Oct 2023
Causal Unsupervised Semantic Segmentation
Junho Kim
Byung-Kwan Lee
Yonghyun Ro
96
18
0
11 Oct 2023
IMITATE: Clinical Prior Guided Hierarchical Vision-Language Pre-training
Che Liu
Sibo Cheng
Miaojing Shi
Anand Shah
Wenjia Bai
Rossella Arcucci
94
27
0
11 Oct 2023
ProFSA: Self-supervised Pocket Pretraining via Protein Fragment-Surroundings Alignment
Bowen Gao
Yinjun Jia
Yuanle Mo
Yuyan Ni
Wei-Ying Ma
Zhiming Ma
Yanyan Lan
102
9
0
11 Oct 2023
Computational Pathology at Health System Scale -- Self-Supervised Foundation Models from Three Billion Images
Gabriele Campanella
Ricky Kwan
Eugene Fluder
Jennifer Zeng
A. Stock
...
Adam J. Schoenfeld
Chad M. Vanderbilt
P. Kovatch
Carlos Cordon-Cardo
Thomas J. Fuchs
MedIm
128
27
0
10 Oct 2023
Pre-Trained Masked Image Model for Mobile Robot Navigation
V. Sharma
Anukriti Singh
Pratap Tokekar
88
2
0
10 Oct 2023
Self-supervised Object-Centric Learning for Videos
Görkay Aydemir
Weidi Xie
Fatma Guney
OCL
VOS
SSL
87
29
0
10 Oct 2023
Uni3D: Exploring Unified 3D Representation at Scale
Junsheng Zhou
Jinsheng Wang
Baorui Ma
Yu-Shen Liu
Tiejun Huang
Xinlong Wang
121
98
0
10 Oct 2023
Perceptual MAE for Image Manipulation Localization: A High-level Vision Learner Focusing on Low-level Features
Xiaochen Ma
Jizhe Zhou
Xiong Xu
Zhuohang Jiang
Chi-Man Pun
67
0
0
10 Oct 2023
Watt For What: Rethinking Deep Learning's Energy-Performance Relationship
Shreyank N. Gowda
Xinyue Hao
Gen Li
Laura Sevilla-Lara
Shashank Narayana Gowda
HAI
93
12
0
10 Oct 2023
Self-Supervised Dataset Distillation for Transfer Learning
Dong Bok Lee
Seanie Lee
Joonho Ko
Kenji Kawaguchi
Juho Lee
Sung Ju Hwang
DD
93
3
0
10 Oct 2023
Antenna Response Consistency Driven Self-supervised Learning for WIFI-based Human Activity Recognition
Ke Xu
Jiangtao Wang
Erik Cambria
Dingchang Zheng
37
0
0
10 Oct 2023
Adversarial Masked Image Inpainting for Robust Detection of Mpox and Non-Mpox
Yubiao Yue
Zhenzhang Li
MedIm
53
0
0
10 Oct 2023
Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering
Xiulong Liu
Zhikang Dong
Peng Zhang
76
24
0
10 Oct 2023
Efficient Adaptation of Large Vision Transformer via Adapter Re-Composing
Wei Dong
Dawei Yan
Zhijun Lin
Peng Wang
80
24
0
10 Oct 2023
Layout Sequence Prediction From Noisy Mobile Modality
Haichao Zhang
Yi Tian Xu
Hongsheng Lu
Takayuki Shimizu
Yun Fu
54
1
0
09 Oct 2023
Large-Scale OD Matrix Estimation with A Deep Learning Method
Zheli Xiong
Defu Lian
Enhong Chen
Gang Chen
Xiaomin Cheng
43
0
0
09 Oct 2023
Adaptive Multi-head Contrastive Learning
Lei Wang
Piotr Koniusz
Tom Gedeon
Liang Zheng
111
5
0
09 Oct 2023
Hierarchical Side-Tuning for Vision Transformers
Weifeng Lin
Ziheng Wu
Wentao Yang
Mingxin Huang
Jun Huang
Lianwen Jin
121
8
0
09 Oct 2023
In-Context Convergence of Transformers
Yu Huang
Yuan Cheng
Yingbin Liang
MLT
113
73
0
08 Oct 2023
Improving Discriminative Multi-Modal Learning with Large-Scale Pre-Trained Models
Chenzhuang Du
Yue Zhao
Chonghua Liao
Jiacheng You
Jie Fu
Hang Zhao
93
2
0
08 Oct 2023
Transferable Availability Poisoning Attacks
Yiyong Liu
Michael Backes
Xiao Zhang
AAML
78
3
0
08 Oct 2023
Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation
Dominik Hollidt
Clinton Jia Wang
Polina Golland
Marc Pollefeys
114
0
0
08 Oct 2023
Enhancing Representations through Heterogeneous Self-Supervised Learning
Zhongyu Li
Bo-Wen Yin
Yongxiang Liu
Li Liu
Ming-Ming Cheng
SSL
71
2
0
08 Oct 2023
FairTune: Optimizing Parameter Efficient Fine Tuning for Fairness in Medical Image Analysis
Raman Dutt
Ondrej Bohdal
Sotirios A. Tsaftaris
Timothy M. Hospedales
129
14
0
08 Oct 2023
1st Place Solution of Egocentric 3D Hand Pose Estimation Challenge 2023 Technical Report:A Concise Pipeline for Egocentric Hand Pose Reconstruction
Zhishan Zhou
Zhi Lv
Shihao Zhou
Minqiang Zou
Tong Wu
Mochen Yu
Yao Tang
Jiajun Liang
77
4
0
07 Oct 2023
ConvNeXtv2 Fusion with Mask R-CNN for Automatic Region Based Coronary Artery Stenosis Detection for Disease Diagnosis
Sandesh Pokhrel
Sanjay Bhandari
Eduard Vazquez
Yash Raj Shrestha
Binod Bhattarai
MedIm
42
3
0
07 Oct 2023
Generalized Robust Test-Time Adaptation in Continuous Dynamic Scenarios
Shuangliang Li
Longhui Yuan
Binhui Xie
Tao Yang
TTA
82
2
0
07 Oct 2023
Tree-GPT: Modular Large Language Model Expert System for Forest Remote Sensing Image Understanding and Interactive Analysis
Siqi Du
Shengjun Tang
Weixi Wang
Xiaoming Li
Renzhong Guo
113
9
0
07 Oct 2023
AG-CRC: Anatomy-Guided Colorectal Cancer Segmentation in CT with Imperfect Anatomical Knowledge
Rongzhao Zhang
Zhian Bai
Ruoying Yu
Wenrao Pang
Lingyun Wang
Lifeng Zhu
Xiaofan Zhang
Huan Zhang
Weiguo Hu
53
1
0
07 Oct 2023
Metadata-Conditioned Generative Models to Synthesize Anatomically-Plausible 3D Brain MRIs
Wei Peng
Tomas Bosschieter
J. Ouyang
Robert Paul
Ehsan Adeli
Qingyu Zhao
K. Pohl
MedIm
91
9
0
07 Oct 2023
MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Muhammad Osama Khan
Junbang Liang
Chun-Kai Wang
Shan Yang
Yu Lou
MDE
90
4
0
06 Oct 2023
FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning
Peiran Xu
Zeyu Wang
Jieru Mei
Liangqiong Qu
Alan Yuille
Cihang Xie
Yuyin Zhou
FedML
62
1
0
06 Oct 2023
Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning
Yinda Chen
Wei-Ping Huang
Shenglong Zhou
Qi Chen
Zhiwei Xiong
73
26
0
06 Oct 2023
TiC: Exploring Vision Transformer in Convolution
Song Zhang
Qingzhong Wang
Jiang Bian
Haoyi Xiong
ViT
53
1
0
06 Oct 2023
Excision And Recovery: Visual Defect Obfuscation Based Self-Supervised Anomaly Detection Strategy
Yeonghyeon Park
Sungho Kang
Myung Jin Kim
Yeonho Lee
Hyeong Seok Kim
Juneho Yi
AAML
91
2
0
06 Oct 2023
Robust Multimodal Learning with Missing Modalities via Parameter-Efficient Adaptation
Md Kaykobad Reza
Ashley Prater-Bennette
M. Salman Asif
81
8
0
06 Oct 2023
Sub-token ViT Embedding via Stochastic Resonance Transformers
Dong Lao
Yangchao Wu
Tian Yu Liu
Alex Wong
Stefano Soatto
VOS
79
4
0
06 Oct 2023
URLOST: Unsupervised Representation Learning without Stationarity or Topology
Zeyu Yun
Juexiao Zhang
Bruno A. Olshausen
Yann LeCun
231
1
0
06 Oct 2023
Diffusion Models as Masked Audio-Video Learners
Elvis Nunez
Yanzi Jin
Mohammad Rastegari
Sachin Mehta
Maxwell Horton
56
2
0
05 Oct 2023
Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency
Tianhong Li
Sangnie Bhardwaj
Yonglong Tian
Han Zhang
Jarred Barber
Dina Katabi
Guillaume Lajoie
Huiwen Chang
Dilip Krishnan
VLM
102
5
0
05 Oct 2023
OMG-ATTACK: Self-Supervised On-Manifold Generation of Transferable Evasion Attacks
Ofir Bar Tal
Adi Haviv
Amit H. Bermano
AAML
79
0
0
05 Oct 2023
3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation
Chen Zhao
Tong Zhang
Mathieu Salzmann
3DH
74
9
0
05 Oct 2023
Exploring DINO: Emergent Properties and Limitations for Synthetic Aperture Radar Imagery
Joseph A. Gallego-Mejia
Anna Jungbluth
Laura Martínez-Ferrer
Matt Allen
Francisco Dorr
F. Kalaitzis
Raúl Ramos-Pollán
45
3
0
05 Oct 2023
EAG-RS: A Novel Explainability-guided ROI-Selection Framework for ASD Diagnosis via Inter-regional Relation Learning
Wonsik Jung
Eunjin Jeon
Eunsong Kang
Heung-Il Suk
42
8
0
05 Oct 2023
StegGuard: Fingerprinting Self-supervised Pre-trained Encoders via Secrets Embeder and Extractor
Xingdong Ren
Tianxing Zhang
Hanzhou Wu
Xinpeng Zhang
Yinggui Wang
Guangling Sun
LLMSV
89
0
0
05 Oct 2023
AI-based automated active learning for discovery of hidden dynamic processes: A use case in light microscopy
Nils Friederich
Angelo Jovin Yamachui Sitcheu
Oliver Neumann
Süheyla Eroglu-Kayikçi
Roshan Prizak
Lennart Hilbert
Ralf Mikut
63
2
0
05 Oct 2023
Reinforcement Learning-based Mixture of Vision Transformers for Video Violence Recognition
Hamid Reza Mohammadi
Ehsan Nazerfard
Tahereh Firoozi
ViT
74
2
0
04 Oct 2023
Human-oriented Representation Learning for Robotic Manipulation
Mingxiao Huo
Mingyu Ding
Chenfeng Xu
Thomas Tian
Xinghao Zhu
Yao Mu
Lingfeng Sun
Masayoshi Tomizuka
Wei Zhan
SSL
106
12
0
04 Oct 2023
Multiple Physics Pretraining for Physical Surrogate Models
Michael McCabe
Bruno Régaldo-Saint Blancard
Liam Parker
Ruben Ohana
M. Cranmer
...
Francois Lanusse
Mariel Pettee
Tiberiu Teşileanu
Kyunghyun Cho
Shirley Ho
PINN
AI4CE
110
56
0
04 Oct 2023
Previous
1
2
3
...
53
54
55
...
94
95
96
Next