Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.06377
Cited By
v1
v2
v3 (latest)
Masked Autoencoders Are Scalable Vision Learners
11 November 2021
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked Autoencoders Are Scalable Vision Learners"
50 / 4,779 papers shown
Title
Motion-Guided Masking for Spatiotemporal Representation Learning
D. Fan
Jue Wang
Shuai Liao
Yi Zhu
Vimal Bhat
H. Santos-Villalobos
M. Rohith
Xinyu Li
VGen
83
22
0
24 Aug 2023
DLIP: Distilling Language-Image Pre-training
Huafeng Kuang
Jie Wu
Xiawu Zheng
Ming Li
Xuefeng Xiao
Rui Wang
Min Zheng
Rongrong Ji
VLM
72
5
0
24 Aug 2023
Masked Feature Modelling: Feature Masking for the Unsupervised Pre-training of a Graph Attention Network Block for Bottom-up Video Event Recognition
Dimitrios Daskalakis
Nikolaos Gkalelis
Vasileios Mezaris
81
0
0
24 Aug 2023
SieveNet: Selecting Point-Based Features for Mesh Networks
Shengchao Yuan
Yishun Dou
Rui Shi
Bingbing Ni
Zhong Zheng
3DPC
57
0
0
24 Aug 2023
Masked Autoencoders are Efficient Class Incremental Learners
Jiang-Tian Zhai
Xialei Liu
Andrew D. Bagdanov
Ke Li
Mingg-Ming Cheng
CLL
75
15
0
24 Aug 2023
MOFO: MOtion FOcused Self-Supervision for Video Understanding
Mona Ahmadian
Frank Guerin
Andrew Gilbert
83
2
0
23 Aug 2023
Self-Supervised Learning for Endoscopic Video Analysis
Roy Hirsch
Mathilde Caron
Regev Cohen
Amir Livne
Ron Shapiro
Tomer Golany
Roman Goldenberg
Daniel Freedman
Ehud Rivlin
SSL
76
20
0
23 Aug 2023
Language Reward Modulation for Pretraining Reinforcement Learning
Ademi Adeniji
Amber Xie
Carmelo Sferrazza
Younggyo Seo
Stephen James
Pieter Abbeel
103
29
0
23 Aug 2023
SPPNet: A Single-Point Prompt Network for Nuclei Image Segmentation
Qing Xu
Wenwei Kuang
Zeyu Zhang
Xueyao Bao
Haoran Chen
Wenting Duan
VLM
60
14
0
23 Aug 2023
The TYC Dataset for Understanding Instance-Level Semantics and Motions of Cells in Microstructures
Christoph Reich
Tim Prangemeier
Heinz Koeppl
65
0
0
23 Aug 2023
DR-Tune: Improving Fine-tuning of Pretrained Visual Models by Distribution Regularization with Semantic Calibration
Nana Zhou
Jiaxin Chen
Di Huang
72
4
0
23 Aug 2023
Local Distortion Aware Efficient Transformer Adaptation for Image Quality Assessment
Kangmin Xu
Liang Liao
Jing Xiao
Chaofeng Chen
Haoning Wu
Qiong Yan
Weisi Lin
ViT
57
5
0
23 Aug 2023
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
Junyi Chen
Longteng Guo
Jianxiang Sun
Shuai Shao
Zehuan Yuan
Liang Lin
Dongyu Zhang
MLLM
VLM
MoE
77
10
0
23 Aug 2023
Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations
Mohammadreza Salehi
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
VOS
81
21
0
22 Aug 2023
Tryage: Real-time, intelligent Routing of User Prompts to Large Language Models
S. N. Hari
Matt Thomson
65
13
0
22 Aug 2023
TrackFlow: Multi-Object Tracking with Normalizing Flows
Gianluca Mancusi
Aniello Panariello
Angelo Porrello
Matteo Fabbri
Simone Calderara
Rita Cucchiara
VOT
67
14
0
22 Aug 2023
Addressing the Accuracy-Cost Tradeoff in Material Property Prediction: A Teacher-Student Strategy
Dong Zhu
Zhikuang Xin
Siming Zheng
Yangang Wang
Xiaoyu Yang
60
0
0
22 Aug 2023
A Survey on Self-Supervised Representation Learning
Tobias Uelwer
Jan Robine
Stefan Sylvius Wagner
Marc Höftmann
Eric Upschulte
S. Konietzny
Maike Behrendt
Stefan Harmeling
SSL
AI4TS
OOD
140
12
0
22 Aug 2023
Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding
Jiantao Wu
Shentong Mo
Muhammad Awais
Sara Atito
Zhenhua Feng
J. Kittler
VLM
104
4
0
22 Aug 2023
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Xi Deng
Han Shi
Runhu Huang
Changlin Li
Hang Xu
Jianhua Han
James T. Kwok
Shen Zhao
Wei Zhang
Xiaodan Liang
CLIP
VLM
93
3
0
22 Aug 2023
CiteTracker: Correlating Image and Text for Visual Tracking
Xin Li
Yuqing Huang
Zhenyu He
Yaowei Wang
Huchuan Lu
Ming-Hsuan Yang
101
31
0
22 Aug 2023
HMD-NeMo: Online 3D Avatar Motion Generation From Sparse Observations
S. Aliakbarian
F. Saleh
David Collier
Pashmina Cameron
Darren Cosker
3DH
74
13
0
22 Aug 2023
SegRNN: Segment Recurrent Neural Network for Long-Term Time Series Forecasting
Shengsheng Lin
Weiwei Lin
Wentai Wu
Feiyu Zhao
Ruichao Mo
Haotong Zhang
AI4TS
80
62
0
22 Aug 2023
ConcatPlexer: Additional Dim1 Batching for Faster ViTs
D. Han
Seunghyeon Seo
D. Jeon
Jiho Jang
Chaerin Kong
Nojun Kwak
ViT
MoE
63
0
0
22 Aug 2023
SwinV2DNet: Pyramid and Self-Supervision Compounded Feature Learning for Remote Sensing Images Change Detection
Dalong Zheng
Zebin Wu
Jia-Wei Liu
Zhihui Wei
ViT
86
0
0
22 Aug 2023
Audio-Visual Class-Incremental Learning
Weiguo Pian
Shentong Mo
Yunhui Guo
Yapeng Tian
CLL
VLM
87
30
0
21 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding
Bingkun Huang
Zhiyu Zhao
Guozhen Zhang
Yu Qiao
Limin Wang
81
36
0
21 Aug 2023
Patch Is Not All You Need
Chang-bo Li
Jie Zhang
Yang Wei
Zhilong Ji
Jinfeng Bai
Shiguang Shan
ViT
69
2
0
21 Aug 2023
Spatial Transform Decoupling for Oriented Object Detection
Hongtian Yu
Yunjie Tian
QiXiang Ye
Yunfan Liu
83
29
0
21 Aug 2023
Towards Accelerated Model Training via Bayesian Data Selection
Zhijie Deng
Peng Cui
Jun Zhu
89
5
0
21 Aug 2023
Dataset Quantization
Daquan Zhou
Kaixin Wang
Jianyang Gu
Xiang Peng
Dongze Lian
Yifan Zhang
Yang You
Jiashi Feng
DD
94
41
0
21 Aug 2023
Information Theory-Guided Heuristic Progressive Multi-View Coding
Jiangmeng Li
Hang Gao
Jingyao Wang
Changwen Zheng
91
2
0
21 Aug 2023
Frequency Compensated Diffusion Model for Real-scene Dehazing
Jing Wang
Songtao Wu
Kuanhong Xu
Zhiqiang Yuan
DiffM
85
32
0
21 Aug 2023
X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events
Bo Dai
Linge Wang
Baoxiong Jia
Zeyu Zhang
Song-Chun Zhu
Fangqiu Yi
Yixin Zhu
70
3
0
21 Aug 2023
Diffusion Model as Representation Learner
Xingyi Yang
Xinchao Wang
DiffM
86
60
0
21 Aug 2023
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting
Qidong Huang
Xiaoyi Dong
DongDong Chen
Yinpeng Chen
Lu Yuan
Gang Hua
Weiming Zhang
Neng H. Yu
AAML
113
9
0
20 Aug 2023
ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights
Weixian Lei
Yixiao Ge
Jianfeng Zhang
Dylan Sun
Kun Yi
Ying Shan
Mike Zheng Shou
70
1
0
20 Aug 2023
A Review on Objective-Driven Artificial Intelligence
Apoorv Singh
46
3
0
20 Aug 2023
Efficient Representation Learning for Healthcare with Cross-Architectural Self-Supervision
P. Singh
Jacopo Cirrone
OOD
SSL
89
2
0
19 Aug 2023
UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning
Meiqi Sun
Zhonghan Zhao
Wenhao Chai
Hanjun Luo
Shidong Cao
Yanting Zhang
Lei Li
Gaoang Wang
59
8
0
19 Aug 2023
Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos
Rui Qian
Shuangrui Ding
Xian Liu
Dahua Lin
105
16
0
19 Aug 2023
Recap: Detecting Deepfake Video with Unpredictable Tampered Traces via Recovering Faces and Mapping Recovered Faces
Juan Hu
Xin Liao
Difei Gao
Satoshi Tsutsui
Qian Wang
Zheng Qin
Mike Zheng Shou
CVBM
AAML
58
1
0
19 Aug 2023
Scalable Video Object Segmentation with Simplified Framework
Qiangqiang Wu
Tianyu Yang
WU Wei
Antoni B. Chan
VOS
80
26
0
19 Aug 2023
Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders
Jie Cheng
Xiaodong Mei
Ming-Yuan Liu
98
60
0
19 Aug 2023
VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control
Zi-Yuan Hu
Yanyang Li
Michael R. Lyu
Liwei Wang
VLM
90
16
0
18 Aug 2023
A Lightweight Transformer for Faster and Robust EBSD Data Collection
Harry Dong
S. Donegan
M. Shah
Yuejie Chi
72
2
0
18 Aug 2023
GiGaMAE: Generalizable Graph Masked Autoencoder via Collaborative Latent Space Reconstruction
Yucheng Shi
Yushun Dong
Qiaoyu Tan
Jundong Li
Ninghao Liu
140
27
0
18 Aug 2023
Language-guided Human Motion Synthesis with Atomic Actions
Yuanhao Zhai
Mingzhen Huang
Tianyu Luan
Lu Dong
Ifeoma Nwogu
Siwei Lyu
David Doermann
Junsong Yuan
79
13
0
18 Aug 2023
Diverse Cotraining Makes Strong Semi-Supervised Segmentor
Yijiang Li
Xinjiang Wang
Lihe Yang
Xue Jiang
Wayne Zhang
Ying Gao
89
19
0
18 Aug 2023
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
Zhiqiang Shen
Xiaoxiao Sheng
Hehe Fan
Longguang Wang
Y. Guo
Qiong Liu
Hao Wen
Xiaoping Zhou
3DPC
72
16
0
18 Aug 2023
Previous
1
2
3
...
58
59
60
...
94
95
96
Next