Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.03580
Cited By
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling
9 January 2023
Keyu Tian
Yi-Xin Jiang
Qishuai Diao
Chen Lin
Liwei Wang
Zehuan Yuan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"
21 / 71 papers shown
Title
Adversarial Attacks on Foundational Vision Models
Nathan Inkawhich
Gwendolyn McDonald
R. Luley
VLM
35
12
0
28 Aug 2023
Self-Supervised Pre-Training with Contrastive and Masked Autoencoder Methods for Dealing with Small Datasets in Deep Learning for Medical Imaging
Daniel Wolf
Tristan Payer
C. Lisson
C. Lisson
Meinrad Beer
Michael Götz
Timo Ropinski
35
15
0
12 Aug 2023
Exploring Transformers for Open-world Instance Segmentation
Jiannan Wu
Yi-Xin Jiang
B. Yan
Huchuan Lu
Zehuan Yuan
Ping Luo
ViT
30
5
0
08 Aug 2023
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Daiqing Li
Huan Ling
Amlan Kar
David Acuna
Seung Wook Kim
Karsten Kreis
Antonio Torralba
Sanja Fidler
VLM
DiffM
22
27
0
14 Jul 2023
Augmentation-aware Self-supervised Learning with Conditioned Projector
Marcin Przewike'zlikowski
Mateusz Pyla
Bartosz Zieliñski
Bartlomiej Twardowski
Jacek Tabor
Marek Śmieja
SSL
37
2
0
31 May 2023
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training
Utku Ozbulak
Hyun Jung Lee
Beril Boga
Esla Timothy Anzaku
Ho-min Park
Arnout Van Messem
W. D. Neve
J. Vankerschaver
DiffM
26
36
0
23 May 2023
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation
Wenxuan Wang
Jing Liu
Xingjian He
Yisi Zhang
Cheng Chen
Jiachen Shen
Yan Zhang
Jiangyun Li
22
11
0
19 May 2023
An Inverse Scaling Law for CLIP Training
Xianhang Li
Zeyu Wang
Cihang Xie
VLM
CLIP
48
55
0
11 May 2023
Transformer-Based Visual Segmentation: A Survey
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViT
MedIm
42
132
0
19 Apr 2023
CMID: A Unified Self-Supervised Learning Framework for Remote Sensing Image Understanding
Dilxat Muhtar
Xue-liang Zhang
P. Xiao
Zhenshi Li
Feng-Xue Gu
SSL
40
50
0
19 Apr 2023
Multi-Level Contrastive Learning for Dense Prediction Task
Qiushan Guo
Yizhou Yu
Yi-Xin Jiang
Jiannan Wu
Zehuan Yuan
Ping Luo
SSL
32
2
0
04 Apr 2023
SSVMR: Saliency-based Self-training for Video-Music Retrieval
Xuxin Cheng
Zhihong Zhu
Hongxiang Li
Yaowei Li
Yuexian Zou
24
29
0
18 Feb 2023
Self-Supervised Visual Representation Learning via Residual Momentum
T. Pham
Axi Niu
Zhang Kang
Sultan Rizky Hikmawan Madjid
Jiajing Hong
Daehyeok Kim
Joshua Tian Jin Tee
Chang D. Yoo
SSL
46
6
0
17 Nov 2022
A Dynamic Graph Interactive Framework with Label-Semantic Injection for Spoken Language Understanding
Zhihong Zhu
Weiyuan Xu
Xuxin Cheng
Tengtao Song
Yuexian Zou
27
22
0
08 Nov 2022
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
Siyuan Li
Di Wu
Fang Wu
Lei Shang
Stan.Z.Li
32
48
0
27 May 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,443
0
11 Nov 2021
ResNet strikes back: An improved training procedure in timm
Ross Wightman
Hugo Touvron
Hervé Jégou
AI4TS
212
487
0
01 Oct 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
317
5,785
0
29 Apr 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,781
0
24 Feb 2021
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
267
3,371
0
09 Mar 2020
Spatial Transformer Networks
Max Jaderberg
Karen Simonyan
Andrew Zisserman
Koray Kavukcuoglu
146
7,337
0
05 Jun 2015
Previous
1
2