Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.07141
Cited By
Masked Siamese Networks for Label-Efficient Learning
14 April 2022
Mahmoud Assran
Mathilde Caron
Ishan Misra
Piotr Bojanowski
Florian Bordes
Pascal Vincent
Armand Joulin
Michael G. Rabbat
Nicolas Ballas
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Masked Siamese Networks for Label-Efficient Learning"
50 / 226 papers shown
Title
Simple Semi-supervised Knowledge Distillation from Vision-Language Models via
D
\mathbf{\texttt{D}}
D
ual-
H
\mathbf{\texttt{H}}
H
ead
O
\mathbf{\texttt{O}}
O
ptimization
Seongjae Kang
Dong Bok Lee
Hyungjoon Jang
Sung Ju Hwang
VLM
52
0
0
12 May 2025
seq-JEPA: Autoregressive Predictive Learning of Invariant-Equivariant World Models
Hafez Ghaemi
Eilif Muller
Shahab Bakhtiari
49
0
0
06 May 2025
Towards Generalizability to Tone and Content Variations in the Transcription of Amplifier Rendered Electric Guitar Audio
Yu-Hua Chen
Yuan-Chiao Cheng
Yen-Tung Yeh
Jui-Te Wu
J. Jang
Yi-Hsuan Yang
36
0
0
10 Apr 2025
REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval
Shabnam Choudhury
Yash Salunkhe
Sarthak Mehrotra
Biplab Banerjee
34
0
0
04 Apr 2025
Towards Generalizing Temporal Action Segmentation to Unseen Views
Emad Bahrami
Olga Zatsarynna
Gianpiero Francesca
Juergen Gall
EgoV
41
0
0
03 Apr 2025
Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings
Chengan Che
Chao Wang
Tom Vercauteren
Sophia Tsoka
Luis C. García-Peraza-Herrera
MedIm
46
0
0
25 Mar 2025
ChA-MAEViT: Unifying Channel-Aware Masked Autoencoders and Multi-Channel Vision Transformers for Improved Cross-Channel Learning
Chau Pham
Juan C. Caicedo
Bryan A. Plummer
44
0
0
25 Mar 2025
Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection
Gensheng Pei
Tao Chen
Yujia Wang
Xinhao Cai
Xiangbo Shu
Tianfei Zhou
Yazhou Yao
VLM
53
1
0
21 Mar 2025
Robustness Tokens: Towards Adversarial Robustness of Transformers
Brian Pulfer
Yury Belousov
S. Voloshynovskiy
AAML
45
0
0
13 Mar 2025
Multi-Modal Foundation Models for Computational Pathology: A Survey
Dong Li
Guihong Wan
Xintao Wu
Xinyu Wu
Xiaohui Chen
Yi He
Christine G. Lian
Peter K. Sorger
Yevgeniy R. Semenov
Chen Zhao
MedIm
46
0
0
12 Mar 2025
CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation
Hariprasath Govindarajan
Maciej K. Wozniak
Marvin Klingner
Camille Maurice
B. R. Kiran
S. Yogamani
53
0
0
12 Mar 2025
Task-Agnostic Attacks Against Vision Foundation Models
Brian Pulfer
Yury Belousov
Vitaliy Kinakh
Teddy Furon
S. Voloshynovskiy
AAML
68
0
0
05 Mar 2025
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
Benedikt Alkin
Lukas Miklautz
Sepp Hochreiter
Johannes Brandstetter
VLM
65
8
0
24 Feb 2025
From Pixels to Components: Eigenvector Masking for Visual Representation Learning
Alice Bizeul
Thomas M. Sutter
Alain Ryser
Bernhard Schölkopf
Julius von Kügelgen
Julia E. Vogt
86
1
0
10 Feb 2025
Detecting Content Rating Violations in Android Applications: A Vision-Language Approach
Dishanika Denipitiyage
B. Silva
Suranga Seneviratne
A. Seneviratne
Sanjay Chawla
38
0
0
07 Feb 2025
Slot-BERT: Self-supervised Object Discovery in Surgical Video
Guiqiu Liao
M. Jogan
Marcel Hussing
Kenta Nakahashi
Kazuhiro Yasufuku
Amin Madani
Eric Eaton
Daniel A. Hashimoto
124
0
0
21 Jan 2025
Solving the Catastrophic Forgetting Problem in Generalized Category Discovery
Xinzi Cao
Xiawu Zheng
G. Wang
Weijiang Yu
Yunhang Shen
Ke Li
Yutong Lu
Yonghong Tian
CLL
37
4
0
09 Jan 2025
Efficient Object-centric Representation Learning with Pre-trained Geometric Prior
Phúc H. Lê Khắc
Graham Healy
A. Smeaton
OCL
79
0
0
16 Dec 2024
Beyond [cls]: Exploring the true potential of Masked Image Modeling representations
Marcin Przewiȩźlikowski
Randall Balestriero
Wojciech Jasiński
Marek 'Smieja
Bartosz Zieliñski
69
0
0
04 Dec 2024
Effective Fine-Tuning of Vision-Language Models for Accurate Galaxy Morphology Analysis
Ruoqi Wang
Haitao Wang
Qiong Luo
71
0
0
29 Nov 2024
RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training
Raktim Gautam Goswami
P. Krishnamurthy
Yann LeCun
Farshad Khorrami
90
1
0
26 Nov 2024
Design-o-meter: Towards Evaluating and Refining Graphic Designs
Sahil Goyal
Abhinav Mahajan
Swasti Mishra
Prateksha Udhayanan
Tripti Shukla
K. J. Joseph
Balaji Vasan Srinivasan
75
1
0
22 Nov 2024
BioNCERE: Non-Contrastive Enhancement For Relation Extraction In Biomedical Texts
Farshad Noravesh
34
0
0
31 Oct 2024
S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving
Maciej K. Wozniak
Hariprasath Govindarajan
Marvin Klingner
Camille Maurice
B Ravi Kiran
S. Yogamani
3DPC
47
1
0
30 Oct 2024
Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis
Zhiyuan Min
Yawei Luo
Jianwen Sun
Yi Yang
3DGS
36
0
0
30 Oct 2024
A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization
Zhong Ji
S. M. I. Simon X. Yang
Jingren Liu
Yanwei Pang
Jungong Han
28
0
0
29 Oct 2024
On Partial Prototype Collapse in the DINO Family of Self-Supervised Methods
Hariprasath Govindarajan
Per Sidén
Jacob Roll
Fredrik Lindsten
24
2
0
17 Oct 2024
Forte : Finding Outliers with Representation Typicality Estimation
Debargha Ganguly
Warren Morningstar
A. Yu
Vipin Chaudhary
OODD
39
0
0
02 Oct 2024
Denoising with a Joint-Embedding Predictive Architecture
Dengsheng Chen
Jie Hu
Xiaoming Wei
Enhua Wu
DiffM
52
2
0
02 Oct 2024
SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition
Shu Yang
Zhiyuan Cai
Luyang Luo
Ning Ma
Shuchang Xu
Hao Chen
20
0
0
30 Sep 2024
Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Haonan Lin
Wenbin An
Jiahao Wang
Yan Chen
Feng Tian
Mengmeng Wang
Guang Dai
Qianying Wang
Jingdong Wang
34
2
0
29 Sep 2024
Multi-hypotheses Conditioned Point Cloud Diffusion for 3D Human Reconstruction from Occluded Images
Donghwan Kim
Tae-Kyun Kim
DiffM
3DH
33
1
0
27 Sep 2024
Domain-Invariant Representation Learning of Bird Sounds
Ilyass Moummad
Romain Serizel
Emmanouil Benetos
Nicolas Farrugia
SSL
35
2
0
13 Sep 2024
A Survey of the Self Supervised Learning Mechanisms for Vision Transformers
Asifullah Khan
A. Sohail
M. Fiaz
Mehdi Hassan
Tariq Habib Afridi
...
Muhammad Zaigham Zaheer
Kamran Ali
Tangina Sultana
Ziaurrehman Tanoli
Naeem Akhter
45
3
0
30 Aug 2024
A New Era in Computational Pathology: A Survey on Foundation and Vision-Language Models
Dibaloke Chanda
Milan Aryal
Nasim Yahya Soltani
Masoud Ganji
AI4CE
VLM
34
7
0
23 Aug 2024
Zero-Shot Object-Centric Representation Learning
Aniket Didolkar
Andrii Zadaianchuk
Anirudh Goyal
Mike Mozer
Yoshua Bengio
Georg Martius
Maximilian Seitzer
VLM
OCL
37
4
0
17 Aug 2024
SpectralEarth: Training Hyperspectral Foundation Models at Scale
Nassim Ait Ali Braham
C. Albrecht
Julien Mairal
J. Chanussot
Yi Wang
X. Zhu
36
12
0
15 Aug 2024
Masked Image Modeling: A Survey
Vlad Hondru
Florinel-Alin Croitoru
Shervin Minaee
Radu Tudor Ionescu
N. Sebe
64
6
0
13 Aug 2024
Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning
Xinrong Hu
Dewen Zeng
Yawen Wu
Xueyang Li
Yiyu Shi
ViT
MedIm
39
0
0
12 Aug 2024
PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identification
Bin Hu
Xinggang Wang
Wenyu Liu
ViT
33
3
0
10 Aug 2024
HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts
Hongjun Wang
S. Vaze
Kai Han
72
4
0
08 Aug 2024
POA: Pre-training Once for Models of All Sizes
Yingying Zhang
Xin Guo
Jiangwei Lao
Lei Yu
Lixiang Ru
Jian Wang
Guo Ye
Huimei He
Jingdong Chen
Ming Yang
60
1
0
02 Aug 2024
Unsqueeze [CLS] Bottleneck to Learn Rich Representations
Qing Su
Shihao Ji
24
0
0
24 Jul 2024
A Multi-view Mask Contrastive Learning Graph Convolutional Neural Network for Age Estimation
Yiping Zhang
Yuntao Shou
Tao Meng
Wei Ai
Keqin Li
CVBM
43
10
0
23 Jul 2024
Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning
Chen Shen
Chunfeng Lian
Wanqing Zhang
Fan Wang
Jianhua Zhang
...
Hongshu Mu
Hao Wu
Xinggong Liang
Jianhua Ma
Zhenyuan Wang
36
0
0
20 Jul 2024
Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
Ioannis Maniadis Metaxas
Georgios Tzimiropoulos
Ioannis Patras
SSL
27
0
0
15 Jul 2024
STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences
Soroush Mehraban
Mohammad Javad Rajabi
Babak Taati
3DPC
29
0
0
15 Jul 2024
Towards zero-shot amplifier modeling: One-to-many amplifier modeling via tone embedding control
Yu-Hua Chen
Yen-Tung Yeh
Yuan-Chiao Cheng
Jui-Te Wu
Yu-Hsiang Ho
J. Jang
Yi-Hsuan Yang
35
5
0
15 Jul 2024
Multi-modal Masked Siamese Network Improves Chest X-Ray Representation Learning
Saeed Shurrab
Alejandro Guerra-Manzanares
Farah E. Shamout
20
1
0
05 Jul 2024
PathAlign: A vision-language model for whole slide images in histopathology
Faruk Ahmed
Andrew Sellergren
Lin Yang
Shawn Xu
Boris Babenko
...
S. Shetty
Daniel Golden
Yun-hui Liu
David F. Steiner
Ellery Wulczyn
LM&MA
VLM
36
14
0
27 Jun 2024
1
2
3
4
5
Next