Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,175 papers shown
Title
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation
Fangxun Shu
Yue Liao
Le Zhuo
Chenning Xu
Guanghao Zhang
...
Bolin Li
Zhelun Yu
Si Liu
Hongsheng Li
Hao Jiang
VLM
MoE
68
18
0
28 Aug 2024
Can Visual Language Models Replace OCR-Based Visual Question Answering Pipelines in Production? A Case Study in Retail
Bianca Lamm
Janis Keuper
92
2
0
28 Aug 2024
Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection
Jinglun Li
Xinyu Zhou
Pinxue Guo
Yixuan Sun
Yiwen Huang
Weifeng Ge
Wenqiang Zhang
93
2
0
28 Aug 2024
Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration
Xu Zhang
Jiaqi Ma
Guoli Wang
Qian Zhang
Huan Zhang
Lefei Zhang
VLM
183
10
0
28 Aug 2024
Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis
Sakhinana Sagar Srinivas
Chidaksh Ravuru
Geethan Sannidhi
Venkataramana Runkana
82
0
0
27 Aug 2024
A Preliminary Exploration Towards General Image Restoration
Xiangtao Kong
Jinjin Gu
Yihao Liu
Wenlong Zhang
Xiangyu Chen
Yu Qiao
Chao Dong
DiffM
90
3
0
27 Aug 2024
Applying ViT in Generalized Few-shot Semantic Segmentation
Liyuan Geng
Jinhong Xia
Yuanhe Guo
55
1
0
27 Aug 2024
NeuralOOD: Improving Out-of-Distribution Generalization Performance with Brain-machine Fusion Learning Framework
Shuangchen Zhao
Changde Du
Hui Li
Huiguang He
72
0
0
27 Aug 2024
Revisiting Surgical Instrument Segmentation Without Human Intervention: A Graph Partitioning View
Mingyu Sheng
Jianan Fan
Dongnan Liu
Ron Kikinis
Weidong Cai
72
2
0
27 Aug 2024
Pre-training Everywhere: Parameter-Efficient Fine-Tuning for Medical Image Analysis via Target Parameter Pre-training
Xingliang Lei
Yiwen Ye
Zhisong Wang
Ziyang Chen
Minglei Shu
Weidong (Tom) Cai
Yanning Zhang
Yong-quan Xia
106
1
0
27 Aug 2024
The Benefits of Balance: From Information Projections to Variance Reduction
Lang Liu
Ronak R. Mehta
Soumik Pal
Zaïd Harchaoui
81
0
0
27 Aug 2024
SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery
Sarah Rastegar
Mohammadreza Salehi
Yuki M. Asano
Hazel Doughty
Cees G. M. Snoek
94
5
0
26 Aug 2024
Affine steerers for structured keypoint description
Georg Bökman
Johan Edstedt
Michael Felsberg
Fredrik Kahl
LLMSV
74
2
0
26 Aug 2024
An Embedding is Worth a Thousand Noisy Labels
Francesco Di Salvo
Sebastian Doerrich
Ines Rieger
Christian Ledig
NoLa
153
0
0
26 Aug 2024
Hierarchical Network Fusion for Multi-Modal Electron Micrograph Representation Learning with Foundational Large Language Models
Sakhinana Sagar Srinivas
Geethan Sannidhi
Venkataramana Runkana
106
0
0
24 Aug 2024
Preliminary Investigations of a Multi-Faceted Robust and Synergistic Approach in Semiconductor Electron Micrograph Analysis: Integrating Vision Transformers with Large Language and Multimodal Models
Sakhinana Sagar Srinivas
Geethan Sannidhi
Sreeja Gangasani
Chidaksh Ravuru
Venkataramana Runkana
104
0
0
24 Aug 2024
Can Visual Foundation Models Achieve Long-term Point Tracking?
Görkay Aydemir
Weidi Xie
Fatma Guney
80
8
0
24 Aug 2024
Online Continuous Generalized Category Discovery
Keon-Hee Park
Hakyung Lee
Kyungwoo Song
Gyeong-Moon Park
CLL
BDL
93
0
0
24 Aug 2024
Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing
Yangyang Xu
Wenqi Shao
Yong Du
Haiming Zhu
Yang Zhou
Ping Luo
Shengfeng He
DiffM
78
2
0
23 Aug 2024
Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption
Sakhinana Sagar Srinivas
Chidaksh Ravuru
Geethan Sannidhi
Venkataramana Runkana
73
0
0
23 Aug 2024
A New Era in Computational Pathology: A Survey on Foundation and Vision-Language Models
Dibaloke Chanda
Milan Aryal
Nasim Yahya Soltani
Masoud Ganji
AI4CE
VLM
133
7
0
23 Aug 2024
Image Segmentation in Foundation Model Era: A Survey
Tianfei Zhou
Fei Zhang
Boyu Chang
Wenguan Wang
Ye Yuan
E. Konukoglu
Daniel Cremers
VLM
142
12
0
23 Aug 2024
Find the Assembly Mistakes: Error Segmentation for Industrial Applications
Dan Lehman
Tim J. Schoonbeek
Shao-Hsuan Hung
Jacek Kustra
Peter H. N. de With
Fons van der Sommen
UQCV
63
0
0
23 Aug 2024
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey
Qika Lin
Yifan Zhu
Xin Mei
Ling Huang
Jingying Ma
Kai He
Zhen Peng
Min Zhang
Mengling Feng
109
23
0
23 Aug 2024
Symmetric masking strategy enhances the performance of Masked Image Modeling
Khanh-Binh Nguyen
Chae Jung Park
130
0
0
23 Aug 2024
Sapiens: Foundation for Human Vision Models
Rawal Khirodkar
Timur M. Bagautdinov
Julieta Martinez
Su Zhaoen
Austin James
Peter Selednik
Stuart Anderson
Shunsuke Saito
VLM
143
81
0
22 Aug 2024
Cross-Domain Foundation Model Adaptation: Pioneering Computer Vision Models for Geophysical Data Analysis
Zhixiang Guo
Xinming Wu
Luming Liang
Hanlin Sheng
Nuo Chen
Zhengfa Bi
AI4CE
102
4
0
22 Aug 2024
Class-balanced Open-set Semi-supervised Object Detection for Medical Images
Zhanyun Lu
Renshu Gu
Huimin Cheng
Siyu Pang
Mingyu Xu
...
Yaqi Wang
Yuichiro Kinoshita
Juan Ye
Gangyong Jia
Qing Wu
86
0
0
22 Aug 2024
Vision-Based Detection of Uncooperative Targets and Components on Small Satellites
Hannah Grauer
E. Lupu
Connor T. Lee
Soon-Jo Chung
Darren Rowen
Benjamen P. Bycroft
Phaedrus Leeds
John Brader
69
1
0
22 Aug 2024
Supervised Representation Learning towards Generalizable Assembly State Recognition
Tim J. Schoonbeek
Goutham Balachandran
H. Onvlee
Tim Houben
Shao-Hsuan Hung
Jacek Kustra
Peter H. N. de With
Fons van der Sommen
81
1
0
21 Aug 2024
EMCNet : Graph-Nets for Electron Micrographs Classification
Sakhinana Sagar Srinivas
Rajat Kumar Sarkar
Venkataramana Runkana
96
0
0
21 Aug 2024
Continual Gesture Learning without Data via Synthetic Feature Sampling
Zhenyu Lu
Hao Tang
SLR
68
0
0
21 Aug 2024
EmbodiedSAM: Online Segment Any 3D Thing in Real Time
Xiuwei Xu
Huangxing Chen
Linqing Zhao
Ziwei Wang
Jie Zhou
Jiwen Lu
119
16
0
21 Aug 2024
Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors
Fahad Shamshad
Muzammal Naseer
Karthik Nandakumar
AAML
PICV
85
1
0
20 Aug 2024
SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection
Huafeng Chen
Pengxu Wei
Guangqian Guo
Shan Gao
115
12
0
20 Aug 2024
Training Matting Models without Alpha Labels
Wenze Liu
Zixuan Ye
Hao Lu
Z. Cao
Xiangyu Yue
90
1
0
20 Aug 2024
PooDLe: Pooled and dense self-supervised learning from naturalistic videos
Alex N. Wang
Christopher Hoang
Yuwen Xiong
Yann LeCun
Mengye Ren
251
0
0
20 Aug 2024
Learning Precise Affordances from Egocentric Videos for Robotic Manipulation
Gen Li
Nikolaos Tsagkas
Jifei Song
Ruaridh Mon-Williams
S. Vijayakumar
Kun Shao
Laura Sevilla-Lara
80
9
0
19 Aug 2024
Exploiting Fine-Grained Prototype Distribution for Boosting Unsupervised Class Incremental Learning
Jiaming Liu
Hongyuan Liu
Zhili Qin
Wei Han
Yulu Fan
Qinli Yang
Junming Shao
CLL
113
1
0
19 Aug 2024
Mutually-Aware Feature Learning for Few-Shot Object Counting
Yerim Jeon
Subeen Lee
Jihwan Kim
Jae-Pil Heo
96
1
0
19 Aug 2024
Zero-Shot Object-Centric Representation Learning
Aniket Didolkar
Andrii Zadaianchuk
Anirudh Goyal
Mike Mozer
Yoshua Bengio
Georg Martius
Maximilian Seitzer
VLM
OCL
90
8
0
17 Aug 2024
Are CLIP features all you need for Universal Synthetic Image Origin Attribution?
Dario Cioni
Christos Tzelepis
Lorenzo Seidenari
Ioannis Patras
93
2
0
17 Aug 2024
Historical Printed Ornaments: Dataset and Tasks
Sayan Kumar Chaki
Z. S. Baltaci
Elliot Vincent
Remi Emonet
Fabienne Vial-Bonacci
Christelle Bahier-Porte
Mathieu Aubry
Thierry Fournel
73
0
0
16 Aug 2024
Retrieval-augmented Few-shot Medical Image Segmentation with Foundation Models
Lin Zhao
Xiao Chen
Eric Z. Chen
Yikang Liu
Terrence Chen
Shanhui Sun
VLM
109
6
0
16 Aug 2024
SpectralEarth: Training Hyperspectral Foundation Models at Scale
Nassim Ait Ali Braham
C. Albrecht
Julien Mairal
J. Chanussot
Yi Wang
X. Zhu
82
15
0
15 Aug 2024
Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion
Adi Haviv
Shahar Sarfaty
Uri Y. Hacohen
N. Elkin-Koren
Roi Livni
Amit H. Bermano
89
2
0
15 Aug 2024
Towards flexible perception with visual memory
Robert Geirhos
P. Jaini
Austin Stone
Sourabh Medapati
Xi Yi
G. Toderici
Abhijit Ogale
Jonathon Shlens
83
1
0
15 Aug 2024
Unsupervised Part Discovery via Dual Representation Alignment
Jiahao Xia
Wenjian Huang
Min Xu
Jianguo Zhang
Haimin Zhang
Ziyu Sheng
Dong Xu
92
0
0
15 Aug 2024
Navigating Data Scarcity using Foundation Models: A Benchmark of Few-Shot and Zero-Shot Learning Approaches in Medical Imaging
S. Woerner
Christian F. Baumgartner
VLM
MedIm
55
0
0
15 Aug 2024
Continuous Perception Benchmark
Zeyu Wang
Zhenzhen Weng
Serena Yeung-Levy
VLM
66
0
0
15 Aug 2024
Previous
1
2
3
...
22
23
24
...
82
83
84
Next