ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,175 papers shown
Title
SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image
  Classification
SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification
Benjamin Feuer
Jiawei Xu
Niv Cohen
Patrick Yubeaton
Govind Mittal
Chinmay Hegde
60
3
0
07 Oct 2024
Next state prediction gives rise to entangled, yet compositional
  representations of objects
Next state prediction gives rise to entangled, yet compositional representations of objects
Tankred Saanum
Luca M. Schulze Buschoff
Peter Dayan
Eric Schulz
OCLCoGeOOD
65
1
0
07 Oct 2024
Improving Image Clustering with Artifacts Attenuation via Inference-Time
  Attention Engineering
Improving Image Clustering with Artifacts Attenuation via Inference-Time Attention Engineering
Kazumoto Nakamura
Yuji Nozawa
Yu-Chieh Lin
K. Nakata
Youyang Ng
ViT
69
2
0
07 Oct 2024
Intriguing Properties of Large Language and Vision Models
Intriguing Properties of Large Language and Vision Models
Young-Jun Lee
ByungSoo Ko
Han-Gyu Kim
Yechan Hwang
Ho-Jin Choi
LRMVLM
123
0
0
07 Oct 2024
ACDC: Autoregressive Coherent Multimodal Generation using Diffusion
  Correction
ACDC: Autoregressive Coherent Multimodal Generation using Diffusion Correction
Hyungjin Chung
Dohun Lee
Jong Chul Ye
VGenDiffM
68
2
0
07 Oct 2024
Low-Rank Continual Personalization of Diffusion Models
Low-Rank Continual Personalization of Diffusion Models
Łukasz Staniszewski
Katarzyna Zaleska
Kamil Deja
DiffM
101
0
0
07 Oct 2024
Brain Mapping with Dense Features: Grounding Cortical Semantic Selectivity in Natural Images With Vision Transformers
Brain Mapping with Dense Features: Grounding Cortical Semantic Selectivity in Natural Images With Vision Transformers
Andrew F. Luo
Jacob Yeung
Rushikesh Zawar
Shaurya Dewan
Margaret M. Henderson
Leila Wehbe
Michael J. Tarr
105
5
0
07 Oct 2024
Organizing Unstructured Image Collections using Natural Language
Organizing Unstructured Image Collections using Natural Language
Mingxuan Liu
Zhun Zhong
Jun Li
Gianni Franchi
Subhankar Roy
Elisa Ricci
VLM
141
5
0
07 Oct 2024
Human-in-the-loop Reasoning For Traffic Sign Detection: Collaborative Approach Yolo With Video-llava
Human-in-the-loop Reasoning For Traffic Sign Detection: Collaborative Approach Yolo With Video-llava
Mehdi Azarafza
Fatima Idrees
Ali Ehteshami Bejnordi
Charles Steinmetz
Stefan Henkler
A. Rettberg
75
0
0
07 Oct 2024
Learning De-Biased Representations for Remote-Sensing Imagery
Learning De-Biased Representations for Remote-Sensing Imagery
Zichen Tian
Zhaozheng Chen
Qianru Sun
62
0
0
06 Oct 2024
Self-Supervised Anomaly Detection in the Wild: Favor Joint Embeddings
  Methods
Self-Supervised Anomaly Detection in the Wild: Favor Joint Embeddings Methods
Daniel Otero
Rafael Mateus
Randall Balestriero
51
0
0
05 Oct 2024
SyllableLM: Learning Coarse Semantic Units for Speech Language Models
SyllableLM: Learning Coarse Semantic Units for Speech Language Models
Alan Baade
Puyuan Peng
David Harwath
126
8
0
05 Oct 2024
Not All Diffusion Model Activations Have Been Evaluated as
  Discriminative Features
Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features
Benyuan Meng
Qianqian Xu
Zitai Wang
Xiaochun Cao
Qingming Huang
82
7
0
04 Oct 2024
Dessie: Disentanglement for Articulated 3D Horse Shape and Pose
  Estimation from Images
Dessie: Disentanglement for Articulated 3D Horse Shape and Pose Estimation from Images
Ci Li
Yi Yang
Zehang Weng
Elin Hernlund
Silvia Zuffi
Hedvig Kjellström
96
3
0
04 Oct 2024
Optimization Proxies using Limited Labeled Data and Training Time -- A Semi-Supervised Bayesian Neural Network Approach
Optimization Proxies using Limited Labeled Data and Training Time -- A Semi-Supervised Bayesian Neural Network Approach
Parikshit Pareek
Abhijith Jayakumar
K. Sundar
Deepjyoti Deka
Sidhant Misra
110
0
0
04 Oct 2024
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through
  Language Descriptions
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Ziyao Zeng
Yangchao Wu
Hyoungseob Park
Daniel Wang
Fengyu Yang
Stefano Soatto
Dong Lao
Byung-Woo Hong
Alex Wong
MDE
99
7
0
03 Oct 2024
Neutral residues: revisiting adapters for model extension
Neutral residues: revisiting adapters for model extension
Franck Signe Talla
Hervé Jégou
Edouard Grave
67
1
0
03 Oct 2024
HiddenGuard: Fine-Grained Safe Generation with Specialized
  Representation Router
HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router
Lingrui Mei
Shenghua Liu
Yiwei Wang
Baolong Bi
Ruibin Yuan
Xueqi Cheng
113
5
0
03 Oct 2024
Predictive Attractor Models
Predictive Attractor Models
R. Mounir
Sudeep Sarkar
48
0
0
03 Oct 2024
Hard Negative Sample Mining for Whole Slide Image Classification
Hard Negative Sample Mining for Whole Slide Image Classification
Wentao Huang
Xiaoling Hu
Shahira Abousamra
Prateek Prasanna
Chao Chen
VLM
80
6
0
03 Oct 2024
BiSSL: Enhancing the Alignment Between Self-Supervised Pretraining and Downstream Fine-Tuning via Bilevel Optimization
BiSSL: Enhancing the Alignment Between Self-Supervised Pretraining and Downstream Fine-Tuning via Bilevel Optimization
Gustav Wagner Zakarias
Lars Kai Hansen
Zheng-Hua Tan
81
0
0
03 Oct 2024
DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized
  Image Generation
DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation
Jing He
Haodong Li
Yongzhe Hu
Guibao Shen
Yingjie Cai
Weichao Qiu
Ying-Cong Chen
DiffM
97
4
0
02 Oct 2024
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for
  Remote Sensing Images
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images
Kaiyu Li
Ruixun Liu
Xiangyong Cao
Deyu Meng
Zhi Wang
Deyu Meng
Zhi Wang
79
3
0
02 Oct 2024
Towards a vision foundation model for comprehensive assessment of
  Cardiac MRI
Towards a vision foundation model for comprehensive assessment of Cardiac MRI
Athira J. Jacob
Indraneel Borgohain
T. Chitiboi
Puneet Sharma
Dorin Comaniciu
Daniel Rueckert
MedIm
60
5
0
02 Oct 2024
AgriCLIP: Adapting CLIP for Agriculture and Livestock via
  Domain-Specialized Cross-Model Alignment
AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment
Umair Nawaz
Muhammad Awais
Hanan Gani
Muzammal Naseer
Fahad Khan
Salman Khan
Rao Muhammad Anwer
VLMCLIP
86
3
0
02 Oct 2024
Denoising with a Joint-Embedding Predictive Architecture
Denoising with a Joint-Embedding Predictive Architecture
Dengsheng Chen
Jie Hu
Xiaoming Wei
Enhua Wu
DiffM
172
3
0
02 Oct 2024
Tracking objects that change in appearance with phase synchrony
Tracking objects that change in appearance with phase synchrony
Sabine Muzellec
Drew Linsley
A. Ashok
E. Mingolla
Girik Malik
Rufin VanRullen
Thomas Serre
80
2
0
02 Oct 2024
Multi-Scale Fusion for Object Representation
Multi-Scale Fusion for Object Representation
Rongzhen Zhao
V. Wang
Arno Solin
Joni Pajarinen
OCLVOS
115
1
0
02 Oct 2024
Local-to-Global Self-Supervised Representation Learning for Diabetic
  Retinopathy Grading
Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading
Mostafa Hajighasemloua
Samad Sheikhaei
Hamid Soltanian-Zadeha
65
0
0
01 Oct 2024
Arges: Spatio-Temporal Transformer for Ulcerative Colitis Severity
  Assessment in Endoscopy Videos
Arges: Spatio-Temporal Transformer for Ulcerative Colitis Severity Assessment in Endoscopy Videos
Krishna Chaitanya
Pablo F. Damasceno
Shreyas Fadnavis
Pooya Mobadersany
Chaitanya Parmar
...
Lindsey Surace
Louis R. Ghanem
Oana Gabriela Cula
Tommaso Mansi
K. Standish
59
0
0
01 Oct 2024
Domain Aware Multi-Task Pretraining of 3D Swin Transformer for
  T1-weighted Brain MRI
Domain Aware Multi-Task Pretraining of 3D Swin Transformer for T1-weighted Brain MRI
Jonghun Kim
Mansu Kim
Hyunjin Park
MedImViT
54
0
0
01 Oct 2024
Unleashing the Potentials of Likelihood Composition for Multi-modal
  Language Models
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models
Shitian Zhao
Renrui Zhang
Xu Luo
Yan Wang
Shanghang Zhang
Peng Gao
91
0
0
01 Oct 2024
Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
Saurav Jha
Shiqi Yang
Masato Ishii
Mengjie Zhao
Christian Simon
Muhammad Jehanzeb Mirza
Dong Gong
Lina Yao
Shusuke Takahashi
Yuki Mitsufuji
DiffM
147
3
0
01 Oct 2024
Flex3D: Feed-Forward 3D Generation with Flexible Reconstruction Model and Input View Curation
Flex3D: Feed-Forward 3D Generation with Flexible Reconstruction Model and Input View Curation
Junlin Han
Jianyuan Wang
Andrea Vedaldi
Philip Torr
Filippos Kokkinos
124
4
0
01 Oct 2024
Dual Consolidation for Pre-Trained Model-Based Domain-Incremental Learning
Dual Consolidation for Pre-Trained Model-Based Domain-Incremental Learning
Da-Wei Zhou
Zi-Wen Cai
Han-Jia Ye
Lijun Zhang
De-Chuan Zhan
CLLAI4CE
194
2
0
01 Oct 2024
DressRecon: Freeform 4D Human Reconstruction from Monocular Video
DressRecon: Freeform 4D Human Reconstruction from Monocular Video
Jeff Tan
Donglai Xiang
Shubham Tulsiani
Deva Ramanan
Gengshan Yang
3DH
73
4
0
30 Sep 2024
Task-Oriented Pre-Training for Drivable Area Detection
Task-Oriented Pre-Training for Drivable Area Detection
Fulong Ma
Guoyang Zhao
Weiqing Qi
Ming Liu
Jun Ma
VLM
64
1
0
30 Sep 2024
SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning
  for Surgical Phase Recognition
SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition
Shu Yang
Zhiyuan Cai
Luyang Luo
Ning Ma
Shuchang Xu
Hao Chen
67
1
0
30 Sep 2024
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Heeseong Shin
Chaehyun Kim
Sunghwan Hong
Seokju Cho
Anurag Arnab
Paul Hongsuck Seo
Seungryong Kim
VLM
82
1
0
30 Sep 2024
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation
Kun Yuan
V. Srivastav
Nassir Navab
N. Padoy
122
9
0
30 Sep 2024
Annotation-Free Curb Detection Leveraging Altitude Difference Image
Annotation-Free Curb Detection Leveraging Altitude Difference Image
Fulong Ma
Peng Hou
Yuxuan Liu
Yang Liu
Ming Liu
Jun Ma
53
0
0
30 Sep 2024
Flipped Classroom: Aligning Teacher Attention with Student in
  Generalized Category Discovery
Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Haonan Lin
Wenbin An
Jiahao Wang
Yan Chen
Feng Tian
Mengmeng Wang
Guang Dai
Qianying Wang
Jingdong Wang
110
2
0
29 Sep 2024
Self-supervised Auxiliary Learning for Texture and Model-based Hybrid
  Robust and Fair Featuring in Face Analysis
Self-supervised Auxiliary Learning for Texture and Model-based Hybrid Robust and Fair Featuring in Face Analysis
Shukesh Reddy
Nishit Poddar
Srijan Das
Abhijit Das
CVBM
74
0
0
29 Sep 2024
STTM: A New Approach Based Spatial-Temporal Transformer And Memory
  Network For Real-time Pressure Signal In On-demand Food Delivery
STTM: A New Approach Based Spatial-Temporal Transformer And Memory Network For Real-time Pressure Signal In On-demand Food Delivery
Jiang Wang
Haibin Wei
Xiaowei Xu
Jiacheng Shi
Jian Nie
Longzhi Du
Taixu Jiang
AI4TS
417
0
0
29 Sep 2024
Localizing Memorization in SSL Vision Encoders
Localizing Memorization in SSL Vision Encoders
Wenhao Wang
Adam Dziedzic
Michael Backes
Franziska Boenisch
67
2
0
27 Sep 2024
ProMerge: Prompt and Merge for Unsupervised Instance Segmentation
ProMerge: Prompt and Merge for Unsupervised Instance Segmentation
Dylan Li
Gyungin Shin
78
3
0
27 Sep 2024
Explainable Artifacts for Synthetic Western Blot Source Attribution
Explainable Artifacts for Synthetic Western Blot Source Attribution
J. P. Cardenuto
S. Mandelli
Daniel Moreira
Paolo Bestagini
Edward J. Delp
Anderson de Rezende Rocha
66
0
0
27 Sep 2024
UniEmoX: Cross-modal Semantic-Guided Large-Scale Pretraining for
  Universal Scene Emotion Perception
UniEmoX: Cross-modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception
Chuang Chen
Xingwu Sun
Zhi Liu
91
1
0
27 Sep 2024
Gaussian Heritage: 3D Digitization of Cultural Heritage with Integrated
  Object Segmentation
Gaussian Heritage: 3D Digitization of Cultural Heritage with Integrated Object Segmentation
Mahtab Dahaghin
Myrna Castillo
Kourosh Riahidehkordi
M. Toso
Alessio Del Bue
3DGS
81
2
0
27 Sep 2024
Cross-video Identity Correlating for Person Re-identification
  Pre-training
Cross-video Identity Correlating for Person Re-identification Pre-training
Jialong Zuo
Ying Nie
Hanyu Zhou
Huaxin Zhang
Haoyu Wang
Tianyu Guo
Nong Sang
Changxin Gao
90
5
0
27 Sep 2024
Previous
123...192021...828384
Next