ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,175 papers shown
Title
USP: Unified Self-Supervised Pretraining for Image Generation and Understanding
USP: Unified Self-Supervised Pretraining for Image Generation and Understanding
Xiangxiang Chu
Renda Li
Yong Wang
257
1
0
01 Jul 2025
Object Retrieval for Visual Question Answering with Outside Knowledge
Object Retrieval for Visual Question Answering with Outside Knowledge
Shichao Kan
Yuhai Deng
Yixiong Liang
Lihui Cen
Zhe Qu
Linna Zhang
Zhihai He
Yigang Cen
90
0
0
01 Jul 2025
SP$^2$OT: Semantic-Regularized Progressive Partial Optimal Transport for Imbalanced Clustering
SP2^22OT: Semantic-Regularized Progressive Partial Optimal Transport for Imbalanced Clustering
Chuyu Zhang
Hui Ren
Xuming He
OT
86
1
0
01 Jul 2025
LW2G: Learning Whether to Grow for Prompt-based Continual Learning
LW2G: Learning Whether to Grow for Prompt-based Continual Learning
Qian Feng
Dawei Zhou
Hanbin Zhao
Chao Zhang
Jiahua Dong
Dengxin Dai
Hui Qian
VLMCLL
83
5
0
01 Jul 2025
Mitigating Knowledge Discrepancies among Multiple Datasets for Task-agnostic Unified Face Alignment
Mitigating Knowledge Discrepancies among Multiple Datasets for Task-agnostic Unified Face Alignment
Jiahao Xia
Min Xu
Wenjian Huang
Jianguo Zhang
Haimin Zhang
Chunxia Xiao
CVBMFedML
180
0
0
01 Jul 2025
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark
Yi Xin
Jianjiang Yang
Haodi Zhou
Junlong Du
Qi Qin
...
Bin Fu
Xiaokang Yang
Guangtao Zhai
Ming-Hsuan Yang
Xiaohong Liu
VLM
174
86
0
01 Jul 2025
Few-Shot Generalized Category Discovery With Retrieval-Guided Decision Boundary Enhancement
Few-Shot Generalized Category Discovery With Retrieval-Guided Decision Boundary Enhancement
Yunhan Ren
Feng Luo
Siyu Huang
12
0
0
20 Jun 2025
Class Agnostic Instance-level Descriptor for Visual Instance Search
Class Agnostic Instance-level Descriptor for Visual Instance Search
Qi-Ying Sun
Wan-Lei Zhao
Yi-Bo Miao
Chong-Wah Ngo
OCL
22
0
0
20 Jun 2025
Emergent Temporal Correspondences from Video Diffusion Transformers
Emergent Temporal Correspondences from Video Diffusion Transformers
Jisu Nam
Soowon Son
Dahyun Chung
Jiyoung Kim
Siyoon Jin
Junhwa Hur
Seungryong Kim
VGen
23
0
0
20 Jun 2025
Bridging Brain with Foundation Models through Self-Supervised Learning
Hamdi Altaheri
Fakhri Karray
Md. Milon Islam
S M Taslim Uddin Raju
Amir-Hossein Karimi
14
0
0
19 Jun 2025
Reliable Few-shot Learning under Dual Noises
Reliable Few-shot Learning under Dual Noises
Ji Zhang
Jingkuan Song
Lianli Gao
N. Sebe
Heng Tao Shen
NoLa
22
0
0
19 Jun 2025
MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning
MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning
Leonid Ivanov
Vasily Yuryev
Dmitry Yudin
7
0
0
18 Jun 2025
Robust Instant Policy: Leveraging Student's t-Regression Model for Robust In-context Imitation Learning of Robot Manipulation
Robust Instant Policy: Leveraging Student's t-Regression Model for Robust In-context Imitation Learning of Robot Manipulation
Hanbit Oh
Andrea M. Salcedo-Vázquez
I. Ramirez-Alpizar
Y. Domae
15
0
0
18 Jun 2025
Tactile Beyond Pixels: Multisensory Touch Representations for Robot Manipulation
Tactile Beyond Pixels: Multisensory Touch Representations for Robot Manipulation
Carolina Higuera
Akash Sharma
Taosha Fan
Chaithanya Krishna Bodduluri
Byron Boots
...
Mike Lambeta
Tingfan Wu
Zixi Liu
Francois Robert Hogan
Mustafa Mukadam
20
0
0
17 Jun 2025
Latent Action Diffusion for Cross-Embodiment Manipulation
Latent Action Diffusion for Cross-Embodiment Manipulation
Erik Bauer
Elvis Nava
Robert K. Katzschmann
20
0
0
17 Jun 2025
Foundation Model Insights and a Multi-Model Approach for Superior Fine-Grained One-shot Subset Selection
Foundation Model Insights and a Multi-Model Approach for Superior Fine-Grained One-shot Subset Selection
Zhijing Wan
Zhixiang Wang
Zheng Wang
Xin Xu
Shiníchi Satoh
29
0
0
17 Jun 2025
Discrete JEPA: Learning Discrete Token Representations without Reconstruction
Discrete JEPA: Learning Discrete Token Representations without Reconstruction
Junyeob Baek
Hosung Lee
Christopher Hoang
Mengye Ren
Sungjin Ahn
22
0
0
17 Jun 2025
Exploring Non-contrastive Self-supervised Representation Learning for Image-based Profiling
Exploring Non-contrastive Self-supervised Representation Learning for Image-based Profiling
Siran Dai
Qianqian Xu
Peisong Wen
Yang Liu
Qingming Huang
20
0
0
17 Jun 2025
TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast
TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast
Beilei Cui
Yiming Huang
Long Bai
Hongliang Ren
29
0
0
16 Jun 2025
Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry
Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry
Junyoung Seo
Jisang Han
Jaewoo Jung
Siyoon Jin
Joungbin Lee
...
Takashi Shibuya
Donghoon Ahn
Shoukang Hu
Seungryong Kim
Yuki Mitsufuji
VGen
34
0
0
16 Jun 2025
Evolution of ReID: From Early Methods to LLM Integration
Evolution of ReID: From Early Methods to LLM Integration
Amran Bhuiyan
Mizanur Rahman
Md Tahmid Rahman Laskar
Aijun An
Jimmy Xiangji Huang
VLM
19
0
0
16 Jun 2025
Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention
Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention
Jeonghoon Park
Juyoung Lee
Chaeyeon Chung
Jaeseong Lee
Jaegul Choo
Jindong Gu
17
0
0
16 Jun 2025
Boundary-Aware Vision Transformer for Angiography Vascular Network Segmentation
Boundary-Aware Vision Transformer for Angiography Vascular Network Segmentation
Nabil Hezil
Suraj Singh
Vita V. Vlasova
Oleg Y. Rogov
Ahmed Bouridane
R. Hamoudi
ViTMedIm
16
0
0
15 Jun 2025
Generalized Category Discovery under the Long-Tailed Distribution
Generalized Category Discovery under the Long-Tailed Distribution
Bingchen Zhao
Kai Han
5
0
0
14 Jun 2025
EKPC: Elastic Knowledge Preservation and Compensation for Class-Incremental Learning
EKPC: Elastic Knowledge Preservation and Compensation for Class-Incremental Learning
Huaijie Wang
De Cheng
Lingfeng He
Yan Li
Jie Li
Nannan Wang
X. Gao
CLL
25
0
0
14 Jun 2025
Uncertainty Awareness Enables Efficient Labeling for Cancer Subtyping in Digital Pathology
Uncertainty Awareness Enables Efficient Labeling for Cancer Subtyping in Digital Pathology
Nirhoshan Sivaroopan
Chamuditha Jayanga Galappaththige
Chalani Ekanayake
Hasindri Watawana
Ranga Rodrigo
Chamira U. S. Edussooriya
D. Wadduwage
13
0
0
13 Jun 2025
MRI-CORE: A Foundation Model for Magnetic Resonance Imaging
MRI-CORE: A Foundation Model for Magnetic Resonance Imaging
Haoyu Dong
Yuwen Chen
H. Gu
Nicholas Konz
Yaqian Chen
Qihang Li
Maciej A. Mazurowski
MedImVLM
22
0
0
13 Jun 2025
EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction
EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction
Hsi-Che Lin
Yu-Chu Yu
Kai-Po Chang
Y. Wang
71
0
0
13 Jun 2025
Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success
Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success
Che Wang
Jeroen van Baar
Chaitanya Mitash
Shuai-Peng Li
Dylan Randle
Weiyao Wang
Sumedh Sontakke
Kostas E. Bekris
Kapil Katyal
SSL
104
1
0
12 Jun 2025
Prompts to Summaries: Zero-Shot Language-Guided Video Summarization
Prompts to Summaries: Zero-Shot Language-Guided Video Summarization
Mario Barbara
Alaa Maalouf
134
0
0
12 Jun 2025
Improving Out-of-Distribution Detection via Dynamic Covariance Calibration
Improving Out-of-Distribution Detection via Dynamic Covariance Calibration
Kaiyu Guo
Zijian Wang
Tan Pan
Brian C. Lovell
Mahsa Baktashmotlagh
OODD
96
0
0
11 Jun 2025
Attention, Please! Revisiting Attentive Probing for Masked Image Modeling
Attention, Please! Revisiting Attentive Probing for Masked Image Modeling
Bill Psomas
Dionysis Christopoulos
Eirini Baltzi
Ioannis Kakogeorgiou
Tilemachos Aravanis
N. Komodakis
Konstantinos Karantzalos
Yannis Avrithis
Giorgos Tolias
56
0
0
11 Jun 2025
Beyond Overconfidence: Foundation Models Redefine Calibration in Deep Neural Networks
Achim Hekler
Lukas Kuhn
Florian Buettner
UQCV
77
0
0
11 Jun 2025
Accurate and efficient zero-shot 6D pose estimation with frozen foundation models
Andrea Caraffa
Davide Boscaini
Fabio Poiesi
82
0
0
11 Jun 2025
Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation
Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation
Siyu Chen
Ting Han
Chengzheng Fu
Changshe Zhang
Chaolei Wang
Jinhe Su
Guorong Cai
Meiliu Wu
ObjDVLM
93
0
0
11 Jun 2025
A theoretical framework for self-supervised contrastive learning for continuous dependent data
A theoretical framework for self-supervised contrastive learning for continuous dependent data
Alexander Marusov
Alexander Yuhay
Alexey Zaytsev
67
0
0
11 Jun 2025
Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20$^{th}$ century Urban Landscapes with Satellite Imageries
Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20th^{th}th century Urban Landscapes with Satellite Imageries
Tianxiang Hao
Lixian Zhang
Yingjia Zhang
Mengxuan Chen
Jinxiao Zhang
Haohuan Fu
63
0
0
11 Jun 2025
EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule Networks
Athinoulla Konstantinou
Georgios Leontidis
Mamatha Thota
A. Durrant
3DPC
72
0
0
11 Jun 2025
Only-Style: Stylistic Consistency in Image Generation without Content Leakage
Tilemachos Aravanis
P. Filntisis
Petros Maragos
George Retsinas
72
0
0
11 Jun 2025
AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
Zheda Mai
A. Chowdhury
Zihe Wang
Sooyoung Jeon
Lemeng Wang
Jiacheng Hou
Jihyung Kil
Wei-Lun Chao
CoGe
50
0
0
10 Jun 2025
Fine-Grained Spatially Varying Material Selection in Images
Julia Guerrero-Viu
Michael Fischer
Iliyan Georgiev
Elena Garces
Diego F. F. Gutierrez
B. Masiá
Valentin Deschaintre
20
0
0
10 Jun 2025
JAFAR: Jack up Any Feature at Any Resolution
JAFAR: Jack up Any Feature at Any Resolution
Paul Couairon
Loick Chambon
Louis Serrano
Jean-Emmanuel Haugeard
Matthieu Cord
Nicolas Thome
MDE
40
0
0
10 Jun 2025
Revolutionizing Clinical Trials: A Manifesto for AI-Driven Transformation
Revolutionizing Clinical Trials: A Manifesto for AI-Driven Transformation
M. Schaar
Richard W. Peck
E. McKinney
Jim Weatherall
Stuart Bailey
...
Rafik Salama
Christina Gunther
Francesca Frau
Antoine Pugeat
Ramon Hernandez
MedIm
63
6
0
10 Jun 2025
SensorLM: Learning the Language of Wearable Sensors
SensorLM: Learning the Language of Wearable Sensors
Yuwei Zhang
Kumar Ayush
Siyuan Qiao
A. Heydari
Girish Narayanswamy
...
Shwetak N. Patel
Cecilia Mascolo
Xin Liu
Daniel J. McDuff
Yuzhe Yang
46
0
0
10 Jun 2025
Enhancing Video Memorability Prediction with Text-Motion Cross-modal Contrastive Loss and Its Application in Video Summarization
Zhiyi Zhu
Xiaoyu Wu
Youwei Lu
31
0
0
10 Jun 2025
Enhancing Motion Dynamics of Image-to-Video Models via Adaptive Low-Pass Guidance
June Suk Choi
Kyungmin Lee
Sihyun Yu
Yisol Choi
Jinwoo Shin
Kimin Lee
DiffMVGen
24
0
0
10 Jun 2025
Intention-Conditioned Flow Occupancy Models
Chongyi Zheng
S. Park
Sergey Levine
Benjamin Eysenbach
AI4TSOffRLAI4CE
34
0
0
10 Jun 2025
Foundation Models in Medical Imaging -- A Review and Outlook
Foundation Models in Medical Imaging -- A Review and Outlook
Vivien van Veldhuizen
Vanessa Botha
C. Lu
Melis Erdal Cesur
Kevin Groot Lipman
...
Cees Snoek
Lodewyk Wessels
Ritse Mann
Eric Marcus
Jonas Teuwen
MedImVLMAI4CE
60
0
0
10 Jun 2025
HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation
Ziyao Huang
Zixiang Zhou
Juan Cao
Yifeng Ma
Yi Chen
...
Hongmei Wang
Qin Lin
Yuan Zhou
Qinglin Lu
Fan Tang
VGen
30
0
0
10 Jun 2025
UAD: Unsupervised Affordance Distillation for Generalization in Robotic Manipulation
UAD: Unsupervised Affordance Distillation for Generalization in Robotic Manipulation
Yihe Tang
Wenlong Huang
Yingke Wang
Chengshu Li
Roy Yuan
Ruohan Zhang
Jiajun Wu
Li Fei-Fei
36
0
0
10 Jun 2025
1234...828384
Next