ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,175 papers shown
Title
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Yuanwen Yue
Anurag Das
Francis Engelmann
Siyu Tang
J. E. Lenssen
110
28
0
29 Jul 2024
SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow
  Prediction
SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow Prediction
cCaughan Koksal
Ghazal Ghazaei
Felix Holm
Azade Farshad
Nassir Navab
MedIm
87
3
0
29 Jul 2024
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Jinghuan Shang
Karl Schmeckpeper
Brandon B. May
M. Minniti
Tarik Kelestemur
David Watkins
Laura Herlant
VLM
101
24
0
29 Jul 2024
Diffusion Feedback Helps CLIP See Better
Diffusion Feedback Helps CLIP See Better
Wenxuan Wang
Quan-Sen Sun
Fan Zhang
Yepeng Tang
Jing Liu
Xinlong Wang
VLM
138
17
0
29 Jul 2024
FreeLong: Training-Free Long Video Generation with SpectralBlend
  Temporal Attention
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention
Yu Lu
Yuanzhi Liang
Linchao Zhu
Yi Yang
DiffMVGen
116
32
0
29 Jul 2024
Contextuality Helps Representation Learning for Generalized Category
  Discovery
Contextuality Helps Representation Learning for Generalized Category Discovery
Tingzhang Luo
Mingxuan Du
Jiatao Shi
Xinxiang Chen
Bingchen Zhao
Shaoguang Huang
72
4
0
29 Jul 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
Self-Supervised Learning for Text Recognition: A Critical Survey
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
177
2
0
29 Jul 2024
Large-scale cervical precancerous screening via AI-assisted cytology
  whole slide image analysis
Large-scale cervical precancerous screening via AI-assisted cytology whole slide image analysis
Honglin Li
Yusuan Sun
Chenglu Zhu
Yunlong Zhang
Shichuan Zhang
...
Pingyi Chen
Jingxiong Li
Sunyi Zheng
Can Cui
Lin Yang
85
3
0
28 Jul 2024
HRP: Human Affordances for Robotic Pre-Training
HRP: Human Affordances for Robotic Pre-Training
Mohan Kumar Srirama
Sudeep Dasari
Shikhar Bahl
Abhinav Gupta
101
19
0
26 Jul 2024
PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category
  Discovery
PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery
Fernando Julio Cendra
Bingchen Zhao
Kai Han
VLMCLL
102
6
0
26 Jul 2024
SHIC: Shape-Image Correspondences with no Keypoint Supervision
SHIC: Shape-Image Correspondences with no Keypoint Supervision
Aleksandar Shtedritski
Christian Rupprecht
Andrea Vedaldi
3DPC3DH3DV
70
3
0
26 Jul 2024
Deep Companion Learning: Enhancing Generalization Through Historical
  Consistency
Deep Companion Learning: Enhancing Generalization Through Historical Consistency
Ruizhao Zhu
Venkatesh Saligrama
FedML
87
0
0
26 Jul 2024
Boosting Cross-Domain Point Classification via Distilling Relational
  Priors from 2D Transformers
Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers
Longkun Zou
Wanru Zhu
Ke Chen
Lihua Guo
K. Guo
Kui Jia
Yaowei Wang
3DPCViT
84
0
0
26 Jul 2024
Trajectory-aligned Space-time Tokens for Few-shot Action Recognition
Trajectory-aligned Space-time Tokens for Few-shot Action Recognition
Pulkit Kumar
Namitha Padmanabhan
Luke Luo
Sai Saketh Rambhatla
Abhinav Shrivastava
93
4
0
25 Jul 2024
MARINE: A Computer Vision Model for Detecting Rare Predator-Prey
  Interactions in Animal Videos
MARINE: A Computer Vision Model for Detecting Rare Predator-Prey Interactions in Animal Videos
Zsófia Katona
Seyed Sahand Mohamadi Ziabari
Fatemeh Karimi Nejadasl
54
1
0
25 Jul 2024
DINOv2 Rocks Geological Image Analysis: Classification, Segmentation,
  and Interpretability
DINOv2 Rocks Geological Image Analysis: Classification, Segmentation, and Interpretability
Florent Brondolo
Samuel Beaussant
AI4CE
86
1
0
25 Jul 2024
Leveraging Foundation Models via Knowledge Distillation in Multi-Object
  Tracking: Distilling DINOv2 Features to FairMOT
Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT
Niels G. Faber
Seyed Sahand Mohamadi Ziabari
Fatemeh Karimi Nejadasl
95
3
0
25 Jul 2024
Exploring the Effect of Dataset Diversity in Self-Supervised Learning
  for Surgical Computer Vision
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision
Tim J. M. Jaspers
Ronald L.P.D. de Jong
Yasmina Alkhalil
Tijn Zeelenberg
C. H. Kusters
...
Franciscus Hendericus Aäron Bakker
J P Ruurda
Willem M. Brinkman
Peter H. N. de With
Fons van der Sommen
83
3
0
25 Jul 2024
Balancing Complementarity and Consistency via Delayed Activation in
  Incomplete Multi-view Clustering
Balancing Complementarity and Consistency via Delayed Activation in Incomplete Multi-view Clustering
Bo Li
98
1
0
25 Jul 2024
Revisiting Machine Unlearning with Dimensional Alignment
Revisiting Machine Unlearning with Dimensional Alignment
Seonguk Seo
Dongwan Kim
Bohyung Han
MU
58
1
0
25 Jul 2024
Unsqueeze [CLS] Bottleneck to Learn Rich Representations
Unsqueeze [CLS] Bottleneck to Learn Rich Representations
Qing Su
Shihao Ji
90
0
0
24 Jul 2024
PEEKABOO: Hiding parts of an image for unsupervised object localization
PEEKABOO: Hiding parts of an image for unsupervised object localization
Hasib Zunair u
24 A.BenHamza
SSL
130
0
0
24 Jul 2024
Graph Neural Networks: A suitable Alternative to MLPs in Latent 3D
  Medical Image Classification?
Graph Neural Networks: A suitable Alternative to MLPs in Latent 3D Medical Image Classification?
Johannes Kiechle
Daniel M. Lang
Stefan M. Fischer
Lina Felsner
J. Peeken
Julia A. Schnabel
MedIm
70
0
0
24 Jul 2024
Contrastive Learning Is Not Optimal for Quasiperiodic Time Series
Contrastive Learning Is Not Optimal for Quasiperiodic Time Series
A. Atienza
J. Bardram
S. Puthusserypady
BDLAI4TS
85
2
0
24 Jul 2024
Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective
Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective
Jingren Liu
Zhong Ji
YunLong Yu
Jiale Cao
Yanwei Pang
Jungong Han
Xuelong Li
CLL
142
5
0
24 Jul 2024
SINDER: Repairing the Singular Defects of DINOv2
SINDER: Repairing the Singular Defects of DINOv2
Haoqian Wang
Tong Zhang
Mathieu Salzmann
57
4
0
23 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
75
3
0
23 Jul 2024
A Multi-view Mask Contrastive Learning Graph Convolutional Neural
  Network for Age Estimation
A Multi-view Mask Contrastive Learning Graph Convolutional Neural Network for Age Estimation
Yiping Zhang
Yuntao Shou
Tao Meng
Wei Ai
Keqin Li
CVBM
110
10
0
23 Jul 2024
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal
  Large Language Model
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model
Yiwei Ma
Zhibin Wang
Xiaoshuai Sun
Weihuang Lin
Qiang-feng Zhou
Jiayi Ji
Rongrong Ji
MLLMVLM
105
2
0
23 Jul 2024
Reconstructing Training Data From Real World Models Trained with
  Transfer Learning
Reconstructing Training Data From Real World Models Trained with Transfer Learning
Yakir Oz
Gilad Yehudai
Gal Vardi
Itai Antebi
Michal Irani
Niv Haim
67
3
0
22 Jul 2024
CarFormer: Self-Driving with Learned Object-Centric Representations
CarFormer: Self-Driving with Learned Object-Centric Representations
Shadi S. Hamdan
Fatma Guney
3DPCOCL
98
3
0
22 Jul 2024
Towards Latent Masked Image Modeling for Self-Supervised Visual
  Representation Learning
Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning
Yibing Wei
Abhinav Gupta
Pedro Morgado
SSL
75
8
0
22 Jul 2024
MILAN: Milli-Annotations for Lidar Semantic Segmentation
MILAN: Milli-Annotations for Lidar Semantic Segmentation
Nermin Samet
Gilles Puy
Oriane Siméoni
Renaud Marlet
3DPC
82
0
0
22 Jul 2024
Disentangling spatio-temporal knowledge for weakly supervised object
  detection and segmentation in surgical video
Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video
Guiqiu Liao
M. Jogan
Sai Koushik
Eric Eaton
Daniel A. Hashimoto
VOS
99
2
0
22 Jul 2024
Predicting the Best of N Visual Trackers
Predicting the Best of N Visual Trackers
B. Alawode
S. Javed
Arif Mahmood
Jirí Matas
107
1
0
22 Jul 2024
Not All Pairs are Equal: Hierarchical Learning for
  Average-Precision-Oriented Video Retrieval
Not All Pairs are Equal: Hierarchical Learning for Average-Precision-Oriented Video Retrieval
Yang Liu
Qianqian Xu
Peisong Wen
Siran Dai
Qingming Huang
100
1
0
22 Jul 2024
SIGMA:Sinkhorn-Guided Masked Video Modeling
SIGMA:Sinkhorn-Guided Masked Video Modeling
Mohammadreza Salehi
Michael Dorkenwald
Fida Mohammad Thoker
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
96
7
0
22 Jul 2024
Towards Robust Vision Transformer via Masked Adaptive Ensemble
Towards Robust Vision Transformer via Masked Adaptive Ensemble
Fudong Lin
Jiadong Lou
Xu Yuan
Nianfeng Tzeng
ViTAAML
88
2
0
22 Jul 2024
CP-Prompt: Composition-Based Cross-modal Prompting for
  Domain-Incremental Continual Learning
CP-Prompt: Composition-Based Cross-modal Prompting for Domain-Incremental Continual Learning
Yu Feng
Zhen Tian
Zhonghong Ou
Zongfu Han
Haoran Luo
Guangwei Zhang
Meina Song
CLLVLM
63
8
0
22 Jul 2024
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models
Amir Mohammad Karimi Mamaghan
Samuele Papa
Karl Henrik Johansson
Stefan Bauer
Andrea Dittadi
OCL
168
9
0
22 Jul 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep
  Speaker Representation Learning
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
112
6
0
21 Jul 2024
Assessing Sample Quality via the Latent Space of Generative Models
Assessing Sample Quality via the Latent Space of Generative Models
Jingyi Xu
Hieu M. Le
Dimitris Samaras
MedIm
101
3
0
21 Jul 2024
Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and
  Semantically-Rich Vision-Language Models
Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models
Md Zarif Hossain
Ahmed Imteaj
VLMAAML
66
6
0
20 Jul 2024
Large-vocabulary forensic pathological analyses via prototypical
  cross-modal contrastive learning
Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning
Chen Shen
Chunfeng Lian
Wanqing Zhang
Fan Wang
Jianhua Zhang
...
Hongshu Mu
Hao Wu
Xinggong Liang
Jianhua Ma
Zhenyuan Wang
109
1
0
20 Jul 2024
$\infty$-Brush: Controllable Large Image Synthesis with Diffusion Models
  in Infinite Dimensions
∞\infty∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions
Minh-Quan Le
Alexandros Graikos
Srikar Yellapragada
Rajarsi R. Gupta
Joel H. Saltz
Dimitris Samaras
96
10
0
20 Jul 2024
On Learning Discriminative Features from Synthesized Data for
  Self-Supervised Fine-Grained Visual Recognition
On Learning Discriminative Features from Synthesized Data for Self-Supervised Fine-Grained Visual Recognition
Zihu Wang
Lingqiao Liu
Scott Ricardo Figueroa Weston
Samuel Tian
Peng Li
76
2
0
19 Jul 2024
Bucketed Ranking-based Losses for Efficient Training of Object Detectors
Bucketed Ranking-based Losses for Efficient Training of Object Detectors
Feyza Yavuz
Baris Can Cam
Adnan Harun Dogan
Kemal Oksuz
Emre Akbas
Sinan Kalkan
62
2
0
19 Jul 2024
LogoSticker: Inserting Logos into Diffusion Models for Customized
  Generation
LogoSticker: Inserting Logos into Diffusion Models for Customized Generation
Mingkang Zhu
Xi Chen
Zhongdao Wang
Hengshuang Zhao
Jiaya Jia
DiffM
94
3
0
18 Jul 2024
Temporal Representation Learning for Stock Similarities and Its
  Applications in Investment Management
Temporal Representation Learning for Stock Similarities and Its Applications in Investment Management
Yoon-Jeong Hwang
Stefan Zohren
Yongjae Lee
AIFin
73
1
0
18 Jul 2024
FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder
  Architectures
FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures
Hao Lu
Wenze Liu
Hongtao Fu
Zhiguo Cao
59
3
0
18 Jul 2024
Previous
123...242526...828384
Next