ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,176 papers shown
Title
Training-Free Semantic Segmentation via LLM-Supervision
Training-Free Semantic Segmentation via LLM-Supervision
Wenfang Sun
Yingjun Du
Gaowen Liu
Ramana Rao Kompella
Cees G. M. Snoek
VLM
103
3
0
31 Mar 2024
Learning to Rank Patches for Unbiased Image Redundancy Reduction
Learning to Rank Patches for Unbiased Image Redundancy Reduction
Yang Luo
Zhineng Chen
Peng Zhou
Zuxuan Wu
Xieping Gao
Yu-Gang Jiang
SSL
88
4
0
31 Mar 2024
DailyMAE: Towards Pretraining Masked Autoencoders in One Day
DailyMAE: Towards Pretraining Masked Autoencoders in One Day
Jiantao Wu
Shentong Mo
Sara Atito
Zhenhua Feng
Josef Kittler
Muhammad Awais
84
3
0
31 Mar 2024
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and
  Intra-Class Regions for Weakly-Supervised Semantic Segmentation
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation
Sang-Kee Jo
Fei Pan
In-Jae Yu
Kyungsu Kim
104
2
0
30 Mar 2024
Bayesian Exploration of Pre-trained Models for Low-shot Image
  Classification
Bayesian Exploration of Pre-trained Models for Low-shot Image Classification
Yibo Miao
Yu Lei
Feng Zhou
Zhijie Deng
VLMUQCVBDL
107
3
0
30 Mar 2024
Image-to-Image Matching via Foundation Models: A New Perspective for
  Open-Vocabulary Semantic Segmentation
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation
Yuan Wang
Rui Sun
Naisong Luo
Yuwen Pan
Tianzhu Zhang
VLM
81
10
0
30 Mar 2024
InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning
InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning
Yan-Shuo Liang
Wu-Jun Li
CLL
139
53
0
30 Mar 2024
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion
  Models
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models
Barbara Toniella Corradini
Mustafa Shukor
Paul Couairon
Guillaume Couairon
Franco Scarselli
Matthieu Cord
DiffMVLM
125
6
0
29 Mar 2024
InstantSplat: Sparse-view Gaussian Splatting in Seconds
InstantSplat: Sparse-view Gaussian Splatting in Seconds
Zhiwen Fan
Wenyan Cong
Kairun Wen
Kevin Wang
Jian Zhang
...
Boris Ivanovic
Marco Pavone
Georgios Pavlakos
Zhangyang Wang
Yue Wang
3DGS
133
1
0
29 Mar 2024
GaussianCube: A Structured and Explicit Radiance Representation for 3D
  Generative Modeling
GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling
Bowen Zhang
Yiji Cheng
Jiaolong Yang
Chunyu Wang
Feng Zhao
Yansong Tang
Dong Chen
Baining Guo
3DGS
146
10
0
28 Mar 2024
Situation Awareness for Driver-Centric Driving Style Adaptation
Situation Awareness for Driver-Centric Driving Style Adaptation
Johann Haselberger
Bonifaz Stuhr
Bernhard Schick
Steffen Müller
70
1
0
28 Mar 2024
The Bad Batches: Enhancing Self-Supervised Learning in Image
  Classification Through Representative Batch Curation
The Bad Batches: Enhancing Self-Supervised Learning in Image Classification Through Representative Batch Curation
Ozgu Goksu
Nicolas Pugeault
SSL
64
0
0
28 Mar 2024
Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics
Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics
Norman Di Palo
Edward Johns
118
37
0
28 Mar 2024
A Two-Phase Recall-and-Select Framework for Fast Model Selection
A Two-Phase Recall-and-Select Framework for Fast Model Selection
Jianwei Cui
Wenhang Shi
Honglin Tao
Wei Lu
Xiaoyong Du
104
0
0
28 Mar 2024
Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction
Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction
Xiaoyang Lyu
Chirui Chang
Peng Dai
Yang-tian Sun
Xiaojuan Qi
3DGS
112
3
0
28 Mar 2024
DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context
  in Editable Face Generation
DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation
Haonan Lin
Mengmeng Wang
Yan Chen
Wenbin An
Yuzhe Yao
Guang Dai
Qianying Wang
Yong-Jin Liu
Jingdong Wang
DiffM
83
4
0
28 Mar 2024
MVEB: Self-Supervised Learning with Multi-View Entropy Bottleneck
MVEB: Self-Supervised Learning with Multi-View Entropy Bottleneck
Liangjiang Wen
Xiasi Wang
Jianzhuang Liu
Zenglin Xu
59
3
0
28 Mar 2024
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
Mukund Varma
Peihao Wang
Zhiwen Fan
Zhangyang Wang
Hao Su
R. Ramamoorthi
VLM
95
8
0
27 Mar 2024
UniDepth: Universal Monocular Metric Depth Estimation
UniDepth: Universal Monocular Metric Depth Estimation
Luigi Piccinelli
Yung-Hsu Yang
Daniel Gehrig
Mattia Segu
Siyuan Li
Luc Van Gool
Fisher Yu
VLMMDE
180
144
0
27 Mar 2024
Generative Multi-modal Models are Good Class-Incremental Learners
Generative Multi-modal Models are Good Class-Incremental Learners
Xusheng Cao
Haori Lu
Linlan Huang
Xialei Liu
Ming-Ming Cheng
CLL
95
15
0
27 Mar 2024
Branch-Tuning: Balancing Stability and Plasticity for Continual
  Self-Supervised Learning
Branch-Tuning: Balancing Stability and Plasticity for Continual Self-Supervised Learning
Wenzhuo Liu
Fei Zhu
Cheng-Lin Liu
CLL
124
2
0
27 Mar 2024
ShapeGrasp: Zero-Shot Task-Oriented Grasping with Large Language Models
  through Geometric Decomposition
ShapeGrasp: Zero-Shot Task-Oriented Grasping with Large Language Models through Geometric Decomposition
Samuel Li
Sarthak Bhagat
Joseph Campbell
Yaqi Xie
Woojun Kim
Katia Sycara
Simon Stepputtis
LM&Ro
88
15
0
26 Mar 2024
Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders
Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders
Alexandre Eymaël
Renaud Vandeghen
A. Cioppa
Silvio Giancola
Guohao Li
Marc Van Droogenbroeck
ViT
83
8
0
26 Mar 2024
Grad-CAMO: Learning Interpretable Single-Cell Morphological Profiles
  from 3D Cell Painting Images
Grad-CAMO: Learning Interpretable Single-Cell Morphological Profiles from 3D Cell Painting Images
Vivek Gopalakrishnan
Jingzhe Ma
Zhiyong Xie
48
0
0
26 Mar 2024
NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using
  Heuristics-Guided Segmentation
NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation
Jiahao Chen
Yipeng Qin
Lingjie Liu
Jiangbo Lu
Guanbin Li
79
13
0
26 Mar 2024
Neural Clustering based Visual Representation Learning
Neural Clustering based Visual Representation Learning
Guikun Chen
Xia Li
Yi Yang
Wenguan Wang
SSL
107
10
0
26 Mar 2024
Decoding the visual attention of pathologists to reveal their level of
  expertise
Decoding the visual attention of pathologists to reveal their level of expertise
Souradeep Chakraborty
Dana Perez
Paul Friedman
Natallia Sheuka
Constantin Friedman
Oksana Yaskiv
Rajarsi R. Gupta
G. Zelinsky
Joel H. Saltz
Dimitris Samaras
MedIm
68
0
0
25 Mar 2024
SD-DiT: Unleashing the Power of Self-supervised Discrimination in
  Diffusion Transformer
SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Rui Zhu
Yingwei Pan
Yehao Li
Ting Yao
Zhenglong Sun
Tao Mei
C. Chen
131
26
0
25 Mar 2024
VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
Yang Chen
Yingwei Pan
Haibo Yang
Ting Yao
Tao Mei
DiffM
91
20
0
25 Mar 2024
Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art
Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art
Neeloy Chakraborty
Melkior Ornik
Katherine Driggs-Campbell
LRM
256
12
0
25 Mar 2024
Towards Large-Scale Training of Pathology Foundation Models
Towards Large-Scale Training of Pathology Foundation Models
kaiko.ai
N. Aben
Edwin D. de Jong
Ioannis Gatopoulos
Nicolas Kanzig
Mikhail Karasikov
Axel Lagré
Roman Moser
J. Doorn
Fei Tang
MedImAI4CE
89
12
0
24 Mar 2024
latentSplat: Autoencoding Variational Gaussians for Fast Generalizable
  3D Reconstruction
latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction
Christopher Wewer
Kevin Raj
Eddy Ilg
Bernt Schiele
J. E. Lenssen
3DGS
125
66
0
24 Mar 2024
Out-of-Distribution Detection via Deep Multi-Comprehension Ensemble
Out-of-Distribution Detection via Deep Multi-Comprehension Ensemble
Chenhui Xu
Fuxun Yu
Zirui Xu
Nathan Inkawhich
Xiang Chen
OODD
86
6
0
24 Mar 2024
PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for
  Faster Inference
PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster Inference
Tanvir Mahmud
Burhaneddin Yaman
Chun-Hao Liu
Diana Marculescu
130
3
0
24 Mar 2024
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian
  Splatting
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting
Jun Guo
Xiaojian Ma
Yue Fan
Huaping Liu
Qing Li
3DGS
119
31
0
22 Mar 2024
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
Ruining Li
Chuanxia Zheng
Christian Rupprecht
Andrea Vedaldi
DiffM
115
19
0
22 Mar 2024
Neural Plasticity-Inspired Multimodal Foundation Model for Earth
  Observation
Neural Plasticity-Inspired Multimodal Foundation Model for Earth Observation
Zhitong Xiong
Yi Wang
Fahong Zhang
Adam J. Stewart
Joelle Hanna
Damian Borth
Ioannis Papoutsis
B. L. Saux
Gustau Camps-Valls
Xiao Xiang Zhu
AI4CE
117
18
0
22 Mar 2024
Selectively Informative Description can Reduce Undesired Embedding
  Entanglements in Text-to-Image Personalization
Selectively Informative Description can Reduce Undesired Embedding Entanglements in Text-to-Image Personalization
Jimyeong Kim
Jungwon Park
Wonjong Rhee
DiffM
102
5
0
22 Mar 2024
Recent Trends in 3D Reconstruction of General Non-Rigid Scenes
Recent Trends in 3D Reconstruction of General Non-Rigid Scenes
Raza Yunus
J. E. Lenssen
Michael Niemeyer
Yiyi Liao
Christian Rupprecht
Christian Theobalt
Gerard Pons-Moll
Jia-Bin Huang
Vladislav Golyanik
Eddy Ilg
144
26
0
22 Mar 2024
Towards a Comprehensive, Efficient and Promptable Anatomic Structure
  Segmentation Model using 3D Whole-body CT Scans
Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans
Heng Guo
Jianfeng Zhang
Jiaxing Huang
Tony C. W. Mok
Dazhou Guo
Ke Yan
Le Lu
Dakai Jin
Minfeng Xu
MedIm
31
6
0
22 Mar 2024
MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition
  Integration
MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration
Zhichao Wei
Qingkun Su
Long Qin
Weizhi Wang
DiffM
101
6
0
22 Mar 2024
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT
  Descriptors
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors
Saksham Suri
Matthew Walmer
Kamal Gupta
Abhinav Shrivastava
87
7
0
21 Mar 2024
Hierarchical Text-to-Vision Self Supervised Alignment for Improved
  Histopathology Representation Learning
Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning
Hasindri Watawana
Kanchana Ranasinghe
Tariq Mahmood
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
SSL
70
5
0
21 Mar 2024
Implicit Style-Content Separation using B-LoRA
Implicit Style-Content Separation using B-LoRA
Yarden Frenkel
Yael Vinker
Ariel Shamir
Daniel Cohen-Or
MoMeOffRL
103
47
0
21 Mar 2024
Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling
Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling
Chengxu Zhuang
Evelina Fedorenko
Jacob Andreas
69
2
0
21 Mar 2024
DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single
  Video
DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video
Narek Tumanyan
Assaf Singer
Shai Bagon
Tali Dekel
MQ
100
32
0
21 Mar 2024
Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion
  Descriptors
Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors
Nikolaos Tsagkas
Jack Rome
S. Ramamoorthy
Oisin Mac Aodha
Chris Xiaoxuan Lu
54
8
0
21 Mar 2024
Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D
  Pose Estimation
Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation
F. D. Felice
A. Remus
Stefano Gasperini
Benjamin Busam
Lionel Ott
Federico Tombari
Roland Siegwart
C. Avizzano
DiffM
54
10
0
21 Mar 2024
Unsupervised Audio-Visual Segmentation with Modality Alignment
Unsupervised Audio-Visual Segmentation with Modality Alignment
Swapnil Bhosale
Haosen Yang
Diptesh Kanojia
Jiangkang Deng
Xiatian Zhu
VOS
82
6
0
21 Mar 2024
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic
  Segmentation
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation
Kwanyoung Kim
Y. Oh
Jong Chul Ye
VLM
106
8
0
21 Mar 2024
Previous
123...353637...828384
Next