ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,175 papers shown
Title
Swap Path Network for Robust Person Search Pre-training
Swap Path Network for Robust Person Search Pre-training
Lucas Jaffe
A. Zakhor
3DPC
112
0
0
06 Dec 2024
Birth and Death of a Rose
Birth and Death of a Rose
Chen Geng
Yunzhi Zhang
Shangzhe Wu
Jiajun Wu
AI4CE
118
2
0
06 Dec 2024
ARTeFACT: Benchmarking Segmentation Models on Diverse Analogue Media
  Damage
ARTeFACT: Benchmarking Segmentation Models on Diverse Analogue Media Damage
D. Ivanova
Marco Aversa
Paul Henderson
John Williamson
129
0
0
05 Dec 2024
UnZipLoRA: Separating Content and Style from a Single Image
UnZipLoRA: Separating Content and Style from a Single Image
Chang Liu
Viraj Shah
Aiyu Cui
Svetlana Lazebnik
141
4
0
05 Dec 2024
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
Ziwei Huang
Wanggui He
Quanyu Long
Yandi Wang
Haoyuan Li
...
Fangxun Shu
Long Chen
Hao Jiang
Leilei Gan
Leilei Gan
EGVM
521
4
0
05 Dec 2024
HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing
HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing
Jinbin Bai
Wei Chow
L. Yang
Hefei Ling
Juncheng Billy Li
Hao Zhang
Shuicheng Yan
187
10
0
05 Dec 2024
FLAIR: VLM with Fine-grained Language-informed Image Representations
FLAIR: VLM with Fine-grained Language-informed Image Representations
Rui Xiao
Sanghwan Kim
Mariana-Iuliana Georgescu
Zeynep Akata
Stephan Alaniz
VLMCLIP
138
4
0
04 Dec 2024
Beyond [cls]: Exploring the true potential of Masked Image Modeling representations
Beyond [cls]: Exploring the true potential of Masked Image Modeling representations
Marcin Przewiȩźlikowski
Randall Balestriero
Wojciech Jasiński
Marek 'Smieja
Bartosz Zieliñski
223
1
0
04 Dec 2024
GUESS: Generative Uncertainty Ensemble for Self Supervision
GUESS: Generative Uncertainty Ensemble for Self Supervision
S. Mohamadi
Gianfranco Doretto
Donald Adjeroh
UQCV
137
0
0
03 Dec 2024
Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis
  and Manipulation
Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation
Yiftach Edelstein
Or Patashnik
Dana Cohen-Bar
Lihi Zelnik-Manor
138
0
0
03 Dec 2024
Direct Coloring for Self-Supervised Enhanced Feature Decoupling
Direct Coloring for Self-Supervised Enhanced Feature Decoupling
S. Mohamadi
Gianfranco Doretto
Donald Adjeroh
153
0
0
03 Dec 2024
RELOCATE: A Simple Training-Free Baseline for Visual Query Localization
  Using Region-Based Representations
RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations
Savya Khosla
S. Vallecorsa
Alex Schwing
Derek Hoiem
132
2
0
02 Dec 2024
Efficient Semantic Communication Through Transformer-Aided Compression
Efficient Semantic Communication Through Transformer-Aided Compression
Matin Mortaheb
M. A. Khojastepour
S. Ulukus
111
0
0
02 Dec 2024
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D
  Diffusion
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion
Kai He
Chin-Hsuan Wu
Igor Gilitschenski
DiffM3DGS
127
0
0
02 Dec 2024
Gen-SIS: Generative Self-augmentation Improves Self-supervised Learning
Gen-SIS: Generative Self-augmentation Improves Self-supervised Learning
Varun Belagali
Srikar Yellapragada
Alexandros Graikos
S. Kapse
Zilinghan Li
Tarak Nandi
Ravi K. Madduri
Prateek Prasanna
Joel H. Saltz
Dimitris Samaras
DiffM
144
2
0
02 Dec 2024
Occam's LGS: An Efficient Approach for Language Gaussian Splatting
Occam's LGS: An Efficient Approach for Language Gaussian Splatting
Jiahuan Cheng
Jan-Nico Zaech
Luc Van Gool
Danda Pani Paudel
3DGS
152
0
0
02 Dec 2024
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
Sanghwan Kim
Rui Xiao
Mariana-Iuliana Georgescu
Stephan Alaniz
Zeynep Akata
VLM
346
3
0
02 Dec 2024
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
Jiahao Cui
Hui Li
Yun Zhan
Hanlin Shang
K. Cheng
Yuqi Ma
Shan Mu
Hang Zhou
Jingdong Wang
Siyu Zhu
ViTVGen
201
9
0
01 Dec 2024
Rethinking Generalizability and Discriminability of Self-Supervised
  Learning from Evolutionary Game Theory Perspective
Rethinking Generalizability and Discriminability of Self-Supervised Learning from Evolutionary Game Theory Perspective
Jiangmeng Li
Zehua Zang
Qirui Ji
Chuxiong Sun
Jingyao Wang
Junge Zhang
Changwen Zheng
Gang Hua
Hui Xiong
SSL
149
0
0
30 Nov 2024
FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For
  Anomaly Segmentation
FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For Anomaly Segmentation
Chang Won Lee
Selina Leveugle
Svetlana Stolpner
Chris Langley
Paul Grouchy
Jonathan Kelly
Steven Waslander
124
0
0
29 Nov 2024
Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise
Yeonguk Yu
Minhwan Ko
Sungho Shin
Kangmin Kim
K. Lee
NoLa
120
2
0
29 Nov 2024
ROSE: Revolutionizing Open-Set Dense Segmentation with Patch-Wise Perceptual Large Multimodal Model
Kunyang Han
Yibo Hu
Mengxue Qu
Hailin Shi
Yao Zhao
Y. X. Wei
MLLMVLM3DV
271
1
0
29 Nov 2024
Explaining the Impact of Training on Vision Models via Activation Clustering
Explaining the Impact of Training on Vision Models via Activation Clustering
Ahcène Boubekki
Samuel G. Fadel
Sebastian Mair
286
0
0
29 Nov 2024
Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding
Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding
Weinan Zhang
Lu Zhang
Ping Hu
Liqian Ma
Yunzhi Zhuge
Huchuan Lu
3DGS
134
2
0
29 Nov 2024
T-3DGS: Removing Transient Objects for 3D Scene Reconstruction
T-3DGS: Removing Transient Objects for 3D Scene Reconstruction
Vadim Pryadilshchikov
Alexander Markin
Artem Komarichev
Ruslan Rakhimov
Peter Wonka
Evgeny Burnaev
3DGS
195
4
0
29 Nov 2024
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image
  Diffusion Models
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models
Shwetha Ram
T. Neiman
Qianli Feng
Andrew Stuart
S. D. Tran
Trishul Chilimbi
133
2
0
28 Nov 2024
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language
  for Open-Vocabulary Segmentation
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation
Luca Barsellotti
Lorenzo Bianchi
Nicola Messina
F. Carrara
Marcella Cornia
Lorenzo Baraldi
Fabrizio Falchi
Rita Cucchiara
VLM
120
4
0
28 Nov 2024
SAMa: Material-aware 3D Selection and Segmentation
SAMa: Material-aware 3D Selection and Segmentation
Michael Fischer
Iliyan Georgiev
Thibault Groueix
Vladimir G. Kim
Tobias Ritschel
Valentin Deschaintre
3DV
140
1
0
28 Nov 2024
PP-SSL : Priority-Perception Self-Supervised Learning for Fine-Grained Recognition
ShuaiHeng Li
Qing Cai
Fan Zhang
Hao Fei
Yangyang Shu
Ziqiang Liu
Haoyang Li
Lingqiao Liu
127
0
0
28 Nov 2024
SPAgent: Adaptive Task Decomposition and Model Selection for General
  Video Generation and Editing
SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing
Rong-Cheng Tu
Wenhao Sun
Zhao Jin
Jingyi Liao
Jiaxing Huang
Dacheng Tao
VGenDiffM
171
7
0
28 Nov 2024
OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
Hui Li
Mingwang Xu
Yun Zhan
Shan Mu
Jiaye Li
...
Yukang Chen
Tan Chen
Mao Ye
Jingdong Wang
Siyu Zhu
VGen
210
7
0
28 Nov 2024
InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception
InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception
Haijie Li
Y. Wu
Jiarui Meng
Qiankun Gao
Zhiyao Zhang
Ronggang Wang
Jian Zhang
ISeg
157
4
0
28 Nov 2024
Tracking Progress Towards Sustainable Development Goal 6 Using Satellite Imagery
Tracking Progress Towards Sustainable Development Goal 6 Using Satellite Imagery
Othmane Echchabi
Nizar Talty
Josh Manto
Aya Lahlou
Ka Leung Lam
113
0
0
28 Nov 2024
Reconstructing Animals and the Wild
Reconstructing Animals and the Wild
Peter Kulits
Michael J. Black
Silvia Zuffi
90
0
0
27 Nov 2024
Exponential Moving Average of Weights in Deep Learning: Dynamics and
  Benefits
Exponential Moving Average of Weights in Deep Learning: Dynamics and Benefits
Daniel Morales-Brotons
Thijs Vogels
Hadrien Hendrikx
193
23
0
27 Nov 2024
Diffusion Self-Distillation for Zero-Shot Customized Image Generation
Diffusion Self-Distillation for Zero-Shot Customized Image Generation
Shengqu Cai
Eric Ryan Chan
Yunzhi Zhang
Leonidas Guibas
Jiajun Wu
Gordon Wetzstein
132
13
0
27 Nov 2024
Optimizing Multispectral Object Detection: A Bag of Tricks and
  Comprehensive Benchmarks
Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks
Chen Zhou
Peng Cheng
Sihang Li
Yize Zhang
Yibo Yan
Xiaojun Jia
Yanyan Xu
Kaidi Wang
Xiaochun Cao
148
0
0
27 Nov 2024
PATHS: A Hierarchical Transformer for Efficient Whole Slide Image
  Analysis
PATHS: A Hierarchical Transformer for Efficient Whole Slide Image Analysis
Zak Buzzard
Konstantin Hemker
Nikola Simidjievski
M. Jamnik
MedIm
107
0
0
27 Nov 2024
RoMo: Robust Motion Segmentation Improves Structure from Motion
RoMo: Robust Motion Segmentation Improves Structure from Motion
Lily Goli
S. Sabour
Mark J. Matthews
Marcus A. Brubaker
Dmitry Lagun
Alec Jacobson
David J. Fleet
Saurabh Saxena
Andrea Tagliasacchi
VOS
173
5
0
27 Nov 2024
Evaluating Vision-Language Models as Evaluators in Path Planning
Evaluating Vision-Language Models as Evaluators in Path Planning
Mohamed Aghzal
Xiang Yue
Erion Plaku
Ziyu Yao
LRM
232
1
0
27 Nov 2024
RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model
RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model
Huiyang Hu
Peijin Wang
Hanbo Bi
Boyuan Tong
Zehua Wang
...
Ziqi Zhang
Yaowei Wang
QiXiang Ye
Kun Fu
Xian Sun
299
0
0
27 Nov 2024
Spatially Visual Perception for End-to-End Robotic Learning
Spatially Visual Perception for End-to-End Robotic Learning
Travis Davies
Jiahuan Yan
Xiang Chen
Yu Tian
Yueting Zhuang
Yiqi Huang
Luhui Hu
173
1
0
26 Nov 2024
Reward Incremental Learning in Text-to-Image Generation
Reward Incremental Learning in Text-to-Image Generation
Maorong Wang
Jiafeng Mao
Xueting Wang
Toshihiko Yamasaki
EGVM
132
0
0
26 Nov 2024
Boost 3D Reconstruction using Diffusion-based Monocular Camera
  Calibration
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Junyuan Deng
Wei Yin
Xiaoyang Guo
Qian Zhang
Xiaotao Hu
Weiqiang Ren
Xiaoxiao Long
P. Tan
DiffMMDE
155
1
0
26 Nov 2024
An In-depth Investigation of Sparse Rate Reduction in Transformer-like
  Models
An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models
Yunzhe Hu
Difan Zou
Dong Xu
133
1
0
26 Nov 2024
MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic
  Segmentation Network For Relic Landslide Detection
MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection
Juefei He
Yuexing Peng
Wei Li
Junchuan Yu
Daqing Ge
Wei Xiang
85
0
0
26 Nov 2024
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
Chanyoung Kim
Dayun Ju
Woojung Han
Ming-Hsuan Yang
Seong Jae Hwang
VLMVOS
272
1
0
26 Nov 2024
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Zhongyu Xia
Jishuo Li
Zhiwei Lin
Xinhao Wang
Yansen Wang
Ming-Hsuan Yang
VLM
171
3
0
26 Nov 2024
COBRA: A Continual Learning Approach to Vision-Brain Understanding
COBRA: A Continual Learning Approach to Vision-Brain Understanding
Xuan-Bac Nguyen
Arabinda Kumar Choudhary
Pawan Sinha
Xin Li
Khoa Luu
CLL
141
0
0
25 Nov 2024
Probing the Mid-level Vision Capabilities of Self-Supervised Learning
Probing the Mid-level Vision Capabilities of Self-Supervised Learning
Xuweiyi Chen
Markus Marks
Zezhou Cheng
173
0
0
25 Nov 2024
Previous
123...141516...828384
Next