ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,176 papers shown
Title
VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition
VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition
Yun-Jin Li
M. Gladkova
Yan Xia
Rui Wang
Daniel Cremers
114
5
0
21 Mar 2024
On Pretraining Data Diversity for Self-Supervised Learning
On Pretraining Data Diversity for Self-Supervised Learning
Hasan Hammoud
Tuhin Das
Fabio Pizzati
Philip Torr
Adel Bibi
Guohao Li
155
3
0
20 Mar 2024
Embedding Pose Graph, Enabling 3D Foundation Model Capabilities with a
  Compact Representation
Embedding Pose Graph, Enabling 3D Foundation Model Capabilities with a Compact Representation
Hugues Thomas
Jian Zhang
78
1
0
20 Mar 2024
MTP: Advancing Remote Sensing Foundation Model via Multi-Task
  Pretraining
MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining
Di Wang
Jing Zhang
Minqiang Xu
Lin Liu
Dongsheng Wang
...
Chengxi Han
Haonan Guo
Bo Du
Dacheng Tao
Lefei Zhang
83
53
0
20 Mar 2024
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Zhengqing Yuan
Ruoxi Chen
Zhaoxu Li
Haolong Jia
Lifang He
Chi Wang
Lichao Sun
VGen
109
28
0
20 Mar 2024
LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images
LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images
Jing Zhang
Irving Fang
Juexiao Zhang
Hao Wu
Akshat Kaushik
Alice Rodriguez
Hanwen Zhao
Zhuo Zheng
Radu Iovita
Chen Feng
73
5
0
19 Mar 2024
TAPTR: Tracking Any Point with Transformers as Detection
TAPTR: Tracking Any Point with Transformers as Detection
Hongyang Li
Hao Zhang
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Lei Zhang
91
20
0
19 Mar 2024
Opti-Acoustic Semantic SLAM with Unknown Objects in Underwater
  Environments
Opti-Acoustic Semantic SLAM with Unknown Objects in Underwater Environments
Kurran Singh
Jungseok Hong
Nick Rypkema
John J. Leonard
82
2
0
19 Mar 2024
ViTGaze: Gaze Following with Interaction Features in Vision Transformers
ViTGaze: Gaze Following with Interaction Features in Vision Transformers
Yuehao Song
Xinggang Wang
Jingfeng Yao
Wenyu Liu
Jinglin Zhang
Xiangmin Xu
ViT
85
3
0
19 Mar 2024
Selective, Interpretable, and Motion Consistent Privacy Attribute
  Obfuscation for Action Recognition
Selective, Interpretable, and Motion Consistent Privacy Attribute Obfuscation for Action Recognition
Filip Ilic
Henghui Zhao
Thomas Pock
Richard P. Wildes
PICVAAML
70
3
0
19 Mar 2024
Learning Cross-view Visual Geo-localization without Ground Truth
Learning Cross-view Visual Geo-localization without Ground Truth
Haoyuan Li
Chang Xu
Wen Yang
Huai Yu
Gui-Song Xia
96
11
0
19 Mar 2024
Tuning-Free Image Customization with Image and Text Guidance
Tuning-Free Image Customization with Image and Text Guidance
Pengzhi Li
Qiang Nie
Ying Chen
Xi Jiang
Kai Wu
Yuhuan Lin
Yong-Jin Liu
Jinlong Peng
Chengjie Wang
Feng Zheng
DiffM
68
21
0
19 Mar 2024
Pretraining Codomain Attention Neural Operators for Solving Multiphysics
  PDEs
Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs
Md Ashiqur Rahman
Robert Joseph George
Mogab Elleithy
Daniel Leibovici
Zong-Yi Li
...
Julius Berner
Raymond A. Yeh
Jean Kossaifi
Kamyar Azizzadenesheli
A. Anandkumar
AI4CE
131
23
0
19 Mar 2024
NTK-Guided Few-Shot Class Incremental Learning
NTK-Guided Few-Shot Class Incremental Learning
Jingren Liu
Zhong Ji
Yanwei Pang
YunLong Yu
CLL
97
4
0
19 Mar 2024
Do Generated Data Always Help Contrastive Learning?
Do Generated Data Always Help Contrastive Learning?
Yifei Wang
Jizhe Zhang
Yisen Wang
DiffM
109
26
0
19 Mar 2024
OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation
OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation
Junhao Cai
Yisheng He
Weihao Yuan
Siyu Zhu
Zilong Dong
Liefeng Bo
Qifeng Chen
DiffM
87
8
0
19 Mar 2024
ADAPT to Robustify Prompt Tuning Vision Transformers
ADAPT to Robustify Prompt Tuning Vision Transformers
Masih Eskandar
Tooba Imtiaz
Zifeng Wang
Jennifer Dy
VPVLMVLMAAML
98
0
0
19 Mar 2024
Zero-Shot Image Feature Consensus with Deep Functional Maps
Zero-Shot Image Feature Consensus with Deep Functional Maps
Xinle Cheng
Congyue Deng
Adam W. Harley
Yixin Zhu
Leonidas Guibas
89
5
0
18 Mar 2024
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion
  Models
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Junlin Han
Filippos Kokkinos
Philip Torr
VGen
143
42
0
18 Mar 2024
HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
Ce Zhang
Simon Stepputtis
Joseph Campbell
Katia Sycara
Yaqi Xie
112
13
0
18 Mar 2024
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion
  Distillation
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation
Axel Sauer
Frederic Boesel
Tim Dockhorn
A. Blattmann
Patrick Esser
Robin Rombach
DiffM
124
135
0
18 Mar 2024
GenView: Enhancing View Quality with Pretrained Generative Model for
  Self-Supervised Learning
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
Xiaojie Li
Yibo Yang
Hefei Ling
Jianlong Wu
Yue Yu
Guohao Li
Min Zhang
SSL
103
6
0
18 Mar 2024
DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot
  Video Editing
DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing
Hyeonho Jeong
Jinho Chang
Geon Yeong Park
Jong Chul Ye
DiffMVGen
111
18
0
18 Mar 2024
Unimodal Multi-Task Fusion for Emotional Mimicry Intensity Prediction
Unimodal Multi-Task Fusion for Emotional Mimicry Intensity Prediction
Tobias Hallmen
Fabian Deuser
Norbert Oswald
Elisabeth André
84
2
0
18 Mar 2024
HVDistill: Transferring Knowledge from Images to Point Clouds via
  Unsupervised Hybrid-View Distillation
HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation
Sha Zhang
Jiajun Deng
Lei Bai
Houqiang Li
Wanli Ouyang
Yanyong Zhang
3DPC
101
8
0
18 Mar 2024
End-to-end multi-modal product matching in fashion e-commerce
End-to-end multi-modal product matching in fashion e-commerce
Sándor Tóth
Stephen Wilson
Alexia Tsoukara
Enric Moreu
Anton Masalovich
Lars Roemheld
127
0
0
18 Mar 2024
Siamese Learning with Joint Alignment and Regression for
  Weakly-Supervised Video Paragraph Grounding
Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding
Chaolei Tan
Jian-Huang Lai
Wei-Shi Zheng
Jianfang Hu
AI4TS
128
5
0
18 Mar 2024
Align and Distill: Unifying and Improving Domain Adaptive Object Detection
Align and Distill: Unifying and Improving Domain Adaptive Object Detection
Justin Kay
T. Haucke
Suzanne Stathatos
Siqi Deng
Erik Young
Pietro Perona
Sara Beery
Grant Van Horn
119
6
0
18 Mar 2024
TAG: Guidance-free Open-Vocabulary Semantic Segmentation
TAG: Guidance-free Open-Vocabulary Semantic Segmentation
Yasufumi Kawano
Yoshimitsu Aoki
VLM
58
4
0
17 Mar 2024
MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic
  Segmentation
MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation
Yasufumi Kawano
Yoshimitsu Aoki
DiffM
94
8
0
17 Mar 2024
Correcting misinformation on social media with a large language model
Correcting misinformation on social media with a large language model
Xinyi Zhou
Ashish Sharma
Amy X. Zhang
Tim Althoff
KELM
91
5
0
17 Mar 2024
CGI-DM: Digital Copyright Authentication for Diffusion Models via
  Contrasting Gradient Inversion
CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion
Xiaoyu Wu
Yang Hua
Chumeng Liang
Jiaru Zhang
Hao Wang
Tao Song
Haibing Guan
82
6
0
17 Mar 2024
Recent Advances in 3D Gaussian Splatting
Recent Advances in 3D Gaussian Splatting
Tong Wu
Yu-Jie Yuan
Ling-Xiao Zhang
Jie Yang
Yan-Pei Cao
Ling-Qi Yan
Lin Gao
3DGS
163
106
0
17 Mar 2024
A Versatile Framework for Multi-scene Person Re-identification
A Versatile Framework for Multi-scene Person Re-identification
Wei-Shi Zheng
Junkai Yan
Yi-Xing Peng
VLM
74
6
0
17 Mar 2024
Self-supervised co-salient object detection via feature correspondence
  at multiple scales
Self-supervised co-salient object detection via feature correspondence at multiple scales
Souradeep Chakraborty
Dimitris Samaras
89
4
0
17 Mar 2024
Source Prompt Disentangled Inversion for Boosting Image Editability with
  Diffusion Models
Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Rui Li
Ruihuang Li
Song Guo
Lei Zhang
DiffM
89
10
0
17 Mar 2024
Tokensome: Towards a Genetic Vision-Language GPT for Explainable and
  Cognitive Karyotyping
Tokensome: Towards a Genetic Vision-Language GPT for Explainable and Cognitive Karyotyping
Haoxi Zhang
Xinxu Zhang
Yuanxin Lin
Maiqi Wang
Yi Lai
Yu Wang
Linfeng Yu
Yufeng Xu
Ran Cheng
E. Szczerbicki
86
0
0
17 Mar 2024
Endora: Video Generation Models as Endoscopy Simulators
Endora: Video Generation Models as Endoscopy Simulators
Chenxin Li
Hengyu Liu
Yifan Liu
Brandon Yushan Feng
Wuyang Li
Xinyu Liu
Zhen Chen
Jing Shao
Yixuan Yuan
VGenMedIm
127
41
0
17 Mar 2024
N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields
N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields
Yash Bhalgat
Iro Laina
João F. Henriques
Andrew Zisserman
Andrea Vedaldi
97
17
0
16 Mar 2024
SelfIE: Self-Interpretation of Large Language Model Embeddings
SelfIE: Self-Interpretation of Large Language Model Embeddings
Haozhe Chen
Carl Vondrick
Chengzhi Mao
77
27
0
16 Mar 2024
RetMIL: Retentive Multiple Instance Learning for Histopathological Whole
  Slide Image Classification
RetMIL: Retentive Multiple Instance Learning for Histopathological Whole Slide Image Classification
Hongbo Chu
Qiehe Sun
Jiawen Li
Yuxuan Chen
Lizhong Zhang
Tian Guan
Anjia Han
Yonghong He
58
5
0
16 Mar 2024
Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples
Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples
Ziqi Zhou
Minghui Li
Wei Liu
Shengshan Hu
Yechao Zhang
Wei Wan
Lulu Xue
Leo Yu Zhang
Dezhong Yao
Hai Jin
SILMAAML
114
11
0
16 Mar 2024
StableGarment: Garment-Centric Generation via Stable Diffusion
StableGarment: Garment-Centric Generation via Stable Diffusion
Rui Wang
Hailong Guo
Jiaming Liu
Huaxia Li
Haibo Zhao
Xu Tang
Yao Hu
Hao Tang
Peipei Li
DiffM
66
16
0
16 Mar 2024
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
Stephanie Fu
Mark Hamilton
Laura E. Brandt
Axel Feldmann
Zhoutong Zhang
William T. Freeman
MDE
99
51
0
15 Mar 2024
Few-Shot Image Classification and Segmentation as Visual Question
  Answering Using Vision-Language Models
Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models
Tian Meng
Yang Tao
Ruilin Lyu
Wuliang Yin
VLM
86
1
0
15 Mar 2024
Advancing Object Goal Navigation Through LLM-enhanced Object Affinities
  Transfer
Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer
Mengying Lin
Yaran Chen
Dong Zhao
Zhaoran Wang
118
2
0
15 Mar 2024
GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery
GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery
Enguang Wang
Zhimao Peng
Zhengyuan Xie
Fei Yang
Xialei Liu
Ming-Ming Cheng
137
3
0
15 Mar 2024
GroupContrast: Semantic-aware Self-supervised Representation Learning
  for 3D Understanding
GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
Chengyao Wang
Li Jiang
Xiaoyang Wu
Zhuotao Tian
Bohao Peng
Hengshuang Zhao
Jiaya Jia
3DPCSSL
140
17
0
14 Mar 2024
GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary
  Robotic Grasping
GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Yuhang Zheng
Xiangyu Chen
Yupeng Zheng
Songen Gu
Runyi Yang
...
Chao Yang
Dawei Wang
Zhen Chen
Xiaoxiao Long
Meiqing Wang
113
47
0
14 Mar 2024
EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning
EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning
Jongsuk Kim
Hyeongkeun Lee
Kyeongha Rho
Junmo Kim
Joon Son Chung
64
6
0
14 Mar 2024
Previous
123...363738...828384
Next