ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,175 papers shown
Title
Domain-Invariant Representation Learning of Bird Sounds
Domain-Invariant Representation Learning of Bird Sounds
Ilyass Moummad
Romain Serizel
Emmanouil Benetos
Nicolas Farrugia
SSL
103
2
0
13 Sep 2024
Exploiting Supervised Poison Vulnerability to Strengthen Self-Supervised
  Defense
Exploiting Supervised Poison Vulnerability to Strengthen Self-Supervised Defense
Jeremy A. Styborski
Mingzhi Lyu
Yunpeng Huang
Adams Kong
113
0
0
13 Sep 2024
VLTP: Vision-Language Guided Token Pruning for Task-Oriented
  Segmentation
VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
Hanning Chen
Yang Ni
Wenjun Huang
Yezi Liu
SungHeon Jeong
Fei Wen
Nathaniel D. Bastian
Hugo Latapie
Mohsen Imani
VLM
85
4
0
13 Sep 2024
Autoregressive Sequence Modeling for 3D Medical Image Representation
Autoregressive Sequence Modeling for 3D Medical Image Representation
Siwen Wang
Churan Wang
Fei Gao
Lixian Su
Fandong Zhang
Yizhou Wang
Yizhou Yu
MedIm
127
1
0
13 Sep 2024
GroundingBooth: Grounding Text-to-Image Customization
GroundingBooth: Grounding Text-to-Image Customization
Zhexiao Xiong
Wei Xiong
Jing Shi
He Zhang
Yizhi Song
Nathan Jacobs
DiffM
156
9
0
13 Sep 2024
Click2Mask: Local Editing with Dynamic Mask Generation
Click2Mask: Local Editing with Dynamic Mask Generation
Omer Regev
Omri Avrahami
Dani Lischinski
DiffM
118
2
0
12 Sep 2024
MagicStyle: Portrait Stylization Based on Reference Image
MagicStyle: Portrait Stylization Based on Reference Image
Zhaoli Deng
Kaibin Zhou
Fanyi Wang
Zhenpeng Mi
DiffM
103
1
0
12 Sep 2024
Diffusion-Based Image-to-Image Translation by Noise Correction via
  Prompt Interpolation
Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation
Junsung Lee
Minsoo Kang
Bohyung Han
DiffMVLM
43
3
0
12 Sep 2024
Do Vision Foundation Models Enhance Domain Generalization in Medical
  Image Segmentation?
Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation?
Kerem Cekmeceli
Meva Himmetoglu
G. I. Tombak
A. Susmelj
Ertunc Erdil
E. Konukoglu
MedIm
59
3
0
12 Sep 2024
SURGIVID: Annotation-Efficient Surgical Video Object Discovery
SURGIVID: Annotation-Efficient Surgical Video Object Discovery
Çağhan Köksal
Ghazal Ghazaei
Nassir Navab
63
1
0
12 Sep 2024
Foundation Models Boost Low-Level Perceptual Similarity Metrics
Foundation Models Boost Low-Level Perceptual Similarity Metrics
Abhijay Ghildyal
Nabajeet Barman
Saman Zadtootaghaj
99
4
0
11 Sep 2024
Token Turing Machines are Efficient Vision Models
Token Turing Machines are Efficient Vision Models
Purvish Jajal
Nick Eliopoulos
Benjamin Shiue-Hal Chou
George K. Thiravathukal
James C. Davis
Yung-Hsiang Lu
181
0
0
11 Sep 2024
Self-Masking Networks for Unsupervised Adaptation
Self-Masking Networks for Unsupervised Adaptation
Alfonso Taboada Warmerdam
Mathilde Caron
Yuki M. Asano
84
2
0
11 Sep 2024
Unsupervised Point Cloud Registration with Self-Distillation
Unsupervised Point Cloud Registration with Self-Distillation
Christian Lowens
Thorben Funke
André Wagner
Alexandru Paul Condurache
3DPC
93
1
0
11 Sep 2024
StereoCrafter: Diffusion-based Generation of Long and High-fidelity
  Stereoscopic 3D from Monocular Videos
StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Sijie Zhao
Wenbo Hu
Xiaodong Cun
Yong Zhang
Xiaoyu Li
Zhe Kong
Xiangjun Gao
Muyao Niu
Ying Shan
VGenDiffMMDE
99
11
0
11 Sep 2024
Pushing the Limits of Vision-Language Models in Remote Sensing without
  Human Annotations
Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations
Keumgang Cha
Donggeun Yu
Junghoon Seo
VLM
76
1
0
11 Sep 2024
What to align in multimodal contrastive learning?
What to align in multimodal contrastive learning?
Benoit Dufumier
J. Castillo-Navarro
D. Tuia
Jean-Philippe Thiran
156
4
0
11 Sep 2024
Seg-HGNN: Unsupervised and Light-Weight Image Segmentation with
  Hyperbolic Graph Neural Networks
Seg-HGNN: Unsupervised and Light-Weight Image Segmentation with Hyperbolic Graph Neural Networks
Debjyoti Mondal
Rahul Mishra
Chandan Pandey
76
0
0
10 Sep 2024
High-Performance Few-Shot Segmentation with Foundation Models: An
  Empirical Study
High-Performance Few-Shot Segmentation with Foundation Models: An Empirical Study
Shijie Chang
Lihe Zhang
Huchuan Lu
VLM
69
1
0
10 Sep 2024
INTRA: Interaction Relationship-aware Weakly Supervised Affordance
  Grounding
INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding
Ji Ha Jang
H. Seo
Se Young Chun
93
3
0
10 Sep 2024
Towards Generalizable Scene Change Detection
Towards Generalizable Scene Change Detection
Jaewoo Kim
Uehwan Kim
131
0
0
10 Sep 2024
DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks
DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks
Amin Karimi Monsefi
Kishore Prakash Sailaja
Ali Alilooee
Ser-Nam Lim
R. Ramnath
VLM
102
9
0
10 Sep 2024
Evaluating Multiview Object Consistency in Humans and Image Models
Evaluating Multiview Object Consistency in Humans and Image Models
Tyler Bonnen
Stephanie Fu
Yutong Bai
Thomas P. O'Connell
Yoni Friedman
Nancy Kanwisher
J. Tenenbaum
Alexei A. Efros
47
6
0
09 Sep 2024
Leveraging Object Priors for Point Tracking
Leveraging Object Priors for Point Tracking
Bikram Boote
Anh Thai
Wenqi Jia
Ozgur Kara
Stefan Stojanov
James M. Rehg
Sangmin Lee
3DPC
63
0
0
09 Sep 2024
ReL-SAR: Representation Learning for Skeleton Action Recognition with
  Convolutional Transformers and BYOL
ReL-SAR: Representation Learning for Skeleton Action Recognition with Convolutional Transformers and BYOL
Safwen Naimi
W. Bouachir
Guillaume-Alexandre Bilodeau
ViT
68
3
0
09 Sep 2024
NeIn: Telling What You Don't Want
NeIn: Telling What You Don't Want
Nhat-Tan Bui
Dinh-Hieu Hoang
Quoc-Huy Trinh
Minh-Triet Tran
Truong Nguyen
Susan Gauch
146
2
0
09 Sep 2024
Expanding Expressivity in Transformer Models with MöbiusAttention
Expanding Expressivity in Transformer Models with MöbiusAttention
Anna-Maria Halacheva
M. Nayyeri
Steffen Staab
78
1
0
08 Sep 2024
Explicit Mutual Information Maximization for Self-Supervised Learning
Explicit Mutual Information Maximization for Self-Supervised Learning
Lele Chang
Peilin Liu
Qinghai Guo
Fei Wen
SSL
87
0
0
07 Sep 2024
Efficient Training of Large Vision Models via Advanced Automated
  Progressive Learning
Efficient Training of Large Vision Models via Advanced Automated Progressive Learning
Changlin Li
Jiawei Zhang
Sihao Lin
Zongxin Yang
Junwei Liang
Xiaodan Liang
Xiaojun Chang
VLM
70
0
0
06 Sep 2024
Introducing a Class-Aware Metric for Monocular Depth Estimation: An
  Automotive Perspective
Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective
Tim Bader
Leon Eisemann
Adrian Pogorzelski
Namrata Jangid
Attila B. Kis
72
0
0
06 Sep 2024
Organized Grouped Discrete Representation for Object-Centric Learning
Organized Grouped Discrete Representation for Object-Centric Learning
Rongzhen Zhao
V. Wang
Arno Solin
Joni Pajarinen
VOSOCL
144
1
0
05 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-Xiong Wang
140
23
0
05 Sep 2024
CanvOI, an Oncology Intelligence Foundation Model: Scaling FLOPS
  Differently
CanvOI, an Oncology Intelligence Foundation Model: Scaling FLOPS Differently
Jonathan Zalach
Inbal Gazy
Assaf Avinoam
Ron Sinai
Eran Shmuel
Inbar Gilboa
Christine Swisher
Naim Matasci
Reva Basho
David B. Agus
76
0
0
04 Sep 2024
Oops, I Sampled it Again: Reinterpreting Confidence Intervals in
  Few-Shot Learning
Oops, I Sampled it Again: Reinterpreting Confidence Intervals in Few-Shot Learning
Raphael Lafargue
Luke Smith
Franck Vermet
Mathias Löwe
Ian Reid
Vincent Gripon
Jack Valmadre
82
0
0
04 Sep 2024
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
Hayeon Jo
Hyesong Choi
Minhee Cho
Dongbo Min
124
2
0
04 Sep 2024
LinFusion: 1 GPU, 1 Minute, 16K Image
LinFusion: 1 GPU, 1 Minute, 16K Image
Songhua Liu
Weihao Yu
Zhenxiong Tan
Xinchao Wang
121
16
0
03 Sep 2024
Dual Advancement of Representation Learning and Clustering for Sparse
  and Noisy Images
Dual Advancement of Representation Learning and Clustering for Sparse and Noisy Images
Wenlin Li
Yucheng Xu
Xiaoqing Zheng
Suoya Han
Jun Wang
Xiaobo Sun
103
1
0
03 Sep 2024
Enhancing Fine-Grained Visual Recognition in the Low-Data Regime Through
  Feature Magnitude Regularization
Enhancing Fine-Grained Visual Recognition in the Low-Data Regime Through Feature Magnitude Regularization
Avraham Chapman
Haiming Xu
Lingqiao Liu
80
0
0
03 Sep 2024
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for
  Robotic Manipulation
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
Wenlong Huang
Chen Wang
Yongqian Li
Ruohan Zhang
Li Fei-Fei
134
115
0
03 Sep 2024
DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image
  Editing
DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing
Xiaolong Wang
Zhi-Qi Cheng
Jue Wang
Xiaojiang Peng
DiffM
53
0
0
02 Sep 2024
VQ-Flow: Taming Normalizing Flows for Multi-Class Anomaly Detection via
  Hierarchical Vector Quantization
VQ-Flow: Taming Normalizing Flows for Multi-Class Anomaly Detection via Hierarchical Vector Quantization
Yixuan Zhou
Xing Xu
Zhe Sun
Jingkuan Song
A. Cichocki
Heng Tao Shen
123
1
0
02 Sep 2024
Image-to-Lidar Relational Distillation for Autonomous Driving Data
Image-to-Lidar Relational Distillation for Autonomous Driving Data
Anas Mahmoud
Ali Harakeh
Steven Waslander
70
0
0
01 Sep 2024
ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features
  from Multi-View Images
ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images
Xiaoshuai Zhang
Zhicheng Wang
Howard Zhou
Soham Ghosh
Danushen Gnanapragasam
Varun Jampani
Hao Su
Leonidas Guibas
DD
91
5
0
30 Aug 2024
A Survey of the Self Supervised Learning Mechanisms for Vision Transformers
A Survey of the Self Supervised Learning Mechanisms for Vision Transformers
Asifullah Khan
A. Sohail
Mustansar Fiaz
Mehdi Hassan
Tariq Habib Afridi
...
Muhammad Zaigham Zaheer
Kamran Ali
Tangina Sultana
Ziaurrehman Tanoli
Naeem Akhter
274
5
0
30 Aug 2024
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative
  Models
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models
Moreno DÍncà
E. Peruzzo
Massimiliano Mancini
Xingqian Xu
Humphrey Shi
N. Sebe
103
0
0
29 Aug 2024
Identifying Terrain Physical Parameters from Vision -- Towards
  Physical-Parameter-Aware Locomotion and Navigation
Identifying Terrain Physical Parameters from Vision -- Towards Physical-Parameter-Aware Locomotion and Navigation
Jiaqi Chen
Jonas Frey
Ruyi Zhou
Takahiro Miki
Georg Martius
Marco Hutter
102
11
0
29 Aug 2024
Towards Modality-agnostic Label-efficient Segmentation with
  Entropy-Regularized Distribution Alignment
Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Liyao Tang
Zhe Chen
Shanshan Zhao
Chaoyue Wang
Dacheng Tao
107
0
0
29 Aug 2024
SSDM: Scalable Speech Dysfluency Modeling
SSDM: Scalable Speech Dysfluency Modeling
Jiachen Lian
Xuanru Zhou
Z. Ezzes
Jet M J Vonk
Brittany Morin
D. Baquirin
Zachary Mille
M. G. Tempini
Gopala Anumanchipalli
AuLLM
113
4
0
29 Aug 2024
A Simple and Generalist Approach for Panoptic Segmentation
A Simple and Generalist Approach for Panoptic Segmentation
Nedyalko Prisadnikov
Wouter Van Gansbeke
Danda Pani Paudel
Luc Van Gool
VLM
116
0
0
29 Aug 2024
CoRe: Context-Regularized Text Embedding Learning for Text-to-Image
  Personalization
CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization
Feize Wu
Yun Pang
Junyi Zhang
Lianyu Pang
Jian Yin
Baoquan Zhao
Qing Li
Xudong Mao
DiffM
86
5
0
28 Aug 2024
Previous
123...212223...828384
Next