ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXivPDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 1,271 papers shown
Title
OmniGlue: Generalizable Feature Matching with Foundation Model Guidance
OmniGlue: Generalizable Feature Matching with Foundation Model Guidance
Hanwen Jiang
Arjun Karpur
Bingyi Cao
Qixing Huang
André Araujo
VLM
36
29
0
21 May 2024
Personalized Residuals for Concept-Driven Text-to-Image Generation
Personalized Residuals for Concept-Driven Text-to-Image Generation
Cusuh Ham
Matthew Fisher
James Hays
Nicholas I. Kolkin
Yuchen Liu
Richard Y. Zhang
Tobias Hinz
DiffM
50
7
0
21 May 2024
Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion
Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion
Zeyu Zhang
Yiran Wang
Biao Wu
Shuo Chen
Zhiyuan Zhang
Shiya Huang
Wenbo Zhang
Meng Fang
Ling-Hao Chen
Yang Zhao
VGen
46
6
0
18 May 2024
AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous
  Driving
AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving
Daniel Bogdoll
Iramm Hamdard
Lukas Namgyu Rößler
Felix Geisler
Muhammed Bayram
...
Miguel de Campos
Anushervon Tabarov
Yitian Yang
Hanno Gottschalk
J. Marius Zöllner
42
5
0
13 May 2024
Training-free Subject-Enhanced Attention Guidance for Compositional
  Text-to-image Generation
Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation
Shengyuan Liu
Bo Wang
Ye Ma
Te Yang
Xipeng Cao
Quan Chen
Han Li
Di Dong
Peng Jiang
EGVM
44
2
0
11 May 2024
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
Lingdong Kong
You-Chen Liu
Lai Xing Ng
Benoit R. Cottereau
Wei Tsang Ooi
VLM
37
14
0
08 May 2024
${M^2D}$NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields
M2D{M^2D}M2DNeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields
N. Wang
Lefei Zhang
Angel X Chang
55
0
0
08 May 2024
BenthicNet: A global compilation of seafloor images for deep learning applications
BenthicNet: A global compilation of seafloor images for deep learning applications
Scott C. Lowe
B. Misiuk
Isaac Xu
Shakhboz Abdulazizov
A. R. Baroi
...
Jordan A. Thomson
Brittany R. Wilson
Melisa C. Wong
Craig J. Brown
Thomas Trappenberg
49
3
0
08 May 2024
A Review on Discriminative Self-supervised Learning Methods in Computer Vision
A Review on Discriminative Self-supervised Learning Methods in Computer Vision
Nikolaos Giakoumoglou
Tania Stathaki
Athanasios Gkelias
SSL
64
1
0
08 May 2024
S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
Minh-Triet Tran
Adrian de Luis
Haitao Liao
Ying Huang
Roy McCann
Alan Mantooth
Jack Cothren
Ngan Le
90
0
0
07 May 2024
Telextiles: End-to-end Remote Transmission of Fabric Tactile Sensation
Telextiles: End-to-end Remote Transmission of Fabric Tactile Sensation
Takekazu Kitagishi
Yuichi Hiroi
Yuna Watanabe
Yuta Itoh
Jun Rekimoto
24
5
0
06 May 2024
Boosting 3D Neuron Segmentation with 2D Vision Transformer Pre-trained
  on Natural Images
Boosting 3D Neuron Segmentation with 2D Vision Transformer Pre-trained on Natural Images
Yik San Cheng
Runkai Zhao
Heng Wang
Hanchuan Peng
Weidong Cai
ViT
21
1
0
04 May 2024
CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation
CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation
Chenying Liu
C. Albrecht
Yi Wang
Xiao Xiang Zhu
65
2
0
02 May 2024
X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models
X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models
Emmanuelle Bourigault
Abdullah Hamdi
Amir Jamaludin
MedIm
56
2
0
30 Apr 2024
Hallucination of Multimodal Large Language Models: A Survey
Hallucination of Multimodal Large Language Models: A Survey
Zechen Bai
Pichao Wang
Tianjun Xiao
Tong He
Zongbo Han
Zheng Zhang
Mike Zheng Shou
VLM
LRM
95
139
0
29 Apr 2024
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Navve Wasserman
Noam Rotstein
Roy Ganz
Ron Kimmel
DiffM
39
15
0
28 Apr 2024
Embracing Diversity: Interpretable Zero-shot classification beyond one
  vector per class
Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class
Mazda Moayeri
Michael G. Rabbat
Mark Ibrahim
Diane Bouchacourt
VLM
52
1
0
25 Apr 2024
Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud
Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud
Ayumu Saito
Prachi Kudeshia
Jiju Poovvancheri
3DPC
45
7
0
25 Apr 2024
Learning Discriminative Spatio-temporal Representations for
  Semi-supervised Action Recognition
Learning Discriminative Spatio-temporal Representations for Semi-supervised Action Recognition
Yu Wang
Sanpin Zhou
Kun Xia
Le Wang
42
0
0
25 Apr 2024
MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis
MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis
Jiaxin Zhuang
Linshan Wu
Qiong Wang
V. Vardhanabhuti
Lin Luo
Hao Chen
Hao Chen
57
4
0
24 Apr 2024
Understanding Hyperbolic Metric Learning through Hard Negative Sampling
Understanding Hyperbolic Metric Learning through Hard Negative Sampling
Yun Yue
Fangzhou Lin
Guanyi Mou
Ziming Zhang
SSL
30
1
0
23 Apr 2024
OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining
  BEV Segmentation Networks
OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Sophia Sirko-Galouchenko
Alexandre Boulch
Spyros Gidaris
Andrei Bursuc
Antonín Vobecký
Patrick Pérez
Renaud Marlet
3DPC
38
7
0
22 Apr 2024
A Multimodal Automated Interpretability Agent
A Multimodal Automated Interpretability Agent
Tamar Rott Shaham
Sarah Schwettmann
Franklin Wang
Achyuta Rajaram
Evan Hernandez
Jacob Andreas
Antonio Torralba
34
18
0
22 Apr 2024
MeshLRM: Large Reconstruction Model for High-Quality Meshes
MeshLRM: Large Reconstruction Model for High-Quality Meshes
Xinyue Wei
Kai Zhang
Sai Bi
Hao Tan
Fujun Luan
Valentin Deschaintre
Kalyan Sunkavalli
Hao Su
Zexiang Xu
AI4CE
110
73
0
18 Apr 2024
Contrastive Mean-Shift Learning for Generalized Category Discovery
Contrastive Mean-Shift Learning for Generalized Category Discovery
Sua Choi
Dahyun Kang
Minsu Cho
29
10
0
15 Apr 2024
AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning
AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning
Yuwei Tang
Zhenyi Lin
Qilong Wang
Pengfei Zhu
Qinghua Hu
36
11
0
13 Apr 2024
Towards Sim-to-Real Industrial Parts Classification with Synthetic
  Dataset
Towards Sim-to-Real Industrial Parts Classification with Synthetic Dataset
Xiaomeng Zhu
Talha Bilal
Pär Mårtensson
Lars Hanson
Mårten Björkman
A. Maki
41
11
0
12 Apr 2024
OmniSat: Self-Supervised Modality Fusion for Earth Observation
OmniSat: Self-Supervised Modality Fusion for Earth Observation
Guillaume Astruc
Nicolas Gonthier
Clement Mallet
Loic Landrieu
38
25
0
12 Apr 2024
Scaling Multi-Camera 3D Object Detection through Weak-to-Strong
  Eliciting
Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Hao Lu
Jiaqi Tang
Xinli Xu
Xu Cao
Yunpeng Zhang
Guoqing Wang
Dalong Du
Hao Chen
Ying-Cong Chen
35
3
0
10 Apr 2024
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
Xiaoyu Liu
Yuxiang Wei
Ming-Yu Liu
Xianhui Lin
Peiran Ren
Xuansong Xie
Wangmeng Zuo
DiffM
47
5
0
09 Apr 2024
Self-Explainable Affordance Learning with Embodied Caption
Self-Explainable Affordance Learning with Embodied Caption
Zhipeng Zhang
Zhimin Wei
Guolei Sun
Peng Wang
Luc Van Gool
50
3
0
08 Apr 2024
Social-MAE: Social Masked Autoencoder for Multi-person Motion
  Representation Learning
Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning
Mahsa Ehsanpour
Ian Reid
Hamid Rezatofighi
ViT
37
0
0
08 Apr 2024
DinoBloom: A Foundation Model for Generalizable Cell Embeddings in
  Hematology
DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology
Valentin Koch
S. J. Wagner
Salome Kazeminia
Ece Sancar
Matthias Hehr
Julia A. Schnabel
Tingying Peng
Carsten Marr
AI4CE
MedIm
27
5
0
07 Apr 2024
CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality
  Novel-view Synthesis
CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis
Gyeongjin Kang
Younggeun Lee
Seungjun Oh
Eunbyung Park
VGen
35
1
0
07 Apr 2024
RoNet: Rotation-oriented Continuous Image Translation
RoNet: Rotation-oriented Continuous Image Translation
Yi Li
Xinxiong Xie
Lina Lei
Haiyan Fu
Yanqing Guo
3DH
44
0
0
06 Apr 2024
Dissecting Query-Key Interaction in Vision Transformers
Dissecting Query-Key Interaction in Vision Transformers
Xu Pan
Aaron Philip
Ziqian Xie
Odelia Schwartz
39
1
0
04 Apr 2024
JUICER: Data-Efficient Imitation Learning for Robotic Assembly
JUICER: Data-Efficient Imitation Learning for Robotic Assembly
Lars Ankile
Anthony Simeonov
Idan Shenfeld
Pulkit Agrawal
LM&Ro
42
15
0
04 Apr 2024
Multi Positive Contrastive Learning with Pose-Consistent Generated
  Images
Multi Positive Contrastive Learning with Pose-Consistent Generated Images
Sho Inayoshi
Aji Resindra Widya
Satoshi Ozaki
Junji Otsuka
Takeshi Ohashi
3DH
52
1
0
04 Apr 2024
Specularity Factorization for Low-Light Enhancement
Specularity Factorization for Low-Light Enhancement
A. S. Baslamisli
Noah Snavely. Intrinsic
47
4
0
02 Apr 2024
A Universal Knowledge Embedded Contrastive Learning Framework for
  Hyperspectral Image Classification
A Universal Knowledge Embedded Contrastive Learning Framework for Hyperspectral Image Classification
Quanwei Liu
Yanni Dong
Tao Huang
Lefei Zhang
Bo Du
VLM
39
1
0
02 Apr 2024
Training-Free Semantic Segmentation via LLM-Supervision
Training-Free Semantic Segmentation via LLM-Supervision
Wenfang Sun
Yingjun Du
Gaowen Liu
Ramana Rao Kompella
Cees G. M. Snoek
VLM
44
2
0
31 Mar 2024
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion
  Models
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models
Barbara Toniella Corradini
Mustafa Shukor
Paul Couairon
Guillaume Couairon
Franco Scarselli
Matthieu Cord
DiffM
VLM
45
4
0
29 Mar 2024
The Bad Batches: Enhancing Self-Supervised Learning in Image
  Classification Through Representative Batch Curation
The Bad Batches: Enhancing Self-Supervised Learning in Image Classification Through Representative Batch Curation
Ozgu Goksu
Nicolas Pugeault
SSL
37
0
0
28 Mar 2024
Decoding the visual attention of pathologists to reveal their level of
  expertise
Decoding the visual attention of pathologists to reveal their level of expertise
Souradeep Chakraborty
Dana Perez
Paul Friedman
Natallia Sheuka
Constantin Friedman
Oksana Yaskiv
Rajarsi R. Gupta
G. Zelinsky
Joel H. Saltz
Dimitris Samaras
MedIm
39
0
0
25 Mar 2024
VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
Yang Chen
Yingwei Pan
Haibo Yang
Ting Yao
Tao Mei
DiffM
42
18
0
25 Mar 2024
Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art
Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art
Neeloy Chakraborty
Melkior Ornik
Katherine Driggs-Campbell
LRM
57
9
0
25 Mar 2024
Hierarchical Text-to-Vision Self Supervised Alignment for Improved
  Histopathology Representation Learning
Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning
Hasindri Watawana
Kanchana Ranasinghe
Tariq Mahmood
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
SSL
43
4
0
21 Mar 2024
VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition
VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition
Yun-Jin Li
M. Gladkova
Yan Xia
Rui Wang
Daniel Cremers
37
5
0
21 Mar 2024
NTK-Guided Few-Shot Class Incremental Learning
NTK-Guided Few-Shot Class Incremental Learning
Jingren Liu
Zhong Ji
Yanwei Pang
YunLong Yu
CLL
39
3
0
19 Mar 2024
ADAPT to Robustify Prompt Tuning Vision Transformers
ADAPT to Robustify Prompt Tuning Vision Transformers
Masih Eskandar
Tooba Imtiaz
Zifeng Wang
Jennifer Dy
VPVLM
VLM
AAML
38
0
0
19 Mar 2024
Previous
123...8910...242526
Next