ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,176 papers shown
Title
Contrastive-Based Deep Embeddings for Label Noise-Resilient
  Histopathology Image Classification
Contrastive-Based Deep Embeddings for Label Noise-Resilient Histopathology Image Classification
Lucas Dedieu
Nicolas Nerrienet
A. Nivaggioli
Clara Simmat
Marceau Clavel
Arnaud Gauthier
Stéphane Sockeel
Rémy Peyret
NoLa
78
1
0
11 Apr 2024
Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using Prototypes
Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using Prototypes
Poulami Sinhamahapatra
Franziska Schwaiger
Shirsha Bose
Huiyu Wang
Karsten Roscher
Stephan Guennemann
79
1
0
11 Apr 2024
BRAVE: Broadening the visual encoding of vision-language models
BRAVE: Broadening the visual encoding of vision-language models
Ouguzhan Fatih Kar
A. Tonioni
Petra Poklukar
Achin Kulshrestha
Amir Zamir
Federico Tombari
MLLMVLM
80
32
0
10 Apr 2024
UMBRAE: Unified Multimodal Brain Decoding
UMBRAE: Unified Multimodal Brain Decoding
Weihao Xia
Raoul de Charette
Cengiz Öztireli
Jing-Hao Xue
82
9
0
10 Apr 2024
Wild Visual Navigation: Fast Traversability Learning via Pre-Trained
  Models and Online Self-Supervision
Wild Visual Navigation: Fast Traversability Learning via Pre-Trained Models and Online Self-Supervision
Matías Mattamala
Jonas Frey
Piotr Libera
Nived Chebrolu
Georg Martius
Cesar Cadena
Marco Hutter
Maurice F. Fallon
SSL
79
9
0
10 Apr 2024
Scaling Multi-Camera 3D Object Detection through Weak-to-Strong
  Eliciting
Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Hao Lu
Jiaqi Tang
Xinli Xu
Xu Cao
Yunpeng Zhang
Guoqing Wang
Dalong Du
Hao Chen
Ying-Cong Chen
86
3
0
10 Apr 2024
How to Craft Backdoors with Unlabeled Data Alone?
How to Craft Backdoors with Unlabeled Data Alone?
Yifei Wang
Wenhan Ma
Stefanie Jegelka
Yisen Wang
SyDa
61
0
0
10 Apr 2024
Training-Free Open-Vocabulary Segmentation with Offline
  Diffusion-Augmented Prototype Generation
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation
Luca Barsellotti
Roberto Amoroso
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
VLMDiffM
95
14
0
09 Apr 2024
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
Xiaoyu Liu
Yuxiang Wei
Ming-Yu Liu
Xianhui Lin
Peiran Ren
Xuansong Xie
Wangmeng Zuo
DiffM
88
6
0
09 Apr 2024
Learning Embeddings with Centroid Triplet Loss for Object Identification
  in Robotic Grasping
Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping
Anas Gouda
Max Schwarz
Christopher Reining
Sven Behnke
Alice Kirchheim
VLM
60
0
0
09 Apr 2024
GHNeRF: Learning Generalizable Human Features with Efficient Neural
  Radiance Fields
GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields
Arnab Dey
Di Yang
Rohith Agaram
A. Dantcheva
Andrew I. Comport
Srinath Sridhar
Jean Martinet
3DH
57
0
0
09 Apr 2024
HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields
HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields
Arnab Dey
Di Yang
A. Dantcheva
Jean Martinet
3DH
33
0
0
09 Apr 2024
Learning 3D-Aware GANs from Unposed Images with Template Feature Field
Learning 3D-Aware GANs from Unposed Images with Template Feature Field
Xinya Chen
Hanlei Guo
Yanrui Bin
Shangzhan Zhang
Yuanbo Yang
Yue Wang
Yujun Shen
Yiyi Liao
3DH
66
3
0
08 Apr 2024
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
Kunpeng Song
Yizhe Zhu
Bingchen Liu
Qing Yan
A. Elgammal
Xiao Yang
DiffM
84
22
0
08 Apr 2024
MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning
MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning
Matteo Farina
Massimiliano Mancini
Elia Cunegatti
Gaowen Liu
Giovanni Iacca
Elisa Ricci
VLM
84
2
0
08 Apr 2024
Self-Explainable Affordance Learning with Embodied Caption
Self-Explainable Affordance Learning with Embodied Caption
Zhipeng Zhang
Zhimin Wei
Guolei Sun
Peng Wang
Luc Van Gool
92
6
0
08 Apr 2024
Social-MAE: Social Masked Autoencoder for Multi-person Motion
  Representation Learning
Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning
Mahsa Ehsanpour
Ian Reid
Hamid Rezatofighi
ViT
78
0
0
08 Apr 2024
CDAD-Net: Bridging Domain Gaps in Generalized Category Discovery
CDAD-Net: Bridging Domain Gaps in Generalized Category Discovery
Sai Bhargav Rongali
Sarthak Mehrotra
Ankit Jha
C. MohamadHassanN
Shirsha Bose
Tanisha Gupta
Mainak Singha
Biplab Banerjee
96
2
0
08 Apr 2024
Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask
  Prompt
Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt
Zhiqi Huang
Hui Xiong
Haoyu Wang
Longguang Wang
Zhiheng Li
DiffM
65
0
0
08 Apr 2024
MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation
MC2^22: Multi-concept Guidance for Customized Multi-concept Generation
Jiaxiu Jiang
Yabo Zhang
Kailai Feng
Xiaohe Wu
Wangmeng Zuo
DiffM
113
12
0
08 Apr 2024
DinoBloom: A Foundation Model for Generalizable Cell Embeddings in
  Hematology
DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology
Valentin Koch
S. J. Wagner
Salome Kazeminia
Ece Sancar
Matthias Hehr
Julia A. Schnabel
Tingying Peng
Carsten Marr
AI4CEMedIm
88
8
0
07 Apr 2024
CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality
  Novel-view Synthesis
CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis
Gyeongjin Kang
Younggeun Lee
Seungjun Oh
Eunbyung Park
VGen
89
1
0
07 Apr 2024
DATENeRF: Depth-Aware Text-based Editing of NeRFs
DATENeRF: Depth-Aware Text-based Editing of NeRFs
Sara Rojas
Julien Philip
Kai Zhang
Sai Bi
Fujun Luan
Guohao Li
Kalyan Sunkavalli
DiffM
75
3
0
06 Apr 2024
Cluster-based Video Summarization with Temporal Context Awareness
Cluster-based Video Summarization with Temporal Context Awareness
Hai-Dang Huynh-Lam
Ngoc-Phuong Ho-Thi
Minh-Triet Tran
Trung-Truc Huynh-Le
51
2
0
06 Apr 2024
RoNet: Rotation-oriented Continuous Image Translation
RoNet: Rotation-oriented Continuous Image Translation
Yi Li
Xinxiong Xie
Lina Lei
Haiyan Fu
Yanqing Guo
3DH
86
0
0
06 Apr 2024
Vision Transformers in Domain Adaptation and Generalization: A Study of
  Robustness
Vision Transformers in Domain Adaptation and Generalization: A Study of Robustness
Shadi Alijani
Jamil Fayyad
Homayoun Najjaran
OOD
118
1
0
05 Apr 2024
LOSS-SLAM: Lightweight Open-Set Semantic Simultaneous Localization and
  Mapping
LOSS-SLAM: Lightweight Open-Set Semantic Simultaneous Localization and Mapping
Kurran Singh
Tim Magoun
John J. Leonard
128
1
0
05 Apr 2024
Dissecting Query-Key Interaction in Vision Transformers
Dissecting Query-Key Interaction in Vision Transformers
Xu Pan
Aaron Philip
Ziqian Xie
Odelia Schwartz
117
1
0
04 Apr 2024
Test Time Training for Industrial Anomaly Segmentation
Test Time Training for Industrial Anomaly Segmentation
Alex Costanzino
Pierluigi Zama Ramirez
Mirko Del Moro
Agostino Aiezzo
Giuseppe Lisanti
Samuele Salti
Luigi Di Stefano
77
0
0
04 Apr 2024
JUICER: Data-Efficient Imitation Learning for Robotic Assembly
JUICER: Data-Efficient Imitation Learning for Robotic Assembly
Lars Ankile
Anthony Simeonov
Idan Shenfeld
Pulkit Agrawal
LM&Ro
123
19
0
04 Apr 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
  Matching
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Dongzhi Jiang
Guanglu Song
Xiaoshi Wu
Renrui Zhang
Dazhong Shen
Zhuofan Zong
Yu Liu
Hongsheng Li
VLM
134
28
0
04 Apr 2024
WorDepth: Variational Language Prior for Monocular Depth Estimation
WorDepth: Variational Language Prior for Monocular Depth Estimation
Ziyao Zeng
Daniel Wang
Fengyu Yang
Hyoungseob Park
Yangchao Wu
Stefano Soatto
Byung-Woo Hong
Dong Lao
Alex Wong
MDE
136
28
0
04 Apr 2024
Multi Positive Contrastive Learning with Pose-Consistent Generated
  Images
Multi Positive Contrastive Learning with Pose-Consistent Generated Images
Sho Inayoshi
Aji Resindra Widya
Satoshi Ozaki
Junji Otsuka
Takeshi Ohashi
3DH
151
1
0
04 Apr 2024
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency
  Decomposition
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
Yisheng He
Weihao Yuan
Siyu Zhu
Zilong Dong
Liefeng Bo
Qixing Huang
84
3
0
03 Apr 2024
A Unified Membership Inference Method for Visual Self-supervised Encoder
  via Part-aware Capability
A Unified Membership Inference Method for Visual Self-supervised Encoder via Part-aware Capability
Jie Zhu
Jirong Zha
Ding Li
Leye Wang
103
8
0
03 Apr 2024
Masked Completion via Structured Diffusion with White-Box Transformers
Masked Completion via Structured Diffusion with White-Box Transformers
Druv Pai
Ziyang Wu
Sam Buchanan
Yaodong Yu
Yi-An Ma
72
14
0
03 Apr 2024
3D Congealing: 3D-Aware Image Alignment in the Wild
3D Congealing: 3D-Aware Image Alignment in the Wild
Yunzhi Zhang
Zizhang Li
Amit Raj
Andreas Engelhardt
Yuanzhen Li
Tingbo Hou
Jiajun Wu
Varun Jampani
3DV
76
0
0
02 Apr 2024
BRAVEn: Improving Self-Supervised Pre-training for Visual and Auditory
  Speech Recognition
BRAVEn: Improving Self-Supervised Pre-training for Visual and Auditory Speech Recognition
A. Haliassos
Andreas Zinonos
Rodrigo Mira
Stavros Petridis
Maja Pantic
VLMSSLAI4TS
87
13
0
02 Apr 2024
SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation
SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation
V. Srivastav
Keqi Chen
N. Padoy
93
8
0
02 Apr 2024
Specularity Factorization for Low-Light Enhancement
Specularity Factorization for Low-Light Enhancement
A. S. Baslamisli
Noah Snavely. Intrinsic
86
4
0
02 Apr 2024
M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets
M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets
Gaurish Thakkar
Sherzod Hakimov
Marko Tadić
65
4
0
02 Apr 2024
A Universal Knowledge Embedded Contrastive Learning Framework for
  Hyperspectral Image Classification
A Universal Knowledge Embedded Contrastive Learning Framework for Hyperspectral Image Classification
Quanwei Liu
Yanni Dong
Wei Chen
Lefei Zhang
Bo Du
VLM
89
3
0
02 Apr 2024
Diffusion Deepfake
Diffusion Deepfake
Chaitali Bhattacharyya
Hanxiao Wang
Feng Zhang
Sung-Ha Kim
Xiatian Zhu
73
5
0
02 Apr 2024
Can Biases in ImageNet Models Explain Generalization?
Can Biases in ImageNet Models Explain Generalization?
Paul Gavrikov
J. Keuper
OODVLM
67
15
0
01 Apr 2024
Measuring Style Similarity in Diffusion Models
Measuring Style Similarity in Diffusion Models
Gowthami Somepalli
Anubhav Gupta
Kamal Gupta
Shramay Palta
Micah Goldblum
Jonas Geiping
Abhinav Shrivastava
Tom Goldstein
EGVM
103
42
0
01 Apr 2024
An image speaks a thousand words, but can everyone listen? On image
  transcreation for cultural relevance
An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance
Simran Khanuja
Sathyanarayanan Ramamoorthy
Yueqi Song
Graham Neubig
DiffM
105
18
0
01 Apr 2024
Feature Splatting: Language-Driven Physics-Based Scene Synthesis and
  Editing
Feature Splatting: Language-Driven Physics-Based Scene Synthesis and Editing
Ri-Zhao Qiu
Ge Yang
Weijia Zeng
Xiaolong Wang
3DGS
69
27
0
01 Apr 2024
SyncMask: Synchronized Attentional Masking for Fashion-centric
  Vision-Language Pretraining
SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining
Chull Hwan Song
Taebaek Hwang
Jooyoung Yoon
Shunghyun Choi
Yeong Hyeon Gu
52
5
0
01 Apr 2024
Machine Learning Robustness: A Primer
Machine Learning Robustness: A Primer
Houssem Ben Braiek
Foutse Khomh
AAMLOOD
106
8
0
01 Apr 2024
Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text
  Guidance
Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text Guidance
G. Nam
Byeongho Heo
Juho Lee
VLM
78
7
0
01 Apr 2024
Previous
123...343536...828384
Next