Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,176 papers shown
Title
Contrastive-Based Deep Embeddings for Label Noise-Resilient Histopathology Image Classification
Lucas Dedieu
Nicolas Nerrienet
A. Nivaggioli
Clara Simmat
Marceau Clavel
Arnaud Gauthier
Stéphane Sockeel
Rémy Peyret
NoLa
78
1
0
11 Apr 2024
Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using Prototypes
Poulami Sinhamahapatra
Franziska Schwaiger
Shirsha Bose
Huiyu Wang
Karsten Roscher
Stephan Guennemann
79
1
0
11 Apr 2024
BRAVE: Broadening the visual encoding of vision-language models
Ouguzhan Fatih Kar
A. Tonioni
Petra Poklukar
Achin Kulshrestha
Amir Zamir
Federico Tombari
MLLM
VLM
80
32
0
10 Apr 2024
UMBRAE: Unified Multimodal Brain Decoding
Weihao Xia
Raoul de Charette
Cengiz Öztireli
Jing-Hao Xue
82
9
0
10 Apr 2024
Wild Visual Navigation: Fast Traversability Learning via Pre-Trained Models and Online Self-Supervision
Matías Mattamala
Jonas Frey
Piotr Libera
Nived Chebrolu
Georg Martius
Cesar Cadena
Marco Hutter
Maurice F. Fallon
SSL
79
9
0
10 Apr 2024
Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Hao Lu
Jiaqi Tang
Xinli Xu
Xu Cao
Yunpeng Zhang
Guoqing Wang
Dalong Du
Hao Chen
Ying-Cong Chen
86
3
0
10 Apr 2024
How to Craft Backdoors with Unlabeled Data Alone?
Yifei Wang
Wenhan Ma
Stefanie Jegelka
Yisen Wang
SyDa
61
0
0
10 Apr 2024
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation
Luca Barsellotti
Roberto Amoroso
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
VLM
DiffM
95
14
0
09 Apr 2024
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
Xiaoyu Liu
Yuxiang Wei
Ming-Yu Liu
Xianhui Lin
Peiran Ren
Xuansong Xie
Wangmeng Zuo
DiffM
88
6
0
09 Apr 2024
Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping
Anas Gouda
Max Schwarz
Christopher Reining
Sven Behnke
Alice Kirchheim
VLM
60
0
0
09 Apr 2024
GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields
Arnab Dey
Di Yang
Rohith Agaram
A. Dantcheva
Andrew I. Comport
Srinath Sridhar
Jean Martinet
3DH
57
0
0
09 Apr 2024
HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields
Arnab Dey
Di Yang
A. Dantcheva
Jean Martinet
3DH
33
0
0
09 Apr 2024
Learning 3D-Aware GANs from Unposed Images with Template Feature Field
Xinya Chen
Hanlei Guo
Yanrui Bin
Shangzhan Zhang
Yuanbo Yang
Yue Wang
Yujun Shen
Yiyi Liao
3DH
66
3
0
08 Apr 2024
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
Kunpeng Song
Yizhe Zhu
Bingchen Liu
Qing Yan
A. Elgammal
Xiao Yang
DiffM
84
22
0
08 Apr 2024
MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning
Matteo Farina
Massimiliano Mancini
Elia Cunegatti
Gaowen Liu
Giovanni Iacca
Elisa Ricci
VLM
84
2
0
08 Apr 2024
Self-Explainable Affordance Learning with Embodied Caption
Zhipeng Zhang
Zhimin Wei
Guolei Sun
Peng Wang
Luc Van Gool
92
6
0
08 Apr 2024
Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning
Mahsa Ehsanpour
Ian Reid
Hamid Rezatofighi
ViT
78
0
0
08 Apr 2024
CDAD-Net: Bridging Domain Gaps in Generalized Category Discovery
Sai Bhargav Rongali
Sarthak Mehrotra
Ankit Jha
C. MohamadHassanN
Shirsha Bose
Tanisha Gupta
Mainak Singha
Biplab Banerjee
96
2
0
08 Apr 2024
Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt
Zhiqi Huang
Hui Xiong
Haoyu Wang
Longguang Wang
Zhiheng Li
DiffM
65
0
0
08 Apr 2024
MC
2
^2
2
: Multi-concept Guidance for Customized Multi-concept Generation
Jiaxiu Jiang
Yabo Zhang
Kailai Feng
Xiaohe Wu
Wangmeng Zuo
DiffM
113
12
0
08 Apr 2024
DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology
Valentin Koch
S. J. Wagner
Salome Kazeminia
Ece Sancar
Matthias Hehr
Julia A. Schnabel
Tingying Peng
Carsten Marr
AI4CE
MedIm
88
8
0
07 Apr 2024
CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis
Gyeongjin Kang
Younggeun Lee
Seungjun Oh
Eunbyung Park
VGen
89
1
0
07 Apr 2024
DATENeRF: Depth-Aware Text-based Editing of NeRFs
Sara Rojas
Julien Philip
Kai Zhang
Sai Bi
Fujun Luan
Guohao Li
Kalyan Sunkavalli
DiffM
75
3
0
06 Apr 2024
Cluster-based Video Summarization with Temporal Context Awareness
Hai-Dang Huynh-Lam
Ngoc-Phuong Ho-Thi
Minh-Triet Tran
Trung-Truc Huynh-Le
51
2
0
06 Apr 2024
RoNet: Rotation-oriented Continuous Image Translation
Yi Li
Xinxiong Xie
Lina Lei
Haiyan Fu
Yanqing Guo
3DH
86
0
0
06 Apr 2024
Vision Transformers in Domain Adaptation and Generalization: A Study of Robustness
Shadi Alijani
Jamil Fayyad
Homayoun Najjaran
OOD
118
1
0
05 Apr 2024
LOSS-SLAM: Lightweight Open-Set Semantic Simultaneous Localization and Mapping
Kurran Singh
Tim Magoun
John J. Leonard
128
1
0
05 Apr 2024
Dissecting Query-Key Interaction in Vision Transformers
Xu Pan
Aaron Philip
Ziqian Xie
Odelia Schwartz
117
1
0
04 Apr 2024
Test Time Training for Industrial Anomaly Segmentation
Alex Costanzino
Pierluigi Zama Ramirez
Mirko Del Moro
Agostino Aiezzo
Giuseppe Lisanti
Samuele Salti
Luigi Di Stefano
77
0
0
04 Apr 2024
JUICER: Data-Efficient Imitation Learning for Robotic Assembly
Lars Ankile
Anthony Simeonov
Idan Shenfeld
Pulkit Agrawal
LM&Ro
123
19
0
04 Apr 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Dongzhi Jiang
Guanglu Song
Xiaoshi Wu
Renrui Zhang
Dazhong Shen
Zhuofan Zong
Yu Liu
Hongsheng Li
VLM
134
28
0
04 Apr 2024
WorDepth: Variational Language Prior for Monocular Depth Estimation
Ziyao Zeng
Daniel Wang
Fengyu Yang
Hyoungseob Park
Yangchao Wu
Stefano Soatto
Byung-Woo Hong
Dong Lao
Alex Wong
MDE
136
28
0
04 Apr 2024
Multi Positive Contrastive Learning with Pose-Consistent Generated Images
Sho Inayoshi
Aji Resindra Widya
Satoshi Ozaki
Junji Otsuka
Takeshi Ohashi
3DH
151
1
0
04 Apr 2024
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
Yisheng He
Weihao Yuan
Siyu Zhu
Zilong Dong
Liefeng Bo
Qixing Huang
84
3
0
03 Apr 2024
A Unified Membership Inference Method for Visual Self-supervised Encoder via Part-aware Capability
Jie Zhu
Jirong Zha
Ding Li
Leye Wang
103
8
0
03 Apr 2024
Masked Completion via Structured Diffusion with White-Box Transformers
Druv Pai
Ziyang Wu
Sam Buchanan
Yaodong Yu
Yi-An Ma
72
14
0
03 Apr 2024
3D Congealing: 3D-Aware Image Alignment in the Wild
Yunzhi Zhang
Zizhang Li
Amit Raj
Andreas Engelhardt
Yuanzhen Li
Tingbo Hou
Jiajun Wu
Varun Jampani
3DV
76
0
0
02 Apr 2024
BRAVEn: Improving Self-Supervised Pre-training for Visual and Auditory Speech Recognition
A. Haliassos
Andreas Zinonos
Rodrigo Mira
Stavros Petridis
Maja Pantic
VLM
SSL
AI4TS
87
13
0
02 Apr 2024
SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation
V. Srivastav
Keqi Chen
N. Padoy
93
8
0
02 Apr 2024
Specularity Factorization for Low-Light Enhancement
A. S. Baslamisli
Noah Snavely. Intrinsic
86
4
0
02 Apr 2024
M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets
Gaurish Thakkar
Sherzod Hakimov
Marko Tadić
65
4
0
02 Apr 2024
A Universal Knowledge Embedded Contrastive Learning Framework for Hyperspectral Image Classification
Quanwei Liu
Yanni Dong
Wei Chen
Lefei Zhang
Bo Du
VLM
89
3
0
02 Apr 2024
Diffusion Deepfake
Chaitali Bhattacharyya
Hanxiao Wang
Feng Zhang
Sung-Ha Kim
Xiatian Zhu
73
5
0
02 Apr 2024
Can Biases in ImageNet Models Explain Generalization?
Paul Gavrikov
J. Keuper
OOD
VLM
67
15
0
01 Apr 2024
Measuring Style Similarity in Diffusion Models
Gowthami Somepalli
Anubhav Gupta
Kamal Gupta
Shramay Palta
Micah Goldblum
Jonas Geiping
Abhinav Shrivastava
Tom Goldstein
EGVM
103
42
0
01 Apr 2024
An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance
Simran Khanuja
Sathyanarayanan Ramamoorthy
Yueqi Song
Graham Neubig
DiffM
105
18
0
01 Apr 2024
Feature Splatting: Language-Driven Physics-Based Scene Synthesis and Editing
Ri-Zhao Qiu
Ge Yang
Weijia Zeng
Xiaolong Wang
3DGS
69
27
0
01 Apr 2024
SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining
Chull Hwan Song
Taebaek Hwang
Jooyoung Yoon
Shunghyun Choi
Yeong Hyeon Gu
52
5
0
01 Apr 2024
Machine Learning Robustness: A Primer
Houssem Ben Braiek
Foutse Khomh
AAML
OOD
106
8
0
01 Apr 2024
Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text Guidance
G. Nam
Byeongho Heo
Juho Lee
VLM
78
7
0
01 Apr 2024
Previous
1
2
3
...
34
35
36
...
82
83
84
Next