ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,175 papers shown
Title
Open World DETR: Transformer based Open World Object Detection
Open World DETR: Transformer based Open World Object Detection
Na Dong
Yongqiang Zhang
Mingli Ding
G. Lee
83
12
0
06 Dec 2022
Self-Supervised Audio-Visual Speech Representations Learning By
  Multimodal Self-Distillation
Self-Supervised Audio-Visual Speech Representations Learning By Multimodal Self-Distillation
Jing-Xuan Zhang
Genshun Wan
Zhenhua Ling
Jia Pan
Jianqing Gao
Cong Liu
SSL
81
13
0
06 Dec 2022
Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases
Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases
Mazda Moayeri
Wenxiao Wang
Sahil Singla
Soheil Feizi
169
16
0
05 Dec 2022
PEANUT: Predicting and Navigating to Unseen Targets
PEANUT: Predicting and Navigating to Unseen Targets
Albert J. Zhai
Shenlong Wang
82
23
0
05 Dec 2022
One-shot Implicit Animatable Avatars with Model-based Priors
One-shot Implicit Animatable Avatars with Model-based Priors
Yangyi Huang
Hongwei Yi
Weiyang Liu
Haofan Wang
Boxi Wu
Wenxiao Wang
Binbin Lin
Debing Zhang
Deng Cai
3DH
122
33
0
05 Dec 2022
Location-Aware Self-Supervised Transformers for Semantic Segmentation
Location-Aware Self-Supervised Transformers for Semantic Segmentation
Mathilde Caron
N. Houlsby
Cordelia Schmid
ViT
70
14
0
05 Dec 2022
3D-LatentMapper: View Agnostic Single-View Reconstruction of 3D Shapes
3D-LatentMapper: View Agnostic Single-View Reconstruction of 3D Shapes
Alara Dirik
Pinar Yanardag
3DV
26
1
0
05 Dec 2022
Joint Self-Supervised Image-Volume Representation Learning with
  Intra-Inter Contrastive Clustering
Joint Self-Supervised Image-Volume Representation Learning with Intra-Inter Contrastive Clustering
D. M. Nguyen
Hoangvu Nguyen
M. T. N. Truong
T. Cao
Binh Duc Nguyen
Nhat Ho
Paul Swoboda
Shadi Albarqouni
P. Xie
Daniel Sonntag
SSL
76
21
0
04 Dec 2022
Self-supervised AutoFlow
Self-supervised AutoFlow
Hsin-Ping Huang
Charles Herrmann
Junhwa Hur
Erika Lu
Kyle Sargent
Austin Stone
Ming-Hsuan Yang
Deqing Sun
117
9
0
04 Dec 2022
Exploring Stochastic Autoregressive Image Modeling for Visual
  Representation
Exploring Stochastic Autoregressive Image Modeling for Visual Representation
Yu-Hang Qi
Fan Yang
Yousong Zhu
Yufei Liu
Liwei Wu
Rui Zhao
Wei Li
DiffM
57
13
0
03 Dec 2022
A Domain-specific Perceptual Metric via Contrastive Self-supervised
  Representation: Applications on Natural and Medical Images
A Domain-specific Perceptual Metric via Contrastive Self-supervised Representation: Applications on Natural and Medical Images
Hongwei Bran Li
Chinmay Prabhakar
Suprosanna Shit
Johannes C. Paetzold
Tamaz Amiranashvili
Jianguo Zhang
Daniel Rueckert
Juan Eugenio Iglesias
Benedikt Wiestler
Bjoern Menze
OODSSL
64
3
0
03 Dec 2022
PROB: Probabilistic Objectness for Open World Object Detection
PROB: Probabilistic Objectness for Open World Object Detection
O. Zohar
Kuan-Chieh Wang
Serena Yeung
83
63
0
02 Dec 2022
PASTA: Proportional Amplitude Spectrum Training Augmentation for
  Syn-to-Real Domain Generalization
PASTA: Proportional Amplitude Spectrum Training Augmentation for Syn-to-Real Domain Generalization
Prithvijit Chattopadhyay
Kartik Sarangmath
Vivek Vijaykumar
Judy Hoffman
143
29
0
02 Dec 2022
Improving Zero-Shot Models with Label Distribution Priors
Improving Zero-Shot Models with Label Distribution Priors
Jonathan Kahana
Niv Cohen
Yedid Hoshen
VLM
136
14
0
01 Dec 2022
Hyperbolic Contrastive Learning for Visual Representations beyond
  Objects
Hyperbolic Contrastive Learning for Visual Representations beyond Objects
Songwei Ge
Shlok Kumar Mishra
Simon Kornblith
Chun-Liang Li
David Jacobs
OCLSSL
129
57
0
01 Dec 2022
Parametric Information Maximization for Generalized Category Discovery
Parametric Information Maximization for Generalized Category Discovery
Florent Chiaroni
Jose Dolz
Imtiaz Masud Ziko
A. Mitiche
Ismail Ben Ayed
92
18
0
01 Dec 2022
Spatio-Temporal Crop Aggregation for Video Representation Learning
Spatio-Temporal Crop Aggregation for Video Representation Learning
Sepehr Sameni
Simon Jenni
Paolo Favaro
103
3
0
30 Nov 2022
Hierarchical Transformer for Survival Prediction Using Multimodality
  Whole Slide Images and Genomics
Hierarchical Transformer for Survival Prediction Using Multimodality Whole Slide Images and Genomics
Chunyuan Li
Xinliang Zhu
Jiawen Yao
Junzhou Huang
MedIm
64
13
0
29 Nov 2022
SparsePose: Sparse-View Camera Pose Regression and Refinement
SparsePose: Sparse-View Camera Pose Regression and Refinement
Samarth Sinha
Jason Y. Zhang
Andrea Tagliasacchi
Igor Gilitschenski
David B. Lindell
88
44
0
29 Nov 2022
LUMix: Improving Mixup by Better Modelling Label Uncertainty
LUMix: Improving Mixup by Better Modelling Label Uncertainty
Shuyang Sun
Jieneng Chen
Ruifei He
Alan Yuille
Philip Torr
Song Bai
UQCVNoLa
74
5
0
29 Nov 2022
A Visual Active Search Framework for Geospatial Exploration
A Visual Active Search Framework for Geospatial Exploration
Anindya Sarkar
Michael Lanier
Scott Alfeld
Jiarui Feng
Roman Garnett
Nathan Jacobs
Yevgeniy Vorobeychik
92
7
0
28 Nov 2022
Perceive, Ground, Reason, and Act: A Benchmark for General-purpose
  Visual Representation
Perceive, Ground, Reason, and Act: A Benchmark for General-purpose Visual Representation
Jiangyong Huang
William Zhu
Baoxiong Jia
Zan Wang
Xiaojian Ma
Qing Li
Siyuan Huang
123
5
0
28 Nov 2022
A Light Touch Approach to Teaching Transformers Multi-view Geometry
A Light Touch Approach to Teaching Transformers Multi-view Geometry
Yash Bhalgat
Joao F. Henriques
Andrew Zisserman
ViT
101
6
0
28 Nov 2022
Learning Dense Object Descriptors from Multiple Views for Low-shot
  Category Generalization
Learning Dense Object Descriptors from Multiple Views for Low-shot Category Generalization
Stefan Stojanov
Anh Thai
Zixuan Huang
James M. Rehg
101
2
0
28 Nov 2022
Leveraging Image Matching Toward End-to-End Relative Camera Pose
  Regression
Leveraging Image Matching Toward End-to-End Relative Camera Pose Regression
Fadi Khatib
Yuval Margalit
Meirav Galun
Ronen Basri
67
2
0
27 Nov 2022
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary
  Semantic Segmentation
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Huaishao Luo
Junwei Bao
Youzheng Wu
Xiaodong He
Tianrui Li
VLM
122
153
0
27 Nov 2022
Dynamic Feature Pruning and Consolidation for Occluded Person
  Re-Identification
Dynamic Feature Pruning and Consolidation for Occluded Person Re-Identification
Yuteng Ye
Hang Zhou
Jiale Cai
Chenxing Gao
Youjia Zhang
Junle Wang
Qiang Hu
Junqing Yu
Wei Yang
69
6
0
27 Nov 2022
A Unified Framework for Contrastive Learning from a Perspective of
  Affinity Matrix
A Unified Framework for Contrastive Learning from a Perspective of Affinity Matrix
Wenbin Li
Meihao Kong
Xuesong Yang
Lei Wang
Jing Huo
Yang Gao
Jiebo Luo
40
0
0
26 Nov 2022
Rethinking Alignment and Uniformity in Unsupervised Image Semantic
  Segmentation
Rethinking Alignment and Uniformity in Unsupervised Image Semantic Segmentation
Daoan Zhang
Chenming Li
Haoquan Li
Wen-Fong Huang
Lingyun Huang
Jianguo Zhang
89
20
0
26 Nov 2022
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video
  Representation Learning
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning
Pritam Sarkar
Ali Etemad
112
23
0
25 Nov 2022
Adaptive Attention Link-based Regularization for Vision Transformers
Adaptive Attention Link-based Regularization for Vision Transformers
Heegon Jin
Jongwon Choi
ViT
87
0
0
25 Nov 2022
Self-Supervised Learning based on Heat Equation
Self-Supervised Learning based on Heat Equation
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Lu Yuan
Zicheng Liu
Youzuo Lin
77
4
0
23 Nov 2022
Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
R. Burgert
Kanchana Ranasinghe
Xiang Li
Michael S. Ryoo
DiffMVLM
88
38
0
23 Nov 2022
ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event
  Classification
ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification
Sara Atito
Muhammad Awais
Wenwu Wang
Mark D. Plumbley
J. Kittler
ViT
71
11
0
23 Nov 2022
Unsupervised 3D Keypoint Discovery with Multi-View Geometry
Unsupervised 3D Keypoint Discovery with Multi-View Geometry
S. Honari
Chen Zhao
Mathieu Salzmann
Pascal Fua
3DH
70
1
0
23 Nov 2022
Reason from Context with Self-supervised Learning
Reason from Context with Self-supervised Learning
Xinyu Liu
Ankur Sikarwar
Gabriel Kreiman
Zenglin Shi
Mengmi Zhang
ReLMLRM
94
1
0
23 Nov 2022
PANeRF: Pseudo-view Augmentation for Improved Neural Radiance Fields
  Based on Few-shot Inputs
PANeRF: Pseudo-view Augmentation for Improved Neural Radiance Fields Based on Few-shot Inputs
Young Chun Ahn
Seokhwan Jang
Sungheon Park
Ji-Yeon Kim
Nahyup Kang
93
12
0
23 Nov 2022
Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token
  Migration
Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration
Yunjie Tian
Lingxi Xie
Jihao Qiu
Jianbin Jiao
Yaowei Wang
Qi Tian
Qixiang Ye
ViT
98
7
0
23 Nov 2022
MagicPony: Learning Articulated 3D Animals in the Wild
MagicPony: Learning Articulated 3D Animals in the Wild
Shangzhe Wu
Ruining Li
Tomas Jakab
Christian Rupprecht
Andrea Vedaldi
ViT
84
77
0
22 Nov 2022
On the Transferability of Visual Features in Generalized Zero-Shot
  Learning
On the Transferability of Visual Features in Generalized Zero-Shot Learning
Paola Cascante-Bonilla
Leonid Karlinsky
James Smith
Yanjun Qi
Vicente Ordonez
75
2
0
22 Nov 2022
Exemplar-free Continual Learning of Vision Transformers via Gated
  Class-Attention and Cascaded Feature Drift Compensation
Exemplar-free Continual Learning of Vision Transformers via Gated Class-Attention and Cascaded Feature Drift Compensation
Marco Cotogni
Fei Yang
C. Cusano
Andrew D. Bagdanov
Joost van de Weijer
CLL
93
0
0
22 Nov 2022
SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural
  Radiance Fields
SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields
Ashkan Mirzaei
Tristan Aumentado-Armstrong
Konstantinos G. Derpanis
J. Kelly
Marcus A. Brubaker
Igor Gilitschenski
Alex Levinshtein
103
116
0
22 Nov 2022
Towards Automated Polyp Segmentation Using Weakly- and Semi-Supervised
  Learning and Deformable Transformers
Towards Automated Polyp Segmentation Using Weakly- and Semi-Supervised Learning and Deformable Transformers
Guangyu Ren
Michalis Lazarou
Jing Yuan
Tania Stathaki
ViTMedIm
54
9
0
21 Nov 2022
Last-Mile Embodied Visual Navigation
Last-Mile Embodied Visual Navigation
Justin Wasserman
Karmesh Yadav
Girish Chowdhary
Abhi Gupta
Unnat Jain
108
34
0
21 Nov 2022
Parametric Classification for Generalized Category Discovery: A Baseline
  Study
Parametric Classification for Generalized Category Discovery: A Baseline Study
Xin Wen
Bingchen Zhao
Xiaojuan Qi
94
76
0
21 Nov 2022
Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space
  Viewpoint
Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint
Hongyu Liu
Yibing Song
Qifeng Chen
DiffM
96
21
0
21 Nov 2022
Unifying Vision-Language Representation Space with Single-tower
  Transformer
Unifying Vision-Language Representation Space with Single-tower Transformer
Jiho Jang
Chaerin Kong
D. Jeon
Seonhoon Kim
Nojun Kwak
113
21
0
21 Nov 2022
MINTIME: Multi-Identity Size-Invariant Video Deepfake Detection
MINTIME: Multi-Identity Size-Invariant Video Deepfake Detection
D. Coccomini
Giorgos Kordopatis-Zilos
Giuseppe Amato
R. Caldelli
Fabrizio Falchi
Symeon Papadopoulos
Claudio Gennaro
93
16
0
20 Nov 2022
Rethinking Batch Sample Relationships for Data Representation: A
  Batch-Graph Transformer based Approach
Rethinking Batch Sample Relationships for Data Representation: A Batch-Graph Transformer based Approach
Xixi Wang
Bowei Jiang
Tianlin Li
Bin Luo
ViT
111
5
0
19 Nov 2022
Bidirectional Generation of Structure and Properties Through a Single
  Molecular Foundation Model
Bidirectional Generation of Structure and Properties Through a Single Molecular Foundation Model
Jinho Chang
Jong Chul Ye
AI4CE
67
36
0
19 Nov 2022
Previous
123...676869...828384
Next