ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXivPDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 1,249 papers shown
Title
FreeSOLO: Learning to Segment Objects without Annotations
FreeSOLO: Learning to Segment Objects without Annotations
Xinlong Wang
Zhiding Yu
Shalini De Mello
Jan Kautz
Anima Anandkumar
Chunhua Shen
J. Álvarez
ISeg
SSeg
24
112
0
24 Feb 2022
GroupViT: Semantic Segmentation Emerges from Text Supervision
GroupViT: Semantic Segmentation Emerges from Text Supervision
Jiarui Xu
Shalini De Mello
Sifei Liu
Wonmin Byeon
Thomas Breuel
Jan Kautz
Xinyu Wang
ViT
VLM
189
499
0
22 Feb 2022
Assessing the State of Self-Supervised Human Activity Recognition using
  Wearables
Assessing the State of Self-Supervised Human Activity Recognition using Wearables
H. Haresamudram
Irfan Essa
Thomas Plötz
SSL
42
86
0
22 Feb 2022
CaMEL: Mean Teacher Learning for Image Captioning
CaMEL: Mean Teacher Learning for Image Captioning
Manuele Barraco
Matteo Stefanini
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
ViT
VLM
35
27
0
21 Feb 2022
A Self-Supervised Descriptor for Image Copy Detection
A Self-Supervised Descriptor for Image Copy Detection
Ed Pizzi
Sreya . Dutta Roy
Sugosh Nagavara Ravindra
Priya Goyal
Matthijs Douze
SSL
34
117
0
21 Feb 2022
Vision Models Are More Robust And Fair When Pretrained On Uncurated
  Images Without Supervision
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision
Priya Goyal
Quentin Duval
Isaac Seessel
Mathilde Caron
Ishan Misra
Levent Sagun
Armand Joulin
Piotr Bojanowski
VLM
SSL
26
110
0
16 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
37
7
0
16 Feb 2022
Meta Knowledge Distillation
Meta Knowledge Distillation
Jihao Liu
Boxiao Liu
Hongsheng Li
Yu Liu
18
25
0
16 Feb 2022
ScoreNet: Learning Non-Uniform Attention and Augmentation for
  Transformer-Based Histopathological Image Classification
ScoreNet: Learning Non-Uniform Attention and Augmentation for Transformer-Based Histopathological Image Classification
Thomas Stegmüller
Behzad Bozorgtabar
A. Spahr
Jean-Philippe Thiran
ViT
MedIm
21
42
0
15 Feb 2022
CATs++: Boosting Cost Aggregation with Convolutions and Transformers
CATs++: Boosting Cost Aggregation with Convolutions and Transformers
Seokju Cho
Sunghwan Hong
Seung Wook Kim
ViT
27
34
0
14 Feb 2022
Multi-Modal Knowledge Graph Construction and Application: A Survey
Multi-Modal Knowledge Graph Construction and Application: A Survey
Xiangru Zhu
Zhixu Li
Xiaodan Wang
Xueyao Jiang
Penglei Sun
Xuwu Wang
Yanghua Xiao
N. Yuan
28
154
0
11 Feb 2022
Energy-Based Contrastive Learning of Visual Representations
Energy-Based Contrastive Learning of Visual Representations
Beomsu Kim
Jong Chul Ye
20
16
0
10 Feb 2022
Point-Level Region Contrast for Object Detection Pre-Training
Point-Level Region Contrast for Object Detection Pre-Training
Yutong Bai
Xinlei Chen
Alexander Kirillov
Alan Yuille
Alexander C. Berg
3DPC
28
50
0
09 Feb 2022
Automated Distance Estimation for Wildlife Camera Trapping
Automated Distance Estimation for Wildlife Camera Trapping
Peter Johanns
T. Haucke
Volker Steinhage
23
17
0
09 Feb 2022
Distillation with Contrast is All You Need for Self-Supervised Point
  Cloud Representation Learning
Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning
Kexue Fu
Peng Gao
Renrui Zhang
Hongsheng Li
Yu Qiao
Manning Wang
SSL
3DPC
22
23
0
09 Feb 2022
MaskGIT: Masked Generative Image Transformer
MaskGIT: Masked Generative Image Transformer
Huiwen Chang
Han Zhang
Lu Jiang
Ce Liu
William T. Freeman
ViT
40
622
0
08 Feb 2022
Results and findings of the 2021 Image Similarity Challenge
Results and findings of the 2021 Image Similarity Challenge
Zoe Papakipos
Giorgos Tolias
Tomás Jenícek
Ed Pizzi
Shuhei Yokoo
...
Sanjay V. Addicam
S. M. Papadakis
Cristian Canton Ferrer
Ondřej Chum
Matthijs Douze
13
13
0
08 Feb 2022
How to Understand Masked Autoencoders
How to Understand Masked Autoencoders
Shuhao Cao
Peng-Tao Xu
David A. Clifton
29
40
0
08 Feb 2022
Transformers in Self-Supervised Monocular Depth Estimation with Unknown
  Camera Intrinsics
Transformers in Self-Supervised Monocular Depth Estimation with Unknown Camera Intrinsics
Arnav Varma
Hemang Chawla
Bahram Zonooz
Elahe Arani
ViT
MDE
36
49
0
07 Feb 2022
Machine Translation from Signed to Spoken Languages: State of the Art
  and Challenges
Machine Translation from Signed to Spoken Languages: State of the Art and Challenges
Mathieu De Coster
D. Shterionov
Mieke Van Herreweghe
J. Dambre
SLR
13
40
0
07 Feb 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple
  Sequence-to-Sequence Learning Framework
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Peng Wang
An Yang
Rui Men
Junyang Lin
Shuai Bai
Zhikang Li
Jianxin Ma
Chang Zhou
Jingren Zhou
Hongxia Yang
MLLM
ObjD
53
850
0
07 Feb 2022
Bootstrapped Representation Learning for Skeleton-Based Action
  Recognition
Bootstrapped Representation Learning for Skeleton-Based Action Recognition
Olivier Moliner
Sangxia Huang
Kalle Åström
SSL
27
13
0
04 Feb 2022
AtmoDist: Self-supervised Representation Learning for Atmospheric
  Dynamics
AtmoDist: Self-supervised Representation Learning for Atmospheric Dynamics
Sebastian Hoffmann
C. Lessig
AI4Cl
24
8
0
02 Feb 2022
Mars Terrain Segmentation with Less Labels
Mars Terrain Segmentation with Less Labels
Edwin Y. Goh
Jingdao Chen
Brian Wilson
13
28
0
01 Feb 2022
Learning Super-Features for Image Retrieval
Learning Super-Features for Image Retrieval
Philippe Weinzaepfel
Thomas Lucas
Diane Larlus
Yannis Kalantidis
SupR
VLM
33
45
0
31 Jan 2022
Visual Representation Learning with Self-Supervised Attention for
  Low-Label High-data Regime
Visual Representation Learning with Self-Supervised Attention for Low-Label High-data Regime
Prarthana Bhattacharyya
Chenge Li
Xiaonan Zhao
István Fehérvári
Jason Sun
ViT
34
2
0
22 Jan 2022
Self-supervised Video Representation Learning with Cascade Positive
  Retrieval
Self-supervised Video Representation Learning with Cascade Positive Retrieval
Cheng-En Wu
Farley Lai
Yujie Hu
Asim Kadav
SSL
AI4TS
30
3
0
20 Jan 2022
Leveraging Real Talking Faces via Self-Supervision for Robust Forgery
  Detection
Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection
A. Haliassos
Rodrigo Mira
Stavros Petridis
M. Pantic
CVBM
40
126
0
18 Jan 2022
Video Transformers: A Survey
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
22
103
0
16 Jan 2022
Pushing the limits of self-supervised ResNets: Can we outperform
  supervised learning without labels on ImageNet?
Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?
Nenad Tomašev
Ioana Bica
Brian McWilliams
Lars Buesing
Razvan Pascanu
Charles Blundell
Jovana Mitrović
SSL
90
81
0
13 Jan 2022
BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations
BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations
Daiqing Li
Huan Ling
Seung Wook Kim
Karsten Kreis
Adela Barriuso
Sanja Fidler
Antonio Torralba
36
103
0
12 Jan 2022
Generalized Category Discovery
Generalized Category Discovery
S. Vaze
Kai Han
Andrea Vedaldi
Andrew Zisserman
38
188
0
07 Jan 2022
Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors
  in MRI Images
Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images
Ali Hatamizadeh
V. Nath
Yucheng Tang
Dong Yang
H. Roth
Daguang Xu
ViT
MedIm
21
1,059
0
04 Jan 2022
Splicing ViT Features for Semantic Appearance Transfer
Splicing ViT Features for Semantic Appearance Transfer
Narek Tumanyan
Omer Bar-Tal
Shai Bagon
Tali Dekel
DiffM
21
173
0
02 Jan 2022
Optimal Representations for Covariate Shift
Optimal Representations for Covariate Shift
Yangjun Ruan
Yann Dubois
Chris J. Maddison
OOD
25
68
0
31 Dec 2021
Augmenting Convolutional networks with attention-based aggregation
Augmenting Convolutional networks with attention-based aggregation
Hugo Touvron
Matthieu Cord
Alaaeldin El-Nouby
Piotr Bojanowski
Armand Joulin
Gabriel Synnaeve
Hervé Jégou
ViT
38
47
0
27 Dec 2021
SLIP: Self-supervision meets Language-Image Pre-training
SLIP: Self-supervision meets Language-Image Pre-training
Norman Mu
Alexander Kirillov
David A. Wagner
Saining Xie
VLM
CLIP
60
479
0
23 Dec 2021
Are Large-scale Datasets Necessary for Self-Supervised Pre-training?
Are Large-scale Datasets Necessary for Self-Supervised Pre-training?
Alaaeldin El-Nouby
Gautier Izacard
Hugo Touvron
Ivan Laptev
Hervé Jégou
Edouard Grave
SSL
27
149
0
20 Dec 2021
Learning with Label Noise for Image Retrieval by Selecting Interactions
Learning with Label Noise for Image Retrieval by Selecting Interactions
Sarah Ibrahimi
Arnaud Sors
Rafael Sampaio de Rezende
S. Clinchant
NoLa
VLM
24
16
0
20 Dec 2021
High Fidelity Visualization of What Your Self-Supervised Representation
  Knows About
High Fidelity Visualization of What Your Self-Supervised Representation Knows About
Florian Bordes
Randall Balestriero
Pascal Vincent
DiffM
25
61
0
16 Dec 2021
Masked Feature Prediction for Self-Supervised Visual Pre-Training
Masked Feature Prediction for Self-Supervised Visual Pre-Training
Chen Wei
Haoqi Fan
Saining Xie
Chaoxia Wu
Alan Yuille
Christoph Feichtenhofer
ViT
88
655
0
16 Dec 2021
HODOR: High-level Object Descriptors for Object Re-segmentation in Video
  Learned from Static Images
HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images
A. Athar
Jonathon Luiten
Alexander Hermans
Deva Ramanan
Bastian Leibe
VOS
27
25
0
16 Dec 2021
Ensembling Off-the-shelf Models for GAN Training
Ensembling Off-the-shelf Models for GAN Training
Nupur Kumari
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
34
86
0
16 Dec 2021
Unsupervised Dense Information Retrieval with Contrastive Learning
Unsupervised Dense Information Retrieval with Contrastive Learning
Gautier Izacard
Mathilde Caron
Lucas Hosseini
Sebastian Riedel
Piotr Bojanowski
Armand Joulin
Edouard Grave
RALM
38
808
0
16 Dec 2021
Deep Hash Distillation for Image Retrieval
Deep Hash Distillation for Image Retrieval
Young Kyun Jang
Geonmo Gu
ByungSoo Ko
Isaac Kang
N. Cho
21
34
0
16 Dec 2021
FIgLib & SmokeyNet: Dataset and Deep Learning Model for Real-Time
  Wildland Fire Smoke Detection
FIgLib & SmokeyNet: Dataset and Deep Learning Model for Real-Time Wildland Fire Smoke Detection
Anshuman Dewangan
Yash Pande
Hans-Werner Braun
F. Vernon
Ismael Pérez
I. Altintas
G. Cottrell
M. H. Nguyen
14
45
0
16 Dec 2021
Towards General and Efficient Active Learning
Towards General and Efficient Active Learning
Yichen Xie
M. Tomizuka
Wei Zhan
VLM
35
10
0
15 Dec 2021
Deep ViT Features as Dense Visual Descriptors
Deep ViT Features as Dense Visual Descriptors
Shirzad Amir
Yossi Gandelsman
Shai Bagon
Tali Dekel
MDE
ViT
36
273
0
10 Dec 2021
Label, Verify, Correct: A Simple Few Shot Object Detection Method
Label, Verify, Correct: A Simple Few Shot Object Detection Method
Prannay Kaul
Weidi Xie
Andrew Zisserman
ObjD
17
81
0
10 Dec 2021
FLAVA: A Foundational Language And Vision Alignment Model
FLAVA: A Foundational Language And Vision Alignment Model
Amanpreet Singh
Ronghang Hu
Vedanuj Goswami
Guillaume Couairon
Wojciech Galuba
Marcus Rohrbach
Douwe Kiela
CLIP
VLM
40
687
0
08 Dec 2021
Previous
123...22232425
Next