ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,175 papers shown
Title
Learning from the Web: Language Drives Weakly-Supervised Incremental
  Learning for Semantic Segmentation
Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation
Chang Liu
Giulia Rizzoli
Pietro Zanuttigh
Fu Li
Yi Niu
CLL
122
2
0
18 Jul 2024
Multi-sentence Video Grounding for Long Video Generation
Multi-sentence Video Grounding for Long Video Generation
Wei Feng
Xin Wang
Hong Chen
Zeyang Zhang
Wenwu Zhu
DiffM
71
0
0
18 Jul 2024
OVGNet: A Unified Visual-Linguistic Framework for Open-Vocabulary
  Robotic Grasping
OVGNet: A Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping
Meng Li
Qi Zhao
Shuchang Lyu
Chunlei Wang
Yujing Ma
Guangliang Cheng
Chenguang Yang
113
5
0
18 Jul 2024
Audio-visual Generalized Zero-shot Learning the Easy Way
Audio-visual Generalized Zero-shot Learning the Easy Way
Shentong Mo
Pedro Morgado
61
5
0
18 Jul 2024
ColorMAE: Exploring data-independent masking strategies in Masked
  AutoEncoders
ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders
Carlos Hinojosa
Shuming Liu
Guohao Li
67
2
0
17 Jul 2024
EchoSight: Advancing Visual-Language Models with Wiki Knowledge
EchoSight: Advancing Visual-Language Models with Wiki Knowledge
Yibin Yan
Weidi Xie
RALM
141
14
0
17 Jul 2024
InfoNorm: Mutual Information Shaping of Normals for Sparse-View
  Reconstruction
InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction
Xulong Wang
Siyan Dong
Youyi Zheng
Yanchao Yang
91
1
0
17 Jul 2024
Benchmarking Robust Self-Supervised Learning Across Diverse Downstream
  Tasks
Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks
Antoni Kowalczuk
Jan Dubiñski
Atiyeh Ashari Ghomi
Yi Sui
George Stein
Jiapeng Wu
Jesse C. Cresswell
Franziska Boenisch
Adam Dziedzic
SSLAAML
75
3
0
17 Jul 2024
Progressive Proxy Anchor Propagation for Unsupervised Semantic
  Segmentation
Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation
Hyun Seok Seong
WonJun Moon
Subeen Lee
Jae-Pil Heo
90
1
0
17 Jul 2024
ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language
  Inference
ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
Mengcheng Lan
Chaofeng Chen
Yiping Ke
Xinjiang Wang
Xue Jiang
Wayne Zhang
VLM
117
29
0
17 Jul 2024
GeneralAD: Anomaly Detection Across Domains by Attending to Distorted
  Features
GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features
Luc P.J. Strater
Mohammadreza Salehi
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
83
9
0
17 Jul 2024
R+X: Retrieval and Execution from Everyday Human Videos
R+X: Retrieval and Execution from Everyday Human Videos
Georgios Papagiannis
Norman Di Palo
Pietro Vitiello
Edward Johns
144
18
0
17 Jul 2024
Generalized Coverage for More Robust Low-Budget Active Learning
Generalized Coverage for More Robust Low-Budget Active Learning
Wonho Bae
Junhyug Noh
Danica J. Sutherland
129
4
0
16 Jul 2024
A Closer Look at Benchmarking Self-Supervised Pre-training with Image
  Classification
A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification
Markus Marks
Manuel Knott
Neehar Kondapaneni
Elijah Cole
T. Defraeye
Fernando Pérez-Cruz
Pietro Perona
SSL
125
5
0
16 Jul 2024
CroMo-Mixup: Augmenting Cross-Model Representations for Continual
  Self-Supervised Learning
CroMo-Mixup: Augmenting Cross-Model Representations for Continual Self-Supervised Learning
Erum Mushtaq
D. Yaldiz
Yavuz Faruk Bakman
Jie Ding
Chenyang Tao
Dimitrios Dimitriadis
A. Avestimehr
CLL
85
1
0
16 Jul 2024
Subject-driven Text-to-Image Generation via Preference-based
  Reinforcement Learning
Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning
Yanting Miao
William Loh
Suraj Kothawade
Pascal Poupart
Abdullah Rashwan
Yeqing Li
EGVM
57
5
0
16 Jul 2024
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
Seokhun Choi
H. Song
Jaechul Kim
Taehyeong Kim
Hoseok Do
3DGS
105
23
0
16 Jul 2024
Rate-Distortion-Cognition Controllable Versatile Neural Image
  Compression
Rate-Distortion-Cognition Controllable Versatile Neural Image Compression
Jinming Liu
Ruoyu Feng
Yunpeng Qi
Qiuyu Chen
Zhibo Chen
Wenjun Zeng
Xin Jin
89
2
0
16 Jul 2024
DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised
  Pre-Training
DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised Pre-Training
Guillermo Jiménez-Pérez
Pedro Osório
Josef Cersovsky
Javier Montalt-Tordera
Jens Hooge
Steffen Vogler
Sadegh Mohammadi
MedIm
94
2
0
16 Jul 2024
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded
  Scenes
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes
Zhi Cai
Yingjie Gao
Yaoyan Zheng
Nan Zhou
Di Huang
VLM
91
6
0
16 Jul 2024
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Yanqin Jiang
Chaohui Yu
Chenjie Cao
Fan Wang
Weiming Hu
Jin Gao
VGen
80
19
0
16 Jul 2024
SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images
SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images
Nir Barel
Ron Shapira Weber
Nir Mualem
Shahaf E. Finder
Oren Freifeld
171
2
0
16 Jul 2024
Efficient Unsupervised Visual Representation Learning with Explicit
  Cluster Balancing
Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
Ioannis Maniadis Metaxas
Georgios Tzimiropoulos
Ioannis Patras
SSL
104
0
0
15 Jul 2024
STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton
  Sequences
STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences
Soroush Mehraban
Mohammad Javad Rajabi
Babak Taati
3DPC
102
1
0
15 Jul 2024
DINO Pre-training for Vision-based End-to-end Autonomous Driving
DINO Pre-training for Vision-based End-to-end Autonomous Driving
Shubham Juneja
P. Daniušis
Virginijus Marcinkevičius
93
2
0
15 Jul 2024
Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision
  Models
Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models
Rining Wu
Feixiang Zhou
Ziwei Yin
Jian K. Liu
70
0
0
15 Jul 2024
Joint-Embedding Predictive Architecture for Self-Supervised Learning of
  Mask Classification Architecture
Joint-Embedding Predictive Architecture for Self-Supervised Learning of Mask Classification Architecture
Donghee Kim
Sungduk Cho
Hyeonwoo Cho
Chanmin Park
Jinyoung Kim
Won Hwa Kim
96
0
0
15 Jul 2024
Representation Learning and Identity Adversarial Training for Facial Behavior Understanding
Representation Learning and Identity Adversarial Training for Facial Behavior Understanding
Mang Ning
A. A. Salah
Itir Onal Ertugrul
CVBM
178
5
0
15 Jul 2024
Enhancing Weakly-Supervised Histopathology Image Segmentation with
  Knowledge Distillation on MIL-Based Pseudo-Labels
Enhancing Weakly-Supervised Histopathology Image Segmentation with Knowledge Distillation on MIL-Based Pseudo-Labels
Yinsheng He
Xingyu Li
Roger J. Zemp
VLM
98
0
0
14 Jul 2024
A Self-Supervised Learning Pipeline for Demographically Fair Facial
  Attribute Classification
A Self-Supervised Learning Pipeline for Demographically Fair Facial Attribute Classification
Sreeraj Ramachandran
A. Rattani
74
1
0
14 Jul 2024
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Cheng Shi
Yulin Zhang
Bin Yang
Jiajin Tang
Yuexin Ma
Sibei Yang
3DPC
110
1
0
14 Jul 2024
CLOVER: Context-aware Long-term Object Viewpoint- and Environment-
  Invariant Representation Learning
CLOVER: Context-aware Long-term Object Viewpoint- and Environment- Invariant Representation Learning
Dongmyeong Lee
Amanda Adkins
Joydeep Biswas
99
0
0
12 Jul 2024
3x2: 3D Object Part Segmentation by 2D Semantic Correspondences
3x2: 3D Object Part Segmentation by 2D Semantic Correspondences
Anh Thai
Weiyao Wang
Hao Tang
Stefan Stojanov
Matt Feiszli
James M. Rehg
3DPC
104
6
0
12 Jul 2024
StyleSplat: 3D Object Style Transfer with Gaussian Splatting
StyleSplat: 3D Object Style Transfer with Gaussian Splatting
Sahil Jain
Avik Kuthiala
P. Sethi
Prakanshul Saxena
3DGS
74
5
0
12 Jul 2024
iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental
  Learning
iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning
Tom Fischer
Yaoyao Liu
Artur Jesslen
Noor Ahmed
Prakhar Kaushik
Angtian Wang
Alan Yuille
Adam Kortylewski
Eddy Ilg
CLLAI4CE
81
1
0
12 Jul 2024
On the Role of Discrete Tokenization in Visual Representation Learning
On the Role of Discrete Tokenization in Visual Representation Learning
Tianqi Du
Yifei Wang
Yisen Wang
101
7
0
12 Jul 2024
Textual Query-Driven Mask Transformer for Domain Generalized
  Segmentation
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation
Byeonghyun Pak
Byeongju Woo
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
134
5
0
12 Jul 2024
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on
  Robustness
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Honghao Chen
Yurong Zhang
Xiaokun Feng
Xiangxiang Chu
Kaiqi Huang
AAML
81
6
0
12 Jul 2024
Tissue-Contrastive Semi-Masked Autoencoders for Segmentation Pretraining
  on Chest CT
Tissue-Contrastive Semi-Masked Autoencoders for Segmentation Pretraining on Chest CT
Jie Zheng
Ru Wen
Haiqin Hu
Lina Wei
Kui Su
Wei Chen
Chen Liu
Jun Wang
91
1
0
12 Jul 2024
Bora: Biomedical Generalist Video Generation Model
Bora: Biomedical Generalist Video Generation Model
Weixiang Sun
Xiaocao You
Ruizhe Zheng
Zhengqing Yuan
Xiang Li
Lifang He
Quanzheng Li
Lichao Sun
VGenMedIm
83
9
0
12 Jul 2024
ElasticAST: An Audio Spectrogram Transformer for All Length and
  Resolutions
ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions
Jiu Feng
Mehmet Hamza Erol
Joon Son Chung
Arda Senocak
64
1
0
11 Jul 2024
NODE-Adapter: Neural Ordinary Differential Equations for Better
  Vision-Language Reasoning
NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning
Yi Zhang
Chun-Wun Cheng
Ke Yu
Zhihai He
Carola-Bibiane Schonlieb
Angelica I Aviles-Rivero
VLM
85
2
0
11 Jul 2024
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate
  Video-based Bug Reports
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports
Yanfu Yan
Nathan Cooper
Oscar Chaparro
Kevin Moran
Denys Poshyvanyk
87
8
0
11 Jul 2024
Paving the way toward foundation models for irregular and unaligned
  Satellite Image Time Series
Paving the way toward foundation models for irregular and unaligned Satellite Image Time Series
Iris Dumeur
Silvia Valero
Jordi Inglada
111
3
0
11 Jul 2024
WildGaussians: 3D Gaussian Splatting in the Wild
WildGaussians: 3D Gaussian Splatting in the Wild
Jonáš Kulhánek
Songyou Peng
Zuzana Kukelova
Marc Pollefeys
Torsten Sattler
3DGS
161
51
0
11 Jul 2024
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic
  Segmentation
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation
Tong Shao
Zhuotao Tian
Hang Zhao
Jingyong Su
VLM
114
16
0
11 Jul 2024
Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for
  Multi-Robot Cooperation Tasks
Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for Multi-Robot Cooperation Tasks
Pu Feng
Junkang Liang
Size Wang
Xin Yu
Xin Ji
Yiting Chen
Kui Zhang
Rongye Shi
Wenjun Wu
122
7
0
11 Jul 2024
Swiss DINO: Efficient and Versatile Vision Framework for On-device
  Personal Object Search
Swiss DINO: Efficient and Versatile Vision Framework for On-device Personal Object Search
Kirill Paramonov
Jia-Xing Zhong
Umberto Michieli
J. Moon
Mete Ozay
121
2
0
10 Jul 2024
Pan-cancer Histopathology WSI Pre-training with Position-aware Masked
  Autoencoder
Pan-cancer Histopathology WSI Pre-training with Position-aware Masked Autoencoder
Kun-Hsuan Wu
Zhiguo Jiang
Kunming Tang
Jun Shi
Fengying Xie
Wei Wang
Haibo Wu
Yushan Zheng
43
1
0
10 Jul 2024
Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation
  Pretraining
Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining
Tianfang Sun
Zhizhong Zhang
Xin Tan
Yanyun Qu
Yuan Xie
109
0
0
10 Jul 2024
Previous
123...252627...828384
Next