ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,176 papers shown
Title
Beyond A*: Better Planning with Transformers via Search Dynamics
  Bootstrapping
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Lucas Lehnert
Sainbayar Sukhbaatar
DiJia Su
Qinqing Zheng
Paul Mcvay
Michael Rabbat
Yuandong Tian
121
65
0
21 Feb 2024
Unsupervised Concept Discovery Mitigates Spurious Correlations
Unsupervised Concept Discovery Mitigates Spurious Correlations
Md Rifat Arefin
Yan Zhang
A. Baratin
Francesco Locatello
Irina Rish
Dianbo Liu
Kenji Kawaguchi
104
6
0
20 Feb 2024
Aria Everyday Activities Dataset
Aria Everyday Activities Dataset
Zhaoyang Lv
Nickolas Charron
Pierre Moulon
Alexander Gamino
Cheng Peng
...
Yuyang Zou
Richard Newcombe
Jakob Julian Engel
Xiaqing Pan
Carl Ren
58
12
0
20 Feb 2024
A Touch, Vision, and Language Dataset for Multimodal Alignment
A Touch, Vision, and Language Dataset for Multimodal Alignment
Letian Fu
Gaurav Datta
Huang Huang
Will Panitch
Jaimyn Drake
Joseph Ortiz
Mustafa Mukadam
Mike Lambeta
Roberto Calandra
Ken Goldberg
VLM
97
43
0
20 Feb 2024
DINOBot: Robot Manipulation via Retrieval and Alignment with Vision
  Foundation Models
DINOBot: Robot Manipulation via Retrieval and Alignment with Vision Foundation Models
Norman Di Palo
Edward Johns
LM&Ro
106
30
0
20 Feb 2024
Slot-VLM: SlowFast Slots for Video-Language Modeling
Slot-VLM: SlowFast Slots for Video-Language Modeling
Jiaqi Xu
Cuiling Lan
Wenxuan Xie
Xuejin Chen
Yan Lu
MLLMVLM
46
7
0
20 Feb 2024
Visual Style Prompting with Swapping Self-Attention
Visual Style Prompting with Swapping Self-Attention
Jaeseok Jeong
Junho Kim
Yunjey Choi
Gayoung Lee
Youngjung Uh
DiffM
90
43
0
20 Feb 2024
Object-level Geometric Structure Preserving for Natural Image Stitching
Object-level Geometric Structure Preserving for Natural Image Stitching
Wenxiao Cai
Wankou Yang
60
5
0
20 Feb 2024
Visual Reasoning in Object-Centric Deep Neural Networks: A Comparative
  Cognition Approach
Visual Reasoning in Object-Centric Deep Neural Networks: A Comparative Cognition Approach
Guillermo Puebla
Jeffrey S. Bowers
OCL
86
0
0
20 Feb 2024
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey
Fabio Tosi
Youming Zhang
Ziren Gong
Erik Sandström
S. Mattoccia
Martin R. Oswald
Matteo Poggi
3DGS
214
64
0
20 Feb 2024
Multilinear Mixture of Experts: Scalable Expert Specialization through
  Factorization
Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
James Oldfield
Markos Georgopoulos
Grigorios G. Chrysos
Christos Tzelepis
Yannis Panagakis
M. Nicolaou
Jiankang Deng
Ioannis Patras
MoE
128
10
0
19 Feb 2024
Integrating kNN with Foundation Models for Adaptable and Privacy-Aware
  Image Classification
Integrating kNN with Foundation Models for Adaptable and Privacy-Aware Image Classification
Sebastian Doerrich
Tobias Archut
Francesco Di Salvo
Christian Ledig
61
4
0
19 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRMVLM
139
64
0
19 Feb 2024
ComFusion: Personalized Subject Generation in Multiple Specific Scenes
  From Single Image
ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image
Yan Hong
Jianfu Zhang
DiffM
107
3
0
19 Feb 2024
Semantically-aware Neural Radiance Fields for Visual Scene
  Understanding: A Comprehensive Review
Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review
Thang-Anh-Quan Nguyen
Amine Bourki
Mátyás Macudzinski
Anthony Brunel
M. Bennamoun
134
13
0
17 Feb 2024
Revisiting Feature Prediction for Learning Visual Representations from
  Video
Revisiting Feature Prediction for Learning Visual Representations from Video
Adrien Bardes
Q. Garrido
Jean Ponce
Xinlei Chen
Michael G. Rabbat
Yann LeCun
Mahmoud Assran
Nicolas Ballas
MDEVLM
160
87
0
15 Feb 2024
Rewards-in-Context: Multi-objective Alignment of Foundation Models with
  Dynamic Preference Adjustment
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
Rui Yang
Xiaoman Pan
Feng Luo
Shuang Qiu
Han Zhong
Dong Yu
Jianshu Chen
233
83
0
15 Feb 2024
SAMformer: Unlocking the Potential of Transformers in Time Series
  Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention
SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention
Romain Ilbert
Ambroise Odonnat
Vasilii Feofanov
Aladin Virmaux
Giuseppe Paolo
Themis Palpanas
I. Redko
AI4TS
128
29
0
15 Feb 2024
DreamMatcher: Appearance Matching Self-Attention for
  Semantically-Consistent Text-to-Image Personalization
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
Jisu Nam
Heesu Kim
Dongjae Lee
Siyoon Jin
Seungryong Kim
Seunggyu Chang
DiffM
111
44
0
15 Feb 2024
Magic-Me: Identity-Specific Video Customized Diffusion
Magic-Me: Identity-Specific Video Customized Diffusion
Ze Ma
Daquan Zhou
Chun-Hsiao Yeh
Xue-She Wang
Xiuyu Li
Huanrui Yang
Zhen Dong
Kurt Keutzer
Jiashi Feng
VGenDiffM
86
32
0
14 Feb 2024
Affine transformation estimation improves visual self-supervised
  learning
Affine transformation estimation improves visual self-supervised learning
David Torpey
Richard Klein
SSL
46
1
0
14 Feb 2024
IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality
  3D Generation
IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation
Luke Melas-Kyriazi
Iro Laina
Christian Rupprecht
Natalia Neverova
Andrea Vedaldi
Oran Gafni
Filippos Kokkinos
3DGS
106
68
0
13 Feb 2024
NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs
NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs
Michael Fischer
Zhengqin Li
Thu Nguyen-Phuoc
Aljaz Bozic
Zhao Dong
Carl S. Marshall
Tobias Ritschel
81
11
0
13 Feb 2024
Leveraging Self-Supervised Instance Contrastive Learning for Radar
  Object Detection
Leveraging Self-Supervised Instance Contrastive Learning for Radar Object Detection
Colin Decourt
R. V. Rullen
D. Salle
Thomas Oberlin
SSL
70
0
0
13 Feb 2024
Comparative Analysis of ImageNet Pre-Trained Deep Learning Models and
  DINOv2 in Medical Imaging Classification
Comparative Analysis of ImageNet Pre-Trained Deep Learning Models and DINOv2 in Medical Imaging Classification
Yuning Huang
Jingchen Zou
Lanxi Meng
Xin Yue
Qing Zhao
Jianqiang Li
Changwei Song
Gabriel Jimenez
Shaowu Li
Guanghui Fu
104
13
0
12 Feb 2024
Discriminative Adversarial Unlearning
Discriminative Adversarial Unlearning
Rohan Sharma
Shijie Zhou
Kaiyi Ji
Changyou Chen
MU
76
1
0
10 Feb 2024
A self-supervised framework for learning whole slide representations
A self-supervised framework for learning whole slide representations
X. Hou
Cheng Jiang
A. Kondepudi
Yiwei Lyu
Asadur Chowdury
Honglak Lee
Todd C. Hollon
MedIm
96
6
0
09 Feb 2024
ViGoR: Improving Visual Grounding of Large Vision Language Models with
  Fine-Grained Reward Modeling
ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling
Siming Yan
Min Bai
Weifeng Chen
Xiong Zhou
Qixing Huang
Erran L. Li
VLM
57
20
0
09 Feb 2024
AttnLRP: Attention-Aware Layer-Wise Relevance Propagation for
  Transformers
AttnLRP: Attention-Aware Layer-Wise Relevance Propagation for Transformers
Reduan Achtibat
Sayed Mohammad Vakilzadeh Hatefi
Maximilian Dreyer
Aakriti Jain
Thomas Wiegand
Sebastian Lapuschkin
Wojciech Samek
100
37
0
08 Feb 2024
Task-customized Masked AutoEncoder via Mixture of Cluster-conditional
  Experts
Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts
Zhili Liu
Kai Chen
Jianhua Han
Lanqing Hong
Hang Xu
Zhenguo Li
James T. Kwok
MoE
188
25
0
08 Feb 2024
InCoRo: In-Context Learning for Robotics Control with Feedback Loops
InCoRo: In-Context Learning for Robotics Control with Feedback Loops
Jiaqiang Ye Zhu
Carla Gomez Cano
David Vazquez Bermudez
Michal Drozdzal
LRM
77
8
0
07 Feb 2024
OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language
  Foundation Models for 3D Semantic Understanding
OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding
Guibiao Liao
Kaichen Zhou
Zhenyu Bao
Kanglin Liu
Qing Li
VLM
66
23
0
07 Feb 2024
GSN: Generalisable Segmentation in Neural Radiance Field
GSN: Generalisable Segmentation in Neural Radiance Field
Vinayak Gupta
Rahul Goel
Dhawal Sirikonda
P. J. Narayanan
70
1
0
07 Feb 2024
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax
  Mimicry
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry
Michael Zhang
Kush S. Bhatia
Hermann Kumbong
Christopher Ré
80
54
0
06 Feb 2024
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Weiming Ren
Harry Yang
Ge Zhang
Cong Wei
Xinrun Du
Stephen W. Huang
Wenhu Chen
DiffMVGen
126
66
0
06 Feb 2024
Human-Like Geometric Abstraction in Large Pre-trained Neural Networks
Human-Like Geometric Abstraction in Large Pre-trained Neural Networks
Declan Campbell
Sreejan Kumar
Tyler Giallanza
Thomas Griffiths
Jonathan D. Cohen
GNNOCL
68
3
0
06 Feb 2024
Pre-training of Lightweight Vision Transformers on Small Datasets with
  Minimally Scaled Images
Pre-training of Lightweight Vision Transformers on Small Datasets with Minimally Scaled Images
Jen Hong Tan
ViT
26
3
0
06 Feb 2024
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning
  and Levels-of-Experts
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning and Levels-of-Experts
Kun Wang
Hao Wu
Guibin Zhang
Sihang Li
Yuxuan Liang
Yuankai Wu
Roger Zimmermann
Yang Wang
82
11
0
06 Feb 2024
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection
Shengcao Cao
Dhiraj Joshi
Liangyan Gui
Yu Wang
92
11
0
05 Feb 2024
Just Cluster It: An Approach for Exploration in High-Dimensions using
  Clustering and Pre-Trained Representations
Just Cluster It: An Approach for Exploration in High-Dimensions using Clustering and Pre-Trained Representations
Stefan Sylvius Wagner
Stefan Harmeling
73
2
0
05 Feb 2024
Good Teachers Explain: Explanation-Enhanced Knowledge Distillation
Good Teachers Explain: Explanation-Enhanced Knowledge Distillation
Amin Parchami-Araghi
Moritz Bohle
Sukrut Rao
Bernt Schiele
FAtt
61
4
0
05 Feb 2024
Applying Unsupervised Semantic Segmentation to High-Resolution UAV
  Imagery for Enhanced Road Scene Parsing
Applying Unsupervised Semantic Segmentation to High-Resolution UAV Imagery for Enhanced Road Scene Parsing
Zihan Ma
Yongshang Li
Ronggui Ma
Chen Liang
64
2
0
05 Feb 2024
Enhancing Compositional Generalization via Compositional Feature
  Alignment
Enhancing Compositional Generalization via Compositional Feature Alignment
Haoxiang Wang
Haozhe Si
Huajie Shao
Han Zhao
115
2
0
05 Feb 2024
Vision-Language Models Provide Promptable Representations for
  Reinforcement Learning
Vision-Language Models Provide Promptable Representations for Reinforcement Learning
William Chen
Oier Mees
Aviral Kumar
Sergey Levine
VLMLM&Ro
134
29
0
05 Feb 2024
Point Cloud Matters: Rethinking the Impact of Different Observation
  Spaces on Robot Learning
Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
Haoyi Zhu
Yating Wang
Di Huang
Weicai Ye
Wanli Ouyang
Tong He
SSL3DPC
158
25
0
04 Feb 2024
Deep Spectral Improvement for Unsupervised Image Instance Segmentation
Deep Spectral Improvement for Unsupervised Image Instance Segmentation
Farnoosh Arefi
Amir M. Mansourian
S. Kasaei
ISeg
95
1
0
04 Feb 2024
Review of multimodal machine learning approaches in healthcare
Review of multimodal machine learning approaches in healthcare
"Felix H. Krones
Umar Marikkar
Guy Parsons
Adam Szmul
Adam Mahdi
129
34
0
04 Feb 2024
COMPRER: A Multimodal Multi-Objective Pretraining Framework for Enhanced
  Medical Image Representation
COMPRER: A Multimodal Multi-Objective Pretraining Framework for Enhanced Medical Image Representation
Guy Lutsker
H. Rossman
Nastya Godiva
E. Segal
MedIm
101
1
0
04 Feb 2024
Exploring Intrinsic Properties of Medical Images for Self-Supervised
  Binary Semantic Segmentation
Exploring Intrinsic Properties of Medical Images for Self-Supervised Binary Semantic Segmentation
P. Singh
Jacopo Cirrone
91
0
0
04 Feb 2024
Region-Based Representations Revisited
Region-Based Representations Revisited
Michal Shlapentokh-Rothman
Ansel Blume
Yao Xiao
Yuqun Wu
TV Sethuraman
Heyi Tao
Jae Yong Lee
Wilfredo Torres
Yu-Xiong Wang
Derek Hoiem
124
12
0
04 Feb 2024
Previous
123...394041...828384
Next