ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,175 papers shown
Title
TransAgent: Transfer Vision-Language Foundation Models with
  Heterogeneous Agent Collaboration
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
Yiwei Guo
Shaobin Zhuang
Kunchang Li
Yu Qiao
Yali Wang
VLMCLIP
128
1
0
16 Oct 2024
Jigsaw++: Imagining Complete Shape Priors for Object Reassembly
Jigsaw++: Imagining Complete Shape Priors for Object Reassembly
Jiaxin Lu
Gang Hua
Qixing Huang
59
2
0
15 Oct 2024
SGEdit: Bridging LLM with Text2Image Generative Model for Scene
  Graph-based Image Editing
SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing
Zhiyuan Zhang
Dongdong Chen
J. Liao
DiffM
122
3
0
15 Oct 2024
Visual Fixation-Based Retinal Prosthetic Simulation
Visual Fixation-Based Retinal Prosthetic Simulation
Yuli Wu
Do Dinh Tan Nguyen
Henning Konermann
Rüveyda Yilmaz
Peter Walter
Johannes Stegmaier
54
0
0
15 Oct 2024
A Survey of Low-shot Vision-Language Model Adaptation via Representer
  Theorem
A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem
Kun Ding
Ying Wang
Gaofeng Meng
Shiming Xiang
VLM
78
0
0
15 Oct 2024
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
Hossein Abdi
Mingfei Sun
Andi Zhang
Samuel Kaski
Wei Pan
73
0
0
15 Oct 2024
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation
Jiayi Lin
Jiabo Huang
Jian Hu
S. Gong
DiffMVLM
105
0
0
15 Oct 2024
DRACO: A Denoising-Reconstruction Autoencoder for Cryo-EM
DRACO: A Denoising-Reconstruction Autoencoder for Cryo-EM
Yingjun Shen
Haizhao Dai
Qihe Chen
Yan Zeng
Jiakai Zhang
Yuan Pei
Jingyi Yu
118
3
0
15 Oct 2024
Investigation of Speaker Representation for Target-Speaker Speech
  Processing
Investigation of Speaker Representation for Target-Speaker Speech Processing
Takanori Ashihara
Takafumi Moriya
Shota Horiguchi
Junyi Peng
Tsubasa Ochiai
Marc Delcroix
Kohei Matsuura
Hiroshi Sato
66
1
0
15 Oct 2024
Multiview Scene Graph
Multiview Scene Graph
Juexiao Zhang
Gao Zhu
Sihang Li
Xinhao Liu
Haorui Song
Xinran Tang
Chen Feng
3DV
75
2
0
15 Oct 2024
Tackling Dimensional Collapse toward Comprehensive Universal Domain Adaptation
Tackling Dimensional Collapse toward Comprehensive Universal Domain Adaptation
Hung-Chieh Fang
Po-Yi Lu
Hsuan-Tien Lin
39
0
0
15 Oct 2024
EchoApex: A General-Purpose Vision Foundation Model for Echocardiography
EchoApex: A General-Purpose Vision Foundation Model for Echocardiography
A. Amadou
Yanzhe Zhang
Sebastien Piat
Paul Klein
Ingo Schmuecking
Tiziano Passerini
Puneet Sharma
91
5
0
14 Oct 2024
LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
Tianwei Xiong
Yuqing Wang
Daquan Zhou
Zhijie Lin
Jiashi Feng
Xihui Liu
VGen
106
10
0
14 Oct 2024
Artificial Intelligence-Based Triaging of Cutaneous Melanocytic Lesions
Artificial Intelligence-Based Triaging of Cutaneous Melanocytic Lesions
R. Lucassen
N. Stathonikos
Gerben E. Breimer
M. Veta
W. Blokx
34
2
0
14 Oct 2024
Towards Reliable Verification of Unauthorized Data Usage in Personalized
  Text-to-Image Diffusion Models
Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models
Boheng Li
Yanhao Wei
Yankai Fu
Ziyi Wang
Yiming Li
Jie Zhang
Run Wang
Tianwei Zhang
DiffMAAML
67
11
0
14 Oct 2024
V2M: Visual 2-Dimensional Mamba for Image Representation Learning
V2M: Visual 2-Dimensional Mamba for Image Representation Learning
Chengkun Wang
Wenzhao Zheng
Yuanhui Huang
Jie Zhou
Jiwen Lu
Mamba
37
2
0
14 Oct 2024
LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete
  Latent Space
LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete Latent Space
Shunsuke Sakai
Tatushito Hasegawa
Makoto Koshino
85
1
0
14 Oct 2024
Learning to Customize Text-to-Image Diffusion In Diverse Context
Learning to Customize Text-to-Image Diffusion In Diverse Context
Taewook Kim
Wei Chen
Qiang Qiu
DiffM
60
2
0
14 Oct 2024
Locality Alignment Improves Vision-Language Models
Locality Alignment Improves Vision-Language Models
Ian Covert
Tony Sun
James Zou
Tatsunori Hashimoto
VLM
265
7
0
14 Oct 2024
Exploring Semi-Supervised Learning for Online Mapping
Exploring Semi-Supervised Learning for Online Mapping
Adam Lilja
Erik Wallin
Junsheng Fu
Lars Hammarstrand
SSL
137
1
0
14 Oct 2024
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors
Linshan Wu
Jiaxin Zhuang
Hao Chen
90
6
0
13 Oct 2024
TextMaster: Universal Controllable Text Edit
TextMaster: Universal Controllable Text Edit
Aoqiang Wang
Jiangming Wang
Zhenyu Yan
Wenxiang Shang
Ran Lin
Zhao Zhang
DiffM
56
2
0
13 Oct 2024
SynFER: Towards Boosting Facial Expression Recognition with Synthetic
  Data
SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Xilin He
Cheng Luo
Xiaole Xian
Bing Li
Siyang Song
Muhammad Haris Khan
Weicheng Xie
Linlin Shen
Zongyuan Ge
87
4
0
13 Oct 2024
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion
  Models
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
Eungbean Lee
Somi Jeong
Kwanghoon Sohn
DiffM
63
1
0
13 Oct 2024
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction
Yang Zhou
Hao Shao
Letian Wang
Steven Waslander
Hongsheng Li
Yu Liu
84
2
0
11 Oct 2024
Emerging Pixel Grounding in Large Multimodal Models Without Grounding
  Supervision
Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
Shengcao Cao
Liang-Yan Gui
Yu-Xiong Wang
85
3
0
10 Oct 2024
Distribution Guidance Network for Weakly Supervised Point Cloud Semantic
  Segmentation
Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation
Zhiyi Pan
Wei-Nan Gao
Shan Liu
Ge Li
64
2
0
10 Oct 2024
HeGraphAdapter: Tuning Multi-Modal Vision-Language Models with
  Heterogeneous Graph Adapter
HeGraphAdapter: Tuning Multi-Modal Vision-Language Models with Heterogeneous Graph Adapter
Yumiao Zhao
Bo Jiang
Xiao Wang
Qin Xu
Jin Tang
VLM
85
1
0
10 Oct 2024
FLIER: Few-shot Language Image Models Embedded with Latent
  Representations
FLIER: Few-shot Language Image Models Embedded with Latent Representations
Zhinuo Zhou
Peng Zhou
Xiaoyong Pan
VLM
40
0
0
10 Oct 2024
RNA: Video Editing with ROI-based Neural Atlas
RNA: Video Editing with ROI-based Neural Atlas
Jaekyeong Lee
Geonung Kim
Sunghyun Cho
VGen
54
1
0
10 Oct 2024
O1O: Grouping of Known Classes to Identify Unknown Objects as
  Odd-One-Out
O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out
Mısra Yavuz
Fatma Guney
74
0
0
10 Oct 2024
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Haoyi Zhu
Honghui Yang
Yating Wang
Jiange Yang
Limin Wang
Tong He
3DH
126
9
0
10 Oct 2024
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation
Qingwen Bu
Hongyang Li
Li Chen
Jisong Cai
Jia Zeng
Heming Cui
Maoqing Yao
Yu Qiao
150
11
0
10 Oct 2024
CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features
CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features
Po-han Li
Sandeep Chinchali
Ufuk Topcu
118
2
0
10 Oct 2024
3D Vision-Language Gaussian Splatting
3D Vision-Language Gaussian Splatting
Qucheng Peng
Benjamin Planche
Zhongpai Gao
Meng Zheng
Anwesa Choudhuri
Terrence Chen
Chong Chen
Ziyan Wu
3DGS
84
6
0
10 Oct 2024
Chain-of-Sketch: Enabling Global Visual Reasoning
Chain-of-Sketch: Enabling Global Visual Reasoning
Aryo Lotfi
Enrico Fini
Samy Bengio
Moin Nabi
Emmanuel Abbe
LRM
92
0
0
10 Oct 2024
Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Susan Liang
Chao Huang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
114
8
0
09 Oct 2024
Self-Supervised Learning for Real-World Object Detection: a Survey
Self-Supervised Learning for Real-World Object Detection: a Survey
Alina Ciocarlan
Sidonie Lefebvre
S. L. Hégarat-Mascle
Arnaud Woiselle
ObjD
94
1
0
09 Oct 2024
AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Yukang Cao
Liang Pan
Kai Han
Kwan-Yee K. Wong
Ziwei Liu
VGen
129
6
0
09 Oct 2024
Suppress Content Shift: Better Diffusion Features via Off-the-Shelf
  Generation Techniques
Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques
Benyuan Meng
Qianqian Xu
Zitai Wang
Zhiyong Yang
Xiaochun Cao
Qingming Huang
96
0
0
09 Oct 2024
Happy: A Debiased Learning Framework for Continual Generalized Category
  Discovery
Happy: A Debiased Learning Framework for Continual Generalized Category Discovery
Shijie Ma
Fei Zhu
Zhun Zhong
Wenzhuo Liu
Xu-Yao Zhang
Cheng-Lin Liu
CLL
87
9
0
09 Oct 2024
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers
Stephen Hausler
Peyman Moghadam
SSLViT
68
4
0
09 Oct 2024
Sylber: Syllabic Embedding Representation of Speech from Raw Audio
Sylber: Syllabic Embedding Representation of Speech from Raw Audio
Cheol Jun Cho
Nicholas Lee
Akshat Gupta
Dhruv Agarwal
Ethan Chen
Alan W Black
Gopala K. Anumanchipalli
91
4
0
09 Oct 2024
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Sihyun Yu
Sangkyung Kwak
Huiwon Jang
Jongheon Jeong
Jonathan Huang
Jinwoo Shin
Saining Xie
OCL
184
102
0
09 Oct 2024
Towards Unsupervised Eye-Region Segmentation for Eye Tracking
Towards Unsupervised Eye-Region Segmentation for Eye Tracking
Jiangfan Deng
Zhuang Jia
Zhaoxue Wang
Xiang Long
Daniel K. Du
57
1
0
08 Oct 2024
UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural
  Networks with Convolutional ARMA Filters
UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA Filters
Kovvuri Sai Gopal Reddy
Bodduluri Saran
A. M. Adityaja
Saurabh J. Shigwan
Nitin Kumar
Snehasis Mukherjee
110
1
0
08 Oct 2024
Training-Free Open-Ended Object Detection and Segmentation via Attention
  as Prompts
Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
Zhiwei Lin
Yongtao Wang
Zhi Tang
ObjDVLM
81
7
0
08 Oct 2024
PixLens: A Novel Framework for Disentangled Evaluation in
  Diffusion-Based Image Editing with Object Detection + SAM
PixLens: A Novel Framework for Disentangled Evaluation in Diffusion-Based Image Editing with Object Detection + SAM
Stefan Stefanache
Lluís Pastor Pérez
Julen Costa Watanabe
Ernesto Sanchez Tejedor
Thomas Hofmann
Enis Simsar
EGVM
38
0
0
08 Oct 2024
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through
  Data, Reward, and Conditional Guidance Design
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design
Jiachen Li
Qian Long
Jian Zheng
Xiaofeng Gao
Robinson Piramuthu
Wenhu Chen
William Yang Wang
VGen
128
26
0
08 Oct 2024
Variable Bitrate Residual Vector Quantization for Audio Coding
Variable Bitrate Residual Vector Quantization for Audio Coding
Yunkee Chae
Woosung Choi
Yuhta Takida
Junghyun Koo
Yukara Ikemiya
...
K. Cheuk
Marco A. Martínez-Ramírez
Kyogu Lee
Wei-Hsiang Liao
Yuki Mitsufuji
146
2
0
08 Oct 2024
Previous
123...181920...828384
Next