Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,175 papers shown
Title
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
Yiwei Guo
Shaobin Zhuang
Kunchang Li
Yu Qiao
Yali Wang
VLM
CLIP
128
1
0
16 Oct 2024
Jigsaw++: Imagining Complete Shape Priors for Object Reassembly
Jiaxin Lu
Gang Hua
Qixing Huang
59
2
0
15 Oct 2024
SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing
Zhiyuan Zhang
Dongdong Chen
J. Liao
DiffM
122
3
0
15 Oct 2024
Visual Fixation-Based Retinal Prosthetic Simulation
Yuli Wu
Do Dinh Tan Nguyen
Henning Konermann
Rüveyda Yilmaz
Peter Walter
Johannes Stegmaier
54
0
0
15 Oct 2024
A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem
Kun Ding
Ying Wang
Gaofeng Meng
Shiming Xiang
VLM
78
0
0
15 Oct 2024
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
Hossein Abdi
Mingfei Sun
Andi Zhang
Samuel Kaski
Wei Pan
73
0
0
15 Oct 2024
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation
Jiayi Lin
Jiabo Huang
Jian Hu
S. Gong
DiffM
VLM
105
0
0
15 Oct 2024
DRACO: A Denoising-Reconstruction Autoencoder for Cryo-EM
Yingjun Shen
Haizhao Dai
Qihe Chen
Yan Zeng
Jiakai Zhang
Yuan Pei
Jingyi Yu
118
3
0
15 Oct 2024
Investigation of Speaker Representation for Target-Speaker Speech Processing
Takanori Ashihara
Takafumi Moriya
Shota Horiguchi
Junyi Peng
Tsubasa Ochiai
Marc Delcroix
Kohei Matsuura
Hiroshi Sato
66
1
0
15 Oct 2024
Multiview Scene Graph
Juexiao Zhang
Gao Zhu
Sihang Li
Xinhao Liu
Haorui Song
Xinran Tang
Chen Feng
3DV
75
2
0
15 Oct 2024
Tackling Dimensional Collapse toward Comprehensive Universal Domain Adaptation
Hung-Chieh Fang
Po-Yi Lu
Hsuan-Tien Lin
39
0
0
15 Oct 2024
EchoApex: A General-Purpose Vision Foundation Model for Echocardiography
A. Amadou
Yanzhe Zhang
Sebastien Piat
Paul Klein
Ingo Schmuecking
Tiziano Passerini
Puneet Sharma
91
5
0
14 Oct 2024
LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
Tianwei Xiong
Yuqing Wang
Daquan Zhou
Zhijie Lin
Jiashi Feng
Xihui Liu
VGen
106
10
0
14 Oct 2024
Artificial Intelligence-Based Triaging of Cutaneous Melanocytic Lesions
R. Lucassen
N. Stathonikos
Gerben E. Breimer
M. Veta
W. Blokx
34
2
0
14 Oct 2024
Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models
Boheng Li
Yanhao Wei
Yankai Fu
Ziyi Wang
Yiming Li
Jie Zhang
Run Wang
Tianwei Zhang
DiffM
AAML
67
11
0
14 Oct 2024
V2M: Visual 2-Dimensional Mamba for Image Representation Learning
Chengkun Wang
Wenzhao Zheng
Yuanhui Huang
Jie Zhou
Jiwen Lu
Mamba
37
2
0
14 Oct 2024
LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete Latent Space
Shunsuke Sakai
Tatushito Hasegawa
Makoto Koshino
85
1
0
14 Oct 2024
Learning to Customize Text-to-Image Diffusion In Diverse Context
Taewook Kim
Wei Chen
Qiang Qiu
DiffM
60
2
0
14 Oct 2024
Locality Alignment Improves Vision-Language Models
Ian Covert
Tony Sun
James Zou
Tatsunori Hashimoto
VLM
265
7
0
14 Oct 2024
Exploring Semi-Supervised Learning for Online Mapping
Adam Lilja
Erik Wallin
Junsheng Fu
Lars Hammarstrand
SSL
137
1
0
14 Oct 2024
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors
Linshan Wu
Jiaxin Zhuang
Hao Chen
90
6
0
13 Oct 2024
TextMaster: Universal Controllable Text Edit
Aoqiang Wang
Jiangming Wang
Zhenyu Yan
Wenxiang Shang
Ran Lin
Zhao Zhang
DiffM
56
2
0
13 Oct 2024
SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Xilin He
Cheng Luo
Xiaole Xian
Bing Li
Siyang Song
Muhammad Haris Khan
Weicheng Xie
Linlin Shen
Zongyuan Ge
87
4
0
13 Oct 2024
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
Eungbean Lee
Somi Jeong
Kwanghoon Sohn
DiffM
63
1
0
13 Oct 2024
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction
Yang Zhou
Hao Shao
Letian Wang
Steven Waslander
Hongsheng Li
Yu Liu
84
2
0
11 Oct 2024
Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
Shengcao Cao
Liang-Yan Gui
Yu-Xiong Wang
85
3
0
10 Oct 2024
Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation
Zhiyi Pan
Wei-Nan Gao
Shan Liu
Ge Li
64
2
0
10 Oct 2024
HeGraphAdapter: Tuning Multi-Modal Vision-Language Models with Heterogeneous Graph Adapter
Yumiao Zhao
Bo Jiang
Xiao Wang
Qin Xu
Jin Tang
VLM
85
1
0
10 Oct 2024
FLIER: Few-shot Language Image Models Embedded with Latent Representations
Zhinuo Zhou
Peng Zhou
Xiaoyong Pan
VLM
40
0
0
10 Oct 2024
RNA: Video Editing with ROI-based Neural Atlas
Jaekyeong Lee
Geonung Kim
Sunghyun Cho
VGen
54
1
0
10 Oct 2024
O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out
Mısra Yavuz
Fatma Guney
74
0
0
10 Oct 2024
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Haoyi Zhu
Honghui Yang
Yating Wang
Jiange Yang
Limin Wang
Tong He
3DH
126
9
0
10 Oct 2024
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation
Qingwen Bu
Hongyang Li
Li Chen
Jisong Cai
Jia Zeng
Heming Cui
Maoqing Yao
Yu Qiao
150
11
0
10 Oct 2024
CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features
Po-han Li
Sandeep Chinchali
Ufuk Topcu
118
2
0
10 Oct 2024
3D Vision-Language Gaussian Splatting
Qucheng Peng
Benjamin Planche
Zhongpai Gao
Meng Zheng
Anwesa Choudhuri
Terrence Chen
Chong Chen
Ziyan Wu
3DGS
84
6
0
10 Oct 2024
Chain-of-Sketch: Enabling Global Visual Reasoning
Aryo Lotfi
Enrico Fini
Samy Bengio
Moin Nabi
Emmanuel Abbe
LRM
92
0
0
10 Oct 2024
Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Susan Liang
Chao Huang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
114
8
0
09 Oct 2024
Self-Supervised Learning for Real-World Object Detection: a Survey
Alina Ciocarlan
Sidonie Lefebvre
S. L. Hégarat-Mascle
Arnaud Woiselle
ObjD
94
1
0
09 Oct 2024
AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Yukang Cao
Liang Pan
Kai Han
Kwan-Yee K. Wong
Ziwei Liu
VGen
129
6
0
09 Oct 2024
Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques
Benyuan Meng
Qianqian Xu
Zitai Wang
Zhiyong Yang
Xiaochun Cao
Qingming Huang
96
0
0
09 Oct 2024
Happy: A Debiased Learning Framework for Continual Generalized Category Discovery
Shijie Ma
Fei Zhu
Zhun Zhong
Wenzhuo Liu
Xu-Yao Zhang
Cheng-Lin Liu
CLL
87
9
0
09 Oct 2024
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers
Stephen Hausler
Peyman Moghadam
SSL
ViT
68
4
0
09 Oct 2024
Sylber: Syllabic Embedding Representation of Speech from Raw Audio
Cheol Jun Cho
Nicholas Lee
Akshat Gupta
Dhruv Agarwal
Ethan Chen
Alan W Black
Gopala K. Anumanchipalli
91
4
0
09 Oct 2024
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Sihyun Yu
Sangkyung Kwak
Huiwon Jang
Jongheon Jeong
Jonathan Huang
Jinwoo Shin
Saining Xie
OCL
184
102
0
09 Oct 2024
Towards Unsupervised Eye-Region Segmentation for Eye Tracking
Jiangfan Deng
Zhuang Jia
Zhaoxue Wang
Xiang Long
Daniel K. Du
57
1
0
08 Oct 2024
UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA Filters
Kovvuri Sai Gopal Reddy
Bodduluri Saran
A. M. Adityaja
Saurabh J. Shigwan
Nitin Kumar
Snehasis Mukherjee
110
1
0
08 Oct 2024
Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
Zhiwei Lin
Yongtao Wang
Zhi Tang
ObjD
VLM
81
7
0
08 Oct 2024
PixLens: A Novel Framework for Disentangled Evaluation in Diffusion-Based Image Editing with Object Detection + SAM
Stefan Stefanache
Lluís Pastor Pérez
Julen Costa Watanabe
Ernesto Sanchez Tejedor
Thomas Hofmann
Enis Simsar
EGVM
38
0
0
08 Oct 2024
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design
Jiachen Li
Qian Long
Jian Zheng
Xiaofeng Gao
Robinson Piramuthu
Wenhu Chen
William Yang Wang
VGen
128
26
0
08 Oct 2024
Variable Bitrate Residual Vector Quantization for Audio Coding
Yunkee Chae
Woosung Choi
Yuhta Takida
Junghyun Koo
Yukara Ikemiya
...
K. Cheuk
Marco A. Martínez-Ramírez
Kyogu Lee
Wei-Hsiang Liao
Yuki Mitsufuji
146
2
0
08 Oct 2024
Previous
1
2
3
...
18
19
20
...
82
83
84
Next