v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,176 papers shown

Title
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping Lucas Lehnert Sainbayar Sukhbaatar DiJia Su Qinqing Zheng Paul Mcvay Michael Rabbat Yuandong Tian 121 65 0 21 Feb 2024
Unsupervised Concept Discovery Mitigates Spurious Correlations Md Rifat Arefin Yan Zhang A. Baratin Francesco Locatello Irina Rish Dianbo Liu Kenji Kawaguchi 104 6 0 20 Feb 2024
Aria Everyday Activities Dataset Zhaoyang Lv Nickolas Charron Pierre Moulon Alexander Gamino Cheng Peng ... Yuyang Zou Richard Newcombe Jakob Julian Engel Xiaqing Pan Carl Ren 58 12 0 20 Feb 2024
A Touch, Vision, and Language Dataset for Multimodal Alignment Letian Fu Gaurav Datta Huang Huang Will Panitch Jaimyn Drake Joseph Ortiz Mustafa Mukadam Mike Lambeta Roberto Calandra Ken Goldberg VLM 97 43 0 20 Feb 2024
DINOBot: Robot Manipulation via Retrieval and Alignment with Vision Foundation Models Norman Di Palo Edward Johns LM&Ro 106 30 0 20 Feb 2024
Slot-VLM: SlowFast Slots for Video-Language Modeling Jiaqi Xu Cuiling Lan Wenxuan Xie Xuejin Chen Yan Lu MLLM VLM 46 7 0 20 Feb 2024
Visual Style Prompting with Swapping Self-Attention Jaeseok Jeong Junho Kim Yunjey Choi Gayoung Lee Youngjung Uh DiffM 90 43 0 20 Feb 2024
Object-level Geometric Structure Preserving for Natural Image Stitching Wenxiao Cai Wankou Yang 60 5 0 20 Feb 2024
Visual Reasoning in Object-Centric Deep Neural Networks: A Comparative Cognition Approach Guillermo Puebla Jeffrey S. Bowers OCL 86 0 0 20 Feb 2024
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey Fabio Tosi Youming Zhang Ziren Gong Erik Sandström S. Mattoccia Martin R. Oswald Matteo Poggi 3DGS 214 64 0 20 Feb 2024
Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization James Oldfield Markos Georgopoulos Grigorios G. Chrysos Christos Tzelepis Yannis Panagakis M. Nicolaou Jiankang Deng Ioannis Patras MoE 128 10 0 19 Feb 2024
Integrating kNN with Foundation Models for Adaptable and Privacy-Aware Image Classification Sebastian Doerrich Tobias Archut Francesco Di Salvo Christian Ledig 61 4 0 19 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey Davide Caffagni Federico Cocchi Luca Barsellotti Nicholas Moratelli Sara Sarto Lorenzo Baraldi Lorenzo Baraldi Marcella Cornia Rita Cucchiara LRM VLM 139 64 0 19 Feb 2024
ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image Yan Hong Jianfu Zhang DiffM 107 3 0 19 Feb 2024
Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review Thang-Anh-Quan Nguyen Amine Bourki Mátyás Macudzinski Anthony Brunel M. Bennamoun 134 13 0 17 Feb 2024
Revisiting Feature Prediction for Learning Visual Representations from Video Adrien Bardes Q. Garrido Jean Ponce Xinlei Chen Michael G. Rabbat Yann LeCun Mahmoud Assran Nicolas Ballas MDE VLM 160 87 0 15 Feb 2024
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Rui Yang Xiaoman Pan Feng Luo Shuang Qiu Han Zhong Dong Yu Jianshu Chen 233 83 0 15 Feb 2024
SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention Romain Ilbert Ambroise Odonnat Vasilii Feofanov Aladin Virmaux Giuseppe Paolo Themis Palpanas I. Redko AI4TS 128 29 0 15 Feb 2024
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization Jisu Nam Heesu Kim Dongjae Lee Siyoon Jin Seungryong Kim Seunggyu Chang DiffM 111 44 0 15 Feb 2024
Magic-Me: Identity-Specific Video Customized Diffusion Ze Ma Daquan Zhou Chun-Hsiao Yeh Xue-She Wang Xiuyu Li Huanrui Yang Zhen Dong Kurt Keutzer Jiashi Feng VGen DiffM 86 32 0 14 Feb 2024
Affine transformation estimation improves visual self-supervised learning David Torpey Richard Klein SSL 46 1 0 14 Feb 2024
IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi Iro Laina Christian Rupprecht Natalia Neverova Andrea Vedaldi Oran Gafni Filippos Kokkinos 3DGS 106 68 0 13 Feb 2024
NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs Michael Fischer Zhengqin Li Thu Nguyen-Phuoc Aljaz Bozic Zhao Dong Carl S. Marshall Tobias Ritschel 81 11 0 13 Feb 2024
Leveraging Self-Supervised Instance Contrastive Learning for Radar Object Detection Colin Decourt R. V. Rullen D. Salle Thomas Oberlin SSL 70 0 0 13 Feb 2024
Comparative Analysis of ImageNet Pre-Trained Deep Learning Models and DINOv2 in Medical Imaging Classification Yuning Huang Jingchen Zou Lanxi Meng Xin Yue Qing Zhao Jianqiang Li Changwei Song Gabriel Jimenez Shaowu Li Guanghui Fu 104 13 0 12 Feb 2024
Discriminative Adversarial Unlearning Rohan Sharma Shijie Zhou Kaiyi Ji Changyou Chen MU 76 1 0 10 Feb 2024
A self-supervised framework for learning whole slide representations X. Hou Cheng Jiang A. Kondepudi Yiwei Lyu Asadur Chowdury Honglak Lee Todd C. Hollon MedIm 96 6 0 09 Feb 2024
ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling Siming Yan Min Bai Weifeng Chen Xiong Zhou Qixing Huang Erran L. Li VLM 57 20 0 09 Feb 2024
AttnLRP: Attention-Aware Layer-Wise Relevance Propagation for Transformers Reduan Achtibat Sayed Mohammad Vakilzadeh Hatefi Maximilian Dreyer Aakriti Jain Thomas Wiegand Sebastian Lapuschkin Wojciech Samek 100 37 0 08 Feb 2024
Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts Zhili Liu Kai Chen Jianhua Han Lanqing Hong Hang Xu Zhenguo Li James T. Kwok MoE 188 25 0 08 Feb 2024
InCoRo: In-Context Learning for Robotics Control with Feedback Loops Jiaqiang Ye Zhu Carla Gomez Cano David Vazquez Bermudez Michal Drozdzal LRM 77 8 0 07 Feb 2024
OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding Guibiao Liao Kaichen Zhou Zhenyu Bao Kanglin Liu Qing Li VLM 66 23 0 07 Feb 2024
GSN: Generalisable Segmentation in Neural Radiance Field Vinayak Gupta Rahul Goel Dhawal Sirikonda P. J. Narayanan 70 1 0 07 Feb 2024
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry Michael Zhang Kush S. Bhatia Hermann Kumbong Christopher Ré 80 54 0 06 Feb 2024
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation Weiming Ren Harry Yang Ge Zhang Cong Wei Xinrun Du Stephen W. Huang Wenhu Chen DiffM VGen 126 66 0 06 Feb 2024
Human-Like Geometric Abstraction in Large Pre-trained Neural Networks Declan Campbell Sreejan Kumar Tyler Giallanza Thomas Griffiths Jonathan D. Cohen GNN OCL 68 3 0 06 Feb 2024
Pre-training of Lightweight Vision Transformers on Small Datasets with Minimally Scaled Images Jen Hong Tan ViT 26 3 0 06 Feb 2024
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning and Levels-of-Experts Kun Wang Hao Wu Guibin Zhang Sihang Li Yuxuan Liang Yuankai Wu Roger Zimmermann Yang Wang 82 11 0 06 Feb 2024
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection Shengcao Cao Dhiraj Joshi Liangyan Gui Yu Wang 92 11 0 05 Feb 2024
Just Cluster It: An Approach for Exploration in High-Dimensions using Clustering and Pre-Trained Representations Stefan Sylvius Wagner Stefan Harmeling 73 2 0 05 Feb 2024
Good Teachers Explain: Explanation-Enhanced Knowledge Distillation Amin Parchami-Araghi Moritz Bohle Sukrut Rao Bernt Schiele FAtt 61 4 0 05 Feb 2024
Applying Unsupervised Semantic Segmentation to High-Resolution UAV Imagery for Enhanced Road Scene Parsing Zihan Ma Yongshang Li Ronggui Ma Chen Liang 64 2 0 05 Feb 2024
Enhancing Compositional Generalization via Compositional Feature Alignment Haoxiang Wang Haozhe Si Huajie Shao Han Zhao 115 2 0 05 Feb 2024
Vision-Language Models Provide Promptable Representations for Reinforcement Learning William Chen Oier Mees Aviral Kumar Sergey Levine VLM LM&Ro 134 29 0 05 Feb 2024
Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning Haoyi Zhu Yating Wang Di Huang Weicai Ye Wanli Ouyang Tong He SSL 3DPC 158 25 0 04 Feb 2024
Deep Spectral Improvement for Unsupervised Image Instance Segmentation Farnoosh Arefi Amir M. Mansourian S. Kasaei ISeg 95 1 0 04 Feb 2024
Review of multimodal machine learning approaches in healthcare "Felix H. Krones Umar Marikkar Guy Parsons Adam Szmul Adam Mahdi 129 34 0 04 Feb 2024
COMPRER: A Multimodal Multi-Objective Pretraining Framework for Enhanced Medical Image Representation Guy Lutsker H. Rossman Nastya Godiva E. Segal MedIm 101 1 0 04 Feb 2024
Exploring Intrinsic Properties of Medical Images for Self-Supervised Binary Semantic Segmentation P. Singh Jacopo Cirrone 91 0 0 04 Feb 2024
Region-Based Representations Revisited Michal Shlapentokh-Rothman Ansel Blume Yao Xiao Yuqun Wu TV Sethuraman Heyi Tao Jae Yong Lee Wilfredo Torres Yu-Xiong Wang Derek Hoiem 124 12 0 04 Feb 2024