Title
Deep Learning for Spatiotemporal Big Data: A Vision on Opportunities and Challenges Zhe Jiang 28 0 0 30 Oct 2023
Adversarial Attacks and Defenses in Large Language Models: Old and New Threats Leo Schwinn David Dobre Stephan Günnemann Gauthier Gidel AAML ELM 29 39 0 30 Oct 2023
A Survey on Knowledge Editing of Neural Networks Vittorio Mazzia Alessandro Pedrani Andrea Caciolai Kay Rottmann Davide Bernardi KELM 20 25 0 30 Oct 2023
HyPE: Attention with Hyperbolic Biases for Relative Positional Encoding Giorgio Angelotti 16 0 0 30 Oct 2023
Are Natural Domain Foundation Models Useful for Medical Image Classification? Joana Palés Huix Adithya Raju Ganeshan Johan Fredin Haslum Magnus P Soderberg Christos Matsoukas Kevin Smith OOD MedIm VLM 26 30 0 30 Oct 2023
Few-shot Hybrid Domain Adaptation of Image Generators Hengjia Li Yang Liu Linxuan Xia Yuqi Lin Tu Zheng Zheng Yang Wenxiao Wang Xiaohui Zhong Xiaobo Ren Xiaofei He 22 2 0 30 Oct 2023
A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture Qianqian Shen Yunhan Zhao Nahyun Kwon Jeeeun Kim Yanan Li Shu Kong 28 2 0 30 Oct 2023
CHAMMI: A benchmark for channel-adaptive models in microscopy imaging Zitong S. Chen Chau Pham Siqi Wang Michael Doron Nikita Moshkov Bryan A. Plummer Juan C. Caicedo 30 11 0 30 Oct 2023
Patch-Wise Self-Supervised Visual Representation Learning: A Fine-Grained Approach Ali Javidani Mohammad Amin Sadeghi Babak N. Araabi 30 0 0 28 Oct 2023
One-shot Localization and Segmentation of Medical Images with Foundation Models Deepa Anand Gurunath Reddy Vanika Singhal D. Shanbhag KS Shriram ... Dawei Gui R. Mullick Avinash Gopal Parminder Bhatia Taha A. Kass-Hout MedIm 52 13 0 28 Oct 2023
Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models Tsun-Hsuan Wang Alaa Maalouf Wei Xiao Yutong Ban Alexander Amini Guy Rosman S. Karaman Daniela Rus 27 42 0 26 Oct 2023
SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching Xinghui Li Jingyi Lu Kai Han V. Prisacariu DiffM 30 19 0 26 Oct 2023
Three Pillars improving Vision Foundation Model Distillation for Lidar Gilles Puy Spyros Gidaris Alexandre Boulch Oriane Siméoni Corentin Sautier Patrick Pérez Andrei Bursuc Renaud Marlet 107 18 0 26 Oct 2023
Attribute Based Interpretable Evaluation Metrics for Generative Models Dongkyun Kim Mingi Kwon Youngjung Uh EGVM 40 2 0 26 Oct 2023
SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation Qianxu Wang Haotong Zhang Congyue Deng Yang You Hao Dong Yixin Zhu Leonidas J. Guibas 29 18 0 25 Oct 2023
Open-NeRF: Towards Open Vocabulary NeRF Decomposition Hao Zhang Fang Li Narendra Ahuja 35 12 0 25 Oct 2023
Integrating View Conditions for Image Synthesis Jinbin Bai Zhen Dong Aosong Feng Xiao Zhang Tian-Chun Ye Kaicheng Zhou 67 13 0 24 Oct 2023
Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic Gaussian Mixture Models Iman Nematollahi Kirill Yankov Wolfram Burgard Tim Welschehold 31 0 0 23 Oct 2023
Learning Generalizable Manipulation Policies with Object-Centric 3D Representations Yifeng Zhu Zhenyu Jiang Peter Stone Yuke Zhu 3DPC 29 45 0 22 Oct 2023
A Survey on Continual Semantic Segmentation: Theory, Challenge, Method and Application Bo Yuan Danpei Zhao 3DV CLL 38 10 0 22 Oct 2023
SILC: Improving Vision Language Pretraining with Self-Distillation Muhammad Ferjad Naeem Yongqin Xian Xiaohua Zhai Lukas Hoyer Luc Van Gool F. Tombari VLM 30 33 0 20 Oct 2023
Visual Grounding Helps Learn Word Meanings in Low-Data Regimes Chengxu Zhuang Evelina Fedorenko Jacob Andreas 22 10 0 20 Oct 2023
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity Metrics For Science And Machine Learning Amey P. Pasarkar Adji Bousso Dieng 27 11 0 19 Oct 2023
Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey Oriane Siméoni Éloi Zablocki Spyros Gidaris Gilles Puy Patrick Pérez 31 10 0 19 Oct 2023
An Image is Worth Multiple Words: Discovering Object Level Concepts using Multi-Concept Prompt Learning Chen Jin Ryutaro Tanno Amrutha Saseendran Tom Diethe Philip Teare 21 2 0 18 Oct 2023
Functional Invariants to Watermark Large Transformers Pierre Fernandez Guillaume Couairon Teddy Furon Matthijs Douze 19 8 0 17 Oct 2023
Tracking and Mapping in Medical Computer Vision: A Review Adam Schmidt Omid Mohareri S. DiMaio Michael C. Yip Septimiu E. Salcudean 47 34 0 17 Oct 2023
Towards Training-free Open-world Segmentation via Image Prompt Foundation Models Lv Tang Peng-Tao Jiang Haoke Xiao Bo Li VLM 18 8 0 17 Oct 2023
Prototype-oriented Unsupervised Change Detection for Disaster Management Youngtack Oh Minseok Seo Do-Yun Kim Junghoon Seo 41 0 0 15 Oct 2023
From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models Dongsheng Jiang Yuchen Liu Songlin Liu Jiné Zhao Hao Zhang Zhen Gao Xiaopeng Zhang Jin Li Hongkai Xiong MLLM VLM 41 34 0 13 Oct 2023
Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video Shashanka Venkataramanan Mamshad Nayeem Rizve João Carreira Yuki M. Asano Yannis Avrithis SSL 39 18 0 12 Oct 2023
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy Zichen Zhang Yunshuang Li Osbert Bastani Abhishek Gupta Dinesh Jayaraman Yecheng Jason Ma Luca Weihs 37 17 0 12 Oct 2023
Causal Unsupervised Semantic Segmentation Junho Kim Byung-Kwan Lee Yonghyun Ro 36 18 0 11 Oct 2023
Computational Pathology at Health System Scale -- Self-Supervised Foundation Models from Three Billion Images Gabriele Campanella Ricky Kwan Eugene Fluder Jennifer Zeng A. Stock ... Adam J. Schoenfeld Chad M. Vanderbilt P. Kovatch Carlos Cordon-Cardo Thomas J. Fuchs MedIm 63 25 0 10 Oct 2023
Self-supervised Object-Centric Learning for Videos Görkay Aydemir Weidi Xie Fatma Guney OCL VOS SSL 33 24 0 10 Oct 2023
A General Protocol to Probe Large Vision Models for 3D Physical Understanding Guanqi Zhan Chuanxia Zheng Weidi Xie Andrew Zisserman DiffM 26 14 0 10 Oct 2023
AttributionLab: Faithfulness of Feature Attribution Under Controllable Environments Yang Zhang Yawei Li Hannah Brown Mina Rezaei Bernd Bischl Philip Torr Ashkan Khakzar Kenji Kawaguchi OOD 55 1 0 10 Oct 2023
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models Fei Shen Hu Ye Jun Zhang Cong Wang Xiao Han Wei Yang DiffM 48 56 0 10 Oct 2023
Adaptive Multi-head Contrastive Learning Lei Wang Piotr Koniusz Tom Gedeon Liang Zheng 41 4 0 09 Oct 2023
Improving Discriminative Multi-Modal Learning with Large-Scale Pre-Trained Models Chenzhuang Du Yue Zhao Chonghua Liao Jiacheng You Jie Fu Hang Zhao 47 2 0 08 Oct 2023
Sub-token ViT Embedding via Stochastic Resonance Transformers Dong Lao Yangchao Wu Tian Yu Liu Alex Wong Stefano Soatto VOS 36 4 0 06 Oct 2023
FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators Haiping Wang Yuan Liu Bing Wang Yujing Sun Zhenchao Dong Wenping Wang Bisheng Yang DiffM 38 11 0 05 Oct 2023
Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day Yi Ding Hao Tang Jen-Hao Rick Chang Liangchen Song Zhangyang Wang Liangliang Cao DiffM 43 10 0 04 Oct 2023
Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach Matthew Hanlon Boyang Sun Marc Pollefeys Hermann Blum 20 5 0 04 Oct 2023
NOLA: Compressing LoRA using Linear Combination of Random Basis Soroush Abbasi Koohpayegani K. Navaneet Parsa Nooralinejad Soheil Kolouri Hamed Pirsiavash 40 12 0 04 Oct 2023
CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation Jialei Chen Daisuke Deguchi Chenkai Zhang Xu Zheng Hiroshi Murase VLM 19 9 0 03 Oct 2023
LEAP: Liberate Sparse-view 3D Modeling from Camera Poses Hanwen Jiang Zhenyu Jiang Yue Zhao Qixing Huang 34 37 0 02 Oct 2023
ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video Xinhao Li Yuhan Zhu Limin Wang VLM 35 8 0 02 Oct 2023
HyMNet: a Multimodal Deep Learning System for Hypertension Classification using Fundus Photographs and Cardiometabolic Risk Factors Mohammed Baharoon Hessa Almatar Reema Alduhayan Tariq Aldebasi Badr O. Alahmadi Yahya Bokhari M. Alawad A. Almazroa Abdulrhman Aljouie 31 0 0 02 Oct 2023
LoCUS: Learning Multiscale 3D-consistent Features from Posed Images Dominik A. Kloepfer Dylan Campbell João F. Henriques 3DPC 3DV 45 0 0 02 Oct 2023