Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.01903
Cited By
Robust fine-tuning of zero-shot models
4 September 2021
Mitchell Wortsman
Gabriel Ilharco
Jong Wook Kim
Mike Li
Simon Kornblith
Rebecca Roelofs
Raphael Gontijo-Lopes
Hannaneh Hajishirzi
Ali Farhadi
Hongseok Namkoong
Ludwig Schmidt
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Robust fine-tuning of zero-shot models"
50 / 514 papers shown
Title
Retaining and Enhancing Pre-trained Knowledge in Vision-Language Models with Prompt Ensembling
Donggeun Kim
Yujin Jo
Myungjoo Lee
Taesup Kim
VLM
78
0
0
10 Dec 2024
How to Merge Your Multimodal Models Over Time?
Sebastian Dziadzio
Vishaal Udandarao
Karsten Roth
Ameya Prabhu
Zeynep Akata
Samuel Albanie
Matthias Bethge
MoMe
98
3
0
09 Dec 2024
Adaptive Rank, Reduced Forgetting: Knowledge Retention in Continual Learning Vision-Language Models with Dynamic Rank-Selective LoRA
Haodong Lu
Chongyang Zhao
Jason Xue
Lina Yao
Kristen Moore
Dong Gong
VLM
KELM
CLL
88
3
0
01 Dec 2024
Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
Kaican Li
Weiyan Xie
Yongxiang Huang
Didan Deng
Lanqing Hong
ZeLin Li
Ricardo Silva
N. Zhang
71
0
0
29 Nov 2024
Task Arithmetic Through The Lens Of One-Shot Federated Learning
Zhixu Tao
I. Mason
Sanjeev R. Kulkarni
Xavier Boix
MoMe
FedML
84
3
0
27 Nov 2024
FLEX-CLIP: Feature-Level GEneration Network Enhanced CLIP for X-shot Cross-modal Retrieval
Jingyou Xie
Jiayi Kuang
Zhenzhou Lin
Jiarui Ouyang
Zishuo Zhao
Ying Shen
VLM
CLIP
69
0
0
26 Nov 2024
Words Matter: Leveraging Individual Text Embeddings for Code Generation in CLIP Test-Time Adaptation
Shambhavi Mishra
Julio Silva-Rodrıguez
Ismail ben Ayed
M. Pedersoli
Jose Dolz
VLM
82
1
0
26 Nov 2024
Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics
Tian Bowen
Lai Songning
Wu Jiemin
Shuai Zhihao
Ge Shiming
Yue Yutao
MoMe
70
4
0
25 Nov 2024
LAGUNA: LAnguage Guided UNsupervised Adaptation with structured spaces
Anxhelo Diko
Antonino Furnari
Luigi Cinque
G. Farinella
95
0
0
23 Nov 2024
Improving OOD Generalization of Pre-trained Encoders via Aligned Embedding-Space Ensembles
Shuman Peng
Arash Khoeini
Sharan Vaswani
Martin Ester
70
0
0
20 Nov 2024
Joint Vision-Language Social Bias Removal for CLIP
Haoyu Zhang
Yangyang Guo
Mohan S. Kankanhalli
VLM
67
0
0
19 Nov 2024
Greenback Bears and Fiscal Hawks: Finance is a Jungle and Text Embeddings Must Adapt
Peter Anderson
Mano Vikash Janardhanan
Jason He
Wei Cheng
Charlie Flanagan
RALM
37
3
0
11 Nov 2024
Robust Fine-tuning of Zero-shot Models via Variance Reduction
B. Zhu
Jiequan Cui
H. Zhang
VLM
OODD
43
1
0
11 Nov 2024
Maximizing domain generalization in fetal brain tissue segmentation: the role of synthetic data generation, intensity clustering and real image fine-tuning
Vladyslav Zalevskyi
Thomas Sanchez
Margaux Roulet
Hélène Lajous
Jordina Aviles Verdera
Jana Hutter
Hamza Kebiri
Meritxell Bach Cuadra
OOD
48
1
0
11 Nov 2024
RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models
Maya Varma
Jean-Benoit Delbrouck
Zhihong Chen
Akshay S. Chaudhari
C. Langlotz
VLM
46
6
0
06 Nov 2024
Aligning Characteristic Descriptors with Images for Human-Expert-like Explainability
Bharat Yalavarthi
N. Ratha
35
0
0
06 Nov 2024
Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Junjiao Tian
Chengyue Huang
Z. Kira
36
1
0
03 Nov 2024
Black-Box Forgetting
Yusuke Kuwana
Yuta Goto
Takashi Shibata
Go Irie
CLL
MU
VLM
28
0
0
01 Nov 2024
Model merging with SVD to tie the Knots
George Stoica
Pratik Ramesh
B. Ecsedi
Leshem Choshen
Judy Hoffman
MoMe
36
9
0
25 Oct 2024
Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
Xingyu Zhu
B. Zhu
Yi Tan
Shuo Wang
Y. Hao
Hanwang Zhang
VLM
VPVLM
32
1
0
25 Oct 2024
In Search of the Successful Interpolation: On the Role of Sharpness in CLIP Generalization
Alireza Abdollahpoorrostam
23
0
0
21 Oct 2024
CLIP-VAD: Exploiting Vision-Language Models for Voice Activity Detection
Andrea Appiani
Cigdem Beyan
CLIP
VLM
28
0
0
18 Oct 2024
Fine-Tuning Pre-trained Language Models for Robust Causal Representation Learning
Jialin Yu
Yuxiang Zhou
Yulan He
Nevin L. Zhang
Ricardo Silva
33
0
0
18 Oct 2024
Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion
Yijun Liang
Shweta Bhardwaj
Dinesh Manocha
45
0
0
17 Oct 2024
Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
Jacob Morrison
Noah A. Smith
Hannaneh Hajishirzi
Pang Wei Koh
Jesse Dodge
Pradeep Dasigi
KELM
MoMe
CLL
39
1
0
16 Oct 2024
Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
Shangbin Feng
Zifeng Wang
Yike Wang
Sayna Ebrahimi
Hamid Palangi
...
Nathalie Rauschmayr
Yejin Choi
Yulia Tsvetkov
Chen-Yu Lee
Tomas Pfister
MoMe
35
3
0
15 Oct 2024
Large Model for Small Data: Foundation Model for Cross-Modal RF Human Activity Recognition
Yuxuan Weng
Guoquan Wu
Tianyue Zheng
Yanbing Yang
Jun Luo
21
5
0
13 Oct 2024
Debiasing Vison-Language Models with Text-Only Training
Yunfan Yang
Chaoquan Jiang
Zhiyu Lin
Jinlin Xiao
Jiaming Zhang
Jitao Sang
VLM
28
1
0
12 Oct 2024
LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts
Anh-Quan Cao
M. Jaritz
Matthieu Guillaumin
Raoul de Charette
Loris Bazzani
VLM
CLIP
49
2
0
10 Oct 2024
Tri-Level Navigator: LLM-Empowered Tri-Level Learning for Time Series OOD Generalization
Chengtao Jian
Kai Yang
Yang Jiao
AI4TS
34
3
0
09 Oct 2024
Diversity-Rewarded CFG Distillation
Geoffrey Cideron
A. Agostinelli
Johan Ferret
Sertan Girgin
Romuald Elie
Olivier Bachem
Sarah Perrin
Alexandre Ramé
39
2
0
08 Oct 2024
QT-DoG: Quantization-aware Training for Domain Generalization
Saqib Javed
Hieu Le
Mathieu Salzmann
OOD
MQ
28
1
0
08 Oct 2024
Wolf2Pack: The AutoFusion Framework for Dynamic Parameter Fusion
Bowen Tian
Songning Lai
Yutao Yue
MoMe
30
0
0
08 Oct 2024
Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
Youngtaek Oh
Jae-Won Cho
Dong-Jin Kim
In So Kweon
Junmo Kim
VLM
CoGe
CLIP
27
4
0
07 Oct 2024
What Matters for Model Merging at Scale?
Prateek Yadav
Tu Vu
Jonathan Lai
Alexandra Chronopoulou
Manaal Faruqui
Joey Tianyi Zhou
Tsendsuren Munkhdalai
MoMe
46
15
0
04 Oct 2024
MMP: Towards Robust Multi-Modal Learning with Masked Modality Projection
Niki Nezakati
Md Kaykobad Reza
Ameya Patil
Mashhour Solh
M. Salman Asif
27
1
0
03 Oct 2024
Understanding and Mitigating Miscalibration in Prompt Tuning for Vision-Language Models
Shuoyuan Wang
Yixuan Li
Hongxin Wei
VLM
51
2
0
03 Oct 2024
SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation
Mucong Ding
Bang An
Yuancheng Xu
Anirudh Satheesh
Furong Huang
24
1
0
03 Oct 2024
DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation
Changdae Oh
Yixuan Li
Kyungwoo Song
Sangdoo Yun
Dongyoon Han
OOD
MoMe
45
4
0
03 Oct 2024
Toward a Holistic Evaluation of Robustness in CLIP Models
Weijie Tu
Weijian Deng
Tom Gedeon
VLM
38
5
0
02 Oct 2024
Contrastive Abstraction for Reinforcement Learning
Vihang Patil
M. Hofmarcher
Elisabeth Rumetshofer
Sepp Hochreiter
OffRL
SSL
24
2
0
01 Oct 2024
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models
Shitian Zhao
Renrui Zhang
Xu Luo
Yan Wang
Shanghang Zhang
Peng Gao
18
0
0
01 Oct 2024
Dual Consolidation for Pre-Trained Model-Based Domain-Incremental Learning
Da-Wei Zhou
Zi-Wen Cai
Han-Jia Ye
Lijun Zhang
De-Chuan Zhan
CLL
AI4CE
76
2
0
01 Oct 2024
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Heeseong Shin
Chaehyun Kim
Sunghwan Hong
Seokju Cho
Anurag Arnab
Paul Hongsuck Seo
Seungryong Kim
VLM
34
1
0
30 Sep 2024
Realistic Evaluation of Model Merging for Compositional Generalization
Derek Tam
Yash Kant
Brian Lester
Igor Gilitschenski
Colin Raffel
MoMe
35
6
0
26 Sep 2024
Layer-wise Model Merging for Unsupervised Domain Adaptation in Segmentation Tasks
Roberto Alcover-Couso
Juan C. Sanmiguel
Marcos Escudero-Viñolo
Jose M. Martínez
FedML
MoMe
28
1
0
24 Sep 2024
TSCLIP: Robust CLIP Fine-Tuning for Worldwide Cross-Regional Traffic Sign Recognition
Guoyang Zhao
Fulong Ma
Weiqing Qi
Chenguang Zhang
Yuxuan Liu
Ming Liu
Jun Ma
VLM
CLIP
117
3
0
23 Sep 2024
LARE: Latent Augmentation using Regional Embedding with Vision-Language Model
Kosuke Sakurai
Tatsuya Ishii
Ryotaro Shimizu
Linxin Song
Masayuki Goto
VLM
26
0
0
19 Sep 2024
LPT++: Efficient Training on Mixture of Long-tailed Experts
Bowen Dong
Pan Zhou
W. Zuo
VLM
39
0
0
17 Sep 2024
Finetuning CLIP to Reason about Pairwise Differences
Dylan Sam
Devin Willmott
João Dias Semedo
J. Zico Kolter
VLM
71
3
0
15 Sep 2024
Previous
1
2
3
4
5
...
9
10
11
Next