32
0

FLoE: Fisher-Based Layer Selection for Efficient Sparse Adaptation of Low-Rank Experts

Main:9 Pages
11 Figures
Bibliography:3 Pages
8 Tables
Appendix:5 Pages
Abstract

Parameter-Efficient Fine-Tuning (PEFT) methods have emerged as a widely adopted strategy for adapting pre-trained Large Language Models (LLMs) to downstream tasks, significantly reducing memory and computational costs. However, most existing PEFT techniques uniformly deploy LoRA adapters across all layers, disregarding the intrinsic heterogeneity of layer contributions and task-specific rank requirements. This uniform paradigm leads to redundant parameter allocation and suboptimal adaptation efficiency. To address these limitations, we propose FLoE, a novel PEFT framework that introduces two key innovations: (i) a Fisher information-guided importance scoring mechanism to dynamically identify task-critical transformer layers for MoE-based low-rank adaptation, enabling sparse adapter deployment; and (ii) a Bayesian optimization-driven rank allocator that automatically determines optimal LoRA ranks on specific datasets without exhaustive grid search. Extensive experiments across diverse LLMs and benchmarks reveal that FLoE achieves impressive efficiency-accuracy trade-offs, making FLoE particularly advantageous in resource-constrained environments that necessitate rapid adaptation.

View on arXiv
@article{wang2025_2506.00495,
  title={ FLoE: Fisher-Based Layer Selection for Efficient Sparse Adaptation of Low-Rank Experts },
  author={ Xinyi Wang and Lirong Gao and Haobo Wang and Yiming Zhang and Junbo Zhao },
  journal={arXiv preprint arXiv:2506.00495},
  year={ 2025 }
}
Comments on this paper