Limited-Resource Adapters Are Regularizers, Not Linguists

Cross-lingual transfer from related high-resource languages is a well-established strategy to enhance low-resource language technologies. Prior work has shown that adapters show promise for, e.g., improving low-resource machine translation (MT). In this work, we investigate an adapter souping method combined with cross-attention fine-tuning of a pre-trained MT model to leverage language transfer for three low-resource Creole languages, which exhibit relatedness to different language groups across distinct linguistic dimensions. Our approach improves performance substantially over baselines. However, we find that linguistic relatedness -- or even a lack thereof -- does not covary meaningfully with adapter performance. Surprisingly, our cross-attention fine-tuning approach appears equally effective with randomly initialized adapters, implying that the benefit of adapters in this setting lies in parameter regularization, and not in meaningful information transfer. We provide analysis supporting this regularization hypothesis. Our findings underscore the reality that neural language processing involves many success factors, and that not all neural methods leverage linguistic knowledge in intuitive ways.
View on arXiv@article{fekete2025_2505.24525, title={ Limited-Resource Adapters Are Regularizers, Not Linguists }, author={ Marcell Fekete and Nathaniel R. Robinson and Ernests Lavrinovics and E. Djeride Jean-Baptiste and Raj Dabre and Johannes Bjerva and Heather Lent }, journal={arXiv preprint arXiv:2505.24525}, year={ 2025 } }