Model Merging with Functional Dual Anchors

24 October 2025

ArXiv (abs)PDF HTML HuggingFace (11 upvotes)

Main:9 Pages

15 Figures

Bibliography:4 Pages

8 Tables

Appendix:10 Pages

Abstract

Model merging is an efficient post-training strategy for integrating knowledge from multiple finetuned checkpoints of a shared foundation model. Existing methods operate in the parameter space, combining task vectors to mitigate conflicts, but remain constrained by parameter inconsistencies. We propose Functional Dual Anchors (FDAs), a framework that instead models the input-representation space. FDAs are synthetic inputs whose induced gradients align with task vectors, capturing task-specific functional shifts relative to the pretrained model. This perspective bridges joint multi-task training and post-hoc merging, offering both robustness and flexibility. We further introduce a principled initialization scheme and show that FDAs are complementary to parameter-space model merging. Comprehensive experiments demonstrate the effectiveness of FDAs in model merging.

View on arXiv

Comments on this paper

61

0

Model Merging with Functional Dual Anchors

24 October 2025

ArXiv (abs)PDF HTML HuggingFace (11 upvotes)

Main:9 Pages

15 Figures

Bibliography:4 Pages

8 Tables

Appendix:10 Pages

Abstract

Model merging is an efficient post-training strategy for integrating knowledge from multiple finetuned checkpoints of a shared foundation model. Existing methods operate in the parameter space, combining task vectors to mitigate conflicts, but remain constrained by parameter inconsistencies. We propose Functional Dual Anchors (FDAs), a framework that instead models the input-representation space. FDAs are synthetic inputs whose induced gradients align with task vectors, capturing task-specific functional shifts relative to the pretrained model. This perspective bridges joint multi-task training and post-hoc merging, offering both robustness and flexibility. We further introduce a principled initialization scheme and show that FDAs are complementary to parameter-space model merging. Comprehensive experiments demonstrate the effectiveness of FDAs in model merging.

View on arXiv

Comments on this paper

Title

All Papers

Model Merging with Functional Dual Anchors

Model Merging with Functional Dual Anchors