Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2306.01708
Cited By
v1
v2 (latest)
TIES-Merging: Resolving Interference When Merging Models
Neural Information Processing Systems (NeurIPS), 2023
2 June 2023
Prateek Yadav
Derek Tam
Leshem Choshen
Colin Raffel
Joey Tianyi Zhou
MoMe
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (14 upvotes)
Github (179★)
Papers citing
"TIES-Merging: Resolving Interference When Merging Models"
50 / 356 papers shown
Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates
Atsuki Yamaguchi
Terufumi Morishita
Aline Villavicencio
Nikolaos Aletras
CLL
KELM
ELM
303
1
0
04 Dec 2025
TRINITY: An Evolved LLM Coordinator
Jinglue Xu
Qi Sun
Peter Schwendeman
Stefan Nielsen
Edoardo Cetin
Yujin Tang
LLMAG
314
0
0
04 Dec 2025
Basis-Oriented Low-rank Transfer for Few-Shot and Test-Time Adaptation
Junghwan Park
Woojin Cho
J. Heo
Darongsae Kwon
Kookjin Lee
122
0
0
02 Dec 2025
An Empirical Survey of Model Merging Algorithms for Social Bias Mitigation
Daiki Shirafuji
Tatsuhiko Saito
Yasutomo Kimura
MoMe
KELM
170
0
0
02 Dec 2025
Stay Unique, Stay Efficient: Preserving Model Personality in Multi-Task Merging
Kuangpu Guo
Yuhe Ding
Jian Liang
Zilei Wang
Ran He
MoMe
150
0
0
01 Dec 2025
From Coefficients to Directions: Rethinking Model Merging with Directional Alignment
Zhikang Chen
Sen Cui
Deheng Ye
Min Zhang
Gang Niu
Yu Zhang
Masashi Sugiyama
Tingting Zhu
MoMe
236
0
0
29 Nov 2025
A Systematic Study of In-the-Wild Model Merging for Large Language Models
Oğuz Kağan Hitit
Leander Girrbach
Zeynep Akata
MoMe
386
3
0
26 Nov 2025
Towards Benign Memory Forgetting for Selective Multimodal Large Language Model Unlearning
Zhen Zeng
Leijiang Gu
Zhangling Duan
Feng-Qiang Li
Zenglin Shi
Cees G. M. Snoek
Meng Wang
KELM
MU
CLL
326
1
0
25 Nov 2025
MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent
Yuxia Fu
Zhizhen Zhang
Y. Zhang
Zijian Wang
Zi-Rui Huang
Yadan Luo
MoMe
410
5
0
24 Nov 2025
Escaping Optimization Stagnation: Taking Steps Beyond Task Arithmetic via Difference Vectors
Jinping Wang
Zhiqiang Gao
Dinggen Zhang
Zhiwu Xie
MoMe
343
0
0
22 Nov 2025
MergeSlide: Continual Model Merging and Task-to-Class Prompt-Aligned Inference for Lifelong Learning on Whole Slide Images
Doanh C. Bui
Ba-Hung Ngo
H. Pham
Khang Phuoc-Quy Nguyen
Maï K. Nguyen
Y. Nakashima
CLL
MoMe
VLM
368
0
0
17 Nov 2025
A Novel Hierarchical Integration Method for Efficient Model Merging in Medical LLMs
Prakrit Timilsina
Anuj Nepal
Rajan Kadel
Robin Doss
MoMe
140
0
0
17 Nov 2025
Defending Unauthorized Model Merging via Dual-Stage Weight Protection
Wei-Jia Chen
Min-Yen Tsai
Cheng-Yi Lee
Chia-Mu Yu
MoMe
AAML
454
0
0
14 Nov 2025
Continual Unlearning for Text-to-Image Diffusion Models: A Regularization Perspective
Justin Lee
Zheda Mai
Jinsu Yoo
Chongyu Fan
Cheng Zhang
Wei-Lun Chao
DiffM
VLM
277
2
0
11 Nov 2025
Ghost in the Transformer: Detecting Model Reuse with Invariant Spectral Signatures
Suqing Wang
Ziyang Ma
Xinyi Li
Zuchao Li
206
0
0
09 Nov 2025
Steering Language Models with Weight Arithmetic
Constanza Fierro
Fabien Roger
MoMe
LLMSV
618
5
0
07 Nov 2025
Model Merging Improves Zero-Shot Generalization in Bioacoustic Foundation Models
Davide Marincione
Donato Crisostomi
Roberto Dessi
Emanuele Rodolà
Emanuele Rossi
MoMe
AI4CE
VLM
388
1
0
07 Nov 2025
Merging Continual Pretraining Models for Domain-Specialized LLMs: A Case Study in Finance
Kentaro Ueda
François Portet
H. Suwa
Keiichi Yasumoto
CLL
MoMe
442
1
0
04 Nov 2025
Parameterized Prompt for Incremental Object Detection
Zijia An
Boyu Diao
R. Liu
Libo Huang
Chuanguang Yang
Fei Wang
Zhulin An
Yongjun Xu
CLL
VLM
319
0
0
31 Oct 2025
T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis
Raza Imam
Hu Wang
Dwarikanath Mahapatra
Mohammad Yaqub
MoMe
328
0
0
31 Oct 2025
WeaveRec: An LLM-Based Cross-Domain Sequential Recommendation Framework with Model Merging
Min Hou
Xin Liu
Le Wu
Chenyi He
Hao Liu
Z. Li
Xin Li
Si Wei
MoMe
370
0
0
30 Oct 2025
World Simulation with Video Foundation Models for Physical AI
Nvidia
A. M. Ali
Junjie Bai
Maciej Bala
Yogesh Balaji
...
Jing Zhang
Qinsheng Zhang
Kaiwen Zheng
Andrew Zhu
Yuke Zhu
VGen
PINN
683
64
0
28 Oct 2025
Eigen-Value: Efficient Domain-Robust Data Valuation via Eigenvalue-Based Approach
Youngjun Choi
Joonseong Kang
Sungjun Lim
Kyungwoo Song
TDI
298
0
0
27 Oct 2025
Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP
Yuxin Pan
Zhiguang Cao
Chengyang Gu
Liu Liu
Peilin Zhao
Yize Chen
Fangzhen Lin
225
2
0
24 Oct 2025
Model Merging with Functional Dual Anchors
Kexuan Shi
Yandong Wen
Weiyang Liu
MoMe
322
2
0
24 Oct 2025
Mapping Post-Training Forgetting in Language Models at Scale
Jackson Harmon
Andreas Hochlehnert
Matthias Bethge
Ameya Prabhu
CLL
KELM
235
2
0
20 Oct 2025
Hierarchical Federated Unlearning for Large Language Models
Yisheng Zhong
Zhengbang Yang
Zhuangdi Zhu
MU
273
1
0
19 Oct 2025
MIN-Merging: Merge the Important Neurons for Model Merging
Yunfei Liang
MoMe
608
0
0
18 Oct 2025
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning
Shih-yang Liu
Xin Dong
Ximing Lu
Shizhe Diao
Mingjie Liu
...
Yu Wang
K.-T. Cheng
Yejin Choi
Jan Kautz
Pavlo Molchanov
160
18
0
16 Oct 2025
Backdoor Unlearning by Linear Task Decomposition
Amel Abdelraheem
Alessandro Favero
Gérôme Bovet
Pascal Frossard
AAML
MU
275
0
0
16 Oct 2025
Directional Reasoning Injection for Fine-Tuning MLLMs
Chao Huang
Zeliang Zhang
Jiang Liu
Ximeng Sun
Jialian Wu
X. Yu
Ze Wang
Chenliang Xu
Emad Barsoum
Zicheng Liu
MoMe
LRM
299
3
0
16 Oct 2025
Harmonizing Diverse Models: A Layer-wise Merging Strategy for Consistent Generation
Xujun Peng
Anoop Kumar
Jingyu Wu
Parker Glenn
Daben Liu
MoMe
208
0
0
16 Oct 2025
Purifying Task Vectors in Knowledge-Aware Subspace for Model Merging
Bang An
Yibo Yang
Philip Torr
Bernard Ghanem
MoMe
206
1
0
16 Oct 2025
Weight Weaving: Parameter Pooling for Data-Free Model Merging
Levy G. Chaves
Eduardo Valle
Sandra Avila
MoMe
300
1
0
15 Oct 2025
Towards Reversible Model Merging For Low-rank Weights
Mohammadsajad Alipour
Mohammad Mohammadi Amiri
MoMe
189
0
0
15 Oct 2025
REAP the Experts: Why Pruning Prevails for One-Shot MoE compression
Mike Lasby
Ivan Lazarevich
Nish Sinnadurai
Sean Lie
Yani Andrew Ioannou
Vithursan Thangarasa
165
8
0
15 Oct 2025
Exploring and Leveraging Class Vectors for Classifier Editing
Jaeik Kim
Jaeyoung Do
VLM
226
0
0
13 Oct 2025
On-device System of Compositional Multi-tasking in Large Language Models
Ondrej Bohdal
Konstantinos Theodosiadis
Asterios Mpatziakas
Dimitris Filippidis
Iro Spyrou
...
Kyeng-Hun Lee
J. Moon
Hyeonmok Ko
Mete Ozay
Umberto Michieli
157
1
0
11 Oct 2025
Towards Efficient Multimodal Unified Reasoning Model via Model Merging
Qixiang Yin
Huanjin Yao
Jianghao Chen
Jiaxing Huang
Z. Zhao
Fei Su
LRM
MoMe
371
1
0
10 Oct 2025
Don't Throw Away Your Pretrained Model
Shangbin Feng
Wenhao Yu
Yike Wang
Hongming Zhang
Yulia Tsvetkov
Dong Yu
MoMe
285
4
0
10 Oct 2025
Diagnosing and Mitigating System Bias in Self-Rewarding RL
Chuyi Tan
Peiwen Yuan
Xinglin Wang
Yiwei Li
Shaoxiong Feng
...
Jiayi Shi
Ji Zhang
Boyuan Pan
Yao Hu
Kan Li
154
0
0
10 Oct 2025
Backdoor Vectors: a Task Arithmetic View on Backdoor Attacks and Defenses
Stanisław Pawlak
Jan Dubiñski
Daniel Marczak
Bartłomiej Twardowski
AAML
MoMe
272
0
0
09 Oct 2025
Do We Really Need Permutations? Impact of Model Width on Linear Mode Connectivity
Akira Ito
Masanori Yamada
Daiki Chijiwa
Atsutoshi Kumagai
MoMe
261
0
0
09 Oct 2025
FedQS: Optimizing Gradient and Model Aggregation for Semi-Asynchronous Federated Learning
Yunbo Li
Jiaping Gui
Zhihang Deng
Fanchao Meng
Yue Wu
FedML
410
12
0
09 Oct 2025
Gradient-Sign Masking for Task Vector Transport Across Pre-Trained Models
Filippo Rinaldi
Aniello Panariello
Giacomo Salici
Fengyuan Liu
Marco Ciccone
Angelo Porrello
Simone Calderara
231
1
0
07 Oct 2025
FedSRD: Sparsify-Reconstruct-Decompose for Communication-Efficient Federated Large Language Models Fine-Tuning
Guochen Yan
Luyuan Xie
Qingni Shen
Yuejian Fang
Zhonghai Wu
238
1
0
06 Oct 2025
BaldWhisper: Faster Whisper with Head Shearing and Layer Merging
Yaya Sy
Christophe Cerisara
Irina Illina
116
2
0
06 Oct 2025
Boomerang Distillation Enables Zero-Shot Model Size Interpolation
Sara Kangaslahti
Nihal V. Nayak
Jonathan Geuter
Marco Fumero
Francesco Locatello
David Alvarez-Melis
225
1
0
06 Oct 2025
REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration
Yisu Wang
Ming Wang
Haoyuan Song
Wenjie Huang
Chaozheng Wang
Yi Xie
Xuming Ran
KELM
MoMe
CLL
176
1
0
02 Oct 2025
Expert Merging: Model Merging with Unsupervised Expert Alignment and Importance-Guided Layer Chunking
Dengming Zhang
Xiaowen Ma
Zhenliang Ni
Zhenkai Wu
Han Shu
Xin Jiang
Xinghao Chen
MoMe
195
3
0
30 Sep 2025
1
2
3
4
5
6
7
8
Next
Page 1 of 8