Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.12208
Cited By
Knowledge Fusion By Evolving Weights of Language Models
18 June 2024
Guodong DU
Yiyao Cao
Hanting Liu
Runhua Jiang
Shuyang Yu
Yifei Guo
Sim Kuan Goh
Jing Li
MoMe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Knowledge Fusion By Evolving Weights of Language Models"
15 / 15 papers shown
Title
Multi-Modality Expansion and Retention for LLMs through Parameter Merging and Decoupling
Junlin Li
Guodong DU
Jing Li
Sim Kuan Goh
Wenya Wang
...
Fangming Liu
Jing Li
Saleh Alharbi
Daojing He
Min Zhang
MoMe
CLL
95
1
0
21 May 2025
Robust fine-tuning of zero-shot models
Mitchell Wortsman
Gabriel Ilharco
Jong Wook Kim
Mike Li
Simon Kornblith
...
Raphael Gontijo-Lopes
Hannaneh Hajishirzi
Ali Farhadi
Hongseok Namkoong
Ludwig Schmidt
VLM
117
721
0
04 Sep 2021
MergeDistill: Merging Pre-trained Language Models using Distillation
Simran Khanuja
Melvin Johnson
Partha P. Talukdar
51
16
0
05 Jun 2021
Transformer-based Language Model Fine-tuning Methods for COVID-19 Fake News Detection
Ben Chen
Bin Chen
D. Gao
Qijin Chen
Chengfu Huo
Xiaonan Meng
Weijun Ren
Yang Zhou
62
40
0
14 Jan 2021
Differential Evolution for Neural Architecture Search
Noor H. Awad
Neeratyoy Mallik
Frank Hutter
34
26
0
11 Dec 2020
Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping
Jesse Dodge
Gabriel Ilharco
Roy Schwartz
Ali Farhadi
Hannaneh Hajishirzi
Noah A. Smith
93
595
0
15 Feb 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
381
20,053
0
23 Oct 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
536
24,351
0
26 Jul 2019
Neural Network Acceptability Judgments
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
224
1,407
0
31 May 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
1.0K
7,152
0
20 Apr 2018
WASSA-2017 Shared Task on Emotion Intensity
Saif M. Mohammad
Felipe Bravo-Marquez
67
257
0
11 Aug 2017
An Overview of Multi-Task Learning in Deep Neural Networks
Sebastian Ruder
CVBM
136
2,826
0
15 Jun 2017
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
Adina Williams
Nikita Nangia
Samuel R. Bowman
509
4,473
0
18 Apr 2017
SQuAD: 100,000+ Questions for Machine Comprehension of Text
Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Percy Liang
RALM
246
8,124
0
16 Jun 2016
Communication-Efficient Learning of Deep Networks from Decentralized Data
H. B. McMahan
Eider Moore
Daniel Ramage
S. Hampson
Blaise Agüera y Arcas
FedML
380
17,437
0
17 Feb 2016
1