
Language model compression with weighted low-rank factorization
Papers citing "Language model compression with weighted low-rank factorization"
50 / 78 papers shown
Title |
---|
![]() FactorLLM: Factorizing Knowledge via Mixture of Experts for Large
Language Models Zhongyu Zhao Menghang Dong Rongyu Zhang Wenzhao Zheng Yunpeng Zhang Huanrui Yang Dalong Du Kurt Keutzer Shanghang Zhang |