Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.18110
Cited By
SlimGPT: Layer-wise Structured Pruning for Large Language Models
24 December 2024
Gui Ling
Ziyang Wang
Yuliang Yan
Qingwen Liu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SlimGPT: Layer-wise Structured Pruning for Large Language Models"
7 / 7 papers shown
Title
Olica: Efficient Structured Pruning of Large Language Models without Retraining
Jiujun He
Huazhen Lin
26
0
0
10 Jun 2025
SlimLLM: Accurate Structured Pruning for Large Language Models
Jialong Guo
Xinghao Chen
Yehui Tang
Yunhe Wang
32
0
0
28 May 2025
Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs
Hanting Chen
Jiarui Qin
Jialong Guo
Tao Yuan
Yichun Yin
...
Can Chen
Xinghao Chen
Fisher Yu
Ruiming Tang
Yunhe Wang
63
0
0
26 May 2025
SPAP: Structured Pruning via Alternating Optimization and Penalty Methods
Hanyu Hu
Xiaoming Yuan
92
0
0
06 May 2025
ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs
Fahmida Liza Piya
Rahmatollah Beheshti
293
0
0
23 Apr 2025
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Akhiad Bercovich
Tomer Ronen
Talor Abramovich
Nir Ailon
Nave Assaf
...
Ido Shahaf
Oren Tropp
Omer Ullman Argov
Ran Zilberstein
Ran El-Yaniv
213
4
0
28 Nov 2024
Baichuan 2: Open Large-scale Language Models
Ai Ming Yang
Bin Xiao
Bingning Wang
Borong Zhang
Ce Bian
...
Youxin Jiang
Yuchen Gao
Yupeng Zhang
Guosheng Dong
Zhiying Wu
ELM
LRM
334
755
0
19 Sep 2023
1