Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.11233
Cited By
Evaluating the Impact of Compression Techniques on Task-Specific Performance of Large Language Models
17 September 2024
Bishwash Khanal
Jeffery M. Capone
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evaluating the Impact of Compression Techniques on Task-Specific Performance of Large Language Models"
6 / 6 papers shown
Title
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Wei-Lin Chiang
Lianmin Zheng
Ying Sheng
Anastasios Nikolas Angelopoulos
Tianle Li
...
Hao Zhang
Banghua Zhu
Michael I. Jordan
Joseph E. Gonzalez
Ion Stoica
OSLM
80
536
0
07 Mar 2024
Model Compression and Efficient Inference for Large Language Models: A Survey
Wenxiao Wang
Wei Chen
Yicong Luo
Yongliu Long
Zhengkai Lin
Liye Zhang
Binbin Lin
Deng Cai
Xiaofei He
MQ
54
51
0
15 Feb 2024
Compressing LLMs: The Truth is Rarely Pure and Never Simple
Ajay Jaiswal
Zhe Gan
Xianzhi Du
Bowen Zhang
Zhangyang Wang
Yinfei Yang
MQ
62
47
0
02 Oct 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALM
OSLM
ELM
212
4,085
0
09 Jun 2023
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
Jonathan Frankle
Michael Carbin
147
3,433
0
09 Mar 2018
Learning both Weights and Connections for Efficient Neural Networks
Song Han
Jeff Pool
J. Tran
W. Dally
CVBM
202
6,628
0
08 Jun 2015
1