Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.21352
Cited By
LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
28 October 2024
Ge Yang
Changyi He
J. Guo
Jianyu Wu
Yifu Ding
Aishan Liu
Haotong Qin
Pengliang Ji
Xianglong Liu
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment"
4 / 4 papers shown
Title
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float
Tianyi Zhang
Yang Sui
Shaochen Zhong
V. Chaudhary
Xia Hu
Anshumali Shrivastava
MQ
32
1
0
15 Apr 2025
Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis
Jun Zhao
Ming Wang
Miao Zhang
Yuzhang Shang
Xuebo Liu
Yaowei Wang
Min Zhang
Liqiang Nie
MQ
70
1
0
18 Feb 2025
First-place Solution for Streetscape Shop Sign Recognition Competition
Bin Wang
Li Jing
216
0
0
06 Jan 2025
PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and Models
Zining Wnag
J. Guo
Ruihao Gong
Yang Yong
Aishan Liu
Yushi Huang
Jiaheng Liu
Xianglong Liu
76
1
0
10 Dec 2024
1