Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.01705
Cited By
Progressive Binarization with Semi-Structured Pruning for LLMs
3 February 2025
Xinyu Yan
Tianao Zhang
Zhiteng Li
Yulun Zhang
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Progressive Binarization with Semi-Structured Pruning for LLMs"
12 / 12 papers shown
Title
DVD-Quant: Data-free Video Diffusion Transformers Quantization
Zhiteng Li
Hanxuan Li
Junyi Wu
Kai Liu
Linghe Kong
Guihai Chen
Yulun Zhang
Xiaokang Yang
MQ
VGen
26
0
0
24 May 2025
STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs
Peijie Dong
Lujun Li
Dayou Du
Yuhan Chen
Zhenheng Tang
...
Wei Xue
Wenhan Luo
Qi-fei Liu
Yi-Ting Guo
Xiaowen Chu
MQ
61
6
0
03 Aug 2024
DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Hong Chen
Chengtao Lv
Liang Ding
Haotong Qin
Xiabin Zhou
...
Xuebo Liu
Min Zhang
Jinyang Guo
Xianglong Liu
Dacheng Tao
MQ
17
21
0
19 Feb 2024
Fluctuation-based Adaptive Structured Pruning for Large Language Models
Yongqi An
Xu Zhao
Tao Yu
Ming Tang
Jinqiao Wang
59
47
0
19 Dec 2023
ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models
Zhihang Yuan
Yuzhang Shang
Yue Song
Qiang Wu
Yan Yan
Guangyu Sun
MQ
58
54
0
10 Dec 2023
Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning
Elias Frantar
Sidak Pal Singh
Dan Alistarh
MQ
56
226
0
24 Aug 2022
BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction
Yuhang Li
Ruihao Gong
Xu Tan
Yang Yang
Peng Hu
Qi Zhang
F. Yu
Wei Wang
Shi Gu
MQ
87
426
0
10 Feb 2021
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
126
42,038
0
03 Dec 2019
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions
Christopher Clark
Kenton Lee
Ming-Wei Chang
Tom Kwiatkowski
Michael Collins
Kristina Toutanova
133
1,475
0
24 May 2019
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
324
129,831
0
12 Jun 2017
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
Mohammad Rastegari
Vicente Ordonez
Joseph Redmon
Ali Farhadi
MQ
122
4,342
0
16 Mar 2016
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Song Han
Huizi Mao
W. Dally
3DGS
180
8,793
0
01 Oct 2015
1