Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.01729
Cited By
Softmax Bias Correction for Quantized Generative Models
4 September 2023
N. Pandey
Marios Fournarakis
Chirag I. Patel
Markus Nagel
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Softmax Bias Correction for Quantized Generative Models"
10 / 10 papers shown
Title
DPQ-HD: Post-Training Compression for Ultra-Low Power Hyperdimensional Computing
Nilesh Prasad Pandey
Shriniwas Kulkarni
David Wang
Onat Gungor
Flavio Ponzina
T. Rosing
41
0
0
08 May 2025
Diffusion Model Quantization: A Review
Qian Zeng
Chenggong Hu
Mingli Song
Jie Song
MQ
45
0
0
08 May 2025
SoftmAP: Software-Hardware Co-design for Integer-Only Softmax on Associative Processors
M. Rakka
Jiyang Li
Guohao Dai
A. Eltawil
M. Fouda
Fadi J. Kurdahi
72
1
0
26 Nov 2024
Temporal Feature Matters: A Framework for Diffusion Model Quantization
Yushi Huang
Ruihao Gong
Xianglong Liu
Jing Liu
Yuhang Li
Jiwen Lu
Dacheng Tao
DiffM
MQ
49
0
0
28 Jul 2024
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Alireza Ganjdanesh
Reza Shirkavand
Shangqian Gao
Heng Huang
DiffM
VLM
56
4
0
17 Jun 2024
RefQSR: Reference-based Quantization for Image Super-Resolution Networks
H. Lee
Jun-Sang Yoo
Seung-Won Jung
SupR
18
2
0
02 Apr 2024
Clockwork Diffusion: Efficient Generation With Model-Step Distillation
A. Habibian
Amir Ghodrati
Noor Fathima
Guillaume Sautière
Risheek Garrepalli
Fatih Porikli
Jens Petersen
38
7
0
13 Dec 2023
Efficient Quantization Strategies for Latent Diffusion Models
Yuewei Yang
Xiaoliang Dai
Jialiang Wang
Peizhao Zhang
Hongbo Zhang
DiffM
MQ
24
13
0
09 Dec 2023
Overcoming Oscillations in Quantization-Aware Training
Markus Nagel
Marios Fournarakis
Yelysei Bondarenko
Tijmen Blankevoort
MQ
111
101
0
21 Mar 2022
Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT
Sheng Shen
Zhen Dong
Jiayu Ye
Linjian Ma
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
236
576
0
12 Sep 2019
1