Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.10424
Cited By
QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache
5 February 2025
Rishabh Tiwari
Haocheng Xi
Aditya Tomar
Coleman Hooper
Sehoon Kim
Maxwell Horton
Mahyar Najibi
Michael W. Mahoney
Kemal Kurniawan
Amir Gholami
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache"
1 / 1 papers shown
Title
ML-SpecQD: Multi-Level Speculative Decoding with Quantized Drafts
E. Georganas
Dhiraj D. Kalamkar
Alexander Kozlov
A. Heinecke
MQ
136
0
0
17 Mar 2025
1