
v1v2 (latest)
Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding
Papers citing "Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding"
18 / 18 papers shown
Title |
---|
![]() Qwen Technical Report Jinze Bai Shuai Bai Yunfei Chu Zeyu Cui Kai Dang ...Zhenru Zhang Chang Zhou Jingren Zhou Xiaohuan Zhou Tianhang Zhu |