Lossless Acceleration of Large Language Model via Adaptive N-gram
  Parallel Decoding
v1v2 (latest)

Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

Papers citing "Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding"

18 / 18 papers shown
Title
Qwen Technical Report
Qwen Technical Report
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
...
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
262
1,827
0
28 Sep 2023

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.