Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.20776
Cited By
SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences
27 May 2025
Jungyoub Cha
Hyunjong Kim
Sungzoon Cho
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences"
4 / 4 papers shown
Title
EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test
Yuhui Li
Fangyun Wei
Chao Zhang
Hongyang R. Zhang
244
18
0
03 Mar 2025
LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
Penghui Yang
Cunxiao Du
Fengzhuo Zhang
Haonan Wang
Tianyu Pang
Chao Du
Bo An
RALM
99
2
0
24 Feb 2025
OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure
Jikai Wang
Yi Su
Juntao Li
Qingrong Xia
Zi Ye
Xinyu Duan
Zhefeng Wang
Min Zhang
144
19
0
25 Jun 2024
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Yuhui Li
Fangyun Wei
Chao Zhang
Hongyang R. Zhang
152
165
0
26 Jan 2024
1