Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.14160
Cited By
Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement
21 February 2024
Wonseok Jeon
Mukul Gagrani
Raghavv Goel
Junyoung Park
Mingu Lee
Christopher Lott
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement"
2 / 2 papers shown
Title
Constrained Decoding with Speculative Lookaheads
Nishanth Nakshatri
Shamik Roy
Rajarshi Das
Suthee Chaidaroon
Leonid Boytsov
Rashmi Gangadharaiah
137
0
0
09 Dec 2024
OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure
Jikai Wang
Yi Su
Juntao Li
Qingrong Xia
Zi Ye
Xinyu Duan
Zhefeng Wang
Min Zhang
91
17
0
25 Jun 2024
1