Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.08294
Cited By
Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models
16 January 2024
Shuming Shi
Enbo Zhao
Deng Cai
Leyang Cui
Xinting Huang
Huayang Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models"
2 / 2 papers shown
Title
Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission
Seungeun Oh
Jinhyuk Kim
Jihong Park
Seung-Woo Ko
Jinho Choi
Tony Q. S. Quek
Seong-Lyun Kim
9
0
0
17 May 2025
Locally Typical Sampling
Clara Meister
Tiago Pimentel
Gian Wiher
Ryan Cotterell
143
86
0
01 Feb 2022
1