ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.08294
  4. Cited By
Inferflow: an Efficient and Highly Configurable Inference Engine for
  Large Language Models

Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models

16 January 2024
Shuming Shi
Enbo Zhao
Deng Cai
Leyang Cui
Xinting Huang
Huayang Li
ArXivPDFHTML

Papers citing "Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models"

2 / 2 papers shown
Title
Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission
Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission
Seungeun Oh
Jinhyuk Kim
Jihong Park
Seung-Woo Ko
Jinho Choi
Tony Q. S. Quek
Seong-Lyun Kim
9
0
0
17 May 2025
Locally Typical Sampling
Locally Typical Sampling
Clara Meister
Tiago Pimentel
Gian Wiher
Ryan Cotterell
143
86
0
01 Feb 2022
1