ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.19947
  4. Cited By
Inference acceleration for large language models using "stairs" assisted
  greedy generation

Inference acceleration for large language models using "stairs" assisted greedy generation

29 July 2024
Domas Grigaliunas
M. Lukoševičius
ArXivPDFHTML

Papers citing "Inference acceleration for large language models using "stairs" assisted greedy generation"

2 / 2 papers shown
Title
Fast Inference from Transformers via Speculative Decoding
Fast Inference from Transformers via Speculative Decoding
Yaniv Leviathan
Matan Kalman
Yossi Matias
LRM
79
663
0
30 Nov 2022
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional
  Neural Networks for Extreme Summarization
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
Shashi Narayan
Shay B. Cohen
Mirella Lapata
AILaw
104
1,652
0
27 Aug 2018
1