Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.00853
Cited By
Accelerated AI Inference via Dynamic Execution Methods
30 October 2024
Haim Barad
Jascha Achterberg
Tien Pei Chou
Jean Yu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Accelerated AI Inference via Dynamic Execution Methods"
12 / 12 papers shown
Title
RouteLLM: Learning to Route LLMs with Preference Data
Isaac Ong
Amjad Almahairi
Vincent Wu
Wei-Lin Chiang
Tianhao Wu
Joseph E. Gonzalez
M. W. Kadous
Ion Stoica
130
104
0
26 Jun 2024
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Yuhui Li
Fangyun Wei
Chao Zhang
Hongyang R. Zhang
142
165
0
26 Jan 2024
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Tianle Cai
Yuhong Li
Zhengyang Geng
Hongwu Peng
Jason D. Lee
De-huai Chen
Tri Dao
166
314
0
19 Jan 2024
Leveraging Speculative Sampling and KV-Cache Optimizations Together for Generative AI using OpenVINO
Haim Barad
Ekaterina Aidova
Yury Gorbachev
SyDa
29
1
0
08 Nov 2023
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
120
2,434
0
19 Dec 2022
Fast Inference from Transformers via Speculative Decoding
Yaniv Leviathan
Matan Kalman
Yossi Matias
LRM
151
736
0
30 Nov 2022
Diffusion Models for Video Prediction and Infilling
Tobias Höppe
Arash Mehrjou
Stefan Bauer
Didrik Nielsen
Andrea Dittadi
DiffM
VGen
95
138
0
15 Jun 2022
Video Diffusion Models
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
211
1,642
0
07 Apr 2022
Patterns, predictions, and actions: A story about machine learning
Moritz Hardt
Benjamin Recht
SSL
AI4TS
AI4CE
84
32
0
10 Feb 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
229
700
0
24 Jan 2021
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
639
4,921
0
23 Jan 2020
Green AI
Roy Schwartz
Jesse Dodge
Noah A. Smith
Oren Etzioni
121
1,149
0
22 Jul 2019
1