Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.17420
Cited By
DASH: Input-Aware Dynamic Layer Skipping for Efficient LLM Inference with Markov Decision Policies
23 May 2025
Ning Yang
Fangxin Liu
Junjie Wang
Tao Yang
Kan Liu
Haibing Guan
Li Jiang
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DASH: Input-Aware Dynamic Layer Skipping for Efficient LLM Inference with Markov Decision Policies"
6 / 6 papers shown
Title
AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference
Zhuomin He
Yizhen Yao
Pengfei Zuo
Bin Gao
Qinya Li
Zhenzhe Zheng
Fan Wu
72
1
0
04 Jan 2025
Accelerating Inference in Large Language Models with a Unified Layer Skipping Strategy
Yijin Liu
Fandong Meng
Jie Zhou
AI4CE
50
9
0
10 Apr 2024
SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference
Luciano Del Corro
Allison Del Giorno
Sahaj Agarwal
Ting Yu
Ahmed Hassan Awadallah
Subhabrata Mukherjee
93
59
0
05 Jul 2023
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
743
41,932
0
28 May 2020
Get To The Point: Summarization with Pointer-Generator Networks
A. See
Peter J. Liu
Christopher D. Manning
3DPC
293
4,019
0
14 Apr 2017
Pointer Sentinel Mixture Models
Stephen Merity
Caiming Xiong
James Bradbury
R. Socher
RALM
308
2,854
0
26 Sep 2016
1