Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.08892
Cited By
Characterizing Prompt Compression Methods for Long Context Inference
11 July 2024
Siddharth Jha
Lutfi Eren Erdogan
Sehoon Kim
Kurt Keutzer
A. Gholami
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Characterizing Prompt Compression Methods for Long Context Inference"
4 / 4 papers shown
Title
PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression
Lizhe Chen
Binjia Zhou
Yuyao Ge
Jiayi Chen
Shiguang NI
136
0
0
23 Apr 2025
SnapKV: LLM Knows What You are Looking for Before Generation
Yuhong Li
Yingbing Huang
Bowen Yang
Bharat Venkitesh
Acyr F. Locatelli
Hanchen Ye
Tianle Cai
Patrick Lewis
Deming Chen
VLM
77
153
0
22 Apr 2024
PROMPT-SAW: Leveraging Relation-Aware Graphs for Textual Prompt Compression
Muhammad Asif Ali
Zhengping Li
Shu Yang
Keyuan Cheng
Yang Cao
Tianhao Huang
Lijie Hu
Lu Yu
Di Wang
VLM
RALM
38
9
0
30 Mar 2024
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression
Huiqiang Jiang
Qianhui Wu
Xufang Luo
Dongsheng Li
Chin-Yew Lin
Yuqing Yang
Lili Qiu
RALM
112
183
0
10 Oct 2023
1