Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.07365
Cited By
LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation
20 February 2025
Zican Dong
Junyi Li
Jinhao Jiang
Mingyu Xu
Wayne Xin Zhao
B. Wang
Weipeng Chen
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation"
2 / 2 papers shown
Title
CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability
Han Peng
Jinhao Jiang
Zican Dong
Wayne Xin Zhao
Lei Fang
RALM
23
0
0
15 May 2025
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
Z. Qiu
Z. Wang
Bo Zheng
Zeyu Huang
Kaiyue Wen
...
Fei Huang
Suozhi Huang
Dayiheng Liu
Jingren Zhou
Junyang Lin
MoE
23
0
0
10 May 2025
1