Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.12637
Cited By
Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation
17 April 2025
Linda He
Jue Wang
Maurice Weber
Shang Zhu
Ben Athiwaratkun
Ce Zhang
SyDa
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation"
1 / 1 papers shown
Title
Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction
Jeffrey Willette
Heejun Lee
Sung Ju Hwang
12
0
0
16 May 2025
1