Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.02572
Cited By
HATA: Trainable and Hardware-Efficient Hash-Aware Top-k Attention for Scalable Large Model Inference
3 June 2025
Ping Gong
Jiawei Yi
Shengnan Wang
Juncheng Zhang
Zewen Jin
Ouxiang Zhou
Ruibo Liu
Guanbin Xu
Youhui Bai
Bowen Ye
Kun Yuan
Tong Yang
Gong Zhang
Renhai Chen
Feng Wu
Cheng Li
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"HATA: Trainable and Hardware-Efficient Hash-Aware Top-k Attention for Scalable Large Model Inference"
Title
No papers