Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.11594
Cited By
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training
16 May 2025
Jintao Zhang
Jia Wei
Pengle Zhang
Xiaoming Xu
Haofeng Huang
Haoxu Wang
Kai Jiang
Jun Zhu
Jianfei Chen
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training"
2 / 2 papers shown
Title
SageAttention2++: A More Efficient Implementation of SageAttention2
Jintao Zhang
Xiaoming Xu
Jia Wei
Haofeng Huang
Pengle Zhang
Chendong Xiang
Jun Zhu
Jianfei Chen
MQ
VLM
79
6
0
27 May 2025
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
Tianyu Fu
Yi Ge
Yichen You
Enshu Liu
Zhihang Yuan
Guohao Dai
Shengen Yan
Huazhong Yang
Yu Wang
MoE
LRM
62
0
0
27 May 2025
1