Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.10251
Cited By
Numerical Error Analysis of Large Language Models
13 March 2025
Stanislav Budzinskiy
Wenyi Fang
Longbin Zeng
Philipp Petersen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Numerical Error Analysis of Large Language Models"
1 / 1 papers shown
Title
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
Zihan Qiu
Zekun Wang
Bo Zheng
Zeyu Huang
Kaiyue Wen
...
Fei Huang
Suozhi Huang
Dayiheng Liu
Jingren Zhou
Junyang Lin
MoE
28
0
0
10 May 2025
1