Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.14704
Cited By
Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World Data
17 June 2025
Anton Changalidis
Aki Härmä
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World Data"
3 / 3 papers shown
Title
A Study on ReLU and Softmax in Transformer
Kai Shen
Junliang Guo
Xuejiao Tan
Siliang Tang
Rui Wang
Jiang Bian
89
56
0
13 Feb 2023
Deep Learning using Rectified Linear Units (ReLU)
Abien Fred Agarap
74
3,229
0
22 Mar 2018
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.9K
150,260
0
22 Dec 2014
1