ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.14704
  4. Cited By
Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World Data

Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World Data

17 June 2025
Anton Changalidis
Aki Härmä
ArXiv (abs)PDFHTML

Papers citing "Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World Data"

3 / 3 papers shown
Title
A Study on ReLU and Softmax in Transformer
A Study on ReLU and Softmax in Transformer
Kai Shen
Junliang Guo
Xuejiao Tan
Siliang Tang
Rui Wang
Jiang Bian
89
56
0
13 Feb 2023
Deep Learning using Rectified Linear Units (ReLU)
Deep Learning using Rectified Linear Units (ReLU)
Abien Fred Agarap
74
3,229
0
22 Mar 2018
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.9K
150,260
0
22 Dec 2014
1