ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.12362
  4. Cited By
Transformer tricks: Removing weights for skipless transformers

Transformer tricks: Removing weights for skipless transformers

18 April 2024
Nils Graef
ArXivPDFHTML

Papers citing "Transformer tricks: Removing weights for skipless transformers"

2 / 2 papers shown
Title
Flash normalization: fast normalization for LLMs
Flash normalization: fast normalization for LLMs
Nils Graef
Matthew Clapp
Andrew Wasielewski
21
0
0
12 Jul 2024
Transformer tricks: Precomputing the first layer
Transformer tricks: Precomputing the first layer
Nils Graef
MoE
29
4
0
20 Feb 2024
1