ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.08739
  4. Cited By
The Garden of Forking Paths: Observing Dynamic Parameters Distribution
  in Large Language Models

The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models

13 March 2024
Carlo Nicolini
Jacopo Staiano
Bruno Lepri
Raffaele Marino
    MoE
ArXivPDFHTML

Papers citing "The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models"

4 / 4 papers shown
Title
Enlightenment Period Improving DNN Performance
Enlightenment Period Improving DNN Performance
Tiantian Liu
Weishi Xu
Meng Wan
Jue Wang
33
0
0
02 Apr 2025
Omnigrok: Grokking Beyond Algorithmic Data
Omnigrok: Grokking Beyond Algorithmic Data
Ziming Liu
Eric J. Michaud
Max Tegmark
56
76
0
03 Oct 2022
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
256
1,996
0
31 Dec 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,959
0
20 Apr 2018
1