Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.08739
Cited By
The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models
13 March 2024
Carlo Nicolini
Jacopo Staiano
Bruno Lepri
Raffaele Marino
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models"
4 / 4 papers shown
Title
Enlightenment Period Improving DNN Performance
Tiantian Liu
Weishi Xu
Meng Wan
Jue Wang
33
0
0
02 Apr 2025
Omnigrok: Grokking Beyond Algorithmic Data
Ziming Liu
Eric J. Michaud
Max Tegmark
56
76
0
03 Oct 2022
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
267
1,996
0
31 Dec 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,959
0
20 Apr 2018
1