Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.15961
Cited By
Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation
24 October 2023
Szymon Antoniak
Sebastian Jaszczur
Michal Krutul
Maciej Pióro
Jakub Krajewski
Jan Ludziejewski
Tomasz Odrzygó'zd'z
Marek Cygan
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation"
3 / 3 papers shown
Title
Mixture of Parrots: Experts improve memorization more than reasoning
Samy Jelassi
Clara Mohri
David Brandfonbrener
Alex Gu
Nikhil Vyas
Nikhil Anand
David Alvarez-Melis
Yuanzhi Li
Sham Kakade
Eran Malach
MoE
41
4
0
24 Oct 2024
Mixture-of-Experts with Expert Choice Routing
Yan-Quan Zhou
Tao Lei
Han-Chu Liu
Nan Du
Yanping Huang
Vincent Zhao
Andrew M. Dai
Zhifeng Chen
Quoc V. Le
James Laudon
MoE
160
333
0
18 Feb 2022
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
266
4,532
0
23 Jan 2020
1