Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.11446
Cited By
Scaling Language Models: Methods, Analysis & Insights from Training Gopher
8 December 2021
Jack W. Rae
Sebastian Borgeaud
Trevor Cai
Katie Millican
Jordan Hoffmann
Francis Song
John Aslanides
Sarah Henderson
Roman Ring
Susannah Young
Eliza Rutherford
Tom Hennigan
Jacob Menick
Albin Cassirer
Richard Powell
George van den Driessche
Lisa Anne Hendricks
Maribeth Rauh
Po-Sen Huang
Amelia Glaese
Johannes Welbl
Sumanth Dathathri
Saffron Huang
J. Uesato
John F. J. Mellor
I. Higgins
Antonia Creswell
Nat McAleese
Amy Wu
Erich Elsen
Siddhant M. Jayakumar
Elena Buchatskaya
David Budden
Esme Sutherland
Karen Simonyan
Michela Paganini
Laurent Sifre
Lena Martens
Xiang Lorraine Li
A. Kuncoro
Aida Nematzadeh
E. Gribovskaya
Domenic Donato
Angeliki Lazaridou
A. Mensch
Jean-Baptiste Lespiau
Maria Tsimpoukelli
N. Grigorev
Doug Fritz
Thibault Sottiaux
Mantas Pajarskas
Tobias Pohlen
Z. Gong
Daniel Toyama
Cyprien de Masson dÁutume
Yujia Li
Tayfun Terzi
Vladimir Mikulik
Igor Babuschkin
Aidan Clark
Diego de Las Casas
Aurelia Guy
Chris Jones
James Bradbury
Matthew J. Johnson
Blake A. Hechtman
Laura Weidinger
Iason Gabriel
William S. Isaac
Edward Lockhart
Simon Osindero
Laura Rimell
Chris Dyer
Oriol Vinyals
Kareem W. Ayoub
Jeff Stanway
L. Bennett
Demis Hassabis
Koray Kavukcuoglu
G. Irving
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scaling Language Models: Methods, Analysis & Insights from Training Gopher"
4 / 54 papers shown
Title
FLM-101B: An Open LLM and How to Train It with
100
K
B
u
d
g
e
t
100K Budget
100
K
B
u
d
g
e
t
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Xuying Meng
...
Li Du
Bowen Qin
Zheng Zhang
Aixin Sun
Yequan Wang
75
22
0
07 Sep 2023
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Haipeng Luo
Qingfeng Sun
Can Xu
Pu Zhao
Jian-Guang Lou
...
Xiubo Geng
Qingwei Lin
Shifeng Chen
Yansong Tang
Dongmei Zhang
LRM
OSLM
139
439
0
18 Aug 2023
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Ziyang Luo
Can Xu
Pu Zhao
Qingfeng Sun
Xiubo Geng
Wenxiang Hu
Chongyang Tao
Jing Ma
Qingwei Lin
Daxin Jiang
ELM
SyDa
ALM
68
665
0
14 Jun 2023
On the Creativity of Large Language Models
Giorgio Franceschelli
Mirco Musolesi
128
55
0
27 Mar 2023
Previous
1
2