Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.09650
Cited By
Scaling Laws for Multilingual Neural Machine Translation
19 February 2023
Patrick Fernandes
Behrooz Ghorbani
Xavier Garcia
Markus Freitag
Orhan Firat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scaling Laws for Multilingual Neural Machine Translation"
25 / 25 papers shown
Title
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
Yifei He
Siqi Zeng
Yuzheng Hu
Rui Yang
Tong Zhang
Han Zhao
MoMe
ALM
19
0
0
16 May 2025
Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models
Julian Spravil
Sebastian Houben
Sven Behnke
VLM
75
0
0
12 Mar 2025
(Mis)Fitting: A Survey of Scaling Laws
Margaret Li
Sneha Kudugunta
Luke Zettlemoyer
69
2
0
26 Feb 2025
Scaling Laws for Downstream Task Performance in Machine Translation
Berivan Isik
Natalia Ponomareva
Hussein Hazimeh
Dimitris Paparas
Sergei Vassilvitskii
Sanmi Koyejo
108
3
0
24 Feb 2025
Scaling Laws for Multilingual Language Models
Yifei He
Alon Benhaim
Barun Patra
Praneetha Vaddamanu
Sanchit Ahuja
Parul Chopra
Vishrav Chaudhary
Han Zhao
Xia Song
30
4
0
15 Oct 2024
Scaling Optimal LR Across Token Horizons
Johan Bjorck
Alon Benhaim
Vishrav Chaudhary
Furu Wei
Xia Song
54
4
0
30 Sep 2024
EuroLLM: Multilingual Language Models for Europe
Pedro Henrique Martins
Patrick Fernandes
Joao Alves
Nuno M. Guerreiro
Ricardo Rei
...
Pierre Colombo
Barry Haddow
José G. C. de Souza
Alexandra Birch
André F. T. Martins
29
16
0
24 Sep 2024
Do Neural Scaling Laws Exist on Graph Self-Supervised Learning?
Qian Ma
Haitao Mao
Jingzhe Liu
Zhehua Zhang
Chunlin Feng
Yu Song
Yihan Shao
Yao Ma
40
3
0
20 Aug 2024
Reconciling Kaplan and Chinchilla Scaling Laws
Tim Pearce
Jinyeop Song
34
8
0
12 Jun 2024
LexMatcher: Dictionary-centric Data Collection for LLM-based Machine Translation
Yongjing Yin
Jiali Zeng
Yafu Li
Fandong Meng
Yue Zhang
41
1
0
03 Jun 2024
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Biao Zhang
Zhongtao Liu
Colin Cherry
Orhan Firat
LRM
52
124
0
27 Feb 2024
Selecting Large Language Model to Fine-tune via Rectified Scaling Law
Haowei Lin
Baizhou Huang
Haotian Ye
Qinyu Chen
Zihao Wang
Sujian Li
Jianzhu Ma
Xiaojun Wan
James Zou
Yitao Liang
84
20
0
04 Feb 2024
CroissantLLM: A Truly Bilingual French-English Language Model
Manuel Faysse
Patrick Fernandes
Nuno M. Guerreiro
António Loison
Duarte M. Alves
...
François Yvon
André F.T. Martins
Gautier Viaud
C´eline Hudelot
Pierre Colombo
52
32
0
01 Feb 2024
The Universal Statistical Structure and Scaling Laws of Chaos and Turbulence
Noam Levi
Yaron Oz
AI4CE
26
1
0
02 Nov 2023
A Benchmark for Learning to Translate a New Language from One Grammar Book
Garrett Tanzer
Mirac Suzgun
Chenguang Xi
Dan Jurafsky
Luke Melas-Kyriazi
24
51
0
28 Sep 2023
The Underlying Scaling Laws and Universal Statistical Structure of Complex Datasets
Noam Levi
Yaron Oz
27
4
0
26 Jun 2023
Multilingual Large Language Models Are Not (Yet) Code-Switchers
Ruochen Zhang
Samuel Cahyawijaya
Jan Christian Blaise Cruz
Genta Indra Winata
Alham Fikri Aji
LRM
33
50
0
23 May 2023
When Does Monolingual Data Help Multilingual Translation: The Role of Domain and Model Scale
Christos Baziotis
Biao Zhang
Alexandra Birch
Barry Haddow
30
2
0
23 May 2023
On the Pareto Front of Multilingual Neural Machine Translation
Liang Chen
Shuming Ma
Dongdong Zhang
Furu Wei
Baobao Chang
MoE
23
5
0
06 Apr 2023
Causes and Cures for Interference in Multilingual Translation
Uri Shaham
Maha Elbayad
Vedanuj Goswami
Omer Levy
Shruti Bhosale
23
26
0
14 Dec 2022
Do Current Multi-Task Optimization Methods in Deep Learning Even Help?
Derrick Xin
Behrooz Ghorbani
Ankush Garg
Orhan Firat
Justin Gilmer
MoMe
73
63
0
23 Sep 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh
Albert Webson
Colin Raffel
Stephen H. Bach
Lintang Sutawika
...
T. Bers
Stella Biderman
Leo Gao
Thomas Wolf
Alexander M. Rush
LRM
213
1,657
0
15 Oct 2021
Learning Curve Theory
Marcus Hutter
137
58
0
08 Feb 2021
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
240
4,469
0
23 Jan 2020
Six Challenges for Neural Machine Translation
Philipp Koehn
Rebecca Knowles
AAML
AIMat
221
1,208
0
12 Jun 2017
1