Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.04672
Cited By
v1
v2
v3 (latest)
No Language Left Behind: Scaling Human-Centered Machine Translation
11 July 2022
Nllb team
Marta R. Costa-jussá
James Cross
Onur cCelebi
Maha Elbayad
Kenneth Heafield
Kevin Heffernan
Elahe Kalbassi
Janice Lam
Daniel Licht
Jean Maillard
Anna Y. Sun
Skyler Wang
Guillaume Wenzek
Alison Youngblood
Bapi Akula
Loïc Barrault
Gabriel Mejia Gonzalez
Prangthip Hansanti
John Hoffman
Semarley Jarrett
Kaushik Ram Sadagopan
Dirk Rowe
Shannon L. Spruit
C. Tran
Pierre Yves Andrews
Necip Fazil Ayan
Shruti Bhosale
Sergey Edunov
Angela Fan
Cynthia Gao
Vedanuj Goswami
Francisco Guzmán
Philipp Koehn
Alexandre Mourachko
C. Ropers
Safiyyah Saleem
Holger Schwenk
Jeff Wang
MoE
Re-assign community
ArXiv (abs)
PDF
HTML
Github (31473★)
Papers citing
"No Language Left Behind: Scaling Human-Centered Machine Translation"
50 / 801 papers shown
Title
A Systematic Analysis of Subwords and Cross-Lingual Transfer in Multilingual Translation
Francois Meyer
Jan Buys
90
1
0
29 Mar 2024
IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian Context
Nihar Ranjan Sahoo
Pranamya Prashant Kulkarni
Narjis Asad
Arif Ahmad
Tanu Goyal
Aparna Garimella
Pushpak Bhattacharyya
107
12
0
29 Mar 2024
KazParC: Kazakh Parallel Corpus for Machine Translation
Rustem Yeshpanov
Alina Polonskaya
H. A. Varol
104
2
0
28 Mar 2024
EthioMT: Parallel Corpus for Low-resource Ethiopian Languages
A. Tonja
Olga Kolesnikova
Alexander Gelbukh
Jugal Kalita
26
1
0
28 Mar 2024
Going Beyond Word Matching: Syntax Improves In-context Example Selection for Machine Translation
Chenming Tang
Zhixiang Wang
Hao Sun
51
1
0
28 Mar 2024
A Tulu Resource for Machine Translation
Manu Narayanan
Noemi Aepli
59
4
0
28 Mar 2024
The Role of
n
n
n
-gram Smoothing in the Age of Neural Networks
Luca Malagutti
Andrius Buinovskij
Anej Svete
Clara Meister
Afra Amini
Ryan Cotterell
84
7
0
25 Mar 2024
Advancing Speech Translation: A Corpus of Mandarin-English Conversational Telephone Speech
Shannon Wotherspoon
William Hartmann
M. Snover
16
1
0
25 Mar 2024
Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning?
Shaoxiong Ji
Timothee Mickus
Vincent Segonne
Jörg Tiedemann
CLL
85
4
0
25 Mar 2024
LLMs Are Few-Shot In-Context Low-Resource Language Learners
Samuel Cahyawijaya
Holy Lovenia
Pascale Fung
103
49
0
25 Mar 2024
Building Accurate Translation-Tailored LLMs with Language Aware Instruction Tuning
Changtong Zan
Liang Ding
Li Shen
Yibing Zhen
Weifeng Liu
Dacheng Tao
79
9
0
21 Mar 2024
A New Massive Multilingual Dataset for High-Performance Language Technologies
Ona de Gibert
Graeme Nail
Nikolay Arefyev
Marta Bañón
Jelmer van der Linde
...
Gema Ramírez-Sánchez
Andrey Kutuzov
S. Pyysalo
Stephan Oepen
Jörg Tiedemann
VLM
102
25
0
20 Mar 2024
EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation
A. Tonja
Israel Abebe Azime
Tadesse Destaw Belay
M. Yigezu
Moges Ahmed Mehamed
...
Olga Kolesnikova
Philipp Slusallek
Dietrich Klakow
Shengwu Xiong
Seid Muhie Yimam
130
8
0
20 Mar 2024
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning
Shivam Mhaskar
Nirmesh J. Shah
Mohammadi Zaki
Ashish Gudmalwar
Pankaj Wasnik
R. Shah
64
2
0
20 Mar 2024
Pretraining Language Models Using Translationese
Meet Doshi
Raj Dabre
Pushpak Bhattacharyya
SyDa
90
2
0
20 Mar 2024
A Novel Paradigm Boosting Translation Capabilities of Large Language Models
Jiaxin Guo
Hao Yang
Zongyao Li
Daimeng Wei
Hengchao Shang
Xiaoyu Chen
98
7
0
18 Mar 2024
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Fahim Faisal
Orevaoghene Ahia
Aarohi Srivastava
Kabir Ahuja
David Chiang
Yulia Tsvetkov
Antonios Anastasopoulos
93
32
0
16 Mar 2024
Pointer-Generator Networks for Low-Resource Machine Translation: Don't Copy That!
Niyati Bafna
Philipp Koehn
David Yarowsky
109
1
0
16 Mar 2024
MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling
Tomasz Limisiewicz
Terra Blevins
Hila Gonen
Orevaoghene Ahia
Luke Zettlemoyer
90
17
0
15 Mar 2024
Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models
Chaoqun Liu
Wenxuan Zhang
Yiran Zhao
Anh Tuan Luu
Lidong Bing
LRM
123
14
0
15 Mar 2024
MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation
Jiahuan Li
Shanbo Cheng
Shujian Huang
Jiajun Chen
73
7
0
14 Mar 2024
Bifurcated Attention: Accelerating Massively Parallel Decoding with Shared Prefixes in LLMs
Ben Athiwaratkun
Sujan Kumar Gonugondla
Sanjay Krishna Gouda
Haifeng Qian
Hantian Ding
...
Liangfu Chen
Parminder Bhatia
Ramesh Nallapati
Sudipta Sengupta
Bing Xiang
88
4
0
13 Mar 2024
SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes
Timothee Mickus
Elaine Zosa
Raúl Vázquez
Teemu Vahtola
Jörg Tiedemann
Vincent Segonne
Alessandro Raganato
Marianna Apidianaki
HILM
LRM
73
22
0
12 Mar 2024
ACT-MNMT Auto-Constriction Turning for Multilingual Neural Machine Translation
Shaojie Dai
Xin Liu
Ping Luo
Yue Yu
LRM
64
1
0
11 Mar 2024
To Err Is Human, but Llamas Can Learn It Too
Agnes Luhtaru
Taido Purason
Martin Vainikko
Maksym Del
Mark Fishel
SyDa
ALM
98
2
0
08 Mar 2024
A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain
Qusai Abo Obaidah
Muhy Eddin Za'ter
Adnan Jaljuli
Ali Mahboub
Asma Hakouz
Bashar Alfrou
Yazan Estaitia
56
1
0
07 Mar 2024
Did Translation Models Get More Robust Without Anyone Even Noticing?
Ben Peters
André F. T. Martins
66
3
0
06 Mar 2024
From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models
Luiza Amador Pozzobon
Patrick Lewis
Sara Hooker
Beyza Ermis
97
12
0
06 Mar 2024
General2Specialized LLMs Translation for E-commerce
Kaidi Chen
Ben Chen
Dehong Gao
Huangyu Dai
Wen Jiang
Wei Ning
Shanqing Yu
Libin Yang
Xiaoyan Cai
31
8
0
06 Mar 2024
Design of an Open-Source Architecture for Neural Machine Translation
Séamus Lankford
Haithem Afli
Andy Way
90
0
0
06 Mar 2024
adaptMLLM: Fine-Tuning Multilingual Language Models on Low-Resource Languages with Integrated LLM Playgrounds
Séamus Lankford
Haithem Afli
Andy Way
70
31
0
04 Mar 2024
Language and Speech Technology for Central Kurdish Varieties
Sina Ahmadi
Daban Q. Jaff
Md Mahfuz Ibn Alam
Antonios Anastasopoulos
99
2
0
04 Mar 2024
adaptNMT: an open-source, language-agnostic development environment for Neural Machine Translation
Séamus Lankford
Haithem Afli
Andy Way
73
3
0
04 Mar 2024
NusaBERT: Teaching IndoBERT to be Multilingual and Multicultural
Wilson Wongso
David Samuel Setiawan
Steven Limcorn
Ananto Joyoadikusumo
58
1
0
04 Mar 2024
Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
Fakhraddin Alwajih
El Moatez Billah Nagoudi
Gagan Bhatia
Abdelrahman Mohamed
Muhammad Abdul-Mageed
VLM
LRM
83
16
0
01 Mar 2024
A Bit of a Problem: Measurement Disparities in Dataset Sizes Across Languages
Catherine Arnett
Tyler A. Chang
Benjamin Bergen
67
5
0
01 Mar 2024
Robust Guidance for Unsupervised Data Selection: Capturing Perplexing Named Entities for Domain-Specific Machine Translation
Seunghyun Ji
H. R. Sinulingga
Darongsae Kwon
102
1
0
29 Feb 2024
Teaching Large Language Models an Unseen Language on the Fly
Chen Zhang
Xiao Liu
Jiuheng Lin
Yansong Feng
96
21
0
29 Feb 2024
Fine-Tuned Machine Translation Metrics Struggle in Unseen Domains
Vilém Zouhar
Shuoyang Ding
Anna Currey
Tatyana Badeka
Jenyuan Wang
Brian Thompson
74
17
0
28 Feb 2024
Hire a Linguist!: Learning Endangered Languages with In-Context Linguistic Descriptions
Kexun Zhang
Yee Man Choi
Zhenqiao Song
Taiqi He
Wenjie Wang
Lei Li
77
17
0
28 Feb 2024
Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps
Giuseppe Attanasio
Beatrice Savoldi
Dennis Fucci
Dirk Hovy
92
9
0
28 Feb 2024
Tower: An Open Multilingual Large Language Model for Translation-Related Tasks
Duarte M. Alves
José P. Pombal
Nuno M. Guerreiro
Pedro H. Martins
Joao Alves
...
Patrick Fernandes
Sweta Agrawal
Pierre Colombo
José G. C. de Souza
André F.T. Martins
LRM
127
157
0
27 Feb 2024
Information Flow Routes: Automatically Interpreting Language Models at Scale
Javier Ferrando
Elena Voita
119
41
0
27 Feb 2024
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Mikayel Samvelyan
Sharath Chandra Raparthy
Andrei Lupu
Eric Hambro
Aram H. Markosyan
...
Minqi Jiang
Jack Parker-Holder
Jakob Foerster
Tim Rocktaschel
Roberta Raileanu
SyDa
117
89
0
26 Feb 2024
A Comprehensive Evaluation of Quantization Strategies for Large Language Models
Renren Jin
Jiangcun Du
Wuwei Huang
Wei Liu
Jian Luan
Bin Wang
Deyi Xiong
MQ
109
37
0
26 Feb 2024
TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement
Zhaopeng Feng
Yan Zhang
Hao Li
Bei Wu
Jiayu Liao
Wenqiang Liu
Jun Lang
Yang Feng
Jian Wu
Zuozhu Liu
LRM
138
15
0
26 Feb 2024
Direct Punjabi to English speech translation using discrete units
Prabhjot Kaur
L. A. M. Bush
Weisong Shi
60
0
0
25 Feb 2024
Fine-tuning Large Language Models for Domain-specific Machine Translation
Jiawei Zheng
Hanghai Hong
Xiaoli Wang
Jingsong Su
Yonggui Liang
Shikai Wu
ALM
69
41
0
23 Feb 2024
Unintended Impacts of LLM Alignment on Global Representation
Michael Joseph Ryan
William B. Held
Diyi Yang
116
42
0
22 Feb 2024
PALO: A Polyglot Large Multimodal Model for 5B People
Muhammad Maaz
H. Rasheed
Abdelrahman M. Shaker
Salman Khan
Hisham Cholakal
Rao M. Anwer
Timothy Baldwin
Michael Felsberg
Fahad S. Khan
VLM
LRM
142
15
0
22 Feb 2024
Previous
1
2
3
...
8
9
10
...
15
16
17
Next