Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.04672
Cited By
v1
v2
v3 (latest)
No Language Left Behind: Scaling Human-Centered Machine Translation
11 July 2022
Nllb team
Marta R. Costa-jussá
James Cross
Onur cCelebi
Maha Elbayad
Kenneth Heafield
Kevin Heffernan
Elahe Kalbassi
Janice Lam
Daniel Licht
Jean Maillard
Anna Y. Sun
Skyler Wang
Guillaume Wenzek
Alison Youngblood
Bapi Akula
Loïc Barrault
Gabriel Mejia Gonzalez
Prangthip Hansanti
John Hoffman
Semarley Jarrett
Kaushik Ram Sadagopan
Dirk Rowe
Shannon L. Spruit
C. Tran
Pierre Yves Andrews
Necip Fazil Ayan
Shruti Bhosale
Sergey Edunov
Angela Fan
Cynthia Gao
Vedanuj Goswami
Francisco Guzmán
Philipp Koehn
Alexandre Mourachko
C. Ropers
Safiyyah Saleem
Holger Schwenk
Jeff Wang
MoE
Re-assign community
ArXiv (abs)
PDF
HTML
Github (31473★)
Papers citing
"No Language Left Behind: Scaling Human-Centered Machine Translation"
50 / 801 papers shown
Title
Improving Multi-lingual Alignment Through Soft Contrastive Learning
Minsu Park
Seyeon Choi
Chanyeol Choi
Junseong Kim
Jy-yong Sohn
111
3
0
25 May 2024
Aya 23: Open Weight Releases to Further Multilingual Progress
Viraat Aryabumi
John Dang
Dwarak Talupuru
Saurabh Dash
David Cairuz
...
Aidan Gomez
Phil Blunsom
Marzieh Fadaee
Ahmet Üstün
Sara Hooker
OSLM
107
86
0
23 May 2024
Why Not Transform Chat Large Language Models to Non-English?
Xiang Geng
Ming Zhu
Jiahuan Li
Zhejian Lai
Wei Zou
...
Xinglin Lyu
Min Zhang
Jiajun Chen
Hao Yang
Shujian Huang
68
2
0
22 May 2024
Efficacy of ByT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
Corinne Aars
Lauren Adams
Xiaokan Tian
Zhaoyu Wang
Colton Wismer
Jason Wu
Pablo Rivas
Korn Sooksatra
Matthew Fendt
43
0
0
22 May 2024
OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models
Zhaojian Yu
Yinghao Wu
Zhuotao Deng
Yansong Tang
Xiao-Ping Zhang
82
2
0
21 May 2024
FAME-MT Dataset: Formality Awareness Made Easy for Machine Translation Purposes
Dawid Wi'sniewski
Zofia Rostek
Artur Nowakowski
115
0
0
20 May 2024
Chasing COMET: Leveraging Minimum Bayes Risk Decoding for Self-Improving Machine Translation
Kamil Guttmann
Miko Pokrywka
Adrian Charkiewicz
Artur Nowakowski
109
5
0
20 May 2024
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
Minghao Wu
Jiahao Xu
Yulin Yuan
Gholamreza Haffari
Longyue Wang
Weihua Luo
Kaifu Zhang
LLMAG
184
27
0
20 May 2024
LexGen: Domain-aware Multilingual Lexicon Generation
Ayush Maheshwari
Ayush Maheshwari
A. Singh
Krishnakant Bhatt
Preethi Jyothi
Ganesh Ramakrishnan
67
1
0
18 May 2024
MarkLLM: An Open-Source Toolkit for LLM Watermarking
Leyi Pan
Aiwei Liu
Zhiwei He
Zitian Gao
Xuandong Zhao
...
Shuliang Liu
Xuming Hu
Lijie Wen
Irwin King
Philip S. Yu
136
37
0
16 May 2024
Facilitating Opinion Diversity through Hybrid NLP Approaches
Michiel van der Meer
87
2
0
15 May 2024
Word Alignment as Preference for Machine Translation
Qiyu Wu
Masaaki Nagata
Zhongtao Miao
Yoshimasa Tsuruoka
92
6
0
15 May 2024
A Japanese-Chinese Parallel Corpus Using Crowdsourcing for Web Mining
Masaaki Nagata
Makoto Morishita
Katsuki Chousa
Norihito Yasuda
39
2
0
15 May 2024
CANTONMT: Investigating Back-Translation and Model-Switch Mechanisms for Cantonese-English Neural Machine Translation
Kung Yin Hong
Lifeng Han
Riza Batista-Navarro
Goran Nenadic
78
0
0
13 May 2024
Control Token with Dense Passage Retrieval
Juhwan Lee
Jisu Kim
52
0
0
13 May 2024
Using Machine Translation to Augment Multilingual Classification
Adam King
83
0
0
09 May 2024
Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages
Nathaniel R. Robinson
Raj Dabre
Ammon Shurtz
Rasul Dent
Onenamiyi Onesi
...
Matthew Dean Stutzman
Bismarck Odoom
Sanjeev Khudanpur
Stephen D. Richardson
Kenton Murray
MoE
101
8
0
08 May 2024
XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples
Peiqin Lin
André F. T. Martins
Hinrich Schütze
RALM
144
4
0
08 May 2024
SUTRA: Scalable Multilingual Language Model Architecture
Abhijit Bendale
Michael Sapienza
Steven Ripplinger
Simon Gibbs
Jaewon Lee
Pranav Mistry
LRM
ELM
71
5
0
07 May 2024
Quantifying the Capabilities of LLMs across Scale and Precision
Sher Badshah
Hassan Sajjad
74
14
0
06 May 2024
Toxicity Classification in Ukrainian
Daryna Dementieva
Valeriia Khylenko
N. Babakov
Georg Groh
60
5
0
27 Apr 2024
IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages
Harman Singh
Nitish Gupta
Shikhar Bharadwaj
Dinesh Tewari
Partha P. Talukdar
ELM
84
28
0
25 Apr 2024
Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model
Runzhe Zhan
Xinyi Yang
Derek F. Wong
Lidia S. Chao
Yue Zhang
128
10
0
25 Apr 2024
Translation of Multifaceted Data without Re-Training of Machine Translation Systems
Hyeonseok Moon
Seungyoon Lee
Seongtae Hong
Seungjun Lee
Chanjun Park
Heu-Jeoung Lim
59
0
0
25 Apr 2024
Setting up the Data Printer with Improved English to Ukrainian Machine Translation
Yurii Paniv
Dmytro Chaplynskyi
Nikita Trynus
Volodymyr Kyrylov
AI4CE
92
2
0
23 Apr 2024
RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?
Adrian de Wynter
Ishaan Watts
Nektar Ege Altıntoprak
Tua Wongsangaroonsri
Minghui Zhang
...
Anna Vickers
Stéphanie Visser
Herdyan Widarmanto
A. Zaikin
Si-Qing Chen
LM&MA
92
21
0
22 Apr 2024
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude
Maxim Enis
Mark Hopkins
93
44
0
22 Apr 2024
Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration
Yi-Chong Huang
Xiaocheng Feng
Baohang Li
Yang Xiang
Hui Wang
Bing Qin
Ting Liu
FedML
97
30
0
19 Apr 2024
A Preference-driven Paradigm for Enhanced Translation with Large Language Models
D. Zhu
Sony Trenous
Xiaoyu Shen
Dietrich Klakow
Bill Byrne
Eva Hasler
105
3
0
17 Apr 2024
Neuron Specialization: Leveraging intrinsic task modularity for multilingual machine translation
Shaomu Tan
Di Wu
Christof Monz
MoMe
113
9
0
17 Apr 2024
Many-Shot In-Context Learning
Rishabh Agarwal
Avi Singh
Lei M. Zhang
Bernd Bohnet
Luis Rosias
...
John D. Co-Reyes
Eric Chu
Feryal M. P. Behbahani
Aleksandra Faust
Hugo Larochelle
ReLM
OffRL
BDL
137
121
0
17 Apr 2024
Data-Augmentation-Based Dialectal Adaptation for LLMs
Fahim Faisal
Antonios Anastasopoulos
83
3
0
11 Apr 2024
Curated Datasets and Neural Models for Machine Translation of Informal Registers between Mayan and Spanish Vernaculars
Andrés Lou
Juan Antonio Pérez-Ortiz
Felipe Sánchez-Martínez
Víctor M. Sánchez-Cartagena
31
1
0
11 Apr 2024
Medical mT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain
Iker García-Ferrero
Rodrigo Agerri
Aitziber Atutxa Salazar
Elena Cabrio
Iker de la Iglesia
...
Johana Ramirez-Romero
German Rigau
J. M. Villa-Gonzalez
S. Villata
Andrea Zaninello
122
21
0
11 Apr 2024
High-Dimension Human Value Representation in Large Language Models
Samuel Cahyawijaya
Delong Chen
Yejin Bang
Leila Khalatbari
Bryan Wilie
Ziwei Ji
Etsuko Ishii
Pascale Fung
206
6
0
11 Apr 2024
Language-Independent Representations Improve Zero-Shot Summarization
V. Solovyev
Danni Liu
Jan Niehues
78
0
0
08 Apr 2024
Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding
Ahmad Idrissi-Yaghir
Amin Dada
Henning Schafer
Kamyar Arzideh
Giulia Baldini
...
Peter A. Horn
Christin Seifert
F. Nensa
Jens Kleesiek
Christoph M. Friedrich
AI4MH
73
3
0
08 Apr 2024
Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers
Libo Qin
Qiguang Chen
Yuhang Zhou
Zhi Chen
Hai-Tao Zheng
Lizi Liao
Min Li
Wanxiang Che
Philip S. Yu
LRM
169
38
0
07 Apr 2024
Low-Resource Machine Translation through Retrieval-Augmented LLM Prompting: A Study on the Mambai Language
Raphael Merx
Aso Mahmudi
Katrina Langford
Leo Alberto de Araujo
Ekaterina Vylomova
68
9
0
07 Apr 2024
A Morphology-Based Investigation of Positional Encodings
Poulami Ghosh
Shikhar Vashishth
Raj Dabre
Pushpak Bhattacharyya
75
2
0
06 Apr 2024
Language Models as Critical Thinking Tools: A Case Study of Philosophers
Andre Ye
Jared Moore
Rose Novick
Amy X. Zhang
KELM
ELM
LRM
LLMAG
56
10
0
06 Apr 2024
Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation
Tong Su
Xin Peng
Sarubi Thillainathan
David Guzmán
Surangika Ranathunga
En-Shiun Annie Lee
65
3
0
05 Apr 2024
Sailor: Open Language Models for South-East Asia
Longxu Dou
Qian Liu
Guangtao Zeng
Jia Guo
Jiahui Zhou
Wei Lu
Min Lin
LRM
106
9
0
04 Apr 2024
MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness
Shijia Zhou
Huangyan Shan
Barbara Plank
Robert Litschko
73
2
0
03 Apr 2024
ANGOFA: Leveraging OFA Embedding Initialization and Synthetic Data for Angolan Language Model
Osvaldo Luamba Quinjica
David Ifeoluwa Adelani
65
1
0
03 Apr 2024
CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models
Zaid A. W. Sheikh
Antonios Anastasopoulos
Shruti Rijhwani
Lindia Tjuatja
Robbie Jimerson
Graham Neubig
52
1
0
03 Apr 2024
Backdoor Attack on Multilingual Machine Translation
Jun Wang
Xingliang Yuan
Xuanli He
Benjamin I. P. Rubinstein
Trevor Cohn
66
6
0
03 Apr 2024
M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets
Gaurish Thakkar
Sherzod Hakimov
Marko Tadić
27
4
0
02 Apr 2024
Poro 34B and the Blessing of Multilinguality
Risto Luukkonen
Jonathan Burdge
Elaine Zosa
Aarne Talman
Ville Komulainen
Vaino Hatanpaa
Peter Sarlin
S. Pyysalo
AI4CE
96
14
0
02 Apr 2024
AAdaM at SemEval-2024 Task 1: Augmentation and Adaptation for Multilingual Semantic Textual Relatedness
Miaoran Zhang
Mingyang Wang
Jesujoba Oluwadara Alabi
Dietrich Klakow
VLM
86
6
0
01 Apr 2024
Previous
1
2
3
...
7
8
9
...
15
16
17
Next