ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.04672
  4. Cited By
No Language Left Behind: Scaling Human-Centered Machine Translation
v1v2v3 (latest)

No Language Left Behind: Scaling Human-Centered Machine Translation

11 July 2022
Nllb team
Marta R. Costa-jussá
James Cross
Onur cCelebi
Maha Elbayad
Kenneth Heafield
Kevin Heffernan
Elahe Kalbassi
Janice Lam
Daniel Licht
Jean Maillard
Anna Y. Sun
Skyler Wang
Guillaume Wenzek
Alison Youngblood
Bapi Akula
Loïc Barrault
Gabriel Mejia Gonzalez
Prangthip Hansanti
John Hoffman
Semarley Jarrett
Kaushik Ram Sadagopan
Dirk Rowe
Shannon L. Spruit
C. Tran
Pierre Yves Andrews
Necip Fazil Ayan
Shruti Bhosale
Sergey Edunov
Angela Fan
Cynthia Gao
Vedanuj Goswami
Francisco Guzmán
Philipp Koehn
Alexandre Mourachko
C. Ropers
Safiyyah Saleem
Holger Schwenk
Jeff Wang
    MoE
ArXiv (abs)PDFHTMLGithub (31473★)

Papers citing "No Language Left Behind: Scaling Human-Centered Machine Translation"

50 / 801 papers shown
Title
T-Projection: High Quality Annotation Projection for Sequence Labeling
  Tasks
T-Projection: High Quality Annotation Projection for Sequence Labeling Tasks
Iker García-Ferrero
Rodrigo Agerri
German Rigau
102
16
0
20 Dec 2022
IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for
  Indian Languages
IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages
Ananya B. Sai
Vignesh Nagarajan
Tanay Dixit
Raj Dabre
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
133
24
0
20 Dec 2022
Synthetic Pre-Training Tasks for Neural Machine Translation
Synthetic Pre-Training Tasks for Neural Machine Translation
Zexue He
Graeme W. Blackwood
Yikang Shen
Julian McAuley
Rogerio Feris
54
4
0
19 Dec 2022
Memory-efficient NLLB-200: Language-specific Expert Pruning of a
  Massively Multilingual Machine Translation Model
Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model
Yeskendir Koishekenov
Alexandre Berard
Vassilina Nikoulina
MoE
84
31
0
19 Dec 2022
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Samuel Cahyawijaya
Holy Lovenia
Alham Fikri Aji
Genta Indra Winata
Bryan Wilie
...
Timothy Baldwin
Sebastian Ruder
Herry Sujaini
S. Sakti
Ayu Purwarianti
127
50
0
19 Dec 2022
WACO: Word-Aligned Contrastive Learning for Speech Translation
WACO: Word-Aligned Contrastive Learning for Speech Translation
Siqi Ouyang
Rong Ye
Lei Li
104
28
0
19 Dec 2022
Rainproof: An Umbrella To Shield Text Generators From
  Out-Of-Distribution Data
Rainproof: An Umbrella To Shield Text Generators From Out-Of-Distribution Data
Maxime Darrin
Pablo Piantanida
Pierre Colombo
OODD
222
15
0
18 Dec 2022
BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric
BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric
Mingda Chen
Paul-Ambroise Duquenne
Pierre Yves Andrews
Justine T. Kao
Alexandre Mourachko
Holger Schwenk
Marta R. Costa-jussá
65
18
0
16 Dec 2022
Multi-VALUE: A Framework for Cross-Dialectal English NLP
Multi-VALUE: A Framework for Cross-Dialectal English NLP
Caleb Ziems
William B. Held
Jingfeng Yang
Jwala Dhamala
Rahul Gupta
Diyi Yang
131
44
0
15 Dec 2022
Causes and Cures for Interference in Multilingual Translation
Causes and Cures for Interference in Multilingual Translation
Uri Shaham
Maha Elbayad
Vedanuj Goswami
Omer Levy
Shruti Bhosale
97
26
0
14 Dec 2022
The Massively Multilingual Natural Language Understanding 2022
  (MMNLU-22) Workshop and Competition
The Massively Multilingual Natural Language Understanding 2022 (MMNLU-22) Workshop and Competition
C. Hench
Charith Peris
Jack G. M. FitzGerald
Kay Rottmann
76
3
0
13 Dec 2022
Towards Leaving No Indic Language Behind: Building Monolingual Corpora,
  Benchmark and Models for Indic Languages
Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages
Sumanth Doddapaneni
Rahul Aralikatte
Gowtham Ramesh
Shreyansh Goyal
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
ELM
107
86
0
11 Dec 2022
Demystifying Prompts in Language Models via Perplexity Estimation
Demystifying Prompts in Language Models via Perplexity Estimation
Hila Gonen
Srini Iyer
Terra Blevins
Noah A. Smith
Luke Zettlemoyer
LRM
156
214
0
08 Dec 2022
Frustratingly Easy Label Projection for Cross-lingual Transfer
Frustratingly Easy Label Projection for Cross-lingual Transfer
Yang Chen
Chao Jiang
Alan Ritter
Wei Xu
97
32
0
28 Nov 2022
Artificial Interrogation for Attributing Language Models
Artificial Interrogation for Attributing Language Models
Farhan Dhanani
Muhammad Rafi
34
1
0
20 Nov 2022
Towards Building Text-To-Speech Systems for the Next Billion Users
Towards Building Text-To-Speech Systems for the Next Billion Users
Gokul Karthik Kumar
V. PraveenS.
Pratyush Kumar
Mitesh M. Khapra
Karthik Nandakumar
92
22
0
17 Nov 2022
Hierarchical Phrase-based Sequence-to-Sequence Learning
Hierarchical Phrase-based Sequence-to-Sequence Learning
Bailin Wang
Ivan Titov
Jacob Andreas
Yoon Kim
68
7
0
15 Nov 2022
High-Resource Methodological Bias in Low-Resource Investigations
High-Resource Methodological Bias in Low-Resource Investigations
Maartje ter Hoeve
David Grangier
Natalie Schluter
78
2
0
14 Nov 2022
Speech-to-Speech Translation For A Real-world Unwritten Language
Speech-to-Speech Translation For A Real-world Unwritten Language
Peng-Jen Chen
Ke M. Tran
Yilin Yang
Jingfei Du
Justine T. Kao
...
Sravya Popuri
Changhan Wang
J. Pino
Wei-Ning Hsu
Ann Lee
93
26
0
11 Nov 2022
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual
  Speech-to-Speech Translations
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Paul-Ambroise Duquenne
Hongyu Gong
Ning Dong
Jingfei Du
Ann Lee
Vedanuj Goswani
Changhan Wang
J. Pino
Benoît Sagot
Holger Schwenk
102
38
0
08 Nov 2022
Learning an Artificial Language for Knowledge-Sharing in Multilingual
  Translation
Learning an Artificial Language for Knowledge-Sharing in Multilingual Translation
Danni Liu
Jan Niehues
37
5
0
02 Nov 2022
ACES: Translation Accuracy Challenge Sets for Evaluating Machine
  Translation Metrics
ACES: Translation Accuracy Challenge Sets for Evaluating Machine Translation Metrics
Chantal Amrhein
Nikita Moghe
Liane Guillou
ELM
106
23
0
27 Oct 2022
Robust Domain Adaptation for Pre-trained Multilingual Neural Machine
  Translation Models
Robust Domain Adaptation for Pre-trained Multilingual Neural Machine Translation Models
Mathieu Grosso
Pirashanth Ratnamogan
Alexis Mathey
William Vanhuffel
Michael Fotso Fotso
54
3
0
26 Oct 2022
Contrastive Search Is What You Need For Neural Text Generation
Contrastive Search Is What You Need For Neural Text Generation
Yixuan Su
Nigel Collier
91
53
0
25 Oct 2022
Joint Speech Translation and Named Entity Recognition
Joint Speech Translation and Named Entity Recognition
Marco Gaido
Sara Papi
Matteo Negri
Marco Turchi
89
3
0
21 Oct 2022
University of Cape Town's WMT22 System: Multilingual Machine Translation
  for Southern African Languages
University of Cape Town's WMT22 System: Multilingual Machine Translation for Southern African Languages
Khalid N. Elmadani
Francois Meyer
Jan Buys
52
2
0
21 Oct 2022
SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models
SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models
A. Khan
Hrishikesh Kanade
G. Budhrani
Preet Jhanglani
Jia Xu
138
2
0
21 Oct 2022
The VolcTrans System for WMT22 Multilingual Machine Translation Task
The VolcTrans System for WMT22 Multilingual Machine Translation Task
Xian Qian
Kai Hu
Jiaqiang Wang
Yifeng Liu
Xingyuan Pan
Jun Cao
Mingxuan Wang
89
1
0
20 Oct 2022
Separating Grains from the Chaff: Using Data Filtering to Improve
  Multilingual Translation for Low-Resourced African Languages
Separating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African Languages
Idris Abdulmumin
Michael Beukman
Jesujoba Oluwadara Alabi
Chris C. Emezue
Everlyn Asiko
...
Shamsuddeen Hassan Muhammad
Mofetoluwa Adeyemi
Oreen Yousuf
Sahib Singh
T. Gwadabe
105
9
0
19 Oct 2022
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale
  African Languages
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale African Languages
Wenxiang Jiao
Zhaopeng Tu
Jiarui Li
Wenxuan Wang
Jen-tse Huang
Shuming Shi
93
15
0
18 Oct 2022
SilverAlign: MT-Based Silver Data Algorithm For Evaluating Word
  Alignment
SilverAlign: MT-Based Silver Data Algorithm For Evaluating Word Alignment
Abdullatif Köksal
Silvia Severini
Hinrich Schütze
73
0
0
12 Oct 2022
Investigating Massive Multilingual Pre-Trained Machine Translation
  Models for Clinical Domain via Transfer Learning
Investigating Massive Multilingual Pre-Trained Machine Translation Models for Clinical Domain via Transfer Learning
Lifeng Han
G. Erofeev
Irina Sorokina
Serge Gladkoff
Goran Nenadic
66
8
0
12 Oct 2022
Enriching Biomedical Knowledge for Low-resource Language Through
  Large-Scale Translation
Enriching Biomedical Knowledge for Low-resource Language Through Large-Scale Translation
Long Phan
Tai Dang
H. Tran
Trieu H. Trinh
Vy Phan
Lam D. Chau
Minh-Thang Luong
56
8
0
11 Oct 2022
Multilingual Representation Distillation with Contrastive Learning
Multilingual Representation Distillation with Contrastive Learning
Weiting Tan
Kevin Heffernan
Holger Schwenk
Philipp Koehn
77
16
0
10 Oct 2022
Meta-Principled Family of Hyperparameter Scaling Strategies
Meta-Principled Family of Hyperparameter Scaling Strategies
Sho Yaida
111
16
0
10 Oct 2022
Toxicity in Multilingual Machine Translation at Scale
Toxicity in Multilingual Machine Translation at Scale
Marta R. Costa-jussá
Eric Michael Smith
C. Ropers
Daniel Licht
Jean Maillard
Javier Ferrando
Carlos Escolano
96
27
0
06 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review
State-of-the-art generalisation research in NLP: A taxonomy and review
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Ryan Cotterell
Zhijing Jin
270
99
0
06 Oct 2022
Text Characterization Toolkit
Text Characterization Toolkit
Daniel Simig
Tianlu Wang
Verna Dankers
Peter Henderson
Khuyagbaatar Batsuren
Dieuwke Hupkes
Mona T. Diab
64
0
0
04 Oct 2022
Revamping Multilingual Agreement Bidirectionally via Switched
  Back-translation for Multilingual Neural Machine Translation
Revamping Multilingual Agreement Bidirectionally via Switched Back-translation for Multilingual Neural Machine Translation
Hongyuan Lu
Haoyang Huang
Shuming Ma
Dongdong Zhang
Furu Wei
Wai Lam
52
0
0
28 Sep 2022
Language Varieties of Italy: Technology Challenges and Opportunities
Language Varieties of Italy: Technology Challenges and Opportunities
Alan Ramponi
85
7
0
20 Sep 2022
The first neural machine translation system for the Erzya language
The first neural machine translation system for the Erzya language
David Dale
115
7
0
19 Sep 2022
Examining Large Pre-Trained Language Models for Machine Translation:
  What You Don't Know About It
Examining Large Pre-Trained Language Models for Machine Translation: What You Don't Know About It
Lifeng Han
G. Erofeev
Irina Sorokina
Serge Gladkoff
Goran Nenadic
LM&MA
64
7
0
15 Sep 2022
Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for
  Natural Language Processing Tasks
Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks
B. Wanjawa
Lilian D. A. Wanzare
F. Indede
Owen McOnyango
Edward Ombui
Lawrence Muchemi
54
18
0
25 Aug 2022
IndicSUPERB: A Speech Processing Universal Performance Benchmark for
  Indian languages
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages
Tahir Javed
Kaushal Bhogale
A. Raman
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
ELM
90
26
0
24 Aug 2022
Text to Image Generation: Leaving no Language Behind
Text to Image Generation: Leaving no Language Behind
Pedro Reviriego
Elena Merino-Gómez
VLM
49
13
0
19 Aug 2022
Silo NLP's Participation at WAT2022
Silo NLP's Participation at WAT2022
Shantipriya Parida
Subhadarshi Panda
Stig-Arne Gronroos
Mark Granroth-Wilding
Mika Koistinen
58
3
0
02 Aug 2022
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Sebastian Gehrmann
Abhik Bhattacharjee
Abinaya Mahendiran
Alex Jinpeng Wang
Alexandros Papangelis
...
Yacine Jernite
Yi Xu
Yisi Sang
Yixin Liu
Yufang Hou
118
39
0
22 Jun 2022
Bitext Mining Using Distilled Sentence Representations for Low-Resource
  Languages
Bitext Mining Using Distilled Sentence Representations for Low-Resource Languages
Kevin Heffernan
Onur cCelebi
Holger Schwenk
149
55
0
25 May 2022
Multilingual Machine Translation with Hyper-Adapters
Multilingual Machine Translation with Hyper-Adapters
Christos Baziotis
Mikel Artetxe
James Cross
Shruti Bhosale
124
23
0
22 May 2022
Aksharantar: Open Indic-language Transliteration datasets and models for
  the Next Billion Users
Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users
Yash Madhani
Sushane Parthan
Priyanka A. Bedekar
N. Gokul
Ruchi Khapra
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
73
26
0
06 May 2022
Previous
123...151617
Next