ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.04672
  4. Cited By
No Language Left Behind: Scaling Human-Centered Machine Translation
v1v2v3 (latest)

No Language Left Behind: Scaling Human-Centered Machine Translation

11 July 2022
Nllb team
Marta R. Costa-jussá
James Cross
Onur cCelebi
Maha Elbayad
Kenneth Heafield
Kevin Heffernan
Elahe Kalbassi
Janice Lam
Daniel Licht
Jean Maillard
Anna Y. Sun
Skyler Wang
Guillaume Wenzek
Alison Youngblood
Bapi Akula
Loïc Barrault
Gabriel Mejia Gonzalez
Prangthip Hansanti
John Hoffman
Semarley Jarrett
Kaushik Ram Sadagopan
Dirk Rowe
Shannon L. Spruit
C. Tran
Pierre Yves Andrews
Necip Fazil Ayan
Shruti Bhosale
Sergey Edunov
Angela Fan
Cynthia Gao
Vedanuj Goswami
Francisco Guzmán
Philipp Koehn
Alexandre Mourachko
C. Ropers
Safiyyah Saleem
Holger Schwenk
Jeff Wang
    MoE
ArXiv (abs)PDFHTMLGithub (31473★)

Papers citing "No Language Left Behind: Scaling Human-Centered Machine Translation"

50 / 801 papers shown
Title
Introduction to Latent Variable Energy-Based Models: A Path Towards
  Autonomous Machine Intelligence
Introduction to Latent Variable Energy-Based Models: A Path Towards Autonomous Machine Intelligence
Anna Dawid
Yann LeCun
DRL
104
31
0
05 Jun 2023
BigTranslate: Augmenting Large Language Models with Multilingual
  Translation Capability over 100 Languages
BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages
Wen Yang
Chong Li
Jiajun Zhang
Chengqing Zong
LRM
95
54
0
29 May 2023
CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine
  Translation
CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation
Md Mahfuz Ibn Alam
Sina Ahmadi
Antonios Anastasopoulos
93
8
0
26 May 2023
BIG-C: a Multimodal Multi-Purpose Dataset for Bemba
BIG-C: a Multimodal Multi-Purpose Dataset for Bemba
Claytone Sikasote
Eunice Mukonde
Md Mahfuz Ibn Alam
Antonios Anastasopoulos
58
8
0
26 May 2023
Script Normalization for Unconventional Writing of Under-Resourced
  Languages in Bilingual Communities
Script Normalization for Unconventional Writing of Under-Resourced Languages in Bilingual Communities
Sina Ahmadi
Antonios Anastasopoulos
61
2
0
25 May 2023
Bhasha-Abhijnaanam: Native-script and romanized Language Identification
  for 22 Indic languages
Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages
Yash Madhani
Mitesh M. Khapra
Anoop Kunchukuttan
26
11
0
25 May 2023
Towards Higher Pareto Frontier in Multilingual Machine Translation
Towards Higher Pareto Frontier in Multilingual Machine Translation
Yi-Chong Huang
Xiaocheng Feng
Xinwei Geng
Baohang Li
Bing Qin
82
11
0
25 May 2023
Eliciting the Translation Ability of Large Language Models via
  Multilingual Finetuning with Translation Instructions
Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions
Jiahuan Li
Hao Zhou
Shujian Huang
Shan Chen
Jiajun Chen
LRM
122
57
0
24 May 2023
Bactrian-X: Multilingual Replicable Instruction-Following Models with
  Low-Rank Adaptation
Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation
Haonan Li
Fajri Koto
Minghao Wu
Alham Fikri Aji
Timothy Baldwin
ALM
72
76
0
24 May 2023
GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP
GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP
Md. Tawkat Islam Khondaker
Abdul Waheed
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
ELMLM&MA
83
70
0
24 May 2023
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual
  Transfer
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
Akari Asai
Sneha Kudugunta
Xinyan Velocity Yu
Terra Blevins
Hila Gonen
Machel Reid
Yulia Tsvetkov
Sebastian Ruder
Hannaneh Hajishirzi
120
63
0
24 May 2023
Cascaded Beam Search: Plug-and-Play Terminology-Forcing For Neural
  Machine Translation
Cascaded Beam Search: Plug-and-Play Terminology-Forcing For Neural Machine Translation
Frédéric Odermatt
Béni Egressy
Roger Wattenhofer
38
0
0
23 May 2023
WebIE: Faithful and Robust Information Extraction on the Web
WebIE: Faithful and Robust Information Extraction on the Web
Chenxi Whitehouse
Clara Vania
Alham Fikri Aji
Christos Christodoulopoulos
Andrea Pierleoni
SyDa
88
12
0
23 May 2023
Multilingual Pixel Representations for Translation and Effective
  Cross-lingual Transfer
Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer
Elizabeth Salesky
Neha Verma
Philipp Koehn
Matt Post
93
16
0
23 May 2023
LIMIT: Language Identification, Misidentification, and Translation using
  Hierarchical Models in 350+ Languages
LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages
M. Agarwal
Md Mahfuz Ibn Alam
Antonios Anastasopoulos
104
7
0
23 May 2023
Linear Cross-Lingual Mapping of Sentence Embeddings
Linear Cross-Lingual Mapping of Sentence Embeddings
Oleg V. Vasilyev
Fumika Isono
John Bohannon
LLMSV
50
1
0
23 May 2023
Revisiting Machine Translation for Cross-lingual Classification
Revisiting Machine Translation for Cross-lingual Classification
Mikel Artetxe
Vedanuj Goswami
Shruti Bhosale
Angela Fan
Luke Zettlemoyer
LRM
102
39
0
23 May 2023
Exploring Representational Disparities Between Multilingual and
  Bilingual Translation Models
Exploring Representational Disparities Between Multilingual and Bilingual Translation Models
Neha Verma
Kenton W. Murray
Kevin Duh
79
0
0
23 May 2023
Beyond Shared Vocabulary: Increasing Representational Word Similarities
  across Languages for Multilingual Machine Translation
Beyond Shared Vocabulary: Increasing Representational Word Similarities across Languages for Multilingual Machine Translation
Di Wu
Christof Monz
118
9
0
23 May 2023
When Does Monolingual Data Help Multilingual Translation: The Role of
  Domain and Model Scale
When Does Monolingual Data Help Multilingual Translation: The Role of Domain and Model Scale
Christos Baziotis
Biao Zhang
Alexandra Birch
Barry Haddow
136
2
0
23 May 2023
CTQScorer: Combining Multiple Features for In-context Example Selection
  for Machine Translation
CTQScorer: Combining Multiple Features for In-context Example Selection for Machine Translation
Aswanth Kumar
Ratish Puduppully
Raj Dabre
Anoop Kunchukuttan
103
13
0
23 May 2023
Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in
  Multilingual Machine Translation
Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation
Minwoo Lee
Hyukhun Koh
Kang-il Lee
Dongdong Zhang
Minsu Kim
Kyomin Jung
105
12
0
23 May 2023
Condensing Multilingual Knowledge with Lightweight Language-Specific
  Modules
Condensing Multilingual Knowledge with Lightweight Language-Specific Modules
Haoran Xu
Weiting Tan
Shuyue Stella Li
Yunmo Chen
Benjamin Van Durme
Philipp Koehn
Kenton W. Murray
85
7
0
23 May 2023
MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African
  Languages
MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African Languages
Cheikh M. Bamba Dione
David Ifeoluwa Adelani
Peter Nabende
Jesujoba Oluwadara Alabi
Thapelo Sindane
...
Seydou T. Traoré
C. Uchechukwu
Aliyu Yusuf
M. Abdullahi
Dietrich Klakow
71
13
0
23 May 2023
An Open Dataset and Model for Language Identification
An Open Dataset and Model for Language Identification
Laurie Burchell
Alexandra Birch
Nikolay Bogoychev
Kenneth Heafield
70
36
0
23 May 2023
Do All Languages Cost the Same? Tokenization in the Era of Commercial
  Language Models
Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models
Orevaoghene Ahia
Sachin Kumar
Hila Gonen
Jungo Kasai
David R. Mortensen
Noah A. Smith
Yulia Tsvetkov
122
98
0
23 May 2023
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual
  Pretrained Language Models
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
Peiqin Lin
Chengzhi Hu
Zheyu Zhang
André F. T. Martins
Hinrich Schütze
68
1
0
23 May 2023
InstructAlign: High-and-Low Resource Language Alignment via Continual
  Crosslingual Instruction Tuning
InstructAlign: High-and-Low Resource Language Alignment via Continual Crosslingual Instruction Tuning
Samuel Cahyawijaya
Holy Lovenia
Tiezheng Yu
Willy Chung
Pascale Fung
ALM
91
15
0
23 May 2023
Translation and Fusion Improves Zero-shot Cross-lingual Information
  Extraction
Translation and Fusion Improves Zero-shot Cross-lingual Information Extraction
Yang Chen
Vedaant Shah
Alan Ritter
108
4
0
23 May 2023
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil
  Demographic Biases in Languages at Scale
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale
Marta R. Costa-jussá
Pierre Yves Andrews
Eric Michael Smith
Prangthip Hansanti
C. Ropers
Elahe Kalbassi
Cynthia Gao
Daniel Licht
Carleigh Wood
57
18
0
22 May 2023
Decomposed Prompting for Machine Translation Between Related Languages
  using Large Language Models
Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models
Ratish Puduppully
Anoop Kunchukuttan
Raj Dabre
Ai Ti Aw
Nancy F. Chen
VLM
71
0
0
22 May 2023
Making Language Models Better Tool Learners with Execution Feedback
Making Language Models Better Tool Learners with Execution Feedback
Shuofei Qiao
Honghao Gui
Chengfei Lv
Qianghuai Jia
Huajun Chen
Ningyu Zhang
LLMAG
150
53
0
22 May 2023
Mitigating Data Imbalance and Representation Degeneration in
  Multilingual Machine Translation
Mitigating Data Imbalance and Representation Degeneration in Multilingual Machine Translation
Wen Lai
Alexandra Chronopoulou
Alexander Fraser
79
6
0
22 May 2023
Automatic Spell Checker and Correction for Under-represented Spoken
  Languages: Case Study on Wolof
Automatic Spell Checker and Correction for Under-represented Spoken Languages: Case Study on Wolof
Thierno Ibrahima Cissé
F. Sadat
30
4
0
22 May 2023
Understanding the Effect of Data Augmentation on Knowledge Distillation
Understanding the Effect of Data Augmentation on Knowledge Distillation
Ziqi Wang
Chi Han
Wenxuan Bao
Heng Ji
30
2
0
21 May 2023
Has It All Been Solved? Open NLP Research Questions Not Solved by Large
  Language Models
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
Oana Ignat
Zhijing Jin
Artem Abzaliev
Laura Biester
Santiago Castro
...
Verónica Pérez-Rosas
Siqi Shen
Zekun Wang
Winston Wu
Rada Mihalcea
LRM
139
6
0
21 May 2023
Glot500: Scaling Multilingual Corpora and Language Models to 500
  Languages
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
Ayyoob Imani
Peiqin Lin
Amir Hossein Kargaran
Silvia Severini
Masoud Jalili Sabet
...
Chunlan Ma
Helmut Schmid
André F. T. Martins
François Yvon
Hinrich Schütze
ALMLRM
136
107
0
20 May 2023
Accurate Knowledge Distillation with n-best Reranking
Accurate Knowledge Distillation with n-best Reranking
Hendra Setiawan
57
2
0
20 May 2023
ReSeTOX: Re-learning attention weights for toxicity mitigation in
  machine translation
ReSeTOX: Re-learning attention weights for toxicity mitigation in machine translation
Javier García Gilabert
Carlos Escolano
Marta R. Costa-jussá
CLLMU
97
2
0
19 May 2023
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination
  and Omission Detection in Machine Translation
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation
David Dale
Elena Voita
Janice Lam
Prangthip Hansanti
C. Ropers
Elahe Kalbassi
Cynthia Gao
Loïc Barrault
Marta R. Costa-jussá
HILM
191
30
0
19 May 2023
AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide
  for Simultaneous Speech Translation
AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation
Sara Papi
Marco Turchi
Matteo Negri
67
22
0
19 May 2023
NollySenti: Leveraging Transfer Learning and Machine Translation for
  Nigerian Movie Sentiment Classification
NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification
Iyanuoluwa Shode
David Ifeoluwa Adelani
J. Peng
Anna Feldman
79
12
0
18 May 2023
On the Off-Target Problem of Zero-Shot Multilingual Neural Machine
  Translation
On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
Liang Chen
Shuming Ma
Dongdong Zhang
Furu Wei
Baobao Chang
77
5
0
18 May 2023
Multilingual Event Extraction from Historical Newspaper Adverts
Multilingual Event Extraction from Historical Newspaper Adverts
Nadav Borenstein
N. Perez
Isabelle Augenstein
73
4
0
18 May 2023
ChatGPT Perpetuates Gender Bias in Machine Translation and Ignores
  Non-Gendered Pronouns: Findings across Bengali and Five other Low-Resource
  Languages
ChatGPT Perpetuates Gender Bias in Machine Translation and Ignores Non-Gendered Pronouns: Findings across Bengali and Five other Low-Resource Languages
Sourojit Ghosh
Aylin Caliskan
86
81
0
17 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip Torr
Adel Bibi
126
113
0
17 May 2023
Beqi: Revitalize the Senegalese Wolof Language with a Robust Spelling
  Corrector
Beqi: Revitalize the Senegalese Wolof Language with a Robust Spelling Corrector
Derguene Mbaye
Moussa Diallo
53
3
0
15 May 2023
Not All Languages Are Created Equal in LLMs: Improving Multilingual
  Capability by Cross-Lingual-Thought Prompting
Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting
Haoyang Huang
Tianyi Tang
Dongdong Zhang
Wayne Xin Zhao
Ting Song
Yan Xia
Furu Wei
LRM
114
179
0
11 May 2023
AfriQA: Cross-lingual Open-Retrieval Question Answering for African
  Languages
AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages
Odunayo Ogundepo
T. Gwadabe
Clara E. Rivera
J. Clark
Sebastian Ruder
...
Neo Putini
Ndumiso Mngoma
Priscilla Amuok
R. Iro
Sonia Adhiambo34
91
16
0
11 May 2023
Chain-of-Dictionary Prompting Elicits Translation in Large Language
  Models
Chain-of-Dictionary Prompting Elicits Translation in Large Language Models
Hongyuan Lu
Haoran Yang
Haoyang Huang
Dongdong Zhang
Wai Lam
Furu Wei
LRMAI4CE
106
18
0
11 May 2023
Previous
123...1314151617
Next