ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.04672
  4. Cited By
No Language Left Behind: Scaling Human-Centered Machine Translation
v1v2v3 (latest)

No Language Left Behind: Scaling Human-Centered Machine Translation

11 July 2022
Nllb team
Marta R. Costa-jussá
James Cross
Onur cCelebi
Maha Elbayad
Kenneth Heafield
Kevin Heffernan
Elahe Kalbassi
Janice Lam
Daniel Licht
Jean Maillard
Anna Y. Sun
Skyler Wang
Guillaume Wenzek
Alison Youngblood
Bapi Akula
Loïc Barrault
Gabriel Mejia Gonzalez
Prangthip Hansanti
John Hoffman
Semarley Jarrett
Kaushik Ram Sadagopan
Dirk Rowe
Shannon L. Spruit
C. Tran
Pierre Yves Andrews
Necip Fazil Ayan
Shruti Bhosale
Sergey Edunov
Angela Fan
Cynthia Gao
Vedanuj Goswami
Francisco Guzmán
Philipp Koehn
Alexandre Mourachko
C. Ropers
Safiyyah Saleem
Holger Schwenk
Jeff Wang
    MoE
ArXiv (abs)PDFHTMLGithub (31473★)

Papers citing "No Language Left Behind: Scaling Human-Centered Machine Translation"

50 / 801 papers shown
Title
Zero-shot cross-lingual transfer in instruction tuning of large language
  models
Zero-shot cross-lingual transfer in instruction tuning of large language models
Nadezhda Chirkova
Vassilina Nikoulina
LRM
82
4
0
22 Feb 2024
GATE X-E : A Challenge Set for Gender-Fair Translations from
  Weakly-Gendered Languages
GATE X-E : A Challenge Set for Gender-Fair Translations from Weakly-Gendered Languages
Spencer Rarrick
Ranjita Naik
Sundar Poudel
Vishal Chowdhary
95
2
0
22 Feb 2024
LexC-Gen: Generating Data for Extremely Low-Resource Languages with
  Large Language Models and Bilingual Lexicons
LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons
Zheng-Xin Yong
Cristina Menghini
Stephen H. Bach
83
3
0
21 Feb 2024
Multilingual Coreference Resolution in Low-resource South Asian
  Languages
Multilingual Coreference Resolution in Low-resource South Asian Languages
Ritwik Mishra
Pooja Desur
R. Shah
Ponnurangam Kumaraguru
77
4
0
21 Feb 2024
Enhanced Hallucination Detection in Neural Machine Translation through
  Simple Detector Aggregation
Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation
Anas Himmi
Guillaume Staerman
Marine Picot
Pierre Colombo
Nuno M. Guerreiro
417
7
0
20 Feb 2024
RoCode: A Dataset for Measuring Code Intelligence from Problem
  Definitions in Romanian
RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian
Adrian Cosma
Ioan-Bogdan Iordache
Paolo Rosso
OffRL
45
3
0
20 Feb 2024
UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with
  and without machine translation
UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translation
Ali Naseh
Sai Vallurupalli
LRM
63
2
0
20 Feb 2024
Simpson's Paradox and the Accuracy-Fluency Tradeoff in Translation
Simpson's Paradox and the Accuracy-Fluency Tradeoff in Translation
Zheng Wei Lim
Ekaterina Vylomova
Trevor Cohn
Charles Kemp
49
6
0
20 Feb 2024
High-quality Data-to-Text Generation for Severely Under-Resourced
  Languages with Out-of-the-box Large Language Models
High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models
Michela Lorandi
Anya Belz
58
5
0
19 Feb 2024
Enhancing Multilingual Capabilities of Large Language Models through
  Self-Distillation from Resource-Rich Languages
Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages
Yuan Zhang
Yile Wang
Zijun Liu
Shuo Wang
Xiaolong Wang
Peng Li
Maosong Sun
Yang Liu
LRM
119
15
0
19 Feb 2024
Enabling Weak LLMs to Judge Response Reliability via Meta Ranking
Enabling Weak LLMs to Judge Response Reliability via Meta Ranking
Zijun Liu
Boqun Kou
Peng Li
Ming Yan
Ji Zhang
Fei Huang
Yang Liu
106
3
0
19 Feb 2024
Multi-Task Inference: Can Large Language Models Follow Multiple
  Instructions at Once?
Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?
Seunghyeok Hong
Sangwon Baek
Sangdae Nam
Guijin Son
Seungone Kim
ELMLRM
119
17
0
18 Feb 2024
Advancing Translation Preference Modeling with RLHF: A Step Towards
  Cost-Effective Solution
Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution
Nuo Xu
Jun Zhao
Can Zu
Sixian Li
Lu Chen
...
Shihan Dou
Wenjuan Qin
Tao Gui
Qi Zhang
Xuanjing Huang
88
7
0
18 Feb 2024
Pushing the Limits of Zero-shot End-to-End Speech Translation
Pushing the Limits of Zero-shot End-to-End Speech Translation
Ioannis Tsiamas
Gerard I. Gállego
José A. R. Fonollosa
Marta R. Costa-jussá
97
10
0
16 Feb 2024
Model Compression and Efficient Inference for Large Language Models: A
  Survey
Model Compression and Efficient Inference for Large Language Models: A Survey
Wenxiao Wang
Wei Chen
Yicong Luo
Yongliu Long
Zhengkai Lin
Liye Zhang
Binbin Lin
Deng Cai
Xiaofei He
MQ
116
58
0
15 Feb 2024
Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and
  Generative Datasets
Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and Generative Datasets
Israel Abebe Azime
A. Tonja
Tadesse Destaw Belay
Mitiku Yohannes Fuge
A. Wassie
Eyasu Shiferaw Jada
Yonas Chanie
W. Sewunetie
Seid Muhie Yimam
42
3
0
12 Feb 2024
Text Detoxification as Style Transfer in English and Hindi
Text Detoxification as Style Transfer in English and Hindi
Sourabrata Mukherjee
Akanksha Bansal
Atul Kr. Ojha
John P. Mccrae
Ondrej Dusek
61
9
0
12 Feb 2024
MAFIA: Multi-Adapter Fused Inclusive LanguAge Models
MAFIA: Multi-Adapter Fused Inclusive LanguAge Models
Prachi Jain
Ashutosh Sathe
Varun Gumma
Kabir Ahuja
Sunayana Sitaram
110
1
0
12 Feb 2024
Quality Does Matter: A Detailed Look at the Quality and Utility of
  Web-Mined Parallel Corpora
Quality Does Matter: A Detailed Look at the Quality and Utility of Web-Mined Parallel Corpora
Surangika Ranathunga
Nisansa de Silva
Menan Velayuthan
Aloka Fernando
Charitha Rathnayake
105
13
0
12 Feb 2024
GenTranslate: Large Language Models are Generative Multilingual Speech
  and Machine Translators
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Yuchen Hu
Chen Chen
Chao-Han Huck Yang
Ruizhe Li
Dong Zhang
Zhehuai Chen
Eng Siong Chng
91
21
0
10 Feb 2024
Multilingual E5 Text Embeddings: A Technical Report
Multilingual E5 Text Embeddings: A Technical Report
Liang Wang
Nan Yang
Xiaolong Huang
Linjun Yang
Rangan Majumder
Furu Wei
92
137
0
08 Feb 2024
GPTs Are Multilingual Annotators for Sequence Generation Tasks
GPTs Are Multilingual Annotators for Sequence Generation Tasks
Juhwan Choi
Eunju Lee
Kyohoon Jin
Youngbin Kim
66
11
0
08 Feb 2024
Soft Prompt Tuning for Cross-Lingual Transfer: When Less is More
Soft Prompt Tuning for Cross-Lingual Transfer: When Less is More
Fred Philippy
Siwen Guo
Shohreh Haddadan
Cedric Lothritz
Jacques Klein
Tegawende F. Bissyande
AAMLVLM
42
2
0
06 Feb 2024
BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity
  Text Embeddings Through Self-Knowledge Distillation
BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
Jianlv Chen
Shitao Xiao
Peitian Zhang
Kun Luo
Defu Lian
Zheng Liu
713
448
0
05 Feb 2024
CIDAR: Culturally Relevant Instruction Dataset For Arabic
CIDAR: Culturally Relevant Instruction Dataset For Arabic
Zaid Alyafeai
Khalid Almubarak
Ahmed Ashraf
Deema Alnuhait
Saied Alshahrani
...
Qais Gawah
Zead Saleh
Mustafa Ghaleb
Yousef Ali
Maged S. Al-Shaibani
77
11
0
05 Feb 2024
Constrained Decoding for Cross-lingual Label Projection
Constrained Decoding for Cross-lingual Label Projection
Duong Minh Le
Yang Chen
Alan Ritter
Wei Xu
65
7
0
05 Feb 2024
Translation Errors Significantly Impact Low-Resource Languages in
  Cross-Lingual Learning
Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning
Ashish Agrawal
Barah Fazili
Preethi Jyothi
70
3
0
03 Feb 2024
A Morphologically-Aware Dictionary-based Data Augmentation Technique for
  Machine Translation of Under-Represented Languages
A Morphologically-Aware Dictionary-based Data Augmentation Technique for Machine Translation of Under-Represented Languages
Md Mahfuz Ibn Alam
Sina Ahmadi
Antonios Anastasopoulos
132
0
0
02 Feb 2024
InferCept: Efficient Intercept Support for Augmented Large Language
  Model Inference
InferCept: Efficient Intercept Support for Augmented Large Language Model Inference
Reyna Abhyankar
Zijian He
Vikranth Srivatsa
Hao Zhang
Yiying Zhang
RALM
80
15
0
02 Feb 2024
Code-Switched Language Identification is Harder Than You Think
Code-Switched Language Identification is Harder Than You Think
Laurie Burchell
Alexandra Birch
Robert P. Thompson
Kenneth Heafield
57
0
0
02 Feb 2024
Sequence Shortening for Context-Aware Machine Translation
Sequence Shortening for Context-Aware Machine Translation
Paweł Mąka
Yusuf Can Semerci
Jan Scholtes
Gerasimos Spanakis
49
2
0
02 Feb 2024
CroissantLLM: A Truly Bilingual French-English Language Model
CroissantLLM: A Truly Bilingual French-English Language Model
Manuel Faysse
Patrick Fernandes
Nuno M. Guerreiro
António Loison
Duarte M. Alves
...
François Yvon
André F.T. Martins
Gautier Viaud
C´eline Hudelot
Pierre Colombo
163
37
0
01 Feb 2024
MultiMUC: Multilingual Template Filling on MUC-4
MultiMUC: Multilingual Template Filling on MUC-4
William Gantt
Shabnam Behzad
Hannah YoungEun An
Yunmo Chen
Aaron Steven White
Benjamin Van Durme
M. Yarmohammadi
56
4
0
29 Jan 2024
Non-Fluent Synthetic Target-Language Data Improve Neural Machine
  Translation
Non-Fluent Synthetic Target-Language Data Improve Neural Machine Translation
Víctor M. Sánchez-Cartagena
Miquel Espla-Gomis
J. A. Pérez-Ortiz
F. Sánchez-Martínez
68
4
0
29 Jan 2024
Airavata: Introducing Hindi Instruction-tuned LLM
Airavata: Introducing Hindi Instruction-tuned LLM
Jay Gala
Thanmay Jayakumar
Jaavid Aktar Husain
M. AswanthKumar
Mohammed Safi Ur Rahman Khan
...
Ratish Puduppully
Mitesh M. Khapra
Raj Dabre
Rudra Murthy
Anoop Kunchukuttan
91
27
0
26 Jan 2024
RomanSetu: Efficiently unlocking multilingual capabilities of Large
  Language Models via Romanization
RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization
Jaavid Aktar Husain
Raj Dabre
Aswanth Kumar
Jay Gala
Thanmay Jayakumar
Ratish Puduppully
Anoop Kunchukuttan
130
15
0
25 Jan 2024
MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert Cache
MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert Cache
Leyang Xue
Yao Fu
Zhan Lu
Luo Mai
Mahesh K. Marina
MoE
85
4
0
25 Jan 2024
Misgendering and Assuming Gender in Machine Translation when Working
  with Low-Resource Languages
Misgendering and Assuming Gender in Machine Translation when Working with Low-Resource Languages
Sourojit Ghosh
Srishti Chatterjee
60
0
0
24 Jan 2024
The Language Barrier: Dissecting Safety Challenges of LLMs in
  Multilingual Contexts
The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts
Lingfeng Shen
Weiting Tan
Sihao Chen
Yunmo Chen
Jingyu Zhang
Haoran Xu
Boyuan Zheng
Philipp Koehn
Daniel Khashabi
92
49
0
23 Jan 2024
Improving Machine Translation with Human Feedback: An Exploration of
  Quality Estimation as a Reward Model
Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model
Zhiwei He
Xing Wang
Wenxiang Jiao
Zhuosheng Zhang
Rui Wang
Shuming Shi
Zhaopeng Tu
ALM
96
27
0
23 Jan 2024
What the Weight?! A Unified Framework for Zero-Shot Knowledge
  Composition
What the Weight?! A Unified Framework for Zero-Shot Knowledge Composition
Carolin Holtermann
Markus Frohmann
Navid Rekabsaz
Anne Lauscher
MoMe
64
5
0
23 Jan 2024
How Far Can 100 Samples Go? Unlocking Overall Zero-Shot Multilingual
  Translation via Tiny Multi-Parallel Data
How Far Can 100 Samples Go? Unlocking Overall Zero-Shot Multilingual Translation via Tiny Multi-Parallel Data
Di Wu
Shaomu Tan
Yan Meng
David Stap
Christof Monz
59
0
0
22 Jan 2024
An Empirical Study of In-context Learning in LLMs for Machine
  Translation
An Empirical Study of In-context Learning in LLMs for Machine Translation
Pranjal A. Chitale
Jay Gala
Raj Dabre
LRM
98
7
0
22 Jan 2024
End-to-End Argument Mining over Varying Rhetorical Structures
End-to-End Argument Mining over Varying Rhetorical Structures
Elena Chistova
28
4
0
20 Jan 2024
LangBridge: Multilingual Reasoning Without Multilingual Supervision
LangBridge: Multilingual Reasoning Without Multilingual Supervision
Dongkeun Yoon
Joel Jang
Sungdong Kim
Seungone Kim
Sheikh Shafayat
Minjoon Seo
LRM
56
15
0
19 Jan 2024
Bridging Cultural Nuances in Dialogue Agents through Cultural Value
  Surveys
Bridging Cultural Nuances in Dialogue Agents through Cultural Value Surveys
Yong Cao
Min Chen
Daniel Hershcovich
117
7
0
18 Jan 2024
Machine Translation with Large Language Models: Prompt Engineering for
  Persian, English, and Russian Directions
Machine Translation with Large Language Models: Prompt Engineering for Persian, English, and Russian Directions
Nooshin Pourkamali
Shler Ebrahim Sharifi
LRM
66
9
0
16 Jan 2024
Contrastive Preference Optimization: Pushing the Boundaries of LLM
  Performance in Machine Translation
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation
Haoran Xu
Amr Sharaf
Yunmo Chen
Weiting Tan
Lingfeng Shen
Benjamin Van Durme
Kenton W. Murray
Young Jin Kim
ALM
120
266
0
16 Jan 2024
Authorship Obfuscation in Multilingual Machine-Generated Text Detection
Authorship Obfuscation in Multilingual Machine-Generated Text Detection
Dominik Macko
Robert Moro
Adaku Uchendu
Ivan Srba
Jason Samuel Lucas
Michiharu Yamashita
Nafis Irtiza Tripto
Dongwon Lee
Jakub Simko
Maria Bielikova
DeLMO
91
21
0
15 Jan 2024
MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of
  Large Language Models
MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models
Divyanshu Aggarwal
Ashutosh Sathe
Ishaan Watts
Sunayana Sitaram
64
2
0
15 Jan 2024
Previous
123...91011...151617
Next