ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.04672
  4. Cited By
No Language Left Behind: Scaling Human-Centered Machine Translation
v1v2v3 (latest)

No Language Left Behind: Scaling Human-Centered Machine Translation

11 July 2022
Nllb team
Marta R. Costa-jussá
James Cross
Onur cCelebi
Maha Elbayad
Kenneth Heafield
Kevin Heffernan
Elahe Kalbassi
Janice Lam
Daniel Licht
Jean Maillard
Anna Y. Sun
Skyler Wang
Guillaume Wenzek
Alison Youngblood
Bapi Akula
Loïc Barrault
Gabriel Mejia Gonzalez
Prangthip Hansanti
John Hoffman
Semarley Jarrett
Kaushik Ram Sadagopan
Dirk Rowe
Shannon L. Spruit
C. Tran
Pierre Yves Andrews
Necip Fazil Ayan
Shruti Bhosale
Sergey Edunov
Angela Fan
Cynthia Gao
Vedanuj Goswami
Francisco Guzmán
Philipp Koehn
Alexandre Mourachko
C. Ropers
Safiyyah Saleem
Holger Schwenk
Jeff Wang
    MoE
ArXiv (abs)PDFHTMLGithub (31473★)

Papers citing "No Language Left Behind: Scaling Human-Centered Machine Translation"

50 / 801 papers shown
Title
Don't Rank, Combine! Combining Machine Translation Hypotheses Using
  Quality Estimation
Don't Rank, Combine! Combining Machine Translation Hypotheses Using Quality Estimation
Giorgos Vernikos
Andrei Popescu-Belis
85
15
0
12 Jan 2024
TransliCo: A Contrastive Learning Framework to Address the Script
  Barrier in Multilingual Pretrained Language Models
TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models
Yihong Liu
Chunlan Ma
Haotian Ye
Hinrich Schütze
55
2
0
12 Jan 2024
Adapting Large Language Models for Document-Level Machine Translation
Adapting Large Language Models for Document-Level Machine Translation
Minghao Wu
Thuy-Trang Vu
Zhuang Li
George F. Foster
Gholamreza Haffari
158
45
0
12 Jan 2024
PersianMind: A Cross-Lingual Persian-English Large Language Model
PersianMind: A Cross-Lingual Persian-English Large Language Model
Pedram Rostami
Ali Salemi
M. Dousti
CLLLRM
56
5
0
12 Jan 2024
An approach for mistranslation removal from popular dataset for Indic MT
  Task
An approach for mistranslation removal from popular dataset for Indic MT Task
Sudhansu Bala Das
Leo Raphael Rodrigues
Tapas Kumar Mishra
Bidyut Kr. Patra
37
1
0
12 Jan 2024
Machine Translation Models are Zero-Shot Detectors of Translation Direction
Machine Translation Models are Zero-Shot Detectors of Translation Direction
Michelle Wastl
Jannis Vamvas
Rico Sennrich
VLM
124
0
0
12 Jan 2024
Towards Boosting Many-to-Many Multilingual Machine Translation with
  Large Language Models
Towards Boosting Many-to-Many Multilingual Machine Translation with Large Language Models
Pengzhi Gao
Zhongjun He
Hua Wu
Haifeng Wang
AI4CE
68
3
0
11 Jan 2024
Tuning LLMs with Contrastive Alignment Instructions for Machine
  Translation in Unseen, Low-resource Languages
Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languages
Zhuoyuan Mao
Yen Yu
ALM
58
2
0
11 Jan 2024
A Shocking Amount of the Web is Machine Translated: Insights from
  Multi-Way Parallelism
A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism
Brian Thompson
Mehak Preet Dhaliwal
Peter Frisch
Tobias Domhan
Marcello Federico
88
17
0
11 Jan 2024
POMP: Probability-driven Meta-graph Prompter for LLMs in Low-resource
  Unsupervised Neural Machine Translation
POMP: Probability-driven Meta-graph Prompter for LLMs in Low-resource Unsupervised Neural Machine Translation
Shilong Pan
Zhiliang Tian
Liang Ding
Zhen Huang
Zhihua Wen
Dongsheng Li
107
2
0
11 Jan 2024
Aligning Translation-Specific Understanding to General Understanding in
  Large Language Models
Aligning Translation-Specific Understanding to General Understanding in Large Language Models
Yi-Chong Huang
Xiaocheng Feng
Baohang Li
Chengpeng Fu
Wenshuai Huo
Ting Liu
Bing Qin
49
0
0
10 Jan 2024
MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot
  Detector
MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Marta R. Costa-jussá
Mariano Coria Meglioli
Pierre Yves Andrews
David Dale
Prangthip Hansanti
Elahe Kalbassi
Alex Mourachko
C. Ropers
Carleigh Wood
76
15
0
10 Jan 2024
LLM Augmented LLMs: Expanding Capabilities through Composition
LLM Augmented LLMs: Expanding Capabilities through Composition
Rachit Bansal
Bidisha Samanta
Siddharth Dalmia
Nitish Gupta
Shikhar Vashishth
Sriram Ganapathy
Abhishek Bapna
Prateek Jain
Partha P. Talukdar
CLL
83
38
0
04 Jan 2024
Cheetah: Natural Language Generation for 517 African Languages
Cheetah: Natural Language Generation for 517 African Languages
Ife Adebara
AbdelRahim Elmadany
Muhammad Abdul-Mageed
67
6
0
02 Jan 2024
Typhoon: Thai Large Language Models
Typhoon: Thai Large Language Models
Kunat Pipatanakul
Phatrasek Jirabovonvisut
Potsawee Manakul
Sittipong Sripaisarnmongkol
Ruangsak Patomwong
Pathomporn Chokchainant
Kasima Tharnpipitchai
102
17
0
21 Dec 2023
Fine-tuning Large Language Models for Adaptive Machine Translation
Fine-tuning Large Language Models for Adaptive Machine Translation
Yasmin Moslem
Rejwanul Haque
Andy Way
59
29
0
20 Dec 2023
Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is
  Needed?
Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?
Tannon Kew
Florian Schottmann
Rico Sennrich
LRM
98
40
0
20 Dec 2023
Predicting Human Translation Difficulty with Neural Machine Translation
Predicting Human Translation Difficulty with Neural Machine Translation
Zheng Wei Lim
Ekaterina Vylomova
Charles Kemp
Trevor Cohn
102
0
0
19 Dec 2023
An In-depth Look at Gemini's Language Abilities
An In-depth Look at Gemini's Language Abilities
Syeda Nahida Akter
Zichun Yu
Aashiq Muhamed
Tianyue Ou
Alex Bäuerle
Ángel Alexander Cabrera
Krish Dholakia
Chenyan Xiong
Graham Neubig
LRMELM
98
36
0
18 Dec 2023
Split and Rephrase with Large Language Models
Split and Rephrase with Large Language Models
David Ponce
Thierry Etchegoyhen
Jesús Calleja-Perez
Harritxu Gete
ReLMLRM
88
2
0
18 Dec 2023
IndicIRSuite: Multilingual Dataset and Neural Information Models for
  Indian Languages
IndicIRSuite: Multilingual Dataset and Neural Information Models for Indian Languages
Saiful Haq
Ashutosh Sharma
Pushpak Bhattacharyya
59
3
0
15 Dec 2023
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style
  Models on Dense Captions
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
Jack Urbanek
Florian Bordes
Pietro Astolfi
Mary Williamson
Vasu Sharma
Adriana Romero Soriano
CLIP3DV
93
48
0
14 Dec 2023
A Survey of Text Watermarking in the Era of Large Language Models
A Survey of Text Watermarking in the Era of Large Language Models
Aiwei Liu
Leyi Pan
Yijian Lu
Jingjing Li
Xuming Hu
Xi Zhang
Lijie Wen
Irwin King
Hui Xiong
Philip S. Yu
WaLM
117
66
0
13 Dec 2023
Beyond English: Evaluating LLMs for Arabic Grammatical Error Correction
Beyond English: Evaluating LLMs for Arabic Grammatical Error Correction
S. Kwon
Gagan Bhatia
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
103
18
0
13 Dec 2023
Saturn Platform: Foundation Model Operations and Generative AI for
  Financial Services
Saturn Platform: Foundation Model Operations and Generative AI for Financial Services
Antonio Busson
Rennan Gaio
Rafael H. Rocha
Francisco Evangelista
Bruno Rizzi
Luan Carvalho
Rafael Miceli
Marcos Rabaioli
David Favaro
60
1
0
12 Dec 2023
Neural Machine Translation of Clinical Text: An Empirical Investigation
  into Multilingual Pre-Trained Language Models and Transfer-Learning
Neural Machine Translation of Clinical Text: An Empirical Investigation into Multilingual Pre-Trained Language Models and Transfer-Learning
Lifeng Han
Serge Gladkoff
G. Erofeev
Irina Sorokina
Betty Galiano
Goran Nenadic
LM&MA
97
10
0
12 Dec 2023
Order Matters in the Presence of Dataset Imbalance for Multilingual
  Learning
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning
Dami Choi
Derrick Xin
Hamid Dadkhahi
Justin Gilmer
Ankush Garg
Orhan Firat
Chih-Kuan Yeh
Andrew M. Dai
Behrooz Ghorbani
111
3
0
11 Dec 2023
First Attempt at Building Parallel Corpora for Machine Translation of
  Northeast India's Very Low-Resource Languages
First Attempt at Building Parallel Corpora for Machine Translation of Northeast India's Very Low-Resource Languages
A. Tonja
Melkamu Mersha
Ananya Kalita
Olga Kolesnikova
Jugal Kalita
53
2
0
08 Dec 2023
Clustering Pseudo Language Family in Multilingual Translation Models
  with Fisher Information Matrix
Clustering Pseudo Language Family in Multilingual Translation Models with Fisher Information Matrix
Xinyu Ma
Xuebo Liu
Min Zhang
90
1
0
05 Dec 2023
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation
  with Unified Audio-Visual Speech Representation
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation
J. Choi
Se Jin Park
Minsu Kim
Y. Ro
114
14
0
05 Dec 2023
SeaLLMs -- Large Language Models for Southeast Asia
SeaLLMs -- Large Language Models for Southeast Asia
Xuan-Phi Nguyen
Wenxuan Zhang
Xin Li
Mahani Aljunied
Zhiqiang Hu
...
Yue Deng
Sen Yang
Chaoqun Liu
Hang Zhang
Li Bing
LRM
112
85
0
01 Dec 2023
Increasing Coverage and Precision of Textual Information in Multilingual
  Knowledge Graphs
Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs
Simone Conia
Min Li
Daniel Lee
U. F. Minhas
Ihab F. Ilyas
Yunyao Li
115
9
0
27 Nov 2023
Exploring Methods for Cross-lingual Text Style Transfer: The Case of
  Text Detoxification
Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification
Daryna Dementieva
Daniil Moskovskiy
David Dale
Alexander Panchenko
113
16
0
23 Nov 2023
AfriMTE and AfriCOMET: Enhancing COMET to Embrace Under-resourced
  African Languages
AfriMTE and AfriCOMET: Enhancing COMET to Embrace Under-resourced African Languages
Jiayi Wang
David Ifeoluwa Adelani
Sweta Agrawal
Marek Masiak
Ricardo Rei
...
V. Otiende
C. Mbonu
Sakayo Toadoum Sari
Yao Lu
Pontus Stenetorp
79
10
0
16 Nov 2023
Cognitive Overload: Jailbreaking Large Language Models with Overloaded
  Logical Thinking
Cognitive Overload: Jailbreaking Large Language Models with Overloaded Logical Thinking
Nan Xu
Fei Wang
Ben Zhou
Bangzheng Li
Chaowei Xiao
Muhao Chen
111
60
0
16 Nov 2023
Fumbling in Babel: An Investigation into ChatGPT's Language
  Identification Ability
Fumbling in Babel: An Investigation into ChatGPT's Language Identification Ability
Wei-Rui Chen
Ife Adebara
Khai Duy Doan
Qisheng Liao
Muhammad Abdul-Mageed
74
7
0
16 Nov 2023
To Translate or Not to Translate: A Systematic Investigation of
  Translation-Based Cross-Lingual Transfer to Low-Resource Languages
To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages
Benedikt Ebing
Goran Glavaš
71
4
0
15 Nov 2023
Divergences between Language Models and Human Brains
Divergences between Language Models and Human Brains
Yuchen Zhou
Emmy Liu
Graham Neubig
Michael J. Tarr
Leila Wehbe
135
3
0
15 Nov 2023
When Is Multilinguality a Curse? Language Modeling for 250 High- and
  Low-Resource Languages
When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages
Tyler A. Chang
Catherine Arnett
Zhuowen Tu
Benjamin Bergen
LRM
142
8
0
15 Nov 2023
Structural Priming Demonstrates Abstract Grammatical Representations in
  Multilingual Language Models
Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models
J. Michaelov
Catherine Arnett
Tyler A. Chang
Benjamin Bergen
66
14
0
15 Nov 2023
Violet: A Vision-Language Model for Arabic Image Captioning with Gemini
  Decoder
Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder
Abdelrahman Mohamed
Fakhraddin Alwajih
El Moatez Billah Nagoudi
Alcides Alcoba Inciarte
Muhammad Abdul-Mageed
VLMMLLM
65
7
0
15 Nov 2023
Evaluating Gender Bias in the Translation of Gender-Neutral Languages
  into English
Evaluating Gender Bias in the Translation of Gender-Neutral Languages into English
Spencer Rarrick
Ranjita Naik
Sundar Poudel
Vishal Chowdhary
62
0
0
15 Nov 2023
PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning
PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning
Zhihan Zhang
Dong-Ho Lee
Yuwei Fang
Wenhao Yu
Mengzhao Jia
Meng Jiang
Francesco Barbieri
ALM
116
30
0
15 Nov 2023
Extending Multilingual Machine Translation through Imitation Learning
Extending Multilingual Machine Translation through Imitation Learning
Wen Lai
Viktor Hangya
Alexander Fraser
LRMCLL
69
4
0
14 Nov 2023
Direct Preference Optimization for Neural Machine Translation with
  Minimum Bayes Risk Decoding
Direct Preference Optimization for Neural Machine Translation with Minimum Bayes Risk Decoding
Guangyu Yang
Jinghong Chen
Weizhe Lin
Bill Byrne
88
22
0
14 Nov 2023
MC$^2$: Towards Transparent and Culturally-Aware NLP for Minority
  Languages in China
MC2^22: Towards Transparent and Culturally-Aware NLP for Minority Languages in China
Chen Zhang
Mingxu Tao
Quzhe Huang
Jiuheng Lin
Zhibin Chen
Yansong Feng
56
3
0
14 Nov 2023
MEGAVERSE: Benchmarking Large Language Models Across Languages,
  Modalities, Models and Tasks
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks
Sanchit Ahuja
Divyanshu Aggarwal
Varun Gumma
Ishaan Watts
Ashutosh Sathe
...
Rishav Hada
Prachi Jain
Maxamed Axmed
Kalika Bali
Sunayana Sitaram
ELM
113
46
0
13 Nov 2023
Investigating Multi-Pivot Ensembling with Massively Multilingual Machine
  Translation Models
Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation Models
Alireza Mohammadshahi
Jannis Vamvas
Rico Sennrich
LRM
51
0
0
13 Nov 2023
Simple and Effective Input Reformulations for Translation
Simple and Effective Input Reformulations for Translation
Brian Yu
Hansen Lillemark
Kurt Keutzer
69
0
0
12 Nov 2023
Zero-Shot Cross-Lingual Sentiment Classification under Distribution
  Shift: an Exploratory Study
Zero-Shot Cross-Lingual Sentiment Classification under Distribution Shift: an Exploratory Study
Maarten De Raedt
Semere Kiros Bitew
Fréderic Godin
Thomas Demeester
Chris Develder
87
4
0
11 Nov 2023
Previous
123...101112...151617
Next