Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.06354
Cited By
The Tatoeba Translation Challenge -- Realistic Data Sets for Low Resource and Multilingual MT
13 October 2020
Jörg Tiedemann
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Tatoeba Translation Challenge -- Realistic Data Sets for Low Resource and Multilingual MT"
29 / 29 papers shown
Title
Bemba Speech Translation: Exploring a Low-Resource African Language
Muhammad Hazim Al Farouq
Aman Kassahun Wassie
Yasmin Moslem
41
0
0
05 May 2025
LAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and Giriama
Naome A. Etori
Kevin Lu
Randu Karisa
Arturs Kanepajs
LRM
ELM
160
0
0
14 Mar 2025
UrduLLaMA 1.0: Dataset Curation, Preprocessing, and Evaluation in Low-Resource Settings
Layba Fiaz
Munief Hassan Tahir
Sana Shams
Sarmad Hussain
49
0
0
24 Feb 2025
Merging Feed-Forward Sublayers for Compressed Transformers
Neha Verma
Kenton W. Murray
Kevin Duh
AI4CE
50
0
0
10 Jan 2025
VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models
Ming Cheng
Jiaying Gong
Chenhan Yuan
William A. Ingram
Edward A. Fox
Hoda Eldardiry
42
0
0
07 Nov 2024
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
Shaoxiong Ji
Zihao Li
Indraneil Paul
Jaakko Paavola
Peiqin Lin
...
Dayyán O'Brien
Hengyu Luo
Hinrich Schütze
Jörg Tiedemann
Barry Haddow
CLL
37
3
0
26 Sep 2024
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages
Trinh Pham
Khoi M. Le
Luu Anh Tuan
42
1
0
14 Jun 2024
When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages
Tyler A. Chang
Catherine Arnett
Zhuowen Tu
Benjamin Bergen
LRM
38
7
0
15 Nov 2023
Why bother with geometry? On the relevance of linear decompositions of Transformer embeddings
Timothee Mickus
Raúl Vázquez
20
2
0
10 Oct 2023
Larth: Dataset and Machine Translation for Etruscan
Gianluca Vico
Gerasimos Spanakis
6
1
0
09 Oct 2023
Sheffield's Submission to the AmericasNLP Shared Task on Machine Translation into Indigenous Languages
Edward Gow-Smith
Danae Sánchez Villegas
23
9
0
16 Jun 2023
Angler: Helping Machine Translation Practitioners Prioritize Model Improvements
Samantha Robertson
Zijie J. Wang
Dominik Moritz
Mary Beth Kery
Fred Hohman
32
15
0
12 Apr 2023
Bilex Rx: Lexical Data Augmentation for Massively Multilingual Machine Translation
Alex Jones
Isaac Caswell
Ishan Saxena
Orhan Firat
21
8
0
27 Mar 2023
Adaptive Machine Translation with Large Language Models
Yasmin Moslem
Rejwanul Haque
John D. Kelleher
Andy Way
AI4CE
25
75
0
30 Jan 2023
Improving Machine Translation with Phrase Pair Injection and Corpus Filtering
Akshay Batheja
P. Bhattacharyya
30
15
0
19 Jan 2023
Check-worthy Claim Detection across Topics for Automated Fact-checking
Amani S. Abumansour
A. Zubiaga
16
5
0
16 Dec 2022
Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin
Abhinav Rao
Ho Thi-Nga
Chng Eng Siong
19
3
0
10 Dec 2022
GLAMI-1M: A Multilingual Image-Text Fashion Dataset
Vaclav Kosar
A. Hoskovec
Milan Šulc
Radek Bartyzal
VLM
29
3
0
17 Nov 2022
Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Resource MT Models
Harshita Diddee
Sandipan Dandapat
Monojit Choudhury
T. Ganu
Kalika Bali
29
5
0
27 Oct 2022
SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages
Alireza Mohammadshahi
Vassilina Nikoulina
Alexandre Berard
Caroline Brun
James Henderson
Laurent Besacier
VLM
MoE
LRM
29
20
0
20 Oct 2022
Checks and Strategies for Enabling Code-Switched Machine Translation
Thamme Gowda
Mozhdeh Gheini
Jonathan May
30
3
0
11 Oct 2022
From Zero to Production: Baltic-Ukrainian Machine Translation Systems to Aid Refugees
Toms Bergmanis
Marcis Pinnis
17
1
0
28 Sep 2022
PreQuEL: Quality Estimation of Machine Translation Outputs in Advance
Shachar Don-Yehiya
Leshem Choshen
Omri Abend
27
10
0
18 May 2022
Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice
Andreas Grivas
Nikolay Bogoychev
Adam Lopez
11
9
0
12 Mar 2022
A Large-Scale Study of Machine Translation in the Turkic Languages
Jamshidbek Mirzakhalov
A. Babu
Duygu Ataman
S. Kariev
Francis M. Tyers
...
Esra Onal
Shaxnoza Pulatova
Ahsan Wahab
Orhan Firat
Sriram Chellappan
19
28
0
09 Sep 2021
Survey of Low-Resource Machine Translation
Barry Haddow
Rachel Bawden
Antonio Valerio Miceli Barone
Jindvrich Helcl
Alexandra Birch
AIMat
31
147
0
01 Sep 2021
The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Naman Goyal
Cynthia Gao
Vishrav Chaudhary
Peng-Jen Chen
Guillaume Wenzek
Da Ju
Sanjan Krishnan
MarcÁurelio Ranzato
Francisco Guzman
Angela Fan
15
553
0
06 Jun 2021
Robust Experimentation in the Continuous Time Bandit Problem
Pasquale Antonante
25
0
0
31 Mar 2021
Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism
Orhan Firat
Kyunghyun Cho
Yoshua Bengio
LRM
AIMat
214
623
0
06 Jan 2016
1