Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.11934
Cited By
mT5: A massively multilingual pre-trained text-to-text transformer
22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"mT5: A massively multilingual pre-trained text-to-text transformer"
50 / 475 papers shown
Title
RuCoLA: Russian Corpus of Linguistic Acceptability
Vladislav Mikhailov
T. Shamardina
Max Ryabinin
A. Pestova
I. Smurov
Ekaterina Artemova
30
28
0
23 Oct 2022
Graphemic Normalization of the Perso-Arabic Script
R. Doctor
Alexander Gutkin
Cibu Johny
Brian Roark
R. Sproat
44
4
0
21 Oct 2022
Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of code mixed data
Akshat Gahoi
Jayant Duneja
Anshul Padhi
Shivam Mangale
Saransh Rajput
Tanvi Kamble
D. Sharma
Vasudeva Varma
30
3
0
21 Oct 2022
SLING: Sino Linguistic Evaluation of Large Language Models
Yixiao Song
Kalpesh Krishna
R. Bhatt
Mohit Iyyer
24
8
0
21 Oct 2022
SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models
A. Khan
Hrishikesh Kanade
G. Budhrani
Preet Jhanglani
Jia Xu
81
2
0
21 Oct 2022
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Zhehuai Chen
Ankur Bapna
Andrew Rosenberg
Yu Zhang
Bhuvana Ramabhadran
Pedro J. Moreno
Nanxin Chen
41
17
0
18 Oct 2022
Tone prediction and orthographic conversion for Basaa
I. Nikitin
Brian O'Connor
Anastasia N. Safonova
23
1
0
13 Oct 2022
Decoupled Context Processing for Context Augmented Language Modeling
Zonglin Li
Ruiqi Guo
Surinder Kumar
RALM
KELM
21
23
0
11 Oct 2022
Knowledge Distillation Transfer Sets and their Impact on Downstream NLU Tasks
Charith Peris
Lizhen Tan
Thomas Gueudré
Turan Gojayev
Vivi Wei
Gokmen Oz
30
4
0
10 Oct 2022
Comparing Computational Architectures for Automated Journalism
Yan Sym
João Gabriel Moura Campos
M. M. José
Fabio Gagliardi Cozman
26
0
0
08 Oct 2022
Generative Language Models for Paragraph-Level Question Generation
Asahi Ushio
Fernando Alva-Manchego
Jose Camacho-Collados
ELM
13
45
0
08 Oct 2022
Event Extraction: A Survey
Viet Dac Lai
23
9
0
07 Oct 2022
A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Aishwarya Kamath
Peter Anderson
Su Wang
Jing Yu Koh
Alexander Ku
Austin Waters
Yinfei Yang
Jason Baldridge
Zarana Parekh
LM&Ro
22
45
0
06 Oct 2022
Language Models are Multilingual Chain-of-Thought Reasoners
Freda Shi
Mirac Suzgun
Markus Freitag
Xuezhi Wang
Suraj Srivats
...
Yi Tay
Sebastian Ruder
Denny Zhou
Dipanjan Das
Jason W. Wei
ReLM
LRM
174
326
0
06 Oct 2022
XDoc: Unified Pre-training for Cross-Format Document Understanding
Jingye Chen
Tengchao Lv
Lei Cui
Changrong Zhang
Furu Wei
50
13
0
06 Oct 2022
Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model
Jacob Eisenstein
D. Andor
Bernd Bohnet
Michael Collins
David M. Mimno
LRM
194
24
0
05 Oct 2022
GROOT: Corrective Reward Optimization for Generative Sequential Labeling
Kazuma Hashimoto
K. Raman
VLM
19
1
0
29 Sep 2022
COMPILING: A Benchmark Dataset for Chinese Complexity Controllable Definition Generation
Jiaxin Yuan
Cunliang Kong
Chenhui Xie
Liner Yang
Erhong Yang
27
4
0
29 Sep 2022
Bidirectional Language Models Are Also Few-shot Learners
Ajay Patel
Bryan Li
Mohammad Sadegh Rasooli
Noah Constant
Colin Raffel
Chris Callison-Burch
LRM
70
45
0
29 Sep 2022
An Empirical Study on Cross-X Transfer for Legal Judgment Prediction
Joel Niklaus
Matthias Sturmer
Ilias Chalkidis
ELM
AILaw
55
19
0
25 Sep 2022
MonoByte: A Pool of Monolingual Byte-level Language Models
Hugo Queiroz Abonizio
Leandro Rodrigues de Souza
R. Lotufo
Rodrigo Nogueira
40
1
0
22 Sep 2022
A Benchmark for Understanding and Generating Dialogue between Characters in Stories
Jianzhu Yao
Ziqi Liu
Jian Guan
Minlie Huang
29
1
0
18 Sep 2022
Parameter-Efficient Finetuning for Robust Continual Multilingual Learning
Kartikeya Badola
Shachi Dave
Partha P. Talukdar
CLL
KELM
39
7
0
14 Sep 2022
A New Aligned Simple German Corpus
Vanessa Toborek
M. Busch
M. Bossert
Christian Bauckhage
Pascal Welke
18
12
0
02 Sep 2022
MATra: A Multilingual Attentive Transliteration System for Indian Scripts
Yash Raj
Bhavesh Laddagiri
21
4
0
23 Aug 2022
BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language Models for Sinhala Text Classification
Vinura Dhananjaya
Piyumal Demotte
Surangika Ranathunga
Sanath Jayasena
27
14
0
16 Aug 2022
A Comprehensive Survey of Natural Language Generation Advances from the Perspective of Digital Deception
Keenan I. Jones
Enes ALTUNCU
V. N. Franqueira
Yi-Chia Wang
Shujun Li
DeLMO
42
3
0
11 Aug 2022
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model
Saleh Soltan
Shankar Ananthakrishnan
Jack G. M. FitzGerald
Rahul Gupta
Wael Hamza
...
Mukund Sridhar
Fabian Triefenbach
Apurv Verma
Gokhan Tur
Premkumar Natarajan
56
82
0
02 Aug 2022
Language Modelling with Pixels
Phillip Rust
Jonas F. Lotz
Emanuele Bugliarello
Elizabeth Salesky
Miryam de Lhoneux
Desmond Elliott
VLM
38
46
0
14 Jul 2022
BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling
Javier de la Rosa
E. G. Ponferrada
Paulo Villegas
Pablo González de Prado Salas
Manu Romero
María Grandury
35
95
0
14 Jul 2022
M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation
Jinming Zhao
Haomiao Yang
Ehsan Shareghi
Gholamreza Haffari
48
19
0
03 Jul 2022
esCorpius: A Massive Spanish Crawling Corpus
Asier Gutiérrez-Fandiño
David Pérez-Fernández
Jordi Armengol-Estapé
D. Griol
Z. Callejas
46
2
0
30 Jun 2022
Link the World: Improving Open-domain Conversation with Dynamic Spatiotemporal-aware Knowledge
Han Zhou
Xinchao Xu
Wenquan Wu
Zheng-Yu Niu
Hua Wu
Siqi Bao
Fan Wang
Haifeng Wang
KELM
35
7
0
28 Jun 2022
Deformable CNN and Imbalance-Aware Feature Learning for Singing Technique Classification
Yuya Yamamoto
Juhan Nam
Hiroko Terasawa
28
8
0
24 Jun 2022
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Sebastian Gehrmann
Abhik Bhattacharjee
Abinaya Mahendiran
Alex Jinpeng Wang
Alexandros Papangelis
...
Yacine Jernite
Yi Xu
Yisi Sang
Yixin Liu
Yufang Hou
47
38
0
22 Jun 2022
Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation
Shengyao Zhuang
Houxing Ren
Linjun Shou
Jian Pei
Ming Gong
Guido Zuccon
Daxin Jiang
40
65
0
21 Jun 2022
Automatic Summarization of Russian Texts: Comparison of Extractive and Abstractive Methods
Valeriya Goloviznina
Evgeny Kotelnikov
11
3
0
18 Jun 2022
A Memory System of a Robot Cognitive Architecture and its Implementation in ArmarX
Fabian Peller-Konrad
Rainer Kartmann
Christian R. G. Dreher
Andre Meixner
Fabian Reister
Markus Grotz
Tamim Asfour
24
17
0
05 Jun 2022
Findings of the The RuATD Shared Task 2022 on Artificial Text Detection in Russian
T. Shamardina
Vladislav Mikhailov
Daniil Chernianskii
Alena Fenogenova
Marat Saidov
A. Valeeva
Tatiana Shavrina
I. Smurov
E. Tutubalina
Ekaterina Artemova
DeLMO
16
30
0
03 Jun 2022
hmBERT: Historical Multilingual Language Models for Named Entity Recognition
Stefan Schweter
Luisa März
Katharina Schmid
Erion cCano
41
18
0
31 May 2022
Language Anisotropic Cross-Lingual Model Editing
Yang Xu
Yutai Hou
Wanxiang Che
Min Zhang
KELM
101
24
0
25 May 2022
Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset
Ashish V. Thapliyal
Jordi Pont-Tuset
Xi Chen
Radu Soricut
VGen
90
72
0
25 May 2022
GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained Language Models
Da Yin
Hritik Bansal
Masoud Monajatipoor
Liunian Harold Li
Kai-Wei Chang
49
28
0
24 May 2022
When does Parameter-Efficient Transfer Learning Work for Machine Translation?
Ahmet Üstün
Asa Cooper Stickland
37
7
0
23 May 2022
The Geometry of Multilingual Language Model Representations
Tyler A. Chang
Zhuowen Tu
Benjamin Bergen
23
56
0
22 May 2022
Descartes: Generating Short Descriptions of Wikipedia Articles
Marija Sakota
Maxime Peyrard
Robert West
VLM
20
2
0
20 May 2022
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data
Kabir Ahuja
Monojit Choudhury
Sandipan Dandapat
29
3
0
12 May 2022
UL2: Unifying Language Learning Paradigms
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
...
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
AI4CE
62
297
0
10 May 2022
Enhancing Cross-lingual Transfer by Manifold Mixup
Huiyun Yang
Huadong Chen
Hao Zhou
Lei Li
AAML
31
44
0
09 May 2022
Building Machine Translation Systems for the Next Thousand Languages
Ankur Bapna
Isaac Caswell
Julia Kreutzer
Orhan Firat
D. Esch
...
Apurva Shah
Yanping Huang
Z. Chen
Yonghui Wu
Macduff Hughes
56
98
0
09 May 2022
Previous
1
2
3
...
10
6
7
8
9
Next