ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.11934
  4. Cited By
mT5: A massively multilingual pre-trained text-to-text transformer

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
ArXivPDFHTML

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 469 papers shown
Title
A Systematic Study of Performance Disparities in Multilingual
  Task-Oriented Dialogue Systems
A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems
Songbo Hu
Han Zhou
Moy Yuan
Milan Gritta
Guchun Zhang
Ignacio Iacobacci
Anna Korhonen
Ivan Vulić
35
3
0
19 Oct 2023
Optimized Tokenization for Transcribed Error Correction
Optimized Tokenization for Transcribed Error Correction
Tomer Wullach
Shlomo E. Chazan
32
0
0
16 Oct 2023
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Jirui Qi
Raquel Fernández
Arianna Bisazza
KELM
HILM
27
60
0
16 Oct 2023
CAW-coref: Conjunction-Aware Word-level Coreference Resolution
CAW-coref: Conjunction-Aware Word-level Coreference Resolution
Karel DÓosterlinck
Semere Kiros Bitew
Brandon Papineau
Christopher Potts
Thomas Demeester
Chris Develder
32
8
0
09 Oct 2023
A Benchmark for Learning to Translate a New Language from One Grammar
  Book
A Benchmark for Learning to Translate a New Language from One Grammar Book
Garrett Tanzer
Mirac Suzgun
Chenguang Xi
Dan Jurafsky
Luke Melas-Kyriazi
24
51
0
28 Sep 2023
GECTurk: Grammatical Error Correction and Detection Dataset for Turkish
GECTurk: Grammatical Error Correction and Detection Dataset for Turkish
Atakan Kara
Farrin Marouf Sofian
Andrew Bond
Gözde Gül Sahin
26
4
0
20 Sep 2023
Benchmarking Procedural Language Understanding for Low-Resource
  Languages: A Case Study on Turkish
Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on Turkish
Arda Uzunouglu
Gözde Gül Sahin
34
4
0
13 Sep 2023
DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew
DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew
Shaltiel Shmidman
Avi Shmidman
Moshe Koppel
22
7
0
31 Aug 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
54
14
0
23 Aug 2023
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder
  Language Models
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models
Jie Huang
Ming-Yu Liu
Peng Xu
M. Shoeybi
Kevin Chen-Chuan Chang
Bryan Catanzaro
RALM
35
33
0
15 Aug 2023
NewsDialogues: Towards Proactive News Grounded Conversation
NewsDialogues: Towards Proactive News Grounded Conversation
Siheng Li
Yichun Yin
Cheng Yang
Wangjie Jiang
Yiwei Li
Ze-Long Cheng
Lifeng Shang
Xin Jiang
Qun Liu
Yujiu Yang
29
5
0
12 Aug 2023
Extrapolating Large Language Models to Non-English by Aligning Languages
Extrapolating Large Language Models to Non-English by Aligning Languages
Wenhao Zhu
Yunzhe Lv
Qingxiu Dong
Fei Yuan
Jingjing Xu
Shujian Huang
Lingpeng Kong
Jiajun Chen
Lei Li
45
66
0
09 Aug 2023
Okapi: Instruction-tuned Large Language Models in Multiple Languages
  with Reinforcement Learning from Human Feedback
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Viet Dac Lai
Chien Van Nguyen
Nghia Trung Ngo
Thuat Nguyen
Franck Dernoncourt
Ryan A. Rossi
Thien Huu Nguyen
ALM
42
131
0
29 Jul 2023
Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for
  Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems
Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems
Songbo Hu
Han Zhou
Mete Hergul
Milan Gritta
Guchun Zhang
Ignacio Iacobacci
Ivan Vulić
Anna Korhonen
36
10
0
26 Jul 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
Fahad Shahbaz Khan
VLM
38
118
0
25 Jul 2023
Empowering Cross-lingual Behavioral Testing of NLP Models with
  Typological Features
Empowering Cross-lingual Behavioral Testing of NLP Models with Typological Features
Ester Hlavnova
Sebastian Ruder
35
5
0
11 Jul 2023
Low-Resource Cross-Lingual Adaptive Training for Nigerian Pidgin
Low-Resource Cross-Lingual Adaptive Training for Nigerian Pidgin
Pin-Jie Lin
Muhammed Saeed
Ernie Chang
Merel C. J. Scholman
42
5
0
01 Jul 2023
On Evaluating Multilingual Compositional Generalization with Translated
  Datasets
On Evaluating Multilingual Compositional Generalization with Translated Datasets
Zi Wang
Daniel Hershcovich
28
7
0
20 Jun 2023
DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
Hengli Li
Songchun Zhu
Zilong Zheng
11
8
0
15 Jun 2023
Large-scale Language Model Rescoring on Long-form Data
Large-scale Language Model Rescoring on Long-form Data
Tongzhou Chen
Cyril Allauzen
Yinghui Huang
Daniel S. Park
David Rybach
...
Rodrigo Cabrera
Kartik Audhkhasi
Bhuvana Ramabhadran
Pedro J. Moreno
Michael Riley
35
14
0
13 Jun 2023
Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted
  Sentiment Classification Benchmark
Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark
Lukasz Augustyniak
Szymon Wo'zniak
Marcin Gruza
Piotr Gramacki
Krzysztof Rajda
M. Morzy
Tomasz Kajdanowicz
33
5
0
13 Jun 2023
Can current NLI systems handle German word order? Investigating language
  model performance on a new German challenge set of minimal pairs
Can current NLI systems handle German word order? Investigating language model performance on a new German challenge set of minimal pairs
Ines Reinig
K. Markert
27
0
0
07 Jun 2023
Cross-Lingual Transfer Learning for Phrase Break Prediction with
  Multilingual Language Model
Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model
Hoyeon Lee
Hyun-Wook Yoon
Jong-Hwan Kim
Jae-Min Kim
VLM
32
0
0
05 Jun 2023
Unsupervised Paraphrasing of Multiword Expressions
Unsupervised Paraphrasing of Multiword Expressions
Takashi Wada
Yuji Matsumoto
Timothy Baldwin
Jey Han Lau
32
0
0
02 Jun 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for
  Vision-and-Language Navigation
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li
Joey Tianyi Zhou
DiffM
31
49
0
30 May 2023
PaLI-X: On Scaling up a Multilingual Vision and Language Model
PaLI-X: On Scaling up a Multilingual Vision and Language Model
Xi Chen
Josip Djolonga
Piotr Padlewski
Basil Mustafa
Soravit Changpinyo
...
Mojtaba Seyedhosseini
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
VLM
68
189
0
29 May 2023
Byte-Level Grammatical Error Correction Using Synthetic and Curated
  Corpora
Byte-Level Grammatical Error Correction Using Synthetic and Curated Corpora
Svanhvít Lilja Ingólfsdóttir
Pétur Orri Ragnarsson
H. Jónsson
Haukur Barri Símonarson
Vilhjálmur Þorsteinsson
Vésteinn Snæbjarnarson
SyDa
38
9
0
29 May 2023
A Practical Toolkit for Multilingual Question and Answer Generation
A Practical Toolkit for Multilingual Question and Answer Generation
Asahi Ushio
Fernando Alva-Manchego
Jose Camacho-Collados
SyDa
31
13
0
27 May 2023
Revisiting non-English Text Simplification: A Unified Multilingual
  Benchmark
Revisiting non-English Text Simplification: A Unified Multilingual Benchmark
Michael Joseph Ryan
Tarek Naous
Wei-ping Xu
31
25
0
25 May 2023
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal
  Image Generation
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation
Marco Bellagente
Manuel Brack
H. Teufel
Felix Friedrich
Bjorn Deiseroth
...
Koen Oostermeijer
Andres Felipe Cruz Salinas
P. Schramowski
Kristian Kersting
Samuel Weinbach
45
16
0
24 May 2023
An Efficient Multilingual Language Model Compression through Vocabulary
  Trimming
An Efficient Multilingual Language Model Compression through Vocabulary Trimming
Asahi Ushio
Yi Zhou
Jose Camacho-Collados
46
7
0
24 May 2023
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG
El Moatez Billah Nagoudi
AbdelRahim Elmadany
Ahmed Oumar El-Shangiti
Muhammad Abdul-Mageed
LM&MA
38
17
0
24 May 2023
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual
  Transfer
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
Akari Asai
Sneha Kudugunta
Xinyan Velocity Yu
Terra Blevins
Hila Gonen
Machel Reid
Yulia Tsvetkov
Sebastian Ruder
Hannaneh Hajishirzi
44
54
0
24 May 2023
Having Beer after Prayer? Measuring Cultural Bias in Large Language
  Models
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models
Tarek Naous
Michael Joseph Ryan
Alan Ritter
Wei-ping Xu
37
85
0
23 May 2023
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance
Chenxi Whitehouse
Monojit Choudhury
Alham Fikri Aji
SyDa
LRM
32
68
0
23 May 2023
Multi-Granularity Prompts for Topic Shift Detection in Dialogue
Multi-Granularity Prompts for Topic Shift Detection in Dialogue
Jian-Dong Lin
Yaxin Fan
Xiaomin Chu
Peifeng Li
Qiaoming Zhu
22
5
0
23 May 2023
Sāmayik: A Benchmark and Dataset for English-Sanskrit Translation
Sāmayik: A Benchmark and Dataset for English-Sanskrit Translation
Ayush Maheshwari
Ashim Gupta
Amrith Krishna
Atul Kumar Singh
Ganesh Ramakrishnan
G. Anil Kumar
Jitin Singla
32
1
0
23 May 2023
Concept-aware Training Improves In-context Learning Ability of Language
  Models
Concept-aware Training Improves In-context Learning Ability of Language Models
Michal Štefánik
Marek Kadlcík
KELM
LRM
41
0
0
23 May 2023
Exploring Large Language Models for Classical Philology
Exploring Large Language Models for Classical Philology
Frederick Riemenschneider
Anette Frank
19
15
0
23 May 2023
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual
  Pretrained Language Models
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
Peiqin Lin
Chengzhi Hu
Zheyu Zhang
André F. T. Martins
Hinrich Schütze
35
1
0
23 May 2023
Extrapolating Multilingual Understanding Models as Multilingual
  Generators
Extrapolating Multilingual Understanding Models as Multilingual Generators
Bohong Wu
Fei Yuan
Hai Zhao
Lei Li
Jingjing Xu
AI4CE
25
2
0
22 May 2023
Rethinking Semi-supervised Learning with Language Models
Rethinking Semi-supervised Learning with Language Models
Zhengxiang Shi
Francesco Tonolini
Nikolaos Aletras
Emine Yilmaz
G. Kazai
Yunlong Jiao
32
18
0
22 May 2023
Bidirectional Transformer Reranker for Grammatical Error Correction
Bidirectional Transformer Reranker for Grammatical Error Correction
Ying Zhang
Hidetaka Kamigaito
Manabu Okumura
24
2
0
22 May 2023
GPT-SW3: An Autoregressive Language Model for the Nordic Languages
GPT-SW3: An Autoregressive Language Model for the Nordic Languages
Ariel Ekgren
Amaru Cuba Gyllensten
Felix Stollenwerk
Joey Öhman
T. Isbister
Evangelia Gogoulou
F. Carlsson
Alice Heiman
Judit Casademont
Magnus Sahlgren
29
13
0
22 May 2023
Mitigating Data Imbalance and Representation Degeneration in
  Multilingual Machine Translation
Mitigating Data Imbalance and Representation Degeneration in Multilingual Machine Translation
Wen Lai
Alexandra Chronopoulou
Alexander Fraser
37
5
0
22 May 2023
Kanbun-LM: Reading and Translating Classical Chinese in Japanese Methods
  by Language Models
Kanbun-LM: Reading and Translating Classical Chinese in Japanese Methods by Language Models
Hao Wang
Hirofumi Shimizu
Daisuke Kawahara
43
1
0
22 May 2023
Multilingual Simplification of Medical Texts
Multilingual Simplification of Medical Texts
Sebastian Antony Joseph
Kathryn Kazanas
Keziah Reina
Vishnesh J. Ramanathan
Wei-ping Xu
Byron C. Wallace
Junyi Jessy Li
38
13
0
21 May 2023
SHINE: Syntax-augmented Hierarchical Interactive Encoder for Zero-shot
  Cross-lingual Information Extraction
SHINE: Syntax-augmented Hierarchical Interactive Encoder for Zero-shot Cross-lingual Information Extraction
Jun-Yu Ma
Jia-Chen Gu
Zhen-Hua Ling
Quan Liu
Cong Liu
Guoping Hu
59
1
0
21 May 2023
Glot500: Scaling Multilingual Corpora and Language Models to 500
  Languages
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
Ayyoob Imani
Peiqin Lin
Amir Hossein Kargaran
Silvia Severini
Masoud Jalili Sabet
...
Chunlan Ma
Helmut Schmid
André F. T. Martins
François Yvon
Hinrich Schütze
ALM
LRM
42
95
0
20 May 2023
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented
  Languages
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Sebastian Ruder
J. Clark
Alexander Gutkin
Mihir Kale
Min Ma
...
Dan Garrette
R. Ingle
Melvin Johnson
Dmitry Panteleev
Partha P. Talukdar
ELM
26
38
0
19 May 2023
Previous
12345...8910
Next