ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.11934
  4. Cited By
mT5: A massively multilingual pre-trained text-to-text transformer

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
ArXivPDFHTML

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 475 papers shown
Title
OpineSum: Entailment-based self-training for abstractive opinion
  summarization
OpineSum: Entailment-based self-training for abstractive opinion summarization
Annie Louis
Joshua Maynez
43
7
0
21 Dec 2022
Uncontrolled Lexical Exposure Leads to Overestimation of Compositional
  Generalization in Pretrained Models
Uncontrolled Lexical Exposure Leads to Overestimation of Compositional Generalization in Pretrained Models
Najoung Kim
Tal Linzen
P. Smolensky
34
30
0
21 Dec 2022
How Does Beam Search improve Span-Level Confidence Estimation in
  Generative Sequence Labeling?
How Does Beam Search improve Span-Level Confidence Estimation in Generative Sequence Labeling?
Kazuma Hashimoto
Iftekhar Naim
K. Raman
UQLM
29
2
0
21 Dec 2022
ORCA: A Challenging Benchmark for Arabic Language Understanding
ORCA: A Challenging Benchmark for Arabic Language Understanding
AbdelRahim Elmadany
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
ELM
17
40
0
21 Dec 2022
Character-Aware Models Improve Visual Text Rendering
Character-Aware Models Improve Visual Text Rendering
Rosanne Liu
Daniel H Garrette
Chitwan Saharia
William Chan
Adam Roberts
Sharan Narang
Irina Blok
R. Mical
Mohammad Norouzi
Noah Constant
VLM
31
71
0
20 Dec 2022
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free
  Language Models
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models
Jonas Belouadi
Steffen Eger
54
24
0
20 Dec 2022
MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for
  Natural Language Understanding in Task-Oriented Dialogue
MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue
Nikita Moghe
E. Razumovskaia
Liane Guillou
Ivan Vulić
Anna Korhonen
Alexandra Birch
40
13
0
20 Dec 2022
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Jian Yang
Shuming Ma
Li Dong
Shaohan Huang
Haoyang Huang
Yuwei Yin
Dongdong Zhang
Liqun Yang
Furu Wei
Zhoujun Li
SyDa
AI4CE
32
25
0
20 Dec 2022
LR-Sum: Summarization for Less-Resourced Languages
LR-Sum: Summarization for Less-Resourced Languages
Chester Palen-Michel
Constantine Lignos
17
4
0
19 Dec 2022
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Samuel Cahyawijaya
Holy Lovenia
Alham Fikri Aji
Genta Indra Winata
Bryan Wilie
...
Timothy Baldwin
Sebastian Ruder
Herry Sujaini
S. Sakti
Ayu Purwarianti
39
48
0
19 Dec 2022
Latent Diffusion for Language Generation
Latent Diffusion for Language Generation
Justin Lovelace
Varsha Kishore
Chao-gang Wan
Eliot Shekhtman
Kilian Q. Weinberger
DiffM
24
71
0
19 Dec 2022
Towards leveraging latent knowledge and Dialogue context for real-world
  conversational question answering
Towards leveraging latent knowledge and Dialogue context for real-world conversational question answering
Shaomu Tan
Denis Paperno
RALM
23
0
0
17 Dec 2022
Improving Cross-task Generalization of Unified Table-to-text Models with
  Compositional Task Configurations
Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations
Jifan Chen
Yuhao Zhang
Lan Liu
Rui Dong
Xinchi Chen
Patrick Ng
William Yang Wang
Zhiheng Huang
AI4CE
32
4
0
17 Dec 2022
RISE: Leveraging Retrieval Techniques for Summarization Evaluation
RISE: Leveraging Retrieval Techniques for Summarization Evaluation
David C. Uthus
Jianmo Ni
RALM
19
0
0
17 Dec 2022
DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue
DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue
William B. Held
Christopher Hidey
Fei Liu
Eric Zhu
Rahul Goel
Diyi Yang
Rushin Shah
34
0
0
15 Dec 2022
Multi-VALUE: A Framework for Cross-Dialectal English NLP
Multi-VALUE: A Framework for Cross-Dialectal English NLP
Caleb Ziems
William B. Held
Jingfeng Yang
Jwala Dhamala
Rahul Gupta
Diyi Yang
46
40
0
15 Dec 2022
Advancing Multilingual Pre-training: TRIP Triangular Document-level
  Pre-training for Multilingual Language Models
Advancing Multilingual Pre-training: TRIP Triangular Document-level Pre-training for Multilingual Language Models
Hongyuan Lu
Haoyang Huang
Shuming Ma
Dongdong Zhang
W. Lam
Furu Wei
27
4
0
15 Dec 2022
Summary-Oriented Vision Modeling for Multimodal Abstractive
  Summarization
Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization
Yunlong Liang
Fandong Meng
Jinan Xu
Jiaan Wang
Jinan Xu
Jie Zhou
33
20
0
15 Dec 2022
Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual
  Machine Translation
Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation
Maha Elbayad
Anna Y. Sun
Shruti Bhosale
MoE
54
9
0
15 Dec 2022
DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service
  Chatlog
DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service Chatlog
Xin Zheng
Tianyu Liu
H. Meng
Xu Wang
Yu Jiang
Meng-Liang Rao
Binghuai Lin
Zhifang Sui
Yunbo Cao
35
2
0
14 Dec 2022
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for
  Programming Languages
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
Yekun Chai
Shuohuan Wang
Chao Pang
Yu Sun
Hao Tian
Hua Wu
38
35
0
13 Dec 2022
A Survey on Natural Language Processing for Programming
A Survey on Natural Language Processing for Programming
Qingfu Zhu
Xianzhen Luo
Fang Liu
Cuiyun Gao
Wanxiang Che
25
2
0
12 Dec 2022
Implementing Deep Learning-Based Approaches for Article Summarization in
  Indian Languages
Implementing Deep Learning-Based Approaches for Article Summarization in Indian Languages
Rahul Tangsali
Aabha Pingle
Aditya Vyawahare
Isha Joshi
Raviraj Joshi
43
7
0
12 Dec 2022
MiLMo:Minority Multilingual Pre-trained Language Model
MiLMo:Minority Multilingual Pre-trained Language Model
Sisi Liu
Hanru Shi
Xinhe Yu
Wugedele Bao
Yuan Sun
Xiaobing Zhao
31
0
0
04 Dec 2022
Languages You Know Influence Those You Learn: Impact of Language
  Characteristics on Multi-Lingual Text-to-Text Transfer
Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer
Benjamin Muller
Deepanshu Gupta
Siddharth Patwardhan
J. Fauconnier
David Vandyke
Sachin Agarwal
41
5
0
04 Dec 2022
Nonparametric Masked Language Modeling
Nonparametric Masked Language Modeling
Sewon Min
Weijia Shi
M. Lewis
Xilun Chen
Wen-tau Yih
Hannaneh Hajishirzi
Luke Zettlemoyer
RALM
50
48
0
02 Dec 2022
Extending the Subwording Model of Multilingual Pretrained Models for New
  Languages
Extending the Subwording Model of Multilingual Pretrained Models for New Languages
K. Imamura
Eiichiro Sumita
VLM
29
3
0
29 Nov 2022
Beyond Counting Datasets: A Survey of Multilingual Dataset Construction
  and Necessary Resources
Beyond Counting Datasets: A Survey of Multilingual Dataset Construction and Necessary Resources
Xinyan Velocity Yu
Akari Asai
Trina Chatterjee
Junjie Hu
Eunsol Choi
29
21
0
28 Nov 2022
Breaking the Representation Bottleneck of Chinese Characters: Neural
  Machine Translation with Stroke Sequence Modeling
Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation with Stroke Sequence Modeling
Zhijun Wang
Xuebo Liu
Min Zhang
27
11
0
23 Nov 2022
Word-Level Representation From Bytes For Language Modeling
Word-Level Representation From Bytes For Language Modeling
Chul Lee
Qipeng Guo
Xipeng Qiu
23
1
0
23 Nov 2022
Coreference Resolution through a seq2seq Transition-Based System
Coreference Resolution through a seq2seq Transition-Based System
Bernd Bohnet
Chris Alberti
Michael Collins
28
39
0
22 Nov 2022
GLAMI-1M: A Multilingual Image-Text Fashion Dataset
GLAMI-1M: A Multilingual Image-Text Fashion Dataset
Vaclav Kosar
A. Hoskovec
Milan Šulc
Radek Bartyzal
VLM
32
3
0
17 Nov 2022
Unified Question Answering in Slovene
Unified Question Answering in Slovene
Katja Logar
Marko Robnik-Šikonja
24
0
0
16 Nov 2022
Prompting PaLM for Translation: Assessing Strategies and Performance
Prompting PaLM for Translation: Assessing Strategies and Performance
David Vilar
Markus Freitag
Colin Cherry
Jiaming Luo
Viresh Ratnakar
George F. Foster
LRM
27
154
0
16 Nov 2022
A Benchmark and Dataset for Post-OCR text correction in Sanskrit
A Benchmark and Dataset for Post-OCR text correction in Sanskrit
Ayush Maheshwari
Nikhil Singh
Amrith Krishna
Ganesh Ramakrishnan
31
12
0
15 Nov 2022
mOKB6: A Multilingual Open Knowledge Base Completion Benchmark
mOKB6: A Multilingual Open Knowledge Base Completion Benchmark
Shubham Mittal
Keshav Kolluru
Soumen Chakrabarti
Mausam
33
4
0
13 Nov 2022
Casual Conversations v2: Designing a large consent-driven dataset to
  measure algorithmic bias and robustness
Casual Conversations v2: Designing a large consent-driven dataset to measure algorithmic bias and robustness
C. Hazirbas
Yejin Bang
Tiezheng Yu
Parisa Assar
Bilal Porgali
...
Jacqueline Pan
Emily McReynolds
Miranda Bogen
Pascale Fung
Cristian Canton Ferrer
32
8
0
10 Nov 2022
DiaASQ : A Benchmark of Conversational Aspect-based Sentiment Quadruple
  Analysis
DiaASQ : A Benchmark of Conversational Aspect-based Sentiment Quadruple Analysis
Bobo Li
Hao Fei
Fei Li
Yu-hao Wu
Jinsong Zhang
...
Jingye Li
Yijiang Liu
Lizi Liao
Tat-Seng Chua
Donghong Ji
34
41
0
10 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
118
2,315
0
09 Nov 2022
Local Structure Matters Most in Most Languages
Local Structure Matters Most in Most Languages
Louis Clouâtre
Prasanna Parthasarathi
Amal Zouaq
Sarath Chandar
39
1
0
09 Nov 2022
Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue
  Systems
Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems
Songbo Hu
Ivan Vulić
Fangyu Liu
Anna Korhonen
41
0
0
07 Nov 2022
Multi-level Distillation of Semantic Knowledge for Pre-training
  Multilingual Language Model
Multi-level Distillation of Semantic Knowledge for Pre-training Multilingual Language Model
Mingqi Li
Fei Ding
Dan Zhang
Long Cheng
Hongxin Hu
Feng Luo
41
6
0
02 Nov 2022
Dialect-robust Evaluation of Generated Text
Dialect-robust Evaluation of Generated Text
Jiao Sun
Thibault Sellam
Elizabeth Clark
Tu Vu
Timothy Dozat
Dan Garrette
Aditya Siddhant
Jacob Eisenstein
Sebastian Gehrmann
23
19
0
02 Nov 2022
Two-stage LLM Fine-tuning with Less Specialization and More
  Generalization
Two-stage LLM Fine-tuning with Less Specialization and More Generalization
Yihan Wang
Si Si
Daliang Li
Michal Lukasik
Felix X. Yu
Cho-Jui Hsieh
Inderjit S Dhillon
Sanjiv Kumar
46
29
0
01 Nov 2022
TaTa: A Multilingual Table-to-Text Dataset for African Languages
TaTa: A Multilingual Table-to-Text Dataset for African Languages
Sebastian Gehrmann
Sebastian Ruder
Vitaly Nikolaev
Jan A. Botha
Michael Chavinda
Ankur P. Parikh
Clara E. Rivera
LMTD
27
10
0
31 Oct 2022
Too Brittle To Touch: Comparing the Stability of Quantization and
  Distillation Towards Developing Lightweight Low-Resource MT Models
Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Resource MT Models
Harshita Diddee
Sandipan Dandapat
Monojit Choudhury
T. Ganu
Kalika Bali
31
5
0
27 Oct 2022
Weakly Supervised Data Augmentation Through Prompting for Dialogue
  Understanding
Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding
Maximillian Chen
Alexandros Papangelis
Chenyang Tao
Andrew Rosenbaum
Seokhwan Kim
Yang Liu
Zhou Yu
Dilek Z. Hakkani-Tür
39
32
0
25 Oct 2022
IDK-MRC: Unanswerable Questions for Indonesian Machine Reading
  Comprehension
IDK-MRC: Unanswerable Questions for Indonesian Machine Reading Comprehension
Rifki Afina Putri
Alice Oh
33
9
0
25 Oct 2022
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for
  Cross-lingual Text-to-SQL Semantic Parsing
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing
Peng Shi
Rui Zhang
Richard He Bai
Jimmy J. Lin
RALM
41
42
0
25 Oct 2022
EUR-Lex-Sum: A Multi- and Cross-lingual Dataset for Long-form
  Summarization in the Legal Domain
EUR-Lex-Sum: A Multi- and Cross-lingual Dataset for Long-form Summarization in the Legal Domain
Dennis Aumiller
Ashish Chouhan
Michael Gertz
ELM
AILaw
52
35
0
24 Oct 2022
Previous
123...1056789
Next