Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.11934
Cited By
mT5: A massively multilingual pre-trained text-to-text transformer
22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"mT5: A massively multilingual pre-trained text-to-text transformer"
50 / 475 papers shown
Title
Kanbun-LM: Reading and Translating Classical Chinese in Japanese Methods by Language Models
Hao Wang
Hirofumi Shimizu
Daisuke Kawahara
43
1
0
22 May 2023
Multilingual Simplification of Medical Texts
Sebastian Antony Joseph
Kathryn Kazanas
Keziah Reina
Vishnesh J. Ramanathan
Wei-ping Xu
Byron C. Wallace
Junyi Jessy Li
38
13
0
21 May 2023
SHINE: Syntax-augmented Hierarchical Interactive Encoder for Zero-shot Cross-lingual Information Extraction
Jun-Yu Ma
Jia-Chen Gu
Zhen-Hua Ling
Quan Liu
Cong Liu
Guoping Hu
59
1
0
21 May 2023
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
Ayyoob Imani
Peiqin Lin
Amir Hossein Kargaran
Silvia Severini
Masoud Jalili Sabet
...
Chunlan Ma
Helmut Schmid
André F. T. Martins
François Yvon
Hinrich Schütze
ALM
LRM
42
95
0
20 May 2023
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Sebastian Ruder
J. Clark
Alexander Gutkin
Mihir Kale
Min Ma
...
Dan Garrette
R. Ingle
Melvin Johnson
Dmitry Panteleev
Partha P. Talukdar
ELM
26
38
0
19 May 2023
mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences
David C. Uthus
Santiago Ontañón
Joshua Ainslie
Mandy Guo
VLM
28
10
0
18 May 2023
Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature
Ana Claudia Akemi Matsuki de Faria
Felype de Castro Bastos
Jose Victor Nogueira Alves da Silva
Vitor Lopes Fabris
Valeska Uchôa
Décio Gonccalves de Aguiar Neto
C. F. G. Santos
30
23
0
18 May 2023
Generalized Multiple Intent Conditioned Slot Filling
Harshil Shah
Arthur Wilcke
Marius Cobzarenco
Cristian C Cobzarenco
Edward Challis
David Barber
16
0
0
18 May 2023
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLM
LRM
128
1,152
0
17 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip Torr
Adel Bibi
42
97
0
17 May 2023
Unsupervised Sentence Representation Learning with Frequency-induced Adversarial Tuning and Incomplete Sentence Filtering
Bing Wang
Ximing Li
Zhiyao Yang
Yuanyuan Guan
Jiayin Li
Sheng-sheng Wang
35
6
0
15 May 2023
Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual Transfer for Open-domain Dialogue Generation
Lei Liu
J. Huang
CLL
29
2
0
12 May 2023
Vārta: A Large-Scale Headline-Generation Dataset for Indic Languages
Rahul Aralikatte
Ziling Cheng
Sumanth Doddapaneni
Jackie C.K. Cheung
48
8
0
10 May 2023
An Exploration of Encoder-Decoder Approaches to Multi-Label Classification for Legal and Biomedical Text
Yova Kementchedjhieva
Ilias Chalkidis
34
21
0
09 May 2023
CSED: A Chinese Semantic Error Diagnosis Corpus
Bo Sun
Baoxin Wang
Yixuan Wang
Wanxiang Che
Dayong Wu
Shijin Wang
Ting Liu
37
4
0
09 May 2023
NorBench -- A Benchmark for Norwegian Language Models
David Samuel
Andrey Kutuzov
Samia Touileb
Erik Velldal
Lilja Ovrelid
Egil Rønningstad
Elina Sigdel
Anna Palatkina
26
23
0
06 May 2023
SLTUNET: A Simple Unified Model for Sign Language Translation
Biao Zhang
Mathias Müller
Rico Sennrich
SLR
43
33
0
02 May 2023
MasakhaNEWS: News Topic Classification for African languages
David Ifeoluwa Adelani
Marek Masiak
Israel Abebe Azime
Jesujoba Oluwadara Alabi
A. Tonja
...
Moges Ahmed Mehamed
Evrard Ngabire
Jules Jules
Ivan Ssenkungu
Pontus Stenetorp
28
24
0
19 Apr 2023
UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Hyung Won Chung
Noah Constant
Xavier Garcia
Adam Roberts
Yi Tay
Sharan Narang
Orhan Firat
29
50
0
18 Apr 2023
Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese
Vésteinn Snaebjarnarson
A. Simonsen
Goran Glavavs
Ivan Vulić
39
19
0
18 Apr 2023
Masakhane-Afrisenti at SemEval-2023 Task 12: Sentiment Analysis using Afro-centric Language Models and Adapters for Low-resource African Languages
Israel Abebe Azime
Sana Al-Azzawi
A. Tonja
Iyanuoluwa Shode
Jesujoba Oluwadara Alabi
...
Mardiyyah Oduwole
Tosin P. Adewumi
Samuel Fanijo
Oyinkansola F. Awosan
Oreen Yousuf
25
1
0
13 Apr 2023
Computational modeling of semantic change
Nina Tahmasebi
Haim Dubossarsky
34
6
0
13 Apr 2023
Measuring Normative and Descriptive Biases in Language Models Using Census Data
Samia Touileb
Lilja Ovrelid
Erik Velldal
27
4
0
12 Apr 2023
Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages
Gabriela Pałka
Artur Nowakowski
32
2
0
11 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
30
41
0
07 Apr 2023
Document-Level Machine Translation with Large Language Models
Longyue Wang
Chenyang Lyu
Tianbo Ji
Zhirui Zhang
Dian Yu
Shuming Shi
Zhaopeng Tu
ELM
28
116
0
05 Apr 2023
Rethinking the Role of Token Retrieval in Multi-Vector Retrieval
Jinhyuk Lee
Zhuyun Dai
Sai Meher Karthik Duddu
Tao Lei
Iftekhar Naim
Ming-Wei Chang
Vincent Zhao
24
15
0
04 Apr 2023
Resources and Few-shot Learners for In-context Learning in Slavic Languages
Michal vStefánik
Marek Kadlcík
Piotr Gramacki
Petr Sojka
29
3
0
04 Apr 2023
SimCSum: Joint Learning of Simplification and Cross-lingual Summarization for Cross-lingual Science Journalism
Mehwish Fatima
Tim Kolber
K. Markert
Michael Strube
21
0
0
04 Apr 2023
PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation
Alireza Salemi
Amirhossein Abaskohi
Sara Tavakoli
Yadollah Yaghoobzadeh
A. Shakery
AIMat
27
0
0
03 Apr 2023
Summarizing Indian Languages using Multilingual Transformers based Models
Dhaval Taunk
Vasudeva Varma
VLM
27
9
0
29 Mar 2023
Sigmoid Loss for Language Image Pre-Training
Xiaohua Zhai
Basil Mustafa
Alexander Kolesnikov
Lucas Beyer
CLIP
VLM
36
960
0
27 Mar 2023
Fine-Tashkeel: Finetuning Byte-Level Models for Accurate Arabic Text Diacritization
Bashar Al-Rfooh
Gheith A. Abandah
Rami Al-Rfou
26
4
0
25 Mar 2023
XWikiGen: Cross-lingual Summarization for Encyclopedic Text Generation in Low Resource Languages
Dhaval Taunk
Shivprasad Sagare
Anupam Patil
Shivansh Subramanian
Manish Gupta
Vasudeva Varma
25
3
0
22 Mar 2023
DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer
Shanu Kumar
Abbaraju Soujanya
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
VLM
33
1
0
04 Mar 2023
Cross-Lingual Question Answering over Knowledge Base as Reading Comprehension
Chen Zhang
Yuxuan Lai
Yansong Feng
Xingyu Shen
Haowei Du
Dongyan Zhao
21
3
0
26 Feb 2023
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
Yang Chen
Hexiang Hu
Yi Luan
Haitian Sun
Soravit Changpinyo
Alan Ritter
Ming-Wei Chang
48
80
0
23 Feb 2023
Connecting Vision and Language with Video Localized Narratives
P. Voigtlaender
Soravit Changpinyo
Jordi Pont-Tuset
Radu Soricut
V. Ferrari
VGen
52
21
0
22 Feb 2023
Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a Distilled Representation
M. Moradshahi
Sina J. Semnani
M. Lam
30
7
0
18 Feb 2023
Distillation of encoder-decoder transformers for sequence labelling
M. Farina
D. Pappadopulo
Anant Gupta
Leslie Huang
Ozan Irsoy
Thamar Solorio
VLM
105
3
0
10 Feb 2023
The unreasonable effectiveness of few-shot learning for machine translation
Xavier Garcia
Yamini Bansal
Colin Cherry
George F. Foster
M. Krikun
Fan Feng
Melvin Johnson
Orhan Firat
38
102
0
02 Feb 2023
idT5: Indonesian Version of Multilingual T5 Transformer
Mukhlish Fuadi
A. Wibawa
S. Sumpeno
19
6
0
02 Feb 2023
Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark Datasets
Tosin P. Adewumi
Isabella Sodergren
Lama Alkhaled
Sana Sabah Sabry
F. Liwicki
Marcus Liwicki
41
4
0
28 Jan 2023
One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER
Xiang Chen
Lei Li
Q. Fei
Ningyu Zhang
Chuanqi Tan
Yong-jia Jiang
Fei Huang
Huajun Chen
26
23
0
25 Jan 2023
Truveta Mapper: A Zero-shot Ontology Alignment Framework
Mariyam Amir
Murchana Baruah
Mahsa Eslamialishah
Sina Ehsani
Alireza Bahramali
Sadra Naddaf-sh
Saman Zarandioon
32
7
0
24 Jan 2023
Language Embeddings Sometimes Contain Typological Generalizations
Robert Östling
Murathan Kurfali
NAI
36
9
0
19 Jan 2023
Curriculum Script Distillation for Multilingual Visual Question Answering
Khyathi Raghavi Chandu
A. Geramifard
30
0
0
17 Jan 2023
On the State of German (Abstractive) Text Summarization
Dennis Aumiller
Jing Fan
Michael Gertz
26
1
0
17 Jan 2023
Prompting Large Language Model for Machine Translation: A Case Study
Biao Zhang
Barry Haddow
Alexandra Birch
LRM
27
275
0
17 Jan 2023
GAE-ISumm: Unsupervised Graph-Based Summarization of Indian Languages
Lakshmi Sireesha Vakada
Anudeep Ch
Mounika Marreddy
S. Oota
R. Mamidi
22
1
0
25 Dec 2022
Previous
1
2
3
4
5
6
...
8
9
10
Next