Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.11934
Cited By
mT5: A massively multilingual pre-trained text-to-text transformer
22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"mT5: A massively multilingual pre-trained text-to-text transformer"
50 / 469 papers shown
Title
A Morphology-Based Investigation of Positional Encodings
Poulami Ghosh
Shikhar Vashishth
Raj Dabre
Pushpak Bhattacharyya
34
1
0
06 Apr 2024
ANGOFA: Leveraging OFA Embedding Initialization and Synthetic Data for Angolan Language Model
Osvaldo Luamba Quinjica
David Ifeoluwa Adelani
46
0
0
03 Apr 2024
METAL: Towards Multilingual Meta-Evaluation
Rishav Hada
Varun Gumma
Mohamed Ahmed
Kalika Bali
Sunayana Sitaram
ELM
43
2
0
02 Apr 2024
A Controlled Reevaluation of Coreference Resolution Models
Ian Porada
Xiyuan Zou
Jackie Chi Kit Cheung
35
1
0
31 Mar 2024
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Jinhyuk Lee
Zhuyun Dai
Xiaoqi Ren
Blair Chen
Daniel Cer
...
Aditya Kusupati
Prateek Jain
Siddhartha Reddy Jonnalagadda
Ming-Wei Chang
Iftekhar Naim
RALM
VLM
SyDa
48
41
0
29 Mar 2024
ILLUMINER: Instruction-tuned Large Language Models as Few-shot Intent Classifier and Slot Filler
Paramita Mirza
Viju Sudhi
S. Sahoo
Sinchana Ramakanth Bhat
25
4
0
26 Mar 2024
CLASSLA-web: Comparable Web Corpora of South Slavic Languages Enriched with Linguistic and Genre Annotation
Nikola Ljubesic
Taja Kuzman
38
3
0
19 Mar 2024
Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean
Dojun Park
Sebastian Padó
37
1
0
19 Mar 2024
Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge
Xin Zhao
Naoki Yoshinaga
Daisuke Oba
KELM
HILM
39
10
0
08 Mar 2024
German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset
Laura Mascarell
Ribin Chalumattu
Annette Rios
HILM
46
0
0
06 Mar 2024
PeLLE: Encoder-based language models for Brazilian Portuguese based on open data
Guilherme Lamartine de Mello
Marcelo Finger
F. Serras
M. Carpi
Marcos Menon Jose
Pedro Henrique Domingues
Paulo Cavalim
39
0
0
29 Feb 2024
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data
Takaaki Saeki
Gary Wang
Nobuyuki Morioka
Isaac Elias
Kyle Kastner
...
Andrew Rosenberg
Bhuvana Ramabhadran
Heiga Zen
Francoise Beaufays
Hadar Shemtov
38
13
0
29 Feb 2024
Advancing Generative AI for Portuguese with Open Decoder Gervásio PT*
Rodrigo Santos
Joao Silva
Luís Gomes
João Rodrigues
António Branco
46
10
0
29 Feb 2024
Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows
Yuting Yang
Andrea Merlina
Weijia Song
Tiancheng Yuan
Ken Birman
Roman Vitenberg
49
0
0
27 Feb 2024
Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages
Yuan Zhang
Yile Wang
Zijun Liu
Shuo Wang
Xiaolong Wang
Peng Li
Maosong Sun
Yang Liu
LRM
35
11
0
19 Feb 2024
Large Language Models: A Survey
Shervin Minaee
Tomáš Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
134
371
0
09 Feb 2024
InkSight: Offline-to-Online Handwriting Conversion by Learning to Read and Write
B. Mitrevski
Arina Rak
Julian Schnitzler
Chengkun Li
Andrii Maksai
Jesse Berent
C. Musat
DiffM
34
0
0
08 Feb 2024
CroissantLLM: A Truly Bilingual French-English Language Model
Manuel Faysse
Patrick Fernandes
Nuno M. Guerreiro
António Loison
Duarte M. Alves
...
François Yvon
André F.T. Martins
Gautier Viaud
C´eline Hudelot
Pierre Colombo
55
32
0
01 Feb 2024
TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese
N. Corrêa
Sophia Falk
Shiza Fatimah
Aniket Sen
N. D. Oliveira
30
9
0
30 Jan 2024
A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis on Noisy Bangla Texts
Kazi Toufique Elahi
Tasnuva Binte Rahman
Shakil Shahriar
Samir Sarker
Md. Tanvir Rouf Shawon
G. M. Shahariar
35
1
0
25 Jan 2024
In-context Learning with Retrieved Demonstrations for Language Models: A Survey
an Luo
Xin Xu
Yue Liu
Panupong Pasupat
Mehran Kazemi
RALM
34
55
0
21 Jan 2024
Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges
Aiqi Jiang
A. Zubiaga
AAML
31
3
0
17 Jan 2024
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation
Haoran Xu
Amr Sharaf
Yunmo Chen
Weiting Tan
Lingfeng Shen
Benjamin Van Durme
Kenton W. Murray
Young Jin Kim
ALM
64
211
0
16 Jan 2024
PIXAR: Auto-Regressive Language Modeling in Pixel Space
Yintao Tai
Xiyang Liao
Alessandro Suglia
Antonio Vergari
MLLM
26
7
0
06 Jan 2024
Automatic Data Retrieval for Cross Lingual Summarization
Nikhilesh Bhatnagar
Ashok Urlana
Vandan Mujadia
Pruthwik Mishra
D. Sharma
32
0
0
22 Dec 2023
WordScape: a Pipeline to extract multilingual, visually rich Documents with Layout Annotations from Web Crawl Data
Maurice Weber
Carlo Siebenschuh
Rory Butler
Anton Alexandrov
Valdemar Thanner
...
Haris Jabbar
Ian Foster
Bo-wen Li
Rick L. Stevens
Ce Zhang
21
4
0
15 Dec 2023
Multilingual large language models leak human stereotypes across language boundaries
Yang Trista Cao
Anna Sotnikova
Jieyu Zhao
Linda X. Zou
Rachel Rudinger
Hal Daumé
PILM
33
10
0
12 Dec 2023
Multi-Defendant Legal Judgment Prediction via Hierarchical Reasoning
Yougang Lyu
Jitai Hao
Zihan Wang
Kai Zhao
Shen Gao
Pengjie Ren
Zhumin Chen
Fang Wang
Zhaochun Ren
AILaw
21
9
0
10 Dec 2023
Leveraging Domain Adaptation and Data Augmentation to Improve Quránic IR in English and Arabic
Vera Pavlova
23
2
0
05 Dec 2023
Towards A Foundation Model For Trajectory Intelligence
Alameen Najjar
15
2
0
30 Nov 2023
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings
Andrea W Wen-Yi
David Mimno
33
14
0
29 Nov 2023
RETSim: Resilient and Efficient Text Similarity
Marina Zhang
Owen Vallis
Aysegul Bumin
Tanay Vakharia
Elie Bursztein
36
1
0
28 Nov 2023
DP-NMT: Scalable Differentially-Private Machine Translation
Timour Igamberdiev
Doan Nam Long Vu
Felix Künnecke
Zhuo Yu
Jannik Holmer
Ivan Habernal
37
7
0
24 Nov 2023
Dialogue Quality and Emotion Annotations for Customer Support Conversations
John Mendoncca
Patrícia Pereira
Miguel Menezes
Vera Cabarrão
Ana C. Farinha
Helena Moniz
Joao Paulo Carvalho
A. Lavie
Isabel Trancoso
18
3
0
23 Nov 2023
Vashantor: A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language
Fatema Tuj Johora Faria
Mukaffi Bin Moin
Ahmed Al Wase
Mehidi Ahmmed
Md. Rabius Sani
Tashreef Muhammad
24
5
0
18 Nov 2023
Prompt Pool based Class-Incremental Continual Learning for Dialog State Tracking
Hong Liu
Yucheng Cai
Yuan Zhou
Zhijian Ou
Yi Huang
Junlan Feng
CLL
24
2
0
17 Nov 2023
Take One Step at a Time to Know Incremental Utility of Demonstration: An Analysis on Reranking for Few-Shot In-Context Learning
Kazuma Hashimoto
K. Raman
Michael Bendersky
39
2
0
16 Nov 2023
Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization
Alexandra Chronopoulou
Jonas Pfeiffer
Joshua Maynez
Xinyi Wang
Sebastian Ruder
Priyanka Agrawal
MoMe
26
16
0
15 Nov 2023
Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models
J. Michaelov
Catherine Arnett
Tyler A. Chang
Benjamin Bergen
36
12
0
15 Nov 2023
How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
Fei Yuan
Shuai Yuan
Zhiyong Wu
Lei Li
37
10
0
15 Nov 2023
MELA: Multilingual Evaluation of Linguistic Acceptability
Ziyin Zhang
Yikang Liu
Wei Huang
Junyu Mao
Rui Wang
Hai Hu
30
3
0
15 Nov 2023
On the Analysis of Cross-Lingual Prompt Tuning for Decoder-based Multilingual Model
Nohil Park
Joonsuk Park
Kang Min Yoo
Sungroh Yoon
36
3
0
14 Nov 2023
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval
Nandan Thakur
Jianmo Ni
Gustavo Hernández Ábrego
John Wieting
Jimmy J. Lin
Daniel Cer
RALM
31
12
0
10 Nov 2023
There's no Data Like Better Data: Using QE Metrics for MT Data Filtering
Jan-Thorsten Peter
David Vilar
Daniel Deutsch
Mara Finkelstein
Juraj Juraska
Markus Freitag
19
17
0
09 Nov 2023
Sentiment Analysis through LLM Negotiations
Xiaofei Sun
Xiaoya Li
Shengyu Zhang
Shuhe Wang
Fei Wu
Jiwei Li
Tianwei Zhang
Guoyin Wang
35
16
0
03 Nov 2023
Explicit Morphological Knowledge Improves Pre-training of Language Models for Hebrew
Eylon Gueta
Omer Goldman
Reut Tsarfaty
11
1
0
01 Nov 2023
Generating Medical Prescriptions with Conditional Transformer
Samuel Belkadi
Nicolo Micheletti
Lifeng Han
Warren Del-Pinto
Goran Nenadic
MedIm
29
5
0
30 Oct 2023
Cultural Adaptation of Recipes
Yong Cao
Yova Kementchedjhieva
Ruixiang Cui
Antonia Karamolegkou
Li Zhou
Megan Dare
Lucia Donatelli
Daniel Hershcovich
18
5
0
26 Oct 2023
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages
Chiyu Zhang
Khai Duy Doan
Qisheng Liao
Muhammad Abdul-Mageed
36
6
0
23 Oct 2023
Improving Cross-Lingual Transfer through Subtree-Aware Word Reordering
Ofir Arviv
Dmitry Nikolaev
Taelin Karidi
Omri Abend
LRM
38
3
0
20 Oct 2023
Previous
1
2
3
4
5
6
...
8
9
10
Next