ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.07580
  4. Cited By
mGPT: Few-Shot Learners Go Multilingual
v1v2 (latest)

mGPT: Few-Shot Learners Go Multilingual

15 April 2022
Oleh Shliazhko
Alena Fenogenova
Maria Tikhonova
Vladislav Mikhailov
Anastasia Kozlova
Tatiana Shavrina
ArXiv (abs)PDFHTML

Papers citing "mGPT: Few-Shot Learners Go Multilingual"

50 / 104 papers shown
Title
Analyzing and Adapting Large Language Models for Few-Shot Multilingual
  NLU: Are We There Yet?
Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?
E. Razumovskaia
Ivan Vulić
Anna Korhonen
87
7
0
04 Mar 2024
On the Scaling Laws of Geographical Representation in Language Models
On the Scaling Laws of Geographical Representation in Language Models
Nathan Godey
Eric Villemonte de la Clergerie
Benoît Sagot
106
8
0
29 Feb 2024
Advancing Generative AI for Portuguese with Open Decoder Gervásio PT*
Advancing Generative AI for Portuguese with Open Decoder Gervásio PT*
Rodrigo Santos
Joao Silva
Luís Gomes
João Rodrigues
António Branco
92
10
0
29 Feb 2024
Spot the bot: Coarse-Grained Partition of Semantic Paths for Bots and
  Humans
Spot the bot: Coarse-Grained Partition of Semantic Paths for Bots and Humans
Vasilii A. Gromov
A. S. Kogan
86
1
0
27 Feb 2024
GlórIA -- A Generative and Open Large Language Model for Portuguese
GlórIA -- A Generative and Open Large Language Model for Portuguese
Ricardo Lopes
João Magalhães
David Semedo
61
8
0
20 Feb 2024
Do Llamas Work in English? On the Latent Language of Multilingual
  Transformers
Do Llamas Work in English? On the Latent Language of Multilingual Transformers
Chris Wendler
V. Veselovsky
Giovanni Monea
Robert West
144
132
0
16 Feb 2024
Aya Model: An Instruction Finetuned Open-Access Multilingual Language
  Model
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model
Ahmet Üstün
Viraat Aryabumi
Zheng-Xin Yong
Wei-Yin Ko
Daniel D'souza
...
Shayne Longpre
Niklas Muennighoff
Marzieh Fadaee
Julia Kreutzer
Sara Hooker
ALMELMSyDaLRM
98
230
0
12 Feb 2024
Text Detoxification as Style Transfer in English and Hindi
Text Detoxification as Style Transfer in English and Hindi
Sourabrata Mukherjee
Akanksha Bansal
Atul Kr. Ojha
John P. Mccrae
Ondrej Dusek
61
9
0
12 Feb 2024
From Partial to Strictly Incremental Constituent Parsing
From Partial to Strictly Incremental Constituent Parsing
Ana Ezquerro
Carlos Gómez-Rodríguez
David Vilares
40
0
0
05 Feb 2024
CroissantLLM: A Truly Bilingual French-English Language Model
CroissantLLM: A Truly Bilingual French-English Language Model
Manuel Faysse
Patrick Fernandes
Nuno M. Guerreiro
António Loison
Duarte M. Alves
...
François Yvon
André F.T. Martins
Gautier Viaud
C´eline Hudelot
Pierre Colombo
161
37
0
01 Feb 2024
TeenyTinyLlama: open-source tiny language models trained in Brazilian
  Portuguese
TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese
N. Corrêa
Sophia Falk
Shiza Fatimah
Aniket Sen
N. D. Oliveira
89
9
0
30 Jan 2024
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence
  Labeling Tasks
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks
Bolei Ma
Ercong Nie
Shuzhou Yuan
Helmut Schmid
Michael Farber
Frauke Kreuter
Hinrich Schütze
VLM
147
6
0
29 Jan 2024
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced
  Understanding and Generation
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation
Gokcce Uludougan
Zeynep Yirmibecsouglu Balal
Furkan Akkurt
Melikcsah Turker
Onur Gungor
S. Uskudarli
67
12
0
25 Jan 2024
MaLA-500: Massive Language Adaptation of Large Language Models
MaLA-500: Massive Language Adaptation of Large Language Models
Peiqin Lin
Shaoxiong Ji
Jörg Tiedemann
André F. T. Martins
Hinrich Schütze
ELM
114
18
0
24 Jan 2024
Milestones in Bengali Sentiment Analysis leveraging Transformer-models:
  Fundamentals, Challenges and Future Directions
Milestones in Bengali Sentiment Analysis leveraging Transformer-models: Fundamentals, Challenges and Future Directions
Saptarshi Sengupta
Shreya Ghosh
Prasenjit Mitra
Tarikul Islam Tamiti
66
2
0
15 Jan 2024
Tuning LLMs with Contrastive Alignment Instructions for Machine
  Translation in Unseen, Low-resource Languages
Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languages
Zhuoyuan Mao
Yen Yu
ALM
58
2
0
11 Jan 2024
MERA: A Comprehensive LLM Evaluation in Russian
MERA: A Comprehensive LLM Evaluation in Russian
Alena Fenogenova
Artem Chervyakov
Nikita Martynov
Anastasia Kozlova
Maria Tikhonova
...
Nikita Savushkin
Polina Mikhailova
Denis Dimitrov
Alexander Panchenko
Sergey Markov
ELM
97
12
0
09 Jan 2024
PersianLLaMA: Towards Building First Persian Large Language Model
PersianLLaMA: Towards Building First Persian Large Language Model
Mohammad Amin Abbasi
A. Ghafouri
Mahdi Firouzmandi
Hassan Naderi
B. Minaei-Bidgoli
90
9
0
25 Dec 2023
Predicting Human Translation Difficulty with Neural Machine Translation
Predicting Human Translation Difficulty with Neural Machine Translation
Zheng Wei Lim
Ekaterina Vylomova
Charles Kemp
Trevor Cohn
102
0
0
19 Dec 2023
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient
  Large-scale Multilingual Continued Pretraining
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining
Yihong Liu
Peiqin Lin
Mingyang Wang
Hinrich Schütze
71
29
0
15 Nov 2023
Multilingual Nonce Dependency Treebanks: Understanding how Language
  Models represent and process syntactic structure
Multilingual Nonce Dependency Treebanks: Understanding how Language Models represent and process syntactic structure
David Arps
Laura Kallmeyer
Younes Samih
Hassan Sajjad
66
2
0
13 Nov 2023
Efficiently Adapting Pretrained Language Models To New Languages
Efficiently Adapting Pretrained Language Models To New Languages
Zoltan Csaki
Pian Pawakapan
Urmish Thakker
Qiantong Xu
CLL
99
18
0
09 Nov 2023
Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive
  Language Detection
Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection
Gretel Liz De la Pena Sarracén
Paolo Rosso
Robert Litschko
Goran Glavaš
Simone Paolo Ponzetto
52
1
0
03 Nov 2023
Do large language models solve verbal analogies like children do?
Do large language models solve verbal analogies like children do?
Claire E. Stevenson
Mathilde ter Veen
Rochelle Choenni
Han L. J. van der Maas
Ekaterina Shutova
LRM
30
8
0
31 Oct 2023
Domain Terminology Integration into Machine Translation: Leveraging
  Large Language Models
Domain Terminology Integration into Machine Translation: Leveraging Large Language Models
Yasmin Moslem
Gianfranco Romani
Mahdi Molaei
Rejwanul Haque
John D. Kelleher
Andy Way
71
22
0
22 Oct 2023
On Bilingual Lexicon Induction with Large Language Models
On Bilingual Lexicon Induction with Large Language Models
Yaoyiran Li
Anna Korhonen
Ivan Vulić
74
3
0
21 Oct 2023
Tokenizer Choice For LLM Training: Negligible or Crucial?
Tokenizer Choice For LLM Training: Negligible or Crucial?
Mehdi Ali
Michael Fromm
Klaudia Thellmann
Richard Rutmann
Max Lübbering
...
Malte Ostendorff
Samuel Weinbach
R. Sifa
Stefan Kesselheim
Nicolas Flores-Herr
114
61
0
12 Oct 2023
Exploring the Maze of Multilingual Modeling
Exploring the Maze of Multilingual Modeling
Sina Bagheri Nezhad
Ameeta Agrawal
90
1
0
09 Oct 2023
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text
  via Conditional Probability Curvature
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature
Guangsheng Bao
Yanbin Zhao
Zhiyang Teng
Linyi Yang
Yue Zhang
92
153
0
08 Oct 2023
GECTurk: Grammatical Error Correction and Detection Dataset for Turkish
GECTurk: Grammatical Error Correction and Detection Dataset for Turkish
Atakan Kara
Farrin Marouf Sofian
Andrew Bond
Gözde Gül Sahin
53
4
0
20 Sep 2023
A Family of Pretrained Transformer Language Models for Russian
A Family of Pretrained Transformer Language Models for Russian
Dmitry Zmitrovich
Alexander Abramov
Andrey Kalmykov
Maria Tikhonova
Ekaterina Taktasheva
...
Vitalii Kadulin
Sergey Markov
Tatiana Shavrina
Vladislav Mikhailov
Alena Fenogenova
99
26
0
19 Sep 2023
Multilingual Text Representation
Multilingual Text Representation
Fahim Faisal
48
0
0
02 Sep 2023
Journey to the Center of the Knowledge Neurons: Discoveries of
  Language-Independent Knowledge Neurons and Degenerate Knowledge Neurons
Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Neurons
Yuheng Chen
Pengfei Cao
Yubo Chen
Kang Liu
Jun Zhao
KELM
95
49
0
25 Aug 2023
Testing the Predictions of Surprisal Theory in 11 Languages
Testing the Predictions of Surprisal Theory in 11 Languages
Ethan Gotlieb Wilcox
Tiago Pimentel
Clara Meister
Ryan Cotterell
R. Levy
LRM
165
70
0
07 Jul 2023
Language Versatilists vs. Specialists: An Empirical Revisiting on
  Multilingual Transfer Ability
Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability
Jiacheng Ye
Xijia Tao
Lingpeng Kong
LRM
75
27
0
11 Jun 2023
ModuleFormer: Modularity Emerges from Mixture-of-Experts
ModuleFormer: Modularity Emerges from Mixture-of-Experts
Songlin Yang
Zheyu Zhang
Tianyou Cao
Shawn Tan
Zhenfang Chen
Chuang Gan
KELMMoE
54
10
0
07 Jun 2023
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Momchil Hardalov
Pepa Atanasova
Todor Mihaylov
G. Angelova
K. Simov
P. Osenova
Ves Stoyanov
Ivan Koychev
Preslav Nakov
Dragomir R. Radev
ELMFedML
77
4
0
04 Jun 2023
Having Beer after Prayer? Measuring Cultural Bias in Large Language
  Models
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models
Tarek Naous
Michael Joseph Ryan
Alan Ritter
Wei Xu
108
95
0
23 May 2023
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual
  Pretrained Language Models
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
Peiqin Lin
Chengzhi Hu
Zheyu Zhang
André F. T. Martins
Hinrich Schütze
68
1
0
23 May 2023
Language Models for German Text Simplification: Overcoming Parallel Data
  Scarcity through Style-specific Pre-training
Language Models for German Text Simplification: Overcoming Parallel Data Scarcity through Style-specific Pre-training
Miriam Anschütz
Joshua Oehms
Thomas Wimmer
Bartlomiej Jezierski
Georg Groh
54
22
0
22 May 2023
Kanbun-LM: Reading and Translating Classical Chinese in Japanese Methods
  by Language Models
Kanbun-LM: Reading and Translating Classical Chinese in Japanese Methods by Language Models
Hao Wang
Hirofumi Shimizu
Daisuke Kawahara
66
1
0
22 May 2023
Efficient Language Model Training through Cross-Lingual and Progressive
  Transfer Learning
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
Malte Ostendorff
Georg Rehm
CLIPVLMCLL
120
28
0
23 Jan 2023
JASMINE: Arabic GPT Models for Few-Shot Learning
JASMINE: Arabic GPT Models for Few-Shot Learning
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
AbdelRahim Elmadany
Alcides Alcoba Inciarte
Md. Tawkat Islam Khondaker
77
8
0
21 Dec 2022
Geographic and Geopolitical Biases of Language Models
Geographic and Geopolitical Biases of Language Models
Fahim Faisal
Antonios Anastasopoulos
94
21
0
20 Dec 2022
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Ercong Nie
Sheng Liang
Helmut Schmid
Hinrich Schütze
VLMRALMLRM
108
22
0
19 Dec 2022
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Zheng-Xin Yong
Hailey Schoelkopf
Niklas Muennighoff
Alham Fikri Aji
David Ifeoluwa Adelani
...
Genta Indra Winata
Stella Biderman
Edward Raff
Dragomir R. Radev
Vassilina Nikoulina
CLLVLMAI4CELRM
147
89
0
19 Dec 2022
In-context Examples Selection for Machine Translation
In-context Examples Selection for Machine Translation
Sweta Agrawal
Chunting Zhou
M. Lewis
Luke Zettlemoyer
Marjan Ghazvininejad
LRM
120
198
0
05 Dec 2022
Legal Prompt Engineering for Multilingual Legal Judgement Prediction
Legal Prompt Engineering for Multilingual Legal Judgement Prediction
Dietrich Trautmann
Alina Petrova
Frank Schilder
ELMAILaw
99
80
0
05 Dec 2022
Prompting Language Models for Linguistic Structure
Prompting Language Models for Linguistic Structure
Terra Blevins
Hila Gonen
Luke Zettlemoyer
LRM
125
44
0
15 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
468
2,398
0
09 Nov 2022
Previous
123
Next