ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.07580
  4. Cited By
mGPT: Few-Shot Learners Go Multilingual
v1v2 (latest)

mGPT: Few-Shot Learners Go Multilingual

15 April 2022
Oleh Shliazhko
Alena Fenogenova
Maria Tikhonova
Vladislav Mikhailov
Anastasia Kozlova
Tatiana Shavrina
ArXiv (abs)PDFHTML

Papers citing "mGPT: Few-Shot Learners Go Multilingual"

50 / 104 papers shown
Title
Not quite Sherlock Holmes: Language model predictions do not reliably differentiate impossible from improbable events
Not quite Sherlock Holmes: Language model predictions do not reliably differentiate impossible from improbable events
J. Michaelov
Reeka Estacio
Zhien Zhang
Benjamin Bergen
ReLMLRM
26
0
0
07 Jun 2025
Emergent Abilities of Large Language Models under Continued Pretraining for Language Adaptation
Emergent Abilities of Large Language Models under Continued Pretraining for Language Adaptation
Ahmed Elhady
Eneko Agirre
Mikel Artetxe
CLLKELMELM
35
0
0
30 May 2025
Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits
Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits
Subrit Dikshit
Ritu Tiwari
Priyank Jain
56
0
0
14 May 2025
Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation
Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation
Baban Gain
Dibyanayan Bandyopadhyay
Asif Ekbal
LM&MA
94
2
0
02 Apr 2025
PAD: Towards Efficient Data Generation for Transfer Learning Using Phrase Alignment
PAD: Towards Efficient Data Generation for Transfer Learning Using Phrase Alignment
Jong Myoung Kim
Young-Jun_Lee
Ho-Jin Choi
Sangkeun Jung
102
0
0
24 Mar 2025
Strategic resource allocation in memory encoding: An efficiency principle shaping language processing
Strategic resource allocation in memory encoding: An efficiency principle shaping language processing
Weijie Xu
Richard Futrell
103
1
0
18 Mar 2025
Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh
Fajri Koto
Rituraj Joshi
Nurdaulet Mukhituly
Yanjie Wang
Zhuohan Xie
...
Avraham Sheinin
Natalia Vassilieva
Neha Sengupta
Larry Murray
Preslav Nakov
ALMKELM
130
0
0
03 Mar 2025
Multilingual Language Model Pretraining using Machine-translated Data
Multilingual Language Model Pretraining using Machine-translated Data
Jiayi Wang
Yao Lu
Maurice Weber
Max Ryabinin
David Ifeoluwa Adelani
Yihong Chen
Raphael Tang
Pontus Stenetorp
LRM
130
5
0
20 Feb 2025
LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy
LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy
Zhiwen Ruan
Yixia Li
He Zhu
Longyue Wang
Weihua Luo
Kaifu Zhang
Yuxiao Chen
Guanhua Chen
111
1
0
17 Feb 2025
Do we really have to filter out random noise in pre-training data for language models?
Do we really have to filter out random noise in pre-training data for language models?
Jinghan Ru
Yuxin Xie
Xianwei Zhuang
Yuguo Yin
Zhihui Guo
Zhiming Liu
Qianli Ren
Yuexian Zou
193
6
0
10 Feb 2025
One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge
  Neurons in Large Language Models
One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Pengfei Cao
Yuheng Chen
Zhuoran Jin
Yubo Chen
Kang Liu
Jun Zhao
KELM
115
0
0
26 Nov 2024
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
Zhaopeng Tu
VLM
262
0
0
21 Nov 2024
Multilingual Pretraining Using a Large Corpus Machine-Translated from a
  Single Source Language
Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Jiayi Wang
Yao Lu
Maurice Weber
Max Ryabinin
Yihong Chen
Raphael Tang
Pontus Stenetorp
LRM
104
1
0
31 Oct 2024
MotionGlot: A Multi-Embodied Motion Generation Model
MotionGlot: A Multi-Embodied Motion Generation Model
Sudarshan Harithas
Srinath Sridhar
174
2
0
22 Oct 2024
Linguistically-Informed Multilingual Instruction Tuning: Is There an
  Optimal Set of Languages to Tune?
Linguistically-Informed Multilingual Instruction Tuning: Is There an Optimal Set of Languages to Tune?
Gürkan Soykan
Gözde Gül Şahin
51
1
0
10 Oct 2024
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
Jun Hirako
Ryohei Sasano
Koichi Takeda
107
3
0
06 Oct 2024
IndicSentEval: How Effectively do Multilingual Transformer Models encode
  Linguistic Properties for Indic Languages?
IndicSentEval: How Effectively do Multilingual Transformer Models encode Linguistic Properties for Indic Languages?
Akhilesh Aravapalli
Mounika Marreddy
Subba Reddy Oota
R. Mamidi
Manish Gupta
89
0
0
03 Oct 2024
LangSAMP: Language-Script Aware Multilingual Pretraining
LangSAMP: Language-Script Aware Multilingual Pretraining
Yihong Liu
Haotian Ye
Chunlan Ma
Mingyang Wang
Hinrich Schütze
VLM
246
0
0
26 Sep 2024
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
Shaoxiong Ji
Zihao Li
Indraneil Paul
Jaakko Paavola
Peiqin Lin
...
Dayyán O'Brien
Hengyu Luo
Hinrich Schütze
Jörg Tiedemann
Barry Haddow
CLL
120
7
0
26 Sep 2024
How Transliterations Improve Crosslingual Alignment
How Transliterations Improve Crosslingual Alignment
Yihong Liu
Mingyang Wang
Amir Hossein Kargaran
Ayyoob Imani
Orgest Xhelili
Haotian Ye
Chunlan Ma
François Yvon
Hinrich Schütze
89
4
0
25 Sep 2024
Pruning Multilingual Large Language Models for Multilingual Inference
Pruning Multilingual Large Language Models for Multilingual Inference
Hwichan Kim
Jun Suzuki
Tosho Hirasawa
Mamoru Komachi
76
0
0
25 Sep 2024
On the Role of Context in Reading Time Prediction
On the Role of Context in Reading Time Prediction
Andreas Opedal
Eleanor Chodroff
Ryan Cotterell
Ethan Gotlieb Wilcox
101
8
0
12 Sep 2024
PsychoLex: Unveiling the Psychological Mind of Large Language Models
PsychoLex: Unveiling the Psychological Mind of Large Language Models
Mohammad Amin Abbasi
Farnaz Sadat Mirnezami
Hassan Naderi
LM&MA
68
2
0
16 Aug 2024
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Jupinder Parmar
Shrimai Prabhumoye
Joseph Jennings
Bo Liu
Aastha Jhunjhunwala
Zhilin Wang
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
122
10
0
08 Jul 2024
RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs
RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs
Ekaterina Taktasheva
Maxim Bazhukov
Kirill Koncha
Alena Fenogenova
Ekaterina Artemova
Vladislav Mikhailov
101
13
0
27 Jun 2024
Preference Tuning For Toxicity Mitigation Generalizes Across Languages
Preference Tuning For Toxicity Mitigation Generalizes Across Languages
Xiaochen Li
Zheng-Xin Yong
Stephen H. Bach
CLL
96
18
0
23 Jun 2024
On the Evaluation Practices in Multilingual NLP: Can Machine Translation
  Offer an Alternative to Human Translations?
On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?
Rochelle Choenni
Sara Rajaee
Christof Monz
Ekaterina Shutova
78
2
0
20 Jun 2024
Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+
  Languages
Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages
Fabian David Schmidt
Philipp Borchert
Ivan Vulić
Goran Glavaš
76
6
0
18 Jun 2024
MEMLA: Enhancing Multilingual Knowledge Editing with Neuron-Masked
  Low-Rank Adaptation
MEMLA: Enhancing Multilingual Knowledge Editing with Neuron-Masked Low-Rank Adaptation
Jiakuan Xie
Pengfei Cao
Yuheng Chen
Yubo Chen
Kang Liu
Jun Zhao
KELM
98
6
0
17 Jun 2024
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for
  Low-Resource Languages
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages
Trinh Pham
Khoi M. Le
Luu Anh Tuan
116
1
0
14 Jun 2024
MACT: Model-Agnostic Cross-Lingual Training for Discourse Representation
  Structure Parsing
MACT: Model-Agnostic Cross-Lingual Training for Discourse Representation Structure Parsing
Jiangming Liu
93
1
0
03 Jun 2024
Multilingual Text Style Transfer: Datasets & Models for Indian Languages
Multilingual Text Style Transfer: Datasets & Models for Indian Languages
Sourabrata Mukherjee
Atul Kr. Ojha
Akanksha Bansal
D. Alok
John P. Mccrae
Ondrej Dusek
VLM
57
7
0
31 May 2024
ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios
ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios
Markus Bayer
Justin Lutz
Christian A. Reuter
127
7
0
17 May 2024
LlamaTurk: Adapting Open-Source Generative Large Language Models for
  Low-Resource Language
LlamaTurk: Adapting Open-Source Generative Large Language Models for Low-Resource Language
Cagri Toraman
VLM
112
5
0
13 May 2024
Bridging the Bosphorus: Advancing Turkish Large Language Models through
  Strategies for Low-Resource Language Adaptation and Benchmarking
Bridging the Bosphorus: Advancing Turkish Large Language Models through Strategies for Low-Resource Language Adaptation and Benchmarking
Emre Can Acikgoz
Mete Erdogan
Deniz Yuret
80
8
0
07 May 2024
What Drives Performance in Multilingual Language Models?
What Drives Performance in Multilingual Language Models?
Sina Bagheri Nezhad
Ameeta Agrawal
LRM
101
10
0
29 Apr 2024
Introducing cosmosGPT: Monolingual Training for Turkish Language Models
Introducing cosmosGPT: Monolingual Training for Turkish Language Models
Himmet Toprak Kesgin
M. K. Yuce
Eren Dogan
M. E. Uzun
Atahan Uz
H. E. Seyrek
Ahmed Zeer
M. Amasyalı
89
11
0
26 Apr 2024
Türkçe Dil Modellerinin Performans
  Karşılaştırması Performance Comparison of Turkish Language
  Models
Türkçe Dil Modellerinin Performans Karşılaştırması Performance Comparison of Turkish Language Models
Eren Dogan
M. E. Uzun
Atahan Uz
H. E. Seyrek
Ahmed Zeer
Ezgi Sevi
Himmet Toprak Kesgin
M. K. Yuce
M. Amasyalı
ELM
57
0
0
25 Apr 2024
Understanding Cross-Lingual Alignment -- A Survey
Understanding Cross-Lingual Alignment -- A Survey
Katharina Hämmerl
Jindvrich Libovický
Alexander Fraser
83
14
0
09 Apr 2024
SambaLingo: Teaching Large Language Models New Languages
SambaLingo: Teaching Large Language Models New Languages
Zoltan Csaki
Bo Li
Jonathan Li
Qiantong Xu
Pian Pawakapan
Leon Zhang
Yun Du
Hengyu Zhao
Changran Hu
Urmish Thakker
90
6
0
08 Apr 2024
Multilingual Large Language Model: A Survey of Resources, Taxonomy and
  Frontiers
Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers
Libo Qin
Qiguang Chen
Yuhang Zhou
Zhi Chen
Hai-Tao Zheng
Lizi Liao
Min Li
Wanxiang Che
Philip S. Yu
LRM
162
38
0
07 Apr 2024
MultiParaDetox: Extending Text Detoxification with Parallel Data to New
  Languages
MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages
Daryna Dementieva
N. Babakov
Alexander Panchenko
79
9
0
02 Apr 2024
Aurora-M: The First Open Source Multilingual Language Model Red-teamed
  according to the U.S. Executive Order
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Taishi Nakamura
Mayank Mishra
Simone Tedeschi
Yekun Chai
Jason T Stillerman
...
Virendra Mehta
Matthew Blumberg
Victor May
Huu Nguyen
S. Pyysalo
LRM
86
8
0
30 Mar 2024
Latxa: An Open Language Model and Evaluation Suite for Basque
Latxa: An Open Language Model and Evaluation Suite for Basque
Julen Etxaniz
Oscar Sainz
Naiara Pérez
Itziar Aldabe
German Rigau
Eneko Agirre
Aitor Ormazabal
Mikel Artetxe
A. Soroa
ELM
67
32
0
29 Mar 2024
IndiBias: A Benchmark Dataset to Measure Social Biases in Language
  Models for Indian Context
IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian Context
Nihar Ranjan Sahoo
Pranamya Prashant Kulkarni
Narjis Asad
Arif Ahmad
Tanu Goyal
Aparna Garimella
Pushpak Bhattacharyya
107
12
0
29 Mar 2024
Attention-aware semantic relevance predicting Chinese sentence reading
Attention-aware semantic relevance predicting Chinese sentence reading
Kun Sun
95
1
0
27 Mar 2024
RuBia: A Russian Language Bias Detection Dataset
RuBia: A Russian Language Bias Detection Dataset
Veronika Grigoreva
Anastasiia Ivanova
I. Alimova
Ekaterina Artemova
100
1
0
26 Mar 2024
Computational Sentence-level Metrics Predicting Human Sentence
  Comprehension
Computational Sentence-level Metrics Predicting Human Sentence Comprehension
Kun Sun
Rong Wang
78
0
0
23 Mar 2024
GlossLM: Multilingual Pretraining for Low-Resource Interlinear Glossing
GlossLM: Multilingual Pretraining for Low-Resource Interlinear Glossing
Michael Ginn
Lindia Tjuatja
Taiqi He
Enora Rice
Graham Neubig
Alexis Palmer
Lori Levin University of Colorado
96
4
0
11 Mar 2024
From One to Many: Expanding the Scope of Toxicity Mitigation in Language
  Models
From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models
Luiza Amador Pozzobon
Patrick Lewis
Sara Hooker
Beyza Ermis
97
12
0
06 Mar 2024
123
Next