Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.09093
Cited By
Are All Languages Created Equal in Multilingual BERT?
18 May 2020
Shijie Wu
Mark Dredze
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Are All Languages Created Equal in Multilingual BERT?"
50 / 174 papers shown
Title
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Faiza Hassan
Summra Saleem
Kashif Javed
Muhammad Nabeel Asim
A. Rehman
Andreas Dengel
33
0
0
08 May 2025
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
Enes Özeren
Yihong Liu
Hinrich Schütze
31
0
0
21 Apr 2025
Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting
Ej Zhou
Weiming Lu
28
0
0
15 Apr 2025
Comparing Human Expertise and Large Language Models Embeddings in Content Validity Assessment of Personality Tests
Nicola Milano
Michela Ponticorvo
Davide Marocco
ALM
50
0
0
15 Mar 2025
Optimal word order for non-causal text generation with Large Language Models: the Spanish case
Andrea Busto-Castiñeira
Silvia García-Méndez
Francisco de Arriba-Pérez
Francisco J. González Castaño
41
0
0
21 Feb 2025
Language Fusion for Parameter-Efficient Cross-lingual Transfer
Philipp Borchert
Ivan Vulić
Marie-Francine Moens
Jochen De Weerdt
41
0
0
12 Jan 2025
Prompting with Phonemes: Enhancing LLMs' Multilinguality for Non-Latin Script Languages
Hoang Nguyen
Khyati Mahajan
Vikas Yadav
Philip S. Yu
Masoud Hashemi
Rishabh Maheshwary
Rishabh Maheshwary
49
0
0
04 Nov 2024
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
25
1
0
15 Oct 2024
Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia
Tomás Feith
Akhil Arora
Martin Gerlach
Debjit Paul
Robert West
KELM
28
0
0
05 Oct 2024
Predictability and Causality in Spanish and English Natural Language Generation
Andrea Busto-Castiñeira
Francisco J. González Castaño
Silvia García-Méndez
Francisco de Arriba-Pérez
CML
54
1
0
26 Aug 2024
MoE-LPR: Multilingual Extension of Large Language Models through Mixture-of-Experts with Language Priors Routing
Hao Zhou
Zhijun Wang
Shujian Huang
Xin Huang
Xue Han
Junlan Feng
Chao Deng
Weihua Luo
Jiajun Chen
CLL
MoE
54
5
0
21 Aug 2024
Goldfish: Monolingual Language Models for 350 Languages
Tyler A. Chang
Catherine Arnett
Zhuowen Tu
Benjamin Bergen
LRM
44
4
0
19 Aug 2024
MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Orevaoghene Ahia
Sachin Kumar
Hila Gonen
Valentin Hoffman
Tomasz Limisiewicz
Yulia Tsvetkov
Noah A. Smith
51
4
0
11 Jul 2024
Large Models of What? Mistaking Engineering Achievements for Human Linguistic Agency
Abeba Birhane
Marek McGann
20
7
0
11 Jul 2024
Soft Language Prompts for Language Transfer
Ivan Vykopal
Simon Ostermann
Marian Simko
AAML
42
1
0
02 Jul 2024
Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters
Daniil Gurgurov
Mareike Hartmann
Simon Ostermann
47
6
0
01 Jul 2024
On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?
Rochelle Choenni
Sara Rajaee
Christof Monz
Ekaterina Shutova
39
1
0
20 Jun 2024
Probing the Emergence of Cross-lingual Alignment during LLM Training
Hetong Wang
Pasquale Minervini
E. Ponti
39
8
0
19 Jun 2024
Synergizing Foundation Models and Federated Learning: A Survey
Shenghui Li
Fanghua Ye
Meng Fang
Jiaxu Zhao
Yun-Hin Chan
Edith C. -H. Ngai
Thiemo Voigt
AI4CE
57
5
0
18 Jun 2024
Multilingual Large Language Models and Curse of Multilinguality
Daniil Gurgurov
Tanja Bäumel
Tatiana Anikina
86
4
0
15 Jun 2024
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages
Trinh Pham
Khoi M. Le
Luu Anh Tuan
42
1
0
14 Jun 2024
Decoding the Diversity: A Review of the Indic AI Research Landscape
Sankalp KJ
Vinija Jain
S. Bhaduri
Tamoghna Roy
Aman Chadha
55
5
0
13 Jun 2024
Decipherment-Aware Multilingual Learning in Jointly Trained Language Models
Grandee Lee
34
0
0
11 Jun 2024
An Open Multilingual System for Scoring Readability of Wikipedia
Mykola Trokhymovych
Indira Sen
Martin Gerlach
40
3
0
03 Jun 2024
Targeted Multilingual Adaptation for Low-resource Language Families
C.M. Downey
Terra Blevins
Dhwani Serai
Dwija Parikh
Shane Steinert-Threlkeld
40
2
0
20 May 2024
What Drives Performance in Multilingual Language Models?
Sina Bagheri Nezhad
Ameeta Agrawal
LRM
42
9
0
29 Apr 2024
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment
Zhaofeng Wu
Ananth Balashankar
Yoon Kim
Jacob Eisenstein
Ahmad Beirami
46
13
0
18 Apr 2024
Investigating Gender Bias in Turkish Language Models
Orhun Caglidil
Malte Ostendorff
Georg Rehm
27
2
0
17 Apr 2024
Language Models on a Diet: Cost-Efficient Development of Encoders for Closely-Related Languages via Additional Pretraining
Nikola Ljubesic
Vít Suchomel
Peter Rupnik
Taja Kuzman
Rik van Noord
CLL
29
5
0
08 Apr 2024
A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias
Yuemei Xu
Ling Hu
Jiayi Zhao
Zihan Qiu
Yuqi Ye
Hanwen Gu
LRM
27
36
0
01 Apr 2024
TAMS: Translation-Assisted Morphological Segmentation
Enora Rice
Ali Marashian
Luke Gessler
Alexis Palmer
K. Wense
27
0
0
21 Mar 2024
Prediction of Translation Techniques for the Translation Process
Fan Zhou
Vincent Vandeghinste
37
0
0
21 Mar 2024
Pre-Trained Language Models Represent Some Geographic Populations Better Than Others
Jonathan Dunn
Benjamin Adams
Harish Tayyar Madabushi
32
3
0
16 Mar 2024
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators
Indraneil Paul
Goran Glavas
Iryna Gurevych
42
13
0
06 Mar 2024
ColBERT-XM: A Modular Multi-Vector Representation Model for Zero-Shot Multilingual Information Retrieval
Antoine Louis
V. Saxena
Gijs van Dijck
Gerasimos Spanakis
42
5
0
23 Feb 2024
Transferring BERT Capabilities from High-Resource to Low-Resource Languages Using Vocabulary Matching
Piotr Rybak
29
1
0
22 Feb 2024
Mitigating the Linguistic Gap with Phonemic Representations for Robust Multilingual Language Understanding
Haeji Jung
Changdae Oh
Jooeon Kang
Jimin Sohn
Kyungwoo Song
Jinkyu Kim
David R. Mortensen
29
1
0
22 Feb 2024
Enhancing ESG Impact Type Identification through Early Fusion and Multilingual Models
Hariram Veeramani
Surendrabikram Thapa
Usman Naseem
16
5
0
16 Feb 2024
Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching
Kurt Micallef
Nizar Habash
Claudia Borg
Fadhl Eryani
Houda Bouamor
29
2
0
30 Jan 2024
The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts
Lingfeng Shen
Weiting Tan
Sihao Chen
Yunmo Chen
Jingyu Zhang
Haoran Xu
Boyuan Zheng
Philipp Koehn
Daniel Khashabi
34
38
0
23 Jan 2024
Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Terra Blevins
Tomasz Limisiewicz
Suchin Gururangan
Margaret Li
Hila Gonen
Noah A. Smith
Luke Zettlemoyer
50
22
0
19 Jan 2024
When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages
Tyler A. Chang
Catherine Arnett
Zhuowen Tu
Benjamin Bergen
LRM
43
7
0
15 Nov 2023
Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark
Stephen Mayhew
Terra Blevins
Shuheng Liu
Marek vSuppa
Hila Gonen
...
Nikola Ljubevsić
Lester James V. Miranda
Barbara Plank
Arij Riabi
Yuval Pinter
24
9
0
15 Nov 2023
How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
Fei Yuan
Shuai Yuan
Zhiyong Wu
Lei Li
37
10
0
15 Nov 2023
On the Calibration of Multilingual Question Answering LLMs
Yahan Yang
Soham Dan
Dan Roth
Insup Lee
13
2
0
15 Nov 2023
Syntactic Inductive Bias in Transformer Language Models: Especially Helpful for Low-Resource Languages?
Luke Gessler
Nathan Schneider
11
1
0
01 Nov 2023
Towards a Deep Understanding of Multilingual End-to-End Speech Translation
Haoran Sun
Xiaohu Zhao
Yikun Lei
Shaolin Zhu
Deyi Xiong
39
8
0
31 Oct 2023
Counterfactually Probing Language Identity in Multilingual Models
Anirudh Srinivasan
Venkata S Govindarajan
Kyle Mahowald
26
1
0
29 Oct 2023
Investigating Bias in Multilingual Language Models: Cross-Lingual Transfer of Debiasing Techniques
Manon Reusens
Philipp Borchert
Margot Mieskes
Jochen De Weerdt
Bart Baesens
34
8
0
16 Oct 2023
Exploring the Maze of Multilingual Modeling
Sina Bagheri Nezhad
Ameeta Agrawal
18
1
0
09 Oct 2023
1
2
3
4
Next