ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.09093
  4. Cited By
Are All Languages Created Equal in Multilingual BERT?

Are All Languages Created Equal in Multilingual BERT?

18 May 2020
Shijie Wu
Mark Dredze
ArXivPDFHTML

Papers citing "Are All Languages Created Equal in Multilingual BERT?"

50 / 174 papers shown
Title
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Faiza Hassan
Summra Saleem
Kashif Javed
Muhammad Nabeel Asim
A. Rehman
Andreas Dengel
33
0
0
08 May 2025
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
Enes Özeren
Yihong Liu
Hinrich Schütze
31
0
0
21 Apr 2025
Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting
Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting
Ej Zhou
Weiming Lu
28
0
0
15 Apr 2025
Comparing Human Expertise and Large Language Models Embeddings in Content Validity Assessment of Personality Tests
Comparing Human Expertise and Large Language Models Embeddings in Content Validity Assessment of Personality Tests
Nicola Milano
Michela Ponticorvo
Davide Marocco
ALM
50
0
0
15 Mar 2025
Optimal word order for non-causal text generation with Large Language Models: the Spanish case
Optimal word order for non-causal text generation with Large Language Models: the Spanish case
Andrea Busto-Castiñeira
Silvia García-Méndez
Francisco de Arriba-Pérez
Francisco J. González Castaño
41
0
0
21 Feb 2025
Language Fusion for Parameter-Efficient Cross-lingual Transfer
Language Fusion for Parameter-Efficient Cross-lingual Transfer
Philipp Borchert
Ivan Vulić
Marie-Francine Moens
Jochen De Weerdt
41
0
0
12 Jan 2025
Prompting with Phonemes: Enhancing LLMs' Multilinguality for Non-Latin Script Languages
Prompting with Phonemes: Enhancing LLMs' Multilinguality for Non-Latin Script Languages
Hoang Nguyen
Khyati Mahajan
Vikas Yadav
Philip S. Yu
Masoud Hashemi
Rishabh Maheshwary
Rishabh Maheshwary
49
0
0
04 Nov 2024
Tokenization and Morphology in Multilingual Language Models: A
  Comparative Analysis of mT5 and ByT5
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
25
1
0
15 Oct 2024
Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia
Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia
Tomás Feith
Akhil Arora
Martin Gerlach
Debjit Paul
Robert West
KELM
28
0
0
05 Oct 2024
Predictability and Causality in Spanish and English Natural Language
  Generation
Predictability and Causality in Spanish and English Natural Language Generation
Andrea Busto-Castiñeira
Francisco J. González Castaño
Silvia García-Méndez
Francisco de Arriba-Pérez
CML
54
1
0
26 Aug 2024
MoE-LPR: Multilingual Extension of Large Language Models through
  Mixture-of-Experts with Language Priors Routing
MoE-LPR: Multilingual Extension of Large Language Models through Mixture-of-Experts with Language Priors Routing
Hao Zhou
Zhijun Wang
Shujian Huang
Xin Huang
Xue Han
Junlan Feng
Chao Deng
Weihua Luo
Jiajun Chen
CLL
MoE
54
5
0
21 Aug 2024
Goldfish: Monolingual Language Models for 350 Languages
Goldfish: Monolingual Language Models for 350 Languages
Tyler A. Chang
Catherine Arnett
Zhuowen Tu
Benjamin Bergen
LRM
44
4
0
19 Aug 2024
MAGNET: Improving the Multilingual Fairness of Language Models with
  Adaptive Gradient-Based Tokenization
MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Orevaoghene Ahia
Sachin Kumar
Hila Gonen
Valentin Hoffman
Tomasz Limisiewicz
Yulia Tsvetkov
Noah A. Smith
51
4
0
11 Jul 2024
Large Models of What? Mistaking Engineering Achievements for Human
  Linguistic Agency
Large Models of What? Mistaking Engineering Achievements for Human Linguistic Agency
Abeba Birhane
Marek McGann
20
7
0
11 Jul 2024
Soft Language Prompts for Language Transfer
Soft Language Prompts for Language Transfer
Ivan Vykopal
Simon Ostermann
Marian Simko
AAML
42
1
0
02 Jul 2024
Adapting Multilingual LLMs to Low-Resource Languages with Knowledge
  Graphs via Adapters
Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters
Daniil Gurgurov
Mareike Hartmann
Simon Ostermann
47
6
0
01 Jul 2024
On the Evaluation Practices in Multilingual NLP: Can Machine Translation
  Offer an Alternative to Human Translations?
On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?
Rochelle Choenni
Sara Rajaee
Christof Monz
Ekaterina Shutova
39
1
0
20 Jun 2024
Probing the Emergence of Cross-lingual Alignment during LLM Training
Probing the Emergence of Cross-lingual Alignment during LLM Training
Hetong Wang
Pasquale Minervini
E. Ponti
39
8
0
19 Jun 2024
Synergizing Foundation Models and Federated Learning: A Survey
Synergizing Foundation Models and Federated Learning: A Survey
Shenghui Li
Fanghua Ye
Meng Fang
Jiaxu Zhao
Yun-Hin Chan
Edith C. -H. Ngai
Thiemo Voigt
AI4CE
57
5
0
18 Jun 2024
Multilingual Large Language Models and Curse of Multilinguality
Multilingual Large Language Models and Curse of Multilinguality
Daniil Gurgurov
Tanja Bäumel
Tatiana Anikina
86
4
0
15 Jun 2024
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for
  Low-Resource Languages
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages
Trinh Pham
Khoi M. Le
Luu Anh Tuan
42
1
0
14 Jun 2024
Decoding the Diversity: A Review of the Indic AI Research Landscape
Decoding the Diversity: A Review of the Indic AI Research Landscape
Sankalp KJ
Vinija Jain
S. Bhaduri
Tamoghna Roy
Aman Chadha
55
5
0
13 Jun 2024
Decipherment-Aware Multilingual Learning in Jointly Trained Language
  Models
Decipherment-Aware Multilingual Learning in Jointly Trained Language Models
Grandee Lee
34
0
0
11 Jun 2024
An Open Multilingual System for Scoring Readability of Wikipedia
An Open Multilingual System for Scoring Readability of Wikipedia
Mykola Trokhymovych
Indira Sen
Martin Gerlach
40
3
0
03 Jun 2024
Targeted Multilingual Adaptation for Low-resource Language Families
Targeted Multilingual Adaptation for Low-resource Language Families
C.M. Downey
Terra Blevins
Dhwani Serai
Dwija Parikh
Shane Steinert-Threlkeld
40
2
0
20 May 2024
What Drives Performance in Multilingual Language Models?
What Drives Performance in Multilingual Language Models?
Sina Bagheri Nezhad
Ameeta Agrawal
LRM
40
9
0
29 Apr 2024
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual
  Alignment
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment
Zhaofeng Wu
Ananth Balashankar
Yoon Kim
Jacob Eisenstein
Ahmad Beirami
46
13
0
18 Apr 2024
Investigating Gender Bias in Turkish Language Models
Investigating Gender Bias in Turkish Language Models
Orhun Caglidil
Malte Ostendorff
Georg Rehm
27
2
0
17 Apr 2024
Language Models on a Diet: Cost-Efficient Development of Encoders for
  Closely-Related Languages via Additional Pretraining
Language Models on a Diet: Cost-Efficient Development of Encoders for Closely-Related Languages via Additional Pretraining
Nikola Ljubesic
Vít Suchomel
Peter Rupnik
Taja Kuzman
Rik van Noord
CLL
29
5
0
08 Apr 2024
A Survey on Multilingual Large Language Models: Corpora, Alignment, and
  Bias
A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias
Yuemei Xu
Ling Hu
Jiayi Zhao
Zihan Qiu
Yuqi Ye
Hanwen Gu
LRM
27
36
0
01 Apr 2024
TAMS: Translation-Assisted Morphological Segmentation
TAMS: Translation-Assisted Morphological Segmentation
Enora Rice
Ali Marashian
Luke Gessler
Alexis Palmer
K. Wense
27
0
0
21 Mar 2024
Prediction of Translation Techniques for the Translation Process
Prediction of Translation Techniques for the Translation Process
Fan Zhou
Vincent Vandeghinste
37
0
0
21 Mar 2024
Pre-Trained Language Models Represent Some Geographic Populations Better
  Than Others
Pre-Trained Language Models Represent Some Geographic Populations Better Than Others
Jonathan Dunn
Benjamin Adams
Harish Tayyar Madabushi
32
3
0
16 Mar 2024
IRCoder: Intermediate Representations Make Language Models Robust
  Multilingual Code Generators
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators
Indraneil Paul
Goran Glavas
Iryna Gurevych
40
13
0
06 Mar 2024
ColBERT-XM: A Modular Multi-Vector Representation Model for Zero-Shot
  Multilingual Information Retrieval
ColBERT-XM: A Modular Multi-Vector Representation Model for Zero-Shot Multilingual Information Retrieval
Antoine Louis
V. Saxena
Gijs van Dijck
Gerasimos Spanakis
42
5
0
23 Feb 2024
Transferring BERT Capabilities from High-Resource to Low-Resource
  Languages Using Vocabulary Matching
Transferring BERT Capabilities from High-Resource to Low-Resource Languages Using Vocabulary Matching
Piotr Rybak
29
1
0
22 Feb 2024
Mitigating the Linguistic Gap with Phonemic Representations for Robust
  Multilingual Language Understanding
Mitigating the Linguistic Gap with Phonemic Representations for Robust Multilingual Language Understanding
Haeji Jung
Changdae Oh
Jooeon Kang
Jimin Sohn
Kyungwoo Song
Jinkyu Kim
David R. Mortensen
29
0
0
22 Feb 2024
Enhancing ESG Impact Type Identification through Early Fusion and
  Multilingual Models
Enhancing ESG Impact Type Identification through Early Fusion and Multilingual Models
Hariram Veeramani
Surendrabikram Thapa
Usman Naseem
16
5
0
16 Feb 2024
Cross-Lingual Transfer from Related Languages: Treating Low-Resource
  Maltese as Multilingual Code-Switching
Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching
Kurt Micallef
Nizar Habash
Claudia Borg
Fadhl Eryani
Houda Bouamor
29
2
0
30 Jan 2024
The Language Barrier: Dissecting Safety Challenges of LLMs in
  Multilingual Contexts
The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts
Lingfeng Shen
Weiting Tan
Sihao Chen
Yunmo Chen
Jingyu Zhang
Haoran Xu
Boyuan Zheng
Philipp Koehn
Daniel Khashabi
34
38
0
23 Jan 2024
Breaking the Curse of Multilinguality with Cross-lingual Expert Language
  Models
Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Terra Blevins
Tomasz Limisiewicz
Suchin Gururangan
Margaret Li
Hila Gonen
Noah A. Smith
Luke Zettlemoyer
50
22
0
19 Jan 2024
When Is Multilinguality a Curse? Language Modeling for 250 High- and
  Low-Resource Languages
When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages
Tyler A. Chang
Catherine Arnett
Zhuowen Tu
Benjamin Bergen
LRM
43
7
0
15 Nov 2023
Universal NER: A Gold-Standard Multilingual Named Entity Recognition
  Benchmark
Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark
Stephen Mayhew
Terra Blevins
Shuheng Liu
Marek vSuppa
Hila Gonen
...
Nikola Ljubevsić
Lester James V. Miranda
Barbara Plank
Arij Riabi
Yuval Pinter
24
9
0
15 Nov 2023
How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
Fei Yuan
Shuai Yuan
Zhiyong Wu
Lei Li
37
10
0
15 Nov 2023
On the Calibration of Multilingual Question Answering LLMs
On the Calibration of Multilingual Question Answering LLMs
Yahan Yang
Soham Dan
Dan Roth
Insup Lee
13
2
0
15 Nov 2023
Syntactic Inductive Bias in Transformer Language Models: Especially
  Helpful for Low-Resource Languages?
Syntactic Inductive Bias in Transformer Language Models: Especially Helpful for Low-Resource Languages?
Luke Gessler
Nathan Schneider
11
1
0
01 Nov 2023
Towards a Deep Understanding of Multilingual End-to-End Speech
  Translation
Towards a Deep Understanding of Multilingual End-to-End Speech Translation
Haoran Sun
Xiaohu Zhao
Yikun Lei
Shaolin Zhu
Deyi Xiong
39
8
0
31 Oct 2023
Counterfactually Probing Language Identity in Multilingual Models
Counterfactually Probing Language Identity in Multilingual Models
Anirudh Srinivasan
Venkata S Govindarajan
Kyle Mahowald
26
1
0
29 Oct 2023
Investigating Bias in Multilingual Language Models: Cross-Lingual
  Transfer of Debiasing Techniques
Investigating Bias in Multilingual Language Models: Cross-Lingual Transfer of Debiasing Techniques
Manon Reusens
Philipp Borchert
Margot Mieskes
Jochen De Weerdt
Bart Baesens
32
8
0
16 Oct 2023
Exploring the Maze of Multilingual Modeling
Exploring the Maze of Multilingual Modeling
Sina Bagheri Nezhad
Ameeta Agrawal
16
1
0
09 Oct 2023
1234
Next