ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.09093
  4. Cited By
Are All Languages Created Equal in Multilingual BERT?

Are All Languages Created Equal in Multilingual BERT?

18 May 2020
Shijie Wu
Mark Dredze
ArXivPDFHTML

Papers citing "Are All Languages Created Equal in Multilingual BERT?"

50 / 174 papers shown
Title
Hate Speech Detection in Limited Data Contexts using Synthetic Data
  Generation
Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation
Aman Khullar
Daniel K. Nkemelu
Cuong V. Nguyen
Michael L. Best
37
2
0
04 Oct 2023
Assessment of Pre-Trained Models Across Languages and Grammars
Assessment of Pre-Trained Models Across Languages and Grammars
Alberto Muñoz-Ortiz
David Vilares
Carlos Gómez-Rodríguez
27
2
0
20 Sep 2023
Monolingual or Multilingual Instruction Tuning: Which Makes a Better
  Alpaca
Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca
Pinzhen Chen
Shaoxiong Ji
Nikolay Bogoychev
Andrey Kutuzov
Barry Haddow
Kenneth Heafield
28
45
0
16 Sep 2023
OYXOY: A Modern NLP Test Suite for Modern Greek
OYXOY: A Modern NLP Test Suite for Modern Greek
Konstantinos Kogkalidis
S. Chatzikyriakidis
Eirini Chrysovalantou Giannikouri
Vassiliki Katsouli
Christina Klironomou
...
Dimitris Papadakis
Thelka Pasparaki
Erofili Psaltaki
E. Sakellariou
Hara Soupiona
21
0
0
13 Sep 2023
Embedding structure matters: Comparing methods to adapt multilingual
  vocabularies to new languages
Embedding structure matters: Comparing methods to adapt multilingual vocabularies to new languages
C.M. Downey
Terra Blevins
Nora Goldfine
Shane Steinert-Threlkeld
33
8
0
09 Sep 2023
Combating the Curse of Multilinguality in Cross-Lingual WSD by Aligning
  Sparse Contextualized Word Representations
Combating the Curse of Multilinguality in Cross-Lingual WSD by Aligning Sparse Contextualized Word Representations
Gábor Berend
38
7
0
25 Jul 2023
Scaling Laws Do Not Scale
Scaling Laws Do Not Scale
Fernando Diaz
Michael A. Madaio
23
8
0
05 Jul 2023
Multilingual Multiword Expression Identification Using Lateral
  Inhibition and Domain Adaptation
Multilingual Multiword Expression Identification Using Lateral Inhibition and Domain Adaptation
Andrei-Marius Avram
V. Mititelu
V. Pais
Dumitru-Clementin Cercel
Stefan Trausan-Matu
43
3
0
17 Jun 2023
Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted
  Sentiment Classification Benchmark
Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark
Lukasz Augustyniak
Szymon Wo'zniak
Marcin Gruza
Piotr Gramacki
Krzysztof Rajda
M. Morzy
Tomasz Kajdanowicz
33
5
0
13 Jun 2023
Lost in Translation: Large Language Models in Non-English Content
  Analysis
Lost in Translation: Large Language Models in Non-English Content Analysis
Gabriel Nicholas
Aliya Bhatia
ELM
18
35
0
12 Jun 2023
Exploring the Relationship between Alignment and Cross-lingual Transfer
  in Multilingual Transformers
Exploring the Relationship between Alignment and Cross-lingual Transfer in Multilingual Transformers
Félix Gaschi
Patricio Cerda
Parisa Rastin
Y. Toussaint
22
9
0
05 Jun 2023
Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging
Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging
Fabian David Schmidt
Ivan Vulić
Goran Glavavs
24
8
0
26 May 2023
mmT5: Modular Multilingual Pre-Training Solves Source Language
  Hallucinations
mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Jonas Pfeiffer
Francesco Piccinno
Massimo Nicosia
Xinyi Wang
Machel Reid
Sebastian Ruder
VLM
LRM
36
27
0
23 May 2023
When your Cousin has the Right Connections: Unsupervised Bilingual
  Lexicon Induction for Related Data-Imbalanced Languages
When your Cousin has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced Languages
Niyati Bafna
C. España-Bonet
Josef van Genabith
Benoît Sagot
Rachel Bawden
16
3
0
23 May 2023
How do languages influence each other? Studying cross-lingual data
  sharing during LM fine-tuning
How do languages influence each other? Studying cross-lingual data sharing during LM fine-tuning
Rochelle Choenni
Dan Garrette
Ekaterina Shutova
40
16
0
22 May 2023
Multilingual Event Extraction from Historical Newspaper Adverts
Multilingual Event Extraction from Historical Newspaper Adverts
Nadav Borenstein
N. Perez
Isabelle Augenstein
25
4
0
18 May 2023
Soft Prompt Decoding for Multilingual Dense Retrieval
Soft Prompt Decoding for Multilingual Dense Retrieval
Zhiqi Huang
Hansi Zeng
Hamed Zamani
James Allan
RALM
63
13
0
15 May 2023
A Crosslingual Investigation of Conceptualization in 1335 Languages
A Crosslingual Investigation of Conceptualization in 1335 Languages
Yihong Liu
Haotian Ye
Leonie Weissweiler
Philipp Wicke
Renhao Pei
Robert Zangenfeind
Hinrich Schütze
34
12
0
15 May 2023
Evaluating Embedding APIs for Information Retrieval
Evaluating Embedding APIs for Information Retrieval
Ehsan Kamalloo
Xinyu Crystina Zhang
Odunayo Ogundepo
Nandan Thakur
David Alfonso-Hermelo
Mehdi Rezagholizadeh
Jimmy J. Lin
RALM
29
19
0
10 May 2023
MultiTACRED: A Multilingual Version of the TAC Relation Extraction
  Dataset
MultiTACRED: A Multilingual Version of the TAC Relation Extraction Dataset
Leonhard Hennig
Philippe E. Thomas
Sebastian Möller
18
8
0
08 May 2023
Investigating Lexical Sharing in Multilingual Machine Translation for
  Indian Languages
Investigating Lexical Sharing in Multilingual Machine Translation for Indian Languages
Sonal Sannigrahi
Rachel Bawden
37
0
0
04 May 2023
L3Cube-IndicSBERT: A simple approach for learning cross-lingual sentence
  representations using multilingual BERT
L3Cube-IndicSBERT: A simple approach for learning cross-lingual sentence representations using multilingual BERT
Samruddhi Deode
Janhavi Gadre
Aditi Kajale
Ananya Joshi
Raviraj Joshi
25
20
0
22 Apr 2023
Transfer to a Low-Resource Language via Close Relatives: The Case Study
  on Faroese
Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese
Vésteinn Snaebjarnarson
A. Simonsen
Goran Glavavs
Ivan Vulić
37
19
0
18 Apr 2023
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large
  Language Models in Multilingual Learning
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning
Viet Dac Lai
Nghia Trung Ngo
Amir Pouran Ben Veyseh
Hieu Man
Franck Dernoncourt
Trung Bui
Thien Huu Nguyen
ELM
LM&MA
30
268
0
12 Apr 2023
Model and Evaluation: Towards Fairness in Multilingual Text
  Classification
Model and Evaluation: Towards Fairness in Multilingual Text Classification
Nankai Lin
Junheng He
Zhenghang Tang
Dong-ping Zhou
Aimin Yang
23
1
0
28 Mar 2023
MEGA: Multilingual Evaluation of Generative AI
MEGA: Multilingual Evaluation of Generative AI
Kabir Ahuja
Harshita Diddee
Rishav Hada
Millicent Ochieng
Krithika Ramesh
...
T. Ganu
Sameer Segal
Maxamed Axmed
Kalika Bali
Sunayana Sitaram
LM&MA
LRM
ELM
27
268
0
22 Mar 2023
Language Model Behavior: A Comprehensive Survey
Language Model Behavior: A Comprehensive Survey
Tyler A. Chang
Benjamin Bergen
VLM
LRM
LM&MA
27
103
0
20 Mar 2023
SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches
  for news genre, topic and persuasion technique classification
SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches for news genre, topic and persuasion technique classification
Ben Wu
Olesya Razuvayevskaya
Freddy Heppell
João A. Leite
Carolina Scarton
Kalina Bontcheva
Xingyi Song
13
9
0
16 Mar 2023
Meeting the Needs of Low-Resource Languages: The Value of Automatic
  Alignments via Pretrained Models
Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Abteen Ebrahimi
Arya D. McCarthy
Arturo Oncevay
Luis Chiruzzo
J. Ortega
Gustavo A. Giménez-Lugo
Rolando A. Coto Solano
Katharina Kann
28
6
0
15 Feb 2023
Improving Cross-lingual Information Retrieval on Low-Resource Languages
  via Optimal Transport Distillation
Improving Cross-lingual Information Retrieval on Low-Resource Languages via Optimal Transport Distillation
Zhiqi Huang
Puxuan Yu
James Allan
VLM
38
26
0
29 Jan 2023
Cross-lingual Argument Mining in the Medical Domain
Cross-lingual Argument Mining in the Medical Domain
Anar Yeginbergenova
Rodrigo Agerri
44
7
0
25 Jan 2023
MicroBERT: Effective Training of Low-resource Monolingual BERTs through
  Parameter Reduction and Multitask Learning
MicroBERT: Effective Training of Low-resource Monolingual BERTs through Parameter Reduction and Multitask Learning
Luke Gessler
Amir Zeldes
17
14
0
23 Dec 2022
Mini-Model Adaptation: Efficiently Extending Pretrained Models to New
  Languages via Aligned Shallow Training
Mini-Model Adaptation: Efficiently Extending Pretrained Models to New Languages via Aligned Shallow Training
Kelly Marchisio
Patrick Lewis
Yihong Chen
Mikel Artetxe
35
16
0
20 Dec 2022
MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code
  Completion
MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code Completion
Zi Gong
Yinpeng Guo
Pingyi Zhou
Cuiyun Gao
Yasheng Wang
Zenglin Xu
14
8
0
19 Dec 2022
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Ercong Nie
Sheng Liang
Helmut Schmid
Hinrich Schütze
VLM
RALM
LRM
27
22
0
19 Dec 2022
Lessons learned from the evaluation of Spanish Language Models
Lessons learned from the evaluation of Spanish Language Models
Rodrigo Agerri
Eneko Agirre
ELM
30
15
0
16 Dec 2022
Languages You Know Influence Those You Learn: Impact of Language
  Characteristics on Multi-Lingual Text-to-Text Transfer
Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer
Benjamin Muller
Deepanshu Gupta
Siddharth Patwardhan
J. Fauconnier
David Vandyke
Sachin Agarwal
41
5
0
04 Dec 2022
GreenPLM: Cross-Lingual Transfer of Monolingual Pre-Trained Language
  Models at Almost No Cost
GreenPLM: Cross-Lingual Transfer of Monolingual Pre-Trained Language Models at Almost No Cost
Qingcheng Zeng
Lucas Garay
Peilin Zhou
Dading Chong
Yining Hua
Jiageng Wu
Yi-Cheng Pan
Han Zhou
Rob Voigt
Jie Yang
VLM
26
23
0
13 Nov 2022
Cross-lingual Transfer Learning for Check-worthy Claim Identification
  over Twitter
Cross-lingual Transfer Learning for Check-worthy Claim Identification over Twitter
Maram Hasanain
Tamer Elsayed
24
4
0
09 Nov 2022
Intriguing Properties of Compression on Multilingual Models
Intriguing Properties of Compression on Multilingual Models
Kelechi Ogueji
Orevaoghene Ahia
Gbemileke Onilude
Sebastian Gehrmann
Sara Hooker
Julia Kreutzer
21
12
0
04 Nov 2022
Model and Data Transfer for Cross-Lingual Sequence Labelling in
  Zero-Resource Settings
Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings
Iker García-Ferrero
Rodrigo Agerri
German Rigau
66
21
0
23 Oct 2022
On the Calibration of Massively Multilingual Language Models
On the Calibration of Massively Multilingual Language Models
Kabir Ahuja
Sunayana Sitaram
Sandipan Dandapat
Monojit Choudhury
76
16
0
21 Oct 2022
Some Languages are More Equal than Others: Probing Deeper into the
  Linguistic Disparity in the NLP World
Some Languages are More Equal than Others: Probing Deeper into the Linguistic Disparity in the NLP World
Surangika Ranathunga
Nisansa de Silva
45
35
0
16 Oct 2022
You Can Have Your Data and Balance It Too: Towards Balanced and
  Efficient Multilingual Models
You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models
Tomasz Limisiewicz
Daniel Malkin
Gabriel Stanovsky
24
4
0
13 Oct 2022
Are Pretrained Multilingual Models Equally Fair Across Languages?
Are Pretrained Multilingual Models Equally Fair Across Languages?
Laura Cabello Piqueras
Anders Søgaard
12
9
0
11 Oct 2022
BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language
  Models for Sinhala Text Classification
BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language Models for Sinhala Text Classification
Vinura Dhananjaya
Piyumal Demotte
Surangika Ranathunga
Sanath Jayasena
27
14
0
16 Aug 2022
Learning to translate by learning to communicate
Learning to translate by learning to communicate
C.M. Downey
Xuhui Zhou
Leo Z. Liu
Shane Steinert-Threlkeld
31
5
0
14 Jul 2022
Language Modelling with Pixels
Language Modelling with Pixels
Phillip Rust
Jonas F. Lotz
Emanuele Bugliarello
Elizabeth Salesky
Miryam de Lhoneux
Desmond Elliott
VLM
38
46
0
14 Jul 2022
Improving Low-Resource Speech Recognition with Pretrained Speech Models:
  Continued Pretraining vs. Semi-Supervised Training
Improving Low-Resource Speech Recognition with Pretrained Speech Models: Continued Pretraining vs. Semi-Supervised Training
Mitchell DeHaven
J. Billa
VLM
AI4TS
15
8
0
01 Jul 2022
Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of
  Multilingual Language Models
Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of Multilingual Language Models
Terra Blevins
Hila Gonen
Luke Zettlemoyer
LRM
59
26
0
24 May 2022
Previous
1234
Next