Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.09093
Cited By
Are All Languages Created Equal in Multilingual BERT?
18 May 2020
Shijie Wu
Mark Dredze
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Are All Languages Created Equal in Multilingual BERT?"
50 / 174 papers shown
Title
Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation
Aman Khullar
Daniel K. Nkemelu
Cuong V. Nguyen
Michael L. Best
37
2
0
04 Oct 2023
Assessment of Pre-Trained Models Across Languages and Grammars
Alberto Muñoz-Ortiz
David Vilares
Carlos Gómez-Rodríguez
27
2
0
20 Sep 2023
Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca
Pinzhen Chen
Shaoxiong Ji
Nikolay Bogoychev
Andrey Kutuzov
Barry Haddow
Kenneth Heafield
28
45
0
16 Sep 2023
OYXOY: A Modern NLP Test Suite for Modern Greek
Konstantinos Kogkalidis
S. Chatzikyriakidis
Eirini Chrysovalantou Giannikouri
Vassiliki Katsouli
Christina Klironomou
...
Dimitris Papadakis
Thelka Pasparaki
Erofili Psaltaki
E. Sakellariou
Hara Soupiona
21
0
0
13 Sep 2023
Embedding structure matters: Comparing methods to adapt multilingual vocabularies to new languages
C.M. Downey
Terra Blevins
Nora Goldfine
Shane Steinert-Threlkeld
33
8
0
09 Sep 2023
Combating the Curse of Multilinguality in Cross-Lingual WSD by Aligning Sparse Contextualized Word Representations
Gábor Berend
38
7
0
25 Jul 2023
Scaling Laws Do Not Scale
Fernando Diaz
Michael A. Madaio
23
8
0
05 Jul 2023
Multilingual Multiword Expression Identification Using Lateral Inhibition and Domain Adaptation
Andrei-Marius Avram
V. Mititelu
V. Pais
Dumitru-Clementin Cercel
Stefan Trausan-Matu
43
3
0
17 Jun 2023
Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark
Lukasz Augustyniak
Szymon Wo'zniak
Marcin Gruza
Piotr Gramacki
Krzysztof Rajda
M. Morzy
Tomasz Kajdanowicz
33
5
0
13 Jun 2023
Lost in Translation: Large Language Models in Non-English Content Analysis
Gabriel Nicholas
Aliya Bhatia
ELM
18
35
0
12 Jun 2023
Exploring the Relationship between Alignment and Cross-lingual Transfer in Multilingual Transformers
Félix Gaschi
Patricio Cerda
Parisa Rastin
Y. Toussaint
22
9
0
05 Jun 2023
Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging
Fabian David Schmidt
Ivan Vulić
Goran Glavavs
24
8
0
26 May 2023
mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Jonas Pfeiffer
Francesco Piccinno
Massimo Nicosia
Xinyi Wang
Machel Reid
Sebastian Ruder
VLM
LRM
36
27
0
23 May 2023
When your Cousin has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced Languages
Niyati Bafna
C. España-Bonet
Josef van Genabith
Benoît Sagot
Rachel Bawden
16
3
0
23 May 2023
How do languages influence each other? Studying cross-lingual data sharing during LM fine-tuning
Rochelle Choenni
Dan Garrette
Ekaterina Shutova
40
16
0
22 May 2023
Multilingual Event Extraction from Historical Newspaper Adverts
Nadav Borenstein
N. Perez
Isabelle Augenstein
25
4
0
18 May 2023
Soft Prompt Decoding for Multilingual Dense Retrieval
Zhiqi Huang
Hansi Zeng
Hamed Zamani
James Allan
RALM
63
13
0
15 May 2023
A Crosslingual Investigation of Conceptualization in 1335 Languages
Yihong Liu
Haotian Ye
Leonie Weissweiler
Philipp Wicke
Renhao Pei
Robert Zangenfeind
Hinrich Schütze
34
12
0
15 May 2023
Evaluating Embedding APIs for Information Retrieval
Ehsan Kamalloo
Xinyu Crystina Zhang
Odunayo Ogundepo
Nandan Thakur
David Alfonso-Hermelo
Mehdi Rezagholizadeh
Jimmy J. Lin
RALM
29
19
0
10 May 2023
MultiTACRED: A Multilingual Version of the TAC Relation Extraction Dataset
Leonhard Hennig
Philippe E. Thomas
Sebastian Möller
18
8
0
08 May 2023
Investigating Lexical Sharing in Multilingual Machine Translation for Indian Languages
Sonal Sannigrahi
Rachel Bawden
37
0
0
04 May 2023
L3Cube-IndicSBERT: A simple approach for learning cross-lingual sentence representations using multilingual BERT
Samruddhi Deode
Janhavi Gadre
Aditi Kajale
Ananya Joshi
Raviraj Joshi
25
20
0
22 Apr 2023
Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese
Vésteinn Snaebjarnarson
A. Simonsen
Goran Glavavs
Ivan Vulić
37
19
0
18 Apr 2023
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning
Viet Dac Lai
Nghia Trung Ngo
Amir Pouran Ben Veyseh
Hieu Man
Franck Dernoncourt
Trung Bui
Thien Huu Nguyen
ELM
LM&MA
30
268
0
12 Apr 2023
Model and Evaluation: Towards Fairness in Multilingual Text Classification
Nankai Lin
Junheng He
Zhenghang Tang
Dong-ping Zhou
Aimin Yang
23
1
0
28 Mar 2023
MEGA: Multilingual Evaluation of Generative AI
Kabir Ahuja
Harshita Diddee
Rishav Hada
Millicent Ochieng
Krithika Ramesh
...
T. Ganu
Sameer Segal
Maxamed Axmed
Kalika Bali
Sunayana Sitaram
LM&MA
LRM
ELM
27
268
0
22 Mar 2023
Language Model Behavior: A Comprehensive Survey
Tyler A. Chang
Benjamin Bergen
VLM
LRM
LM&MA
27
103
0
20 Mar 2023
SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches for news genre, topic and persuasion technique classification
Ben Wu
Olesya Razuvayevskaya
Freddy Heppell
João A. Leite
Carolina Scarton
Kalina Bontcheva
Xingyi Song
13
9
0
16 Mar 2023
Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Abteen Ebrahimi
Arya D. McCarthy
Arturo Oncevay
Luis Chiruzzo
J. Ortega
Gustavo A. Giménez-Lugo
Rolando A. Coto Solano
Katharina Kann
28
6
0
15 Feb 2023
Improving Cross-lingual Information Retrieval on Low-Resource Languages via Optimal Transport Distillation
Zhiqi Huang
Puxuan Yu
James Allan
VLM
38
26
0
29 Jan 2023
Cross-lingual Argument Mining in the Medical Domain
Anar Yeginbergenova
Rodrigo Agerri
44
7
0
25 Jan 2023
MicroBERT: Effective Training of Low-resource Monolingual BERTs through Parameter Reduction and Multitask Learning
Luke Gessler
Amir Zeldes
17
14
0
23 Dec 2022
Mini-Model Adaptation: Efficiently Extending Pretrained Models to New Languages via Aligned Shallow Training
Kelly Marchisio
Patrick Lewis
Yihong Chen
Mikel Artetxe
35
16
0
20 Dec 2022
MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code Completion
Zi Gong
Yinpeng Guo
Pingyi Zhou
Cuiyun Gao
Yasheng Wang
Zenglin Xu
14
8
0
19 Dec 2022
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Ercong Nie
Sheng Liang
Helmut Schmid
Hinrich Schütze
VLM
RALM
LRM
27
22
0
19 Dec 2022
Lessons learned from the evaluation of Spanish Language Models
Rodrigo Agerri
Eneko Agirre
ELM
30
15
0
16 Dec 2022
Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer
Benjamin Muller
Deepanshu Gupta
Siddharth Patwardhan
J. Fauconnier
David Vandyke
Sachin Agarwal
41
5
0
04 Dec 2022
GreenPLM: Cross-Lingual Transfer of Monolingual Pre-Trained Language Models at Almost No Cost
Qingcheng Zeng
Lucas Garay
Peilin Zhou
Dading Chong
Yining Hua
Jiageng Wu
Yi-Cheng Pan
Han Zhou
Rob Voigt
Jie Yang
VLM
26
23
0
13 Nov 2022
Cross-lingual Transfer Learning for Check-worthy Claim Identification over Twitter
Maram Hasanain
Tamer Elsayed
24
4
0
09 Nov 2022
Intriguing Properties of Compression on Multilingual Models
Kelechi Ogueji
Orevaoghene Ahia
Gbemileke Onilude
Sebastian Gehrmann
Sara Hooker
Julia Kreutzer
21
12
0
04 Nov 2022
Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings
Iker García-Ferrero
Rodrigo Agerri
German Rigau
66
21
0
23 Oct 2022
On the Calibration of Massively Multilingual Language Models
Kabir Ahuja
Sunayana Sitaram
Sandipan Dandapat
Monojit Choudhury
76
16
0
21 Oct 2022
Some Languages are More Equal than Others: Probing Deeper into the Linguistic Disparity in the NLP World
Surangika Ranathunga
Nisansa de Silva
45
35
0
16 Oct 2022
You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models
Tomasz Limisiewicz
Daniel Malkin
Gabriel Stanovsky
24
4
0
13 Oct 2022
Are Pretrained Multilingual Models Equally Fair Across Languages?
Laura Cabello Piqueras
Anders Søgaard
12
9
0
11 Oct 2022
BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language Models for Sinhala Text Classification
Vinura Dhananjaya
Piyumal Demotte
Surangika Ranathunga
Sanath Jayasena
27
14
0
16 Aug 2022
Learning to translate by learning to communicate
C.M. Downey
Xuhui Zhou
Leo Z. Liu
Shane Steinert-Threlkeld
31
5
0
14 Jul 2022
Language Modelling with Pixels
Phillip Rust
Jonas F. Lotz
Emanuele Bugliarello
Elizabeth Salesky
Miryam de Lhoneux
Desmond Elliott
VLM
38
46
0
14 Jul 2022
Improving Low-Resource Speech Recognition with Pretrained Speech Models: Continued Pretraining vs. Semi-Supervised Training
Mitchell DeHaven
J. Billa
VLM
AI4TS
15
8
0
01 Jul 2022
Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of Multilingual Language Models
Terra Blevins
Hila Gonen
Luke Zettlemoyer
LRM
59
26
0
24 May 2022
Previous
1
2
3
4
Next