Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09077
Cited By
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT
19 April 2019
Shijie Wu
Mark Dredze
VLM
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT"
50 / 185 papers shown
Title
MuMUR : Multilingual Multimodal Universal Retrieval
Avinash Madasu
Estelle Aflalo
Gabriela Ben-Melech Stan
Shachar Rosenman
Shao-Yen Tseng
Gedas Bertasius
Vasudev Lal
47
3
0
24 Aug 2022
BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language Models for Sinhala Text Classification
Vinura Dhananjaya
Piyumal Demotte
Surangika Ranathunga
Sanath Jayasena
29
14
0
16 Aug 2022
Mismatching-Aware Unsupervised Translation Quality Estimation For Low-Resource Languages
Fatemeh Azadi
Heshaam Faili
M. Dousti
30
4
0
31 Jul 2022
Zero-shot Cross-lingual Transfer is Under-specified Optimization
Shijie Wu
Benjamin Van Durme
Mark Dredze
33
6
0
12 Jul 2022
"Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer
Shanu Kumar
Sandipan Dandapat
Monojit Choudhury
29
6
0
30 Jun 2022
Transfer Language Selection for Zero-Shot Cross-Lingual Abusive Language Detection
J. Eronen
M. Ptaszynski
Fumito Masui
Masaki Arata
Gniewosz Leliwa
Michal Wroczynski
19
31
0
02 Jun 2022
Persian Natural Language Inference: A Meta-learning approach
Heydar Soudani
Mohammadreza Mojab
H. Beigy
34
1
0
18 May 2022
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data
Kabir Ahuja
Monojit Choudhury
Sandipan Dandapat
29
3
0
12 May 2022
Lifting the Curse of Multilinguality by Pre-training Modular Transformers
Jonas Pfeiffer
Naman Goyal
Xi Lin
Xian Li
James Cross
Sebastian Riedel
Mikel Artetxe
LRM
40
139
0
12 May 2022
Analyzing Gender Representation in Multilingual Models
Hila Gonen
Shauli Ravfogel
Yoav Goldberg
25
11
0
20 Apr 2022
Cross-Lingual Phrase Retrieval
Heqi Zheng
Xiao Zhang
Zewen Chi
Heyan Huang
T. Yan
Tian Lan
Wei Wei
Xian-Ling Mao
RALM
LRM
35
3
0
19 Apr 2022
Team ÚFAL at CMCL 2022 Shared Task: Figuring out the correct recipe for predicting Eye-Tracking features using Pretrained Language Models
Sunit Bhattacharya
Rishu Kumar
Ondrej Bojar
18
2
0
11 Apr 2022
A Dual-Contrastive Framework for Low-Resource Cross-Lingual Named Entity Recognition
Yingwen Fu
Nankai Lin
Ziyu Yang
Shengyi Jiang
24
4
0
02 Apr 2022
CL-XABSA: Contrastive Learning for Cross-lingual Aspect-based Sentiment Analysis
Nankai Lin
Yingwen Fu
Xiaotian Lin
Aimin Yang
Shengyi Jiang
45
16
0
02 Apr 2022
Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual Transferability
Yoshinari Fujinuma
Jordan L. Boyd-Graber
Katharina Kann
AAML
62
23
0
21 Mar 2022
Pretraining with Artificial Language: Studying Transferable Knowledge in Language Models
Ryokan Ri
Yoshimasa Tsuruoka
32
26
0
19 Mar 2022
Challenges and Strategies in Cross-Cultural NLP
Daniel Hershcovich
Stella Frank
Heather Lent
Miryam de Lhoneux
Mostafa Abdou
...
Ruixiang Cui
Constanza Fierro
Katerina Margatina
Phillip Rust
Anders Søgaard
48
163
0
18 Mar 2022
Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation
Xinyi Wang
Sebastian Ruder
Graham Neubig
42
61
0
17 Mar 2022
Combining Static and Contextualised Multilingual Embeddings
Katharina Hämmerl
Jindrich Libovický
Alexander Fraser
27
10
0
17 Mar 2022
Cross-Lingual Ability of Multilingual Masked Language Models: A Study of Language Structure
Yuan Chai
Yaobo Liang
Nan Duan
LRM
27
21
0
16 Mar 2022
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual Entailment
Haoyu Song
Li Dong
Weinan Zhang
Ting Liu
Furu Wei
VLM
CLIP
33
137
0
14 Mar 2022
CINO: A Chinese Minority Pre-trained Language Model
Ziqing Yang
Zihang Xu
Yiming Cui
Baoxin Wang
Min Lin
Dayong Wu
Zhigang Chen
23
25
0
28 Feb 2022
A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language Models
Vésteinn Snæbjarnarson
Haukur Barri Símonarson
Pétur Orri Ragnarsson
Svanhvít Lilja Ingólfsdóttir
H. Jónsson
Vilhjálmur Þorsteinsson
H. Einarsson
24
26
0
14 Jan 2022
On Cross-Lingual Retrieval with Multilingual Text Encoders
Robert Litschko
Ivan Vulić
Simone Paolo Ponzetto
Goran Glavaš
27
38
0
21 Dec 2021
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
34
143
0
20 Dec 2021
Parsing with Pretrained Language Models, Multiple Datasets, and Dataset Embeddings
Rob van der Goot
Miryam de Lhoneux
37
5
0
07 Dec 2021
Cross-lingual Adaption Model-Agnostic Meta-Learning for Natural Language Understanding
Qianying Liu
Fei Cheng
Sadao Kurohashi
17
1
0
10 Nov 2021
Towards Making the Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation
Guanhua Chen
Shuming Ma
Yun-Nung Chen
Dongdong Zhang
Jia Pan
Wenping Wang
Furu Wei
LRM
31
14
0
16 Oct 2021
An Isotropy Analysis in the Multilingual BERT Embedding Space
S. Rajaee
Mohammad Taher Pilehvar
24
33
0
09 Oct 2021
On the Prunability of Attention Heads in Multilingual BERT
Aakriti Budhraja
Madhura Pande
Pratyush Kumar
Mitesh M. Khapra
52
4
0
26 Sep 2021
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
Sneha Kudugunta
Yanping Huang
Ankur Bapna
M. Krikun
Dmitry Lepikhin
Minh-Thang Luong
Orhan Firat
MoE
119
107
0
24 Sep 2021
BERT Cannot Align Characters
Antonis Maronikolakis
Philipp Dufter
Hinrich Schütze
25
0
0
20 Sep 2021
Locating Language-Specific Information in Contextualized Embeddings
Sheng Liang
Philipp Dufter
Hinrich Schütze
30
7
0
16 Sep 2021
On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning
Marc Tanti
Lonneke van der Plas
Claudia Borg
Albert Gatt
20
10
0
14 Sep 2021
Evaluating Transferability of BERT Models on Uralic Languages
Judit Ács
Dániel Lévai
András Kornai
33
6
0
13 Sep 2021
xGQA: Cross-Lingual Visual Question Answering
Jonas Pfeiffer
Gregor Geigle
Aishwarya Kamath
Jan-Martin O. Steitz
Stefan Roth
Ivan Vulić
Iryna Gurevych
42
56
0
13 Sep 2021
Wine is Not v i n. -- On the Compatibility of Tokenizations Across Languages
Antonis Maronikolakis
Philipp Dufter
Hinrich Schütze
24
17
0
13 Sep 2021
The Impact of Positional Encodings on Multilingual Compression
Vinit Ravishankar
Anders Søgaard
25
5
0
11 Sep 2021
Subword Mapping and Anchoring across Languages
Giorgos Vernikos
Andrei Popescu-Belis
70
12
0
09 Sep 2021
Filling the Gaps in Ancient Akkadian Texts: A Masked Language Modelling Approach
Koren Lazar
Benny Saret
Asaf Yehudai
W. Horowitz
N. Wasserman
Gabriel Stanovsky
41
23
0
09 Sep 2021
Nearest Neighbour Few-Shot Learning for Cross-lingual Classification
M Saiful Bari
Batool Haider
Saab Mansour
VLM
19
13
0
06 Sep 2021
Learning from Multiple Noisy Augmented Data Sets for Better Cross-Lingual Spoken Language Understanding
Yingmei Guo
Linjun Shou
J. Pei
Ming Gong
Mingxing Xu
Zhiyong Wu
Daxin Jiang
34
5
0
03 Sep 2021
mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset
L. Bonifacio
Vitor Jeronymo
Hugo Queiroz Abonizio
Israel Campiotti
Marzieh Fadaee
R. Lotufo
Rodrigo Nogueira
47
108
0
31 Aug 2021
Not All Linearizations Are Equally Data-Hungry in Sequence Labeling Parsing
Alberto Muñoz-Ortiz
Michalina Strzyz
David Vilares
32
9
0
17 Aug 2021
Transfer Learning for Mining Feature Requests and Bug Reports from Tweets and App Store Reviews
Pablo Restrepo Henao
Jannik Fischbach
Dominik Spies
Julian Frattini
Andreas Vogelsang
24
25
0
02 Aug 2021
Modelling Latent Translations for Cross-Lingual Transfer
Edoardo Ponti
Julia Kreutzer
Ivan Vulić
Siva Reddy
37
18
0
23 Jul 2021
The Effectiveness of Intermediate-Task Training for Code-Switched Natural Language Understanding
Archiki Prasad
Mohammad Ali Rehan
Shreyasi Pathak
Preethi Jyothi
32
9
0
21 Jul 2021
A Primer on Pretrained Multilingual Language Models
Sumanth Doddapaneni
Gowtham Ramesh
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
LRM
47
74
0
01 Jul 2021
Revisiting the Primacy of English in Zero-shot Cross-lingual Transfer
Iulia Turc
Kenton Lee
Jacob Eisenstein
Ming-Wei Chang
Kristina Toutanova
26
58
0
30 Jun 2021
X-FACT: A New Benchmark Dataset for Multilingual Fact Checking
Ashim Gupta
Vivek Srikumar
HILM
20
97
0
17 Jun 2021
Previous
1
2
3
4
Next