Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.14958
Cited By
A Call for More Rigor in Unsupervised Cross-lingual Learning
30 April 2020
Mikel Artetxe
Sebastian Ruder
Dani Yogatama
Gorka Labaka
Eneko Agirre
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Call for More Rigor in Unsupervised Cross-lingual Learning"
50 / 59 papers shown
Title
LangSAMP: Language-Script Aware Multilingual Pretraining
Yihong Liu
Haotian Ye
Chunlan Ma
Mingyang Wang
Hinrich Schütze
VLM
138
0
0
26 Sep 2024
When Does Unsupervised Machine Translation Work?
Kelly Marchisio
Kevin Duh
Philipp Koehn
LRM
61
75
0
12 Apr 2020
Translation Artifacts in Cross-lingual Transfer Learning
Mikel Artetxe
Gorka Labaka
Eneko Agirre
44
117
0
09 Apr 2020
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation
Yaobo Liang
Nan Duan
Yeyun Gong
Ning Wu
Fenfei Guo
...
Shuguang Liu
Fan Yang
Daniel Fernando Campos
Rangan Majumder
Ming Zhou
ELM
VLM
77
350
0
03 Apr 2020
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization
Junjie Hu
Sebastian Ruder
Aditya Siddhant
Graham Neubig
Orhan Firat
Melvin Johnson
ELM
154
970
0
24 Mar 2020
TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages
J. Clark
Eunsol Choi
Michael Collins
Dan Garrette
Tom Kwiatkowski
Vitaly Nikolaev
J. Palomaki
130
607
0
10 Mar 2020
Multilingual Denoising Pre-training for Neural Machine Translation
Yinhan Liu
Jiatao Gu
Naman Goyal
Xian Li
Sergey Edunov
Marjan Ghazvininejad
M. Lewis
Luke Zettlemoyer
AI4CE
AIMat
111
1,806
0
22 Jan 2020
CCMatrix: Mining Billions of High-Quality Parallel Sentences on the WEB
Holger Schwenk
Guillaume Wenzek
Sergey Edunov
Edouard Grave
Armand Joulin
72
260
0
10 Nov 2019
Unsupervised Cross-lingual Representation Learning at Scale
Alexis Conneau
Kartikay Khandelwal
Naman Goyal
Vishrav Chaudhary
Guillaume Wenzek
Francisco Guzmán
Edouard Grave
Myle Ott
Luke Zettlemoyer
Veselin Stoyanov
193
6,522
0
05 Nov 2019
On the Cross-lingual Transferability of Monolingual Representations
Mikel Artetxe
Sebastian Ruder
Dani Yogatama
159
793
0
25 Oct 2019
Zero-shot Reading Comprehension by Cross-lingual Transfer Learning with Multi-lingual Language Representation Model
Tsung-Yuan Hsu
Chi-Liang Liu
Hung-yi Lee
47
60
0
15 Sep 2019
Lost in Evaluation: Misleading Benchmarks for Bilingual Dictionary Induction
Yova Kementchedjhieva
Mareike Hartmann
Anders Søgaard
44
36
0
12 Sep 2019
Don't Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction
Paula Czarnowska
Sebastian Ruder
Edouard Grave
Ryan Cotterell
Ann A. Copestake
65
50
0
06 Sep 2019
Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?
Ivan Vulić
Goran Glavaš
Roi Reichart
Anna Korhonen
SSL
53
89
0
04 Sep 2019
Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation
Aditya Siddhant
Melvin Johnson
Henry Tsai
N. Arivazhagan
Jason Riesa
Ankur Bapna
Orhan Firat
Karthik Raman
48
70
0
01 Sep 2019
Cross-Lingual Machine Reading Comprehension
Yiming Cui
Wanxiang Che
Ting Liu
Bing Qin
Shijin Wang
Guoping Hu
27
48
0
01 Sep 2019
Bilingual Lexicon Induction through Unsupervised Machine Translation
Mikel Artetxe
Gorka Labaka
Eneko Agirre
55
55
0
24 Jul 2019
WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia
Holger Schwenk
Vishrav Chaudhary
Shuo Sun
Hongyu Gong
Francisco Guzmán
CVBM
93
404
0
10 Jul 2019
Tabula nearly rasa: Probing the Linguistic Knowledge of Character-Level Neural Language Models Trained on Unsegmented Text
Michael Hahn
Marco Baroni
LMTD
33
15
0
17 Jun 2019
How multilingual is Multilingual BERT?
Telmo Pires
Eva Schlinger
Dan Garrette
LRM
VLM
143
1,401
0
04 Jun 2019
MASS: Masked Sequence to Sequence Pre-training for Language Generation
Kaitao Song
Xu Tan
Tao Qin
Jianfeng Lu
Tie-Yan Liu
99
965
0
07 May 2019
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT
Shijie Wu
Mark Dredze
VLM
SSeg
89
677
0
19 Apr 2019
Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency Parsing
Tal Schuster
Ori Ram
Regina Barzilay
Amir Globerson
68
210
0
25 Feb 2019
An Effective Approach to Unsupervised Machine Translation
Mikel Artetxe
Gorka Labaka
Eneko Agirre
42
154
0
04 Feb 2019
How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions
Goran Glavaš
Robert Litschko
Sebastian Ruder
Ivan Vulić
ELM
62
183
0
01 Feb 2019
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
69
2,735
0
22 Jan 2019
Unsupervised Neural Machine Translation with SMT as Posterior Regularization
Shuo Ren
Zhirui Zhang
Shujie Liu
M. Zhou
Shuai Ma
55
60
0
14 Jan 2019
Unsupervised Neural Machine Translation Initialized by Unsupervised Statistical Machine Translation
Benjamin Marie
Atsushi Fujita
OT
23
32
0
30 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.3K
94,511
0
11 Oct 2018
XNLI: Evaluating Cross-lingual Sentence Representations
Alexis Conneau
Guillaume Lample
Ruty Rinott
Adina Williams
Samuel R. Bowman
Holger Schwenk
Veselin Stoyanov
ELM
55
1,379
0
13 Sep 2018
Unsupervised Cross-lingual Transfer of Word Embedding Spaces
Ruochen Xu
Yiming Yang
Naoki Otani
Yuexin Wu
SSL
41
99
0
10 Sep 2018
Unsupervised Statistical Machine Translation
Mikel Artetxe
Gorka Labaka
Eneko Agirre
45
248
0
04 Sep 2018
Gromov-Wasserstein Alignment of Word Embedding Spaces
David Alvarez-Melis
Tommi Jaakkola
OT
54
328
0
31 Aug 2018
A Discriminative Latent-Variable Model for Bilingual Lexicon Induction
Sebastian Ruder
Ryan Cotterell
Yova Kementchedjhieva
Anders Søgaard
51
30
0
28 Aug 2018
Unsupervised Multilingual Word Embeddings
Xilun Chen
Claire Cardie
50
132
0
27 Aug 2018
Unsupervised Alignment of Embeddings with Wasserstein Procrustes
Edouard Grave
Armand Joulin
Quentin Berthet
51
199
0
29 May 2018
A Corpus for Multilingual Document Classification in Eight Languages
Holger Schwenk
Xian Li
VLM
47
143
0
24 May 2018
Bilingual Sentiment Embeddings: Joint Projection of Sentiment Across Languages
Jeremy Barnes
Roman Klinger
Sabine Schulte im Walde
39
69
0
23 May 2018
A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings
Mikel Artetxe
Gorka Labaka
Eneko Agirre
SSL
65
589
0
16 May 2018
On the Limitations of Unsupervised Bilingual Dictionary Induction
Anders Søgaard
Sebastian Ruder
Ivan Vulić
58
261
0
09 May 2018
Phrase-Based & Neural Unsupervised Machine Translation
Guillaume Lample
Myle Ott
Alexis Conneau
Ludovic Denoyer
MarcÁurelio Ranzato
80
683
0
20 Apr 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
820
7,141
0
20 Apr 2018
Annotation Artifacts in Natural Language Inference Data
Suchin Gururangan
Swabha Swayamdipta
Omer Levy
Roy Schwartz
Samuel R. Bowman
Noah A. Smith
124
1,175
0
06 Mar 2018
Learning Word Vectors for 157 Languages
Edouard Grave
Piotr Bojanowski
Prakhar Gupta
Armand Joulin
Tomas Mikolov
SSL
FaML
90
1,425
0
19 Feb 2018
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
161
11,541
0
15 Feb 2018
Non-Adversarial Unsupervised Word Translation
Yedid Hoshen
Lior Wolf
57
119
0
18 Jan 2018
Unsupervised Machine Translation Using Monolingual Corpora Only
Guillaume Lample
Alexis Conneau
Ludovic Denoyer
MarcÁurelio Ranzato
SSL
98
1,094
0
31 Oct 2017
Unsupervised Neural Machine Translation
Mikel Artetxe
Gorka Labaka
Eneko Agirre
Kyunghyun Cho
83
774
0
30 Oct 2017
Word Translation Without Parallel Data
Alexis Conneau
Guillaume Lample
MarcÁurelio Ranzato
Ludovic Denoyer
Hervé Jégou
284
1,655
0
11 Oct 2017
Learned in Translation: Contextualized Word Vectors
Bryan McCann
James Bradbury
Caiming Xiong
R. Socher
111
907
0
01 Aug 2017
1
2
Next