Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.10464
Cited By
v1
v2 (latest)
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
26 December 2018
Mikel Artetxe
Holger Schwenk
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3640★)
Papers citing
"Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond"
48 / 298 papers shown
Title
On the Importance of Word Order Information in Cross-lingual Sequence Labeling
Zihan Liu
Genta Indra Winata
Samuel Cahyawijaya
Andrea Madotto
Zhaojiang Lin
Pascale Fung
119
3
0
30 Jan 2020
PMIndia -- A Collection of Parallel Corpora of Languages of India
Barry Haddow
Faheem Kirefu
53
103
0
27 Jan 2020
Deep Learning for Hindi Text Classification: A Comparison
Ramchandra Joshi
Purvi Goel
Raviraj Joshi
VLM
59
40
0
19 Jan 2020
Generative Adversarial Zero-Shot Relational Learning for Knowledge Graphs
Pengda Qin
Xin Eric Wang
Wenhu Chen
Chunyun Zhang
Weiran Xu
William Yang Wang
GAN
93
85
0
08 Jan 2020
A Comprehensive Survey of Multilingual Neural Machine Translation
Raj Dabre
Chenhui Chu
Anoop Kunchukuttan
LRM
116
33
0
04 Jan 2020
An Empirical Study of Factors Affecting Language-Independent Models
Xiaotong Liu
Yingbei Tong
Anbang Xu
Rama Akkiraju
32
0
0
30 Dec 2019
A Comparison of Architectures and Pretraining Methods for Contextualized Multilingual Word Embeddings
Niels van der Heijden
Samira Abnar
Ekaterina Shutova
69
16
0
15 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
111
401
0
11 Dec 2019
Automatic Spanish Translation of the SQuAD Dataset for Multilingual Question Answering
C. Carrino
Marta R. Costa-jussá
José A. R. Fonollosa
69
89
0
11 Dec 2019
GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies
Marta R. Costa-jussá
P. Lin
C. España-Bonet
SyDa
64
25
0
10 Dec 2019
Massive vs. Curated Word Embeddings for Low-Resourced Languages. The Case of Yorùbá and Twi
Jesujoba Oluwadara Alabi
Kwabena Amponsah-Kaakyire
David Ifeoluwa Adelani
C. España-Bonet
92
53
0
05 Dec 2019
COSTRA 1.0: A Dataset of Complex Sentence Transformations
P. Barancíková
Ondrej Bojar
48
7
0
03 Dec 2019
hauWE: Hausa Words Embedding for Natural Language Processing
Idris Abdulmumin
B. Galadanci
58
15
0
25 Nov 2019
CCMatrix: Mining Billions of High-Quality Parallel Sentences on the WEB
Holger Schwenk
Guillaume Wenzek
Sergey Edunov
Edouard Grave
Armand Joulin
96
263
0
10 Nov 2019
Can Monolingual Pretrained Models Help Cross-Lingual Classification?
Zewen Chi
Li Dong
Furu Wei
Xian-Ling Mao
Heyan Huang
LRM
VLM
104
13
0
10 Nov 2019
A Bilingual Generative Transformer for Semantic Sentence Embedding
John Wieting
Graham Neubig
Taylor Berg-Kirkpatrick
78
29
0
10 Nov 2019
CCAligned: A Massive Collection of Cross-Lingual Web-Document Pairs
Ahmed El-Kishky
Vishrav Chaudhary
Francisco Guzman
Philipp Koehn
114
200
0
10 Nov 2019
Evaluation of Sentence Representations in Polish
Slawomir Dadas
Michal Perelkiewicz
Rafal Poswiata
184
16
0
25 Oct 2019
Exploring Multilingual Syntactic Sentence Representations
Chen Cecilia Liu
Anderson de Andrade
Muhammad Osama
29
4
0
25 Oct 2019
Wasserstein distances for evaluating cross-lingual embeddings
Georgios Balikas
Karanjit S Kooner
31
1
0
24 Oct 2019
Facebook AI's WAT19 Myanmar-English Translation Task Submission
Peng-Jen Chen
Jiajun Shen
Matt Le
Vishrav Chaudhary
Ahmed El-Kishky
Guillaume Wenzek
Myle Ott
MarcÁurelio Ranzato
40
29
0
15 Oct 2019
Aligning Cross-Lingual Entities with Multi-Aspect Information
Hsiu-Wei Yang
Yanyan Zou
Peng Shi
Wei Lu
Jimmy J. Lin
Xu Sun
99
148
0
15 Oct 2019
Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework
Zirui Wang
Jiateng Xie
Ruochen Xu
Yiming Yang
Graham Neubig
J. Carbonell
98
79
0
10 Oct 2019
Simple and Effective Paraphrastic Similarity from Parallel Translations
John Wieting
Kevin Gimpel
Graham Neubig
Taylor Berg-Kirkpatrick
87
49
0
30 Sep 2019
Regressing Word and Sentence Embeddings for Regularization of Neural Machine Translation
Inigo Jauregi Unanue
E. Z. Borzeshi
Massimo Piccardi
AI4TS
42
0
0
30 Sep 2019
HateMonitors: Language Agnostic Abuse Detection in Social Media
Punyajoy Saha
Binny Mathew
Pawan Goyal
Animesh Mukherjee
50
28
0
27 Sep 2019
Cross-Lingual Natural Language Generation via Pre-Training
Zewen Chi
Li Dong
Furu Wei
Wenhui Wang
Xian-Ling Mao
Heyan Huang
101
138
0
23 Sep 2019
Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Embeddings
Gregor Wiedemann
Steffen Remus
Avi Chawla
Chris Biemann
106
176
0
23 Sep 2019
Enriching BERT with Knowledge Graph Embeddings for Document Classification
Malte Ostendorff
Peter Bourgonje
Maria Berger
J. Moreno-Schneider
Georg Rehm
Bela Gipp
69
82
0
18 Sep 2019
Bridging the domain gap in cross-lingual document classification
Guokun Lai
Barlas Oğuz
Yiming Yang
Veselin Stoyanov
VLM
67
14
0
16 Sep 2019
MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
Julian Martin Eisenschlos
Sebastian Ruder
Piotr Czapla
Marcin Kardas
Sylvain Gugger
Jeremy Howard
69
99
0
10 Sep 2019
Investigating Multilingual NMT Representations at Scale
Sneha Kudugunta
Ankur Bapna
Isaac Caswell
N. Arivazhagan
Orhan Firat
LRM
198
125
0
05 Sep 2019
Unicoder: A Universal Language Encoder by Pre-training with Multiple Cross-lingual Tasks
Haoyang Huang
Yaobo Liang
Nan Duan
Ming Gong
Linjun Shou
Daxin Jiang
M. Zhou
109
233
0
03 Sep 2019
Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation
Aditya Siddhant
Melvin Johnson
Henry Tsai
N. Arivazhagan
Jason Riesa
Ankur Bapna
Orhan Firat
Karthik Raman
86
71
0
01 Sep 2019
Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER
Phillip Keung
Y. Lu
Vikas Bhardwaj
111
81
0
31 Aug 2019
Zero-shot transfer for implicit discourse relation classification
Murathan Kurfali
Robert Östling
46
12
0
30 Jul 2019
WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia
Holger Schwenk
Vishrav Chaudhary
Shuo Sun
Hongyu Gong
Francisco Guzmán
CVBM
118
408
0
10 Jul 2019
Multilingual Universal Sentence Encoder for Semantic Retrieval
Yinfei Yang
Daniel Cer
Amin Ahmad
Mandy Guo
Jax Law
...
Steve Yuan
Chris Tar
Yun-hsuan Sung
B. Strope
R. Kurzweil
3DV
94
481
0
09 Jul 2019
Low-Resource Corpus Filtering using Multilingual Sentence Embeddings
Vishrav Chaudhary
Y. Tang
Francisco Guzmán
Holger Schwenk
Philipp Koehn
86
80
0
20 Jun 2019
How multilingual is Multilingual BERT?
Telmo Pires
Eva Schlinger
Dan Garrette
LRM
VLM
230
1,416
0
04 Jun 2019
Learning Multilingual Word Embeddings Using Image-Text Data
K. Singhal
K. Raman
B. T. Cate
VLM
52
10
0
29 May 2019
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT
Shijie Wu
Mark Dredze
VLM
SSeg
141
681
0
19 Apr 2019
75 Languages, 1 Model: Parsing Universal Dependencies Universally
Dan Kondratyuk
Milan Straka
117
264
0
03 Apr 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLM
FaML
159
3,159
0
01 Apr 2019
Massively Multilingual Neural Machine Translation
Roee Aharoni
Melvin Johnson
Orhan Firat
LRM
AI4CE
90
490
0
28 Feb 2019
How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions
Goran Glavaš
Robert Litschko
Sebastian Ruder
Ivan Vulić
ELM
98
183
0
01 Feb 2019
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
152
2,750
0
22 Jan 2019
A Survey Of Cross-lingual Word Embedding Models
Sebastian Ruder
Ivan Vulić
Anders Søgaard
108
534
0
15 Jun 2017
Previous
1
2
3
4
5
6