Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.10464
Cited By
v1
v2 (latest)
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
26 December 2018
Mikel Artetxe
Holger Schwenk
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3640★)
Papers citing
"Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond"
50 / 298 papers shown
Title
Deception detection in text and its relation to the cultural dimension of individualism/collectivism
Katerina Papantoniou
P. Papadakos
Theodore Patkos
G. Flouris
Ion Androutsopoulos
Dimitris Plexousakis
87
7
0
26 May 2021
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining
Ivana Kvapilíková
Mikel Artetxe
Gorka Labaka
Eneko Agirre
Ondrej Bojar
SSL
66
36
0
21 May 2021
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation
Xiao Pan
Mingxuan Wang
Liwei Wu
Lei Li
97
207
0
20 May 2021
Analysing The Impact Of Linguistic Features On Cross-Lingual Transfer
B. Dolički
Gerasimos Spanakis
70
18
0
12 May 2021
Backretrieval: An Image-Pivoted Evaluation Metric for Cross-Lingual Text Representations Without Parallel Corpora
Mikhail Fain
Niall Twomey
Danushka Bollegala
29
2
0
11 May 2021
Generalising Multilingual Concept-to-Text NLG with Language Agnostic Delexicalisation
Giulio Zhou
Gerasimos Lampouras
43
3
0
07 May 2021
Russian News Clustering and Headline Selection Shared Task
I. Gusev
I. Smurov
32
7
0
03 May 2021
Evaluating the Values of Sources in Transfer Learning
Md. Rizwan Parvez
Kai-Wei Chang
73
18
0
26 Apr 2021
Deep learning for sentence clustering in essay grading support
Li-Hsin Chang
Iiro Rastas
S. Pyysalo
Filip Ginter
52
8
0
23 Apr 2021
skweak: Weak Supervision Made Easy for NLP
Pierre Lison
Jeremy Barnes
A. Hubin
66
44
0
19 Apr 2021
Constrained Language Models Yield Few-Shot Semantic Parsers
Richard Shin
C. H. Lin
Sam Thomson
Charles C. Chen
Subhro Roy
Emmanouil Antonios Platanios
Adam Pauls
Dan Klein
J. Eisner
Benjamin Van Durme
395
206
0
18 Apr 2021
AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages
Abteen Ebrahimi
Manuel Mager
Arturo Oncevay
Vishrav Chaudhary
Luis Chiruzzo
...
Graham Neubig
Alexis Palmer
Rolando A. Coto Solano
Ngoc Thang Vu
Katharina Kann
160
74
0
18 Apr 2021
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
Jack Hessel
Ari Holtzman
Maxwell Forbes
Ronan Le Bras
Yejin Choi
CLIP
229
1,595
0
18 Apr 2021
MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs
Zewen Chi
Li Dong
Shuming Ma
Shaohan Huang Xian-Ling Mao
Heyan Huang
Furu Wei
LRM
123
74
0
18 Apr 2021
XLEnt: Mining a Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word Alignment
Ahmed El-Kishky
Adithya Renduchintala
James Cross
Francisco Guzmán
Philipp Koehn
67
18
0
17 Apr 2021
BERT2Code: Can Pretrained Language Models be Leveraged for Code Search?
Abdullah Al Ishtiaq
Masum Hasan
Md. Mahim Anjum Haque
Kazi Sajeed Mehrab
Tanveer Muttaqueen
Tahmid Hasan
Anindya Iqbal
Rifat Shahriyar
41
5
0
16 Apr 2021
Are Classes Clusters?
Kees Varekamp
16
2
0
16 Apr 2021
XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation
Sebastian Ruder
Noah Constant
Jan A. Botha
Aditya Siddhant
Orhan Firat
...
Pengfei Liu
Junjie Hu
Dan Garrette
Graham Neubig
Melvin Johnson
ELM
AAML
LRM
93
190
0
15 Apr 2021
The Curious Case of Hallucinations in Neural Machine Translation
Vikas Raunak
Arul Menezes
Marcin Junczys-Dowmunt
229
195
0
14 Apr 2021
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages
Gowtham Ramesh
Sumanth Doddapaneni
Aravinth Bheemaraj
Mayank Jobanputra
AK Raghavan
...
K. Deepak
Vivek Raghavan
Anoop Kunchukuttan
Pratyush Kumar
Mitesh Khapra
LRM
106
235
0
12 Apr 2021
Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language Models
James Y. Huang
Kuan-Hao Huang
Kai-Wei Chang
76
21
0
11 Apr 2021
A Neighbourhood Framework for Resource-Lean Content Flagging
Sheikh Muhammad Sarwar
Dimitrina Zlatkova
Momchil Hardalov
Yoan Dinkov
Isabelle Augenstein
Preslav Nakov
62
5
0
31 Mar 2021
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Julia Kreutzer
Isaac Caswell
Lisa Wang
Ahsan Wahab
D. Esch
...
Duygu Ataman
Orevaoghene Ahia
Oghenefego Ahia
Sweta Agrawal
Mofetoluwa Adeyemi
66
280
0
22 Mar 2021
Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots
Samson Tan
Shafiq Joty
AAML
98
36
0
17 Mar 2021
Towards Multi-Sense Cross-Lingual Alignment of Contextual Embeddings
Linlin Liu
Thien Hai Nguyen
Shafiq Joty
Lidong Bing
Luo Si
102
5
0
11 Mar 2021
Are pre-trained text representations useful for multilingual and multi-dimensional language proficiency modeling?
Taraka Rama
Sowmya Vajjala
38
6
0
25 Feb 2021
Hate-Alert@DravidianLangTech-EACL2021: Ensembling strategies for Transformer-based Offensive language Detection
Debjoy Saha
Naman Paharia
Debajit Chakraborty
Punyajoy Saha
Animesh Mukherjee
39
38
0
19 Feb 2021
"Short is the Road that Leads from Fear to Hate": Fear Speech in Indian WhatsApp Groups
Punyajoy Saha
Binny Mathew
Kiran Garimella
Animesh Mukherjee
64
52
0
07 Feb 2021
The Multilingual TEDx Corpus for Speech Recognition and Translation
Elizabeth Salesky
Sanjeev Khudanpur
Jacob Bremerman
R. Cattoni
Matteo Negri
Marco Turchi
Douglas W. Oard
Matt Post
79
126
0
02 Feb 2021
Combining Deep Generative Models and Multi-lingual Pretraining for Semi-supervised Document Classification
Yi Zhu
Ehsan Shareghi
Yingzhen Li
Roi Reichart
Anna Korhonen
VLM
41
5
0
26 Jan 2021
Meta-Learning for Effective Multi-task and Multilingual Modelling
Ishan Tarunesh
Sushil Khyalia
Vishwajeet Kumar
Ganesh Ramakrishnan
Preethi Jyothi
81
16
0
25 Jan 2021
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Changhan Wang
M. Rivière
Ann Lee
Anne Wu
Chaitanya Talnikar
Daniel Haziza
Mary Williamson
J. Pino
Emmanuel Dupoux
SSL
150
498
0
02 Jan 2021
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla
Abhik Bhattacharjee
Tahmid Hasan
Wasi Uddin Ahmad
Kazi Samin Mubasshir
Md. Saiful Islam
Anindya Iqbal
M. Rahman
Rifat Shahriyar
SSL
VLM
101
180
0
01 Jan 2021
Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment
Freda Shi
Luke Zettlemoyer
Sida I. Wang
SSL
84
33
0
01 Jan 2021
A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters
Mengjie Zhao
Yi Zhu
Ehsan Shareghi
Ivan Vulić
Roi Reichart
Anna Korhonen
Hinrich Schütze
112
64
0
31 Dec 2020
Universal Sentence Representation Learning with Conditional Masked Language Model
Ziyi Yang
Yinfei Yang
Daniel Cer
Jax Law
Eric F. Darve
SSL
84
58
0
28 Dec 2020
Pivot Through English: Reliably Answering Multilingual Questions without Document Retrieval
Ivan Montero
Shayne Longpre
Ni Lao
Andrew J. Frank
Christopher DuBois
LRM
64
5
0
28 Dec 2020
Simple or Complex? Learning to Predict Readability of Bengali Texts
Susmoy Chakraborty
Mir Tafseer Nayeem
Wasi Uddin Ahmad
57
20
0
09 Dec 2020
Globetrotter: Connecting Languages by Connecting Images
Dídac Surís
Dave Epstein
Carl Vondrick
VLM
74
9
0
08 Dec 2020
Cross-lingual Transfer of Abstractive Summarizer to Less-resource Language
Aleš Žagar
Marko Robnik-Šikonja
68
9
0
08 Dec 2020
Score Combination for Improved Parallel Corpus Filtering for Low Resource Conditions
Muhammad N. ElNokrashy
Amr Hendy
M. Abdelghaffar
Mohamed Afify
Ahmed Tawfik
Hany Awadalla
48
3
0
16 Nov 2020
Probing Multilingual BERT for Genetic and Typological Signals
Taraka Rama
Lisa Beinborn
Steffen Eger
63
25
0
04 Nov 2020
Biased TextRank: Unsupervised Graph-Based Content Extraction
Ashkan Kazemi
Verónica Pérez-Rosas
Rada Mihalcea
147
30
0
02 Nov 2020
VECO: Variable and Flexible Cross-lingual Pre-training for Language Understanding and Generation
Fuli Luo
Wei Wang
Jiahao Liu
Yijia Liu
Bin Bi
Songfang Huang
Fei Huang
Luo Si
111
52
0
30 Oct 2020
Learning Contextualised Cross-lingual Word Embeddings and Alignments for Extremely Low-Resource Languages Using Parallel Corpora
Takashi Wada
Tomoharu Iwata
Yuji Matsumoto
Timothy Baldwin
Jey Han Lau
120
7
0
27 Oct 2020
ReadOnce Transformers: Reusable Representations of Text for Transformers
Shih-Ting Lin
Ashish Sabharwal
Tushar Khot
112
3
0
24 Oct 2020
Rethinking embedding coupling in pre-trained language models
Hyung Won Chung
Thibault Févry
Henry Tsai
Melvin Johnson
Sebastian Ruder
177
143
0
24 Oct 2020
DICT-MLM: Improved Multilingual Pre-Training using Bilingual Dictionaries
Aditi Chaudhary
K. Raman
Krishna Srinivasan
Jiecao Chen
81
25
0
23 Oct 2020
Multilingual BERT Post-Pretraining Alignment
Lin Pan
Chung-Wei Hang
Haode Qi
Abhishek Shah
Saloni Potdar
Mo Yu
177
44
0
23 Oct 2020
Unsupervised Cross-lingual Adaptation for Sequence Tagging and Beyond
Xin Li
Lidong Bing
Wenxuan Zhang
Zheng Li
Wai Lam
125
25
0
23 Oct 2020
Previous
1
2
3
4
5
6
Next