Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.10464
Cited By
v1
v2 (latest)
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
26 December 2018
Mikel Artetxe
Holger Schwenk
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3640★)
Papers citing
"Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond"
50 / 298 papers shown
Title
Noisy Parallel Data Alignment
Ruoyu Xie
Antonios Anastasopoulos
56
3
0
23 Jan 2023
Language Embeddings Sometimes Contain Typological Generalizations
Robert Östling
Murathan Kurfali
NAI
115
11
0
19 Jan 2023
Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection
Weijia Xu
Sweta Agrawal
Eleftheria Briakou
Marianna J. Martindale
Marine Carpuat
HILM
74
49
0
18 Jan 2023
Automatic Text Simplification of News Articles in the Context of Public Broadcasting
Diego Maupomé
Fanny Rancourt
T. Soulas
Alexandre Lachance
Marie-Jean Meurs
...
Olivier Brochu Dufour
Igor Pontes
Rémi Cardon
Michel Simard
Sowmya Vajjala
100
0
0
26 Dec 2022
Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval
John Wieting
J. Clark
William W. Cohen
Graham Neubig
Taylor Berg-Kirkpatrick
96
6
0
21 Dec 2022
T-Projection: High Quality Annotation Projection for Sequence Labeling Tasks
Iker García-Ferrero
Rodrigo Agerri
German Rigau
99
16
0
20 Dec 2022
IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages
Ananya B. Sai
Vignesh Nagarajan
Tanay Dixit
Raj Dabre
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
133
24
0
20 Dec 2022
Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data
Mozhdeh Gheini
Tatiana Likhomanenko
Matthias Sperber
Hendra Setiawan
90
5
0
20 Dec 2022
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Ercong Nie
Sheng Liang
Helmut Schmid
Hinrich Schütze
VLM
RALM
LRM
108
22
0
19 Dec 2022
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Zheng-Xin Yong
Hailey Schoelkopf
Niklas Muennighoff
Alham Fikri Aji
David Ifeoluwa Adelani
...
Genta Indra Winata
Stella Biderman
Edward Raff
Dragomir R. Radev
Vassilina Nikoulina
CLL
VLM
AI4CE
LRM
147
89
0
19 Dec 2022
Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better
David Dale
Elena Voita
Loïc Barrault
Marta R. Costa-jussá
HILM
227
73
0
16 Dec 2022
BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric
Mingda Chen
Paul-Ambroise Duquenne
Pierre Yves Andrews
Justine T. Kao
Alexandre Mourachko
Holger Schwenk
Marta R. Costa-jussá
65
18
0
16 Dec 2022
DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue
William B. Held
Christopher Hidey
Fei Liu
Eric Zhu
Rahul Goel
Diyi Yang
Rushin Shah
92
0
0
15 Dec 2022
Advancing Multilingual Pre-training: TRIP Triangular Document-level Pre-training for Multilingual Language Models
Hongyuan Lu
Haoyang Huang
Shuming Ma
Dongdong Zhang
W. Lam
Furu Wei
60
4
0
15 Dec 2022
Decomposing a Recurrent Neural Network into Modules for Enabling Reusability and Replacement
S. Imtiaz
Fraol Batole
Astha Singh
Rangeet Pan
Breno Dantas Cruz
Hridesh Rajan
38
7
0
09 Dec 2022
Text Embeddings by Weakly-Supervised Contrastive Pre-training
Liang Wang
Nan Yang
Xiaolong Huang
Binxing Jiao
Linjun Yang
Daxin Jiang
Rangan Majumder
Furu Wei
VLM
263
624
0
07 Dec 2022
Speech-to-Speech Translation For A Real-world Unwritten Language
Peng-Jen Chen
Ke M. Tran
Yilin Yang
Jingfei Du
Justine T. Kao
...
Sravya Popuri
Changhan Wang
J. Pino
Wei-Ning Hsu
Ann Lee
91
26
0
11 Nov 2022
English Contrastive Learning Can Learn Universal Cross-lingual Sentence Embeddings
Yau-Shian Wang
Ashley Wu
Graham Neubig
SSL
93
33
0
11 Nov 2022
Detecting Languages Unintelligible to Multilingual Models through Local Structure Probes
Louis Clouâtre
Prasanna Parthasarathi
Payel Das
Sarath Chandar
71
3
0
09 Nov 2022
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Paul-Ambroise Duquenne
Hongyu Gong
Ning Dong
Jingfei Du
Ann Lee
Vedanuj Goswani
Changhan Wang
J. Pino
Benoît Sagot
Holger Schwenk
102
38
0
08 Nov 2022
Very Low Resource Sentence Alignment: Luhya and Swahili
E. Chimoto
Bruce A. Bassett
CVBM
68
10
0
31 Oct 2022
Domain Adaptation of Machine Translation with Crowdworkers
Makoto Morishita
Jun Suzuki
Masaaki Nagata
44
3
0
28 Oct 2022
RuCoLA: Russian Corpus of Linguistic Acceptability
Vladislav Mikhailov
T. Shamardina
Max Ryabinin
A. Pestova
I. Smurov
Ekaterina Artemova
80
29
0
23 Oct 2022
AugCSE: Contrastive Sentence Embedding with Diverse Augmentations
Zilu Tang
Muhammed Yusuf Kocyigit
Derry Wijaya
110
9
0
20 Oct 2022
Separating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African Languages
Idris Abdulmumin
Michael Beukman
Jesujoba Oluwadara Alabi
Chris C. Emezue
Everlyn Asiko
...
Shamsuddeen Hassan Muhammad
Mofetoluwa Adeyemi
Oreen Yousuf
Sahib Singh
T. Gwadabe
98
9
0
19 Oct 2022
A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning
Kunbo Ding
Weijie Liu
Yuejian Fang
Weiquan Mao
Zhe Zhao
Tao Zhu
Haoyan Liu
Rong Tian
Yiren Chen
68
10
0
18 Oct 2022
Shapley Head Pruning: Identifying and Removing Interference in Multilingual Transformers
William B. Held
Diyi Yang
VLM
102
6
0
11 Oct 2022
Multilingual Representation Distillation with Contrastive Learning
Weiting Tan
Kevin Heffernan
Holger Schwenk
Philipp Koehn
75
16
0
10 Oct 2022
Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning
Yabing Wang
Jianfeng Dong
Tianxiang Liang
Minsong Zhang
Rui Cai
Xun Wang
97
20
0
26 Aug 2022
BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language Models for Sinhala Text Classification
Vinura Dhananjaya
Piyumal Demotte
Surangika Ranathunga
Sanath Jayasena
72
15
0
16 Aug 2022
Training Effective Neural Sentence Encoders from Automatically Mined Paraphrases
Slawomir Dadas
56
5
0
26 Jul 2022
Multilingual Transformer Encoders: a Word-Level Task-Agnostic Evaluation
Félix Gaschi
François Plesse
Parisa Rastin
Y. Toussaint
68
8
0
19 Jul 2022
Extreme compression of sentence-transformer ranker models: faster inference, longer battery life, and less storage on edge devices
Amit Chaulwar
Lukas Malik
Maciej Krajewski
Felix Reichel
Leif-Nissen Lundbæk
M. Huth
B. Matejczyk
VLM
32
3
0
29 Jun 2022
Endowing Language Models with Multimodal Knowledge Graph Representations
Ningyuan Huang
Y. Deshpande
Yibo Liu
Houda Alberts
Kyunghyun Cho
Clara Vania
Iacer Calixto
VLM
72
16
0
27 Jun 2022
Statistical and Neural Methods for Cross-lingual Entity Label Mapping in Knowledge Graphs
Gabriel Amaral
Marcis Pinnis
Inguna Skadicna
Odinaldo Rodrigues
Elena Simperl
47
3
0
17 Jun 2022
Finetuning a Kalaallisut-English machine translation system using web-crawled data
Alex Jones
51
2
0
05 Jun 2022
Findings of the The RuATD Shared Task 2022 on Artificial Text Detection in Russian
T. Shamardina
Vladislav Mikhailov
Daniil Chernianskii
Alena Fenogenova
Marat Saidov
A. Valeeva
Tatiana Shavrina
I. Smurov
E. Tutubalina
Ekaterina Artemova
DeLMO
62
30
0
03 Jun 2022
Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Virginia Adams
Sandeep Subramanian
Mike Chrzanowski
Oleksii Hrinchuk
Oleksii Kuchaiev
63
2
0
02 Jun 2022
Bitext Mining Using Distilled Sentence Representations for Low-Resource Languages
Kevin Heffernan
Onur cCelebi
Holger Schwenk
149
55
0
25 May 2022
Multilingual Normalization of Temporal Expressions with Masked Language Models
Lukas Lange
Jannik Strötgen
Heike Adel
Dietrich Klakow
56
6
0
20 May 2022
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation
Sameer Khurana
Antoine Laurent
James R. Glass
65
37
0
17 May 2022
Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages
Kabir Ahuja
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
LRM
95
16
0
12 May 2022
EASE: Entity-Aware Contrastive Learning of Sentence Embedding
Sosuke Nishikawa
Ryokan Ri
Ikuya Yamada
Yoshimasa Tsuruoka
Isao Echizen
66
31
0
09 May 2022
Gender Bias in Masked Language Models for Multiple Languages
Masahiro Kaneko
Aizhan Imankulova
Danushka Bollegala
Naoaki Okazaki
108
64
0
01 May 2022
Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders
Ivan Vulić
Goran Glavaš
Fangyu Liu
Nigel Collier
Edoardo Ponti
Anna Korhonen
96
9
0
30 Apr 2022
Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer
Haoran Xu
Kenton W. Murray
81
12
0
29 Apr 2022
When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?
Zhuoyuan Mao
Chenhui Chu
Raj Dabre
Haiyue Song
Zhen Wan
Sadao Kurohashi
64
3
0
26 Apr 2022
Cross-Lingual Phrase Retrieval
Heqi Zheng
Xiao Zhang
Zewen Chi
Heyan Huang
T. Yan
Tian Lan
Wei Wei
Xian-Ling Mao
RALM
LRM
70
3
0
19 Apr 2022
The Impact of Cross-Lingual Adjustment of Contextual Word Representations on Zero-Shot Transfer
Pavel Efimov
Leonid Boytsov
E. Arslanova
Pavel Braslavski
56
7
0
13 Apr 2022
MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages
Gokul Karthik Kumar
Abhishek Singh Gehlot
Sahal Shaji Mullappilly
Karthik Nandakumar
83
13
0
12 Apr 2022
Previous
1
2
3
4
5
6
Next