Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.04154
Cited By
Learning Joint Multilingual Sentence Representations with Neural Machine Translation
13 April 2017
Holger Schwenk
Matthijs Douze
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Joint Multilingual Sentence Representations with Neural Machine Translation"
50 / 91 papers shown
Title
Long-context Non-factoid Question Answering in Indic Languages
Ritwik Mishra
R. Shah
Ponnurangam Kumaraguru
33
0
0
18 Apr 2025
A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain
Jorge del Pozo Lérida
Kamil Kojs
János Máté
Mikołaj Antoni Barański
Christian Hardmeier
45
0
0
27 Jan 2025
Mitigating Semantic Leakage in Cross-lingual Embeddings via Orthogonality Constraint
Dayeon Ki
Cheonbok Park
H. Kim
FedML
33
0
0
24 Sep 2024
How to Learn in a Noisy World? Self-Correcting the Real-World Data Noise in Machine Translation
Yan Meng
Di Wu
Christof Monz
36
1
0
02 Jul 2024
Cross-Lingual Transfer Learning for Speech Translation
Rao Ma
Yassir Fathullah
Mengjie Qian
Siyuan Tang
Mark J. F. Gales
Kate Knill
28
1
0
01 Jul 2024
Recovering document annotations for sentence-level bitext
R. Wicks
Matt Post
Philipp Koehn
39
4
0
06 Jun 2024
Residual Quantization with Implicit Neural Codebooks
Iris A. M. Huijben
Matthijs Douze
Matthew Muckley
Ruud J. G. van Sloun
Jakob Verbeek
MQ
29
11
0
26 Jan 2024
The Faiss library
Matthijs Douze
Alexandr Guzhva
Chengqi Deng
Jeff Johnson
Gergely Szilvasy
Pierre-Emmanuel Mazaré
Maria Lomeli
Lucas Hosseini
Hervé Jégou
35
147
0
16 Jan 2024
Mapping Transformer Leveraged Embeddings for Cross-Lingual Document Representation
Tsegaye Misikir Tashu
Eduard-Raul Kontos
M. Sabatelli
Matias Valdenegro-Toro
22
0
0
12 Jan 2024
There's no Data Like Better Data: Using QE Metrics for MT Data Filtering
Jan-Thorsten Peter
David Vilar
Daniel Deutsch
Mara Finkelstein
Juraj Juraska
Markus Freitag
16
17
0
09 Nov 2023
Towards a Deep Understanding of Multilingual End-to-End Speech Translation
Haoran Sun
Xiaohu Zhao
Yikun Lei
Shaolin Zhu
Deyi Xiong
39
8
0
31 Oct 2023
Multilingual Neural Machine Translation System for Indic to Indic Languages
Sudhansu Bala Das
Divyajyoti Panda
T. K. Mishra
Bidyut Kr. Patra
Asif Ekbal
30
0
0
22 Jun 2023
Revisiting non-English Text Simplification: A Unified Multilingual Benchmark
Michael Joseph Ryan
Tarek Naous
Wei-ping Xu
31
25
0
25 May 2023
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale
Marta R. Costa-jussá
Pierre Yves Andrews
Eric Michael Smith
Prangthip Hansanti
C. Ropers
Elahe Kalbassi
Cynthia Gao
Daniel Licht
Carleigh Wood
37
15
0
22 May 2023
The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation
Mutian He
Philip N. Garner
44
4
0
16 May 2023
Are the Best Multilingual Document Embeddings simply Based on Sentence Embeddings?
Sonal Sannigrahi
Josef van Genabith
C. España-Bonet
AILaw
42
4
0
28 Apr 2023
L3Cube-IndicSBERT: A simple approach for learning cross-lingual sentence representations using multilingual BERT
Samruddhi Deode
Janhavi Gadre
Aditi Kajale
Ananya Joshi
Raviraj Joshi
25
20
0
22 Apr 2023
tmn at SemEval-2023 Task 9: Multilingual Tweet Intimacy Detection using XLM-T, Google Translate, and Ensemble Learning
Anna Glazkova
17
1
0
08 Apr 2023
Preparing the Vukúzenzele and ZA-gov-multilingual South African multilingual corpora
Richard Lastrucci
Isheanesu Dzingirai
Jenalea Rajab
Andani Madodonga
Matimba Shingange
Daniel Njini
Vukosi Marivate
36
14
0
07 Mar 2023
Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval
John Wieting
J. Clark
William W. Cohen
Graham Neubig
Taylor Berg-Kirkpatrick
26
6
0
21 Dec 2022
BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric
Mingda Chen
Paul-Ambroise Duquenne
Pierre Yves Andrews
Justine T. Kao
Alexandre Mourachko
Holger Schwenk
Marta R. Costa-jussá
19
17
0
16 Dec 2022
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Paul-Ambroise Duquenne
Hongyu Gong
Ning Dong
Jingfei Du
Ann Lee
Vedanuj Goswani
Changhan Wang
J. Pino
Benoît Sagot
Holger Schwenk
42
34
0
08 Nov 2022
Very Low Resource Sentence Alignment: Luhya and Swahili
E. Chimoto
Bruce A. Bassett
CVBM
11
10
0
31 Oct 2022
The VolcTrans System for WMT22 Multilingual Machine Translation Task
Xian Qian
Kai Hu
Jiaqiang Wang
Yifeng Liu
Xingyuan Pan
Jun Cao
Mingxuan Wang
26
1
0
20 Oct 2022
Retrofitting Multilingual Sentence Embeddings with Abstract Meaning Representation
Deng Cai
Xin Li
Jackie Chun-Sing Ho
Lidong Bing
W. Lam
23
6
0
18 Oct 2022
Decoding speech perception from non-invasive brain recordings
Alexandre Défossez
Charlotte Caucheteux
Jérémy Rapin
Ori Kabeli
J. King
35
116
0
25 Aug 2022
Multilingual Coreference Resolution in Multiparty Dialogue
Boyuan Zheng
Patrick Xia
M. Yarmohammadi
Benjamin Van Durme
58
3
0
02 Aug 2022
EMS: Efficient and Effective Massively Multilingual Sentence Embedding Learning
Zhuoyuan Mao
Chenhui Chu
Sadao Kurohashi
43
1
0
31 May 2022
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation
Sameer Khurana
Antoine Laurent
James R. Glass
25
36
0
17 May 2022
EASE: Entity-Aware Contrastive Learning of Sentence Embedding
Sosuke Nishikawa
Ryokan Ri
Ikuya Yamada
Yoshimasa Tsuruoka
Isao Echizen
28
29
0
09 May 2022
Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation
Xiangpeng Wei
Heng Yu
Yue Hu
Rongxiang Weng
Weihua Luo
Jun Xie
Rong Jin
CLL
17
24
0
14 Apr 2022
Learning How to Translate North Korean through South Korean
Hwichan Kim
Sangwhan Moon
Naoaki Okazaki
Mamoru Komachi
22
2
0
27 Jan 2022
Cross-lingual Transfer of Monolingual Models
Evangelia Gogoulou
Ariel Ekgren
T. Isbister
Magnus Sahlgren
29
17
0
15 Sep 2021
Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
M. Yarmohammadi
Shijie Wu
Marc Marone
Haoran Xu
Seth Ebner
...
Craig Harman
Kenton W. Murray
Aaron Steven White
Mark Dredze
Benjamin Van Durme
31
28
0
14 Sep 2021
Metric Learning in Multilingual Sentence Similarity Measurement for Document Alignment
Charith Rajitha
Lakmali Piyarathne
Dilan Sachintha
Surangika Ranathunga
22
3
0
21 Aug 2021
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task
Yun Tang
Hongyu Gong
Xian Li
Changhan Wang
J. Pino
Holger Schwenk
Naman Goyal
42
10
0
14 Jul 2021
Exploiting Parallel Corpora to Improve Multilingual Embedding based Document and Sentence Alignment
Dilan Sachintha
Lakmali Piyarathna
Charith Rajitha
Surangika Ranathunga
24
3
0
12 Jun 2021
Obtaining Better Static Word Embeddings Using Contextual Embedding Models
Prakhar Gupta
Martin Jaggi
6
30
0
08 Jun 2021
Lightweight Cross-Lingual Sentence Representation Learning
Zhuoyuan Mao
Prakhar Gupta
Pei Wang
Chenhui Chu
Martin Jaggi
Sadao Kurohashi
VLM
22
8
0
28 May 2021
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining
Ivana Kvapilíková
Mikel Artetxe
Gorka Labaka
Eneko Agirre
Ondrej Bojar
SSL
19
36
0
21 May 2021
Graph-based Multilingual Product Retrieval in E-commerce Search
Hanqing Lu
You-Heng Hu
Tong Zhao
Tony Wu
Yiwei Song
Bing Yin
17
25
0
06 May 2021
Paraphrastic Representations at Scale
John Wieting
Kevin Gimpel
Graham Neubig
Taylor Berg-Kirkpatrick
21
19
0
30 Apr 2021
Cross-lingual hate speech detection based on multilingual domain-specific word embeddings
Aymé Arango
Jorge A. Pérez
Bárbara Poblete
24
9
0
30 Apr 2021
Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora
G. Kumar
Philipp Koehn
Sanjeev Khudanpur
11
1
0
11 Mar 2021
Majority Voting with Bidirectional Pre-translation For Bitext Retrieval
Alex Jones
Derry Wijaya
22
6
0
10 Mar 2021
Facebook AI's WMT20 News Translation Task Submission
Peng-Jen Chen
Ann Lee
Changhan Wang
Naman Goyal
Angela Fan
Mary Williamson
Jiatao Gu
VLM
17
37
0
16 Nov 2020
Learning Contextualised Cross-lingual Word Embeddings and Alignments for Extremely Low-Resource Languages Using Parallel Corpora
Takashi Wada
Tomoharu Iwata
Yuji Matsumoto
Timothy Baldwin
Jey Han Lau
30
6
0
27 Oct 2020
Revisiting Modularized Multilingual NMT to Meet Industrial Demands
Sungwon Lyu
Bokyung Son
Kichang Yang
Jaekyoung Bae
MoE
15
20
0
19 Oct 2020
Why Not Simply Translate? A First Swedish Evaluation Benchmark for Semantic Similarity
T. Isbister
Magnus Sahlgren
6
9
0
07 Sep 2020
Transformer based Multilingual document Embedding model
Wei Li
Brian Mak
4
5
0
19 Aug 2020
1
2
Next