ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.10464
  4. Cited By
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual
  Transfer and Beyond
v1v2 (latest)

Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond

26 December 2018
Mikel Artetxe
Holger Schwenk
    3DV
ArXiv (abs)PDFHTMLGithub (3640★)

Papers citing "Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond"

50 / 298 papers shown
Title
Semantic Outlier Removal with Embedding Models and LLMs
Semantic Outlier Removal with Embedding Models and LLMs
Eren Akbiyik
João Almeida
Rik Melis
Ritu Sriram
Viviana Petrescu
Vilhjálmur Vilhjálmsson
17
0
0
19 Jun 2025
Static Word Embeddings for Sentence Semantic Representation
Takashi Wada
Yuki Hirakawa
Ryotaro Shimizu
Takahiro Kawashima
Yuki Saito
91
0
0
05 Jun 2025
XToM: Exploring the Multilingual Theory of Mind for Large Language Models
XToM: Exploring the Multilingual Theory of Mind for Large Language Models
Chunkit Chan
Yauwai Yim
Hongchuan Zeng
Zhiying Zou
Xinyuan Cheng
...
Ginny Wong
Helmut Schmid
Hinrich Schütze
Simon See
Yangqiu Song
LRM
54
0
0
03 Jun 2025
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Jesujoba Oluwadara Alabi
Michael A. Hedderich
David Ifeoluwa Adelani
Dietrich Klakow
118
0
0
27 May 2025
Low-Resource NMT: A Case Study on the Written and Spoken Languages in Hong Kong
Low-Resource NMT: A Case Study on the Written and Spoken Languages in Hong Kong
Hei Yi Mak
Tan Lee
29
7
0
23 May 2025
Krikri: Advancing Open Large Language Models for Greek
Krikri: Advancing Open Large Language Models for Greek
Dimitris Roussis
Leon Voukoutis
Georgios Paraskevopoulos
Sokratis Sofianopoulos
Prokopis Prokopidis
Vassilis Papavasileiou
Athanasios Katsamanis
Stelios Piperidis
Vassilis Katsouros
ALM
89
1
0
19 May 2025
Long-context Non-factoid Question Answering in Indic Languages
Long-context Non-factoid Question Answering in Indic Languages
Ritwik Mishra
R. Shah
Ponnurangam Kumaraguru
76
0
0
18 Apr 2025
Catch Me if You Search: When Contextual Web Search Results Affect the Detection of Hallucinations
Catch Me if You Search: When Contextual Web Search Results Affect the Detection of Hallucinations
Mahjabin Nahar
Eun-Ju Lee
Jin Won Park
Dongwon Lee
HILM
152
0
0
01 Apr 2025
Is LLM the Silver Bullet to Low-Resource Languages Machine Translation?
Is LLM the Silver Bullet to Low-Resource Languages Machine Translation?
Yewei Song
Lujun Li
Cedric Lothritz
Saad Ezzini
Lama Sleem
Niccolo Gentile
Radu State
Tegawende F. Bissyande
Jacques Klein
121
3
0
31 Mar 2025
Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
Umer Butt
Stalin Veranasi
Günter Neumann
135
0
0
27 Mar 2025
High-Dimensional Interlingual Representations of Large Language Models
High-Dimensional Interlingual Representations of Large Language Models
Bryan Wilie
Samuel Cahyawijaya
Junxian He
Pascale Fung
157
0
0
14 Mar 2025
A kinetic-based regularization method for data science applications
Abhisek Ganguly
Alessandro Gabbana
Vybhav Rao
Sauro Succi
Santosh Ansumali
134
0
0
06 Mar 2025
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts
Zhongyang Li
Ziyue Li
Dinesh Manocha
MoE
144
0
0
27 Feb 2025
Optimal word order for non-causal text generation with Large Language Models: the Spanish case
Optimal word order for non-causal text generation with Large Language Models: the Spanish case
Andrea Busto-Castiñeira
Silvia García-Méndez
Francisco de Arriba-Pérez
Francisco J. González Castaño
93
0
0
21 Feb 2025
Beyond Literal Token Overlap: Token Alignability for Multilinguality
Beyond Literal Token Overlap: Token Alignability for Multilinguality
Katharina Hämmerl
Tomasz Limisiewicz
Jindrich Libovický
Alexander Fraser
73
0
0
10 Feb 2025
M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference
M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference
Nikhil Bhendawade
Mahyar Najibi
Devang Naik
Irina Belousova
MoE
127
0
0
04 Feb 2025
Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?
Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?
HyoJung Han
Akiko Eriguchi
Haoran Xu
Hieu T. Hoang
Marine Carpuat
Huda Khayrallah
VLM
89
3
0
12 Oct 2024
Multi-Target Cross-Lingual Summarization: a novel task and a
  language-neutral approach
Multi-Target Cross-Lingual Summarization: a novel task and a language-neutral approach
Diogo Pernes
Gonçalo M. Correia
Afonso Mendes
84
1
0
01 Oct 2024
LangSAMP: Language-Script Aware Multilingual Pretraining
LangSAMP: Language-Script Aware Multilingual Pretraining
Yihong Liu
Haotian Ye
Chunlan Ma
Mingyang Wang
Hinrich Schütze
VLM
246
0
0
26 Sep 2024
Bridging the Language Gap: Enhancing Multilingual Prompt-Based Code Generation in LLMs via Zero-Shot Cross-Lingual Transfer
Bridging the Language Gap: Enhancing Multilingual Prompt-Based Code Generation in LLMs via Zero-Shot Cross-Lingual Transfer
Mingda Li
Abhijit Mishra
Utkarsh Mujumdar
102
0
0
19 Aug 2024
Meltemi: The first open Large Language Model for Greek
Meltemi: The first open Large Language Model for Greek
Leon Voukoutis
Dimitris Roussis
Georgios Paraskevopoulos
Sokratis Sofianopoulos
Prokopis Prokopidis
Vassilis Papavasileiou
Athanasios Katsamanis
Stelios Piperidis
Vassilis Katsouros
VLM
72
9
0
30 Jul 2024
Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment
Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment
Yongxin Huang
Kexin Wang
Goran Glavaš
Iryna Gurevych
98
1
0
20 Jul 2024
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music
  Processing
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing
Shangda Wu
Yashan Wang
Xiaobing Li
Feng Yu
Maosong Sun
92
5
0
02 Jul 2024
How to Learn in a Noisy World? Self-Correcting the Real-World Data Noise in Machine Translation
How to Learn in a Noisy World? Self-Correcting the Real-World Data Noise in Machine Translation
Yan Meng
Di Wu
Christof Monz
101
1
0
02 Jul 2024
Cross-Lingual Transfer Learning for Speech Translation
Cross-Lingual Transfer Learning for Speech Translation
Rao Ma
Yassir Fathullah
Mengjie Qian
Siyuan Tang
Mark Gales
Kate Knill
167
4
0
01 Jul 2024
Improving Zero-Shot Cross-Lingual Transfer via Progressive
  Code-Switching
Improving Zero-Shot Cross-Lingual Transfer via Progressive Code-Switching
Zhuoran Li
Chunming Hu
Jiasi Chen
Zhijun Chen
Xiaohui Guo
Richong Zhang
76
4
0
19 Jun 2024
News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation
News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation
Andreea Iana
Fabian David Schmidt
Goran Glavaš
Heiko Paulheim
173
3
0
18 Jun 2024
Decipherment-Aware Multilingual Learning in Jointly Trained Language
  Models
Decipherment-Aware Multilingual Learning in Jointly Trained Language Models
Grandee Lee
72
0
0
11 Jun 2024
KSW: Khmer Stop Word based Dictionary for Keyword Extraction
KSW: Khmer Stop Word based Dictionary for Keyword Extraction
Nimol Thuon
Wangrui Zhang
Sada Thuon
21
2
0
27 May 2024
A New Benchmark for Evaluating Automatic Speech Recognition in the
  Arabic Call Domain
A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain
Qusai Abo Obaidah
Muhy Eddin Za'ter
Adnan Jaljuli
Ali Mahboub
Asma Hakouz
Bashar Alfrou
Yazan Estaitia
56
1
0
07 Mar 2024
PIRB: A Comprehensive Benchmark of Polish Dense and Hybrid Text
  Retrieval Methods
PIRB: A Comprehensive Benchmark of Polish Dense and Hybrid Text Retrieval Methods
Slawomir Dadas
Michal Perelkiewicz
Rafal Poswiata
109
3
0
20 Feb 2024
Enhanced Hallucination Detection in Neural Machine Translation through
  Simple Detector Aggregation
Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation
Anas Himmi
Guillaume Staerman
Marine Picot
Pierre Colombo
Nuno M. Guerreiro
417
7
0
20 Feb 2024
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
Felix Friedrich
Katharina Hämmerl
P. Schramowski
Manuel Brack
Jindrich Libovický
Kristian Kersting
Alexander Fraser
EGVM
151
14
0
29 Jan 2024
Cross-lingual neural fuzzy matching for exploiting target-language
  monolingual corpora in computer-aided translation
Cross-lingual neural fuzzy matching for exploiting target-language monolingual corpora in computer-aided translation
M. Esplà-Gomis
Víctor M. Sánchez-Cartagena
J. A. Pérez-Ortiz
F. Sánchez-Martínez
75
3
0
16 Jan 2024
Enhancing Context Through Contrast
Enhancing Context Through Contrast
Kshitij Ambilduke
Aneesh Shetye
Diksha Bagade
Rishika Bhagwatkar
Khurshed Fitter
P. Vagdargi
Shital S. Chiddarwar
62
0
0
06 Jan 2024
Improving Text Embeddings with Large Language Models
Improving Text Embeddings with Large Language Models
Liang Wang
Nan Yang
Xiaolong Huang
Linjun Yang
Rangan Majumder
Furu Wei
SyDa
133
190
0
31 Dec 2023
SentAlign: Accurate and Scalable Sentence Alignment
SentAlign: Accurate and Scalable Sentence Alignment
Steinþór Steingrímsson
H. Loftsson
Andy Way
46
8
0
15 Nov 2023
A Material Lens on Coloniality in NLP
A Material Lens on Coloniality in NLP
William B. Held
Camille Harris
Michael Best
Diyi Yang
94
14
0
14 Nov 2023
From Classification to Generation: Insights into Crosslingual Retrieval
  Augmented ICL
From Classification to Generation: Insights into Crosslingual Retrieval Augmented ICL
Xiaoqian Li
Ercong Nie
Sheng Liang
RALMLRM
158
12
0
11 Nov 2023
Language Models are Universal Embedders
Language Models are Universal Embedders
Xin Zhang
Zehan Li
Yanzhao Zhang
Dingkun Long
Pengjun Xie
Meishan Zhang
Min Zhang
KELMELM
282
9
0
12 Oct 2023
Zero-shot Cross-lingual Transfer without Parallel Corpus
Zero-shot Cross-lingual Transfer without Parallel Corpus
Yuyang Zhang
Xiaofeng Han
Baojun Wang
VLM
84
0
0
07 Oct 2023
Knowledge-Prompted Estimator: A Novel Approach to Explainable Machine
  Translation Assessment
Knowledge-Prompted Estimator: A Novel Approach to Explainable Machine Translation Assessment
Hao Yang
Min Zhang
Shimin Tao
Minghan Wang
Daimeng Wei
Yanfei Jiang
LRM
30
10
0
13 Jun 2023
Exploring Anisotropy and Outliers in Multilingual Language Models for
  Cross-Lingual Semantic Sentence Similarity
Exploring Anisotropy and Outliers in Multilingual Language Models for Cross-Lingual Semantic Sentence Similarity
Katharina Hämmerl
Alina Fastowski
Jindrich Libovický
Alexander Fraser
114
7
0
01 Jun 2023
Machine-Created Universal Language for Cross-lingual Transfer
Machine-Created Universal Language for Cross-lingual Transfer
Yaobo Liang
Quanzhi Zhu
Junhe Zhao
Nan Duan
75
7
0
22 May 2023
Towards More Robust NLP System Evaluation: Handling Missing Scores in
  Benchmarks
Towards More Robust NLP System Evaluation: Handling Missing Scores in Benchmarks
Anas Himmi
Ekhine Irurozki
Nathan Noiry
Stephan Clémençon
Pierre Colombo
193
9
0
17 May 2023
HateMM: A Multi-Modal Dataset for Hate Video Classification
HateMM: A Multi-Modal Dataset for Hate Video Classification
Mithun Das
R. Raj
Punyajoy Saha
Binny Mathew
Manish Gupta
Animesh Mukherjee
88
36
0
06 May 2023
On Evaluation of Bangla Word Analogies
On Evaluation of Bangla Word Analogies
Mousumi Akter
Souvik Sarkar
S. Karmaker
25
3
0
10 Apr 2023
Rediscovering Hashed Random Projections for Efficient Quantization of
  Contextualized Sentence Embeddings
Rediscovering Hashed Random Projections for Efficient Quantization of Contextualized Sentence Embeddings
Ulf A. Hamster
Ji-Ung Lee
Alexander Geyken
Iryna Gurevych
72
0
0
13 Mar 2023
Letz Translate: Low-Resource Machine Translation for Luxembourgish
Letz Translate: Low-Resource Machine Translation for Luxembourgish
Yewei Song
Saad Ezzini
Jacques Klein
Tegawende F. Bissyande
C. Lefebvre
A. Goujon
72
3
0
02 Mar 2023
LEALLA: Learning Lightweight Language-agnostic Sentence Embeddings with
  Knowledge Distillation
LEALLA: Learning Lightweight Language-agnostic Sentence Embeddings with Knowledge Distillation
Zhuoyuan Mao
Tetsuji Nakagawa
FedML
71
20
0
16 Feb 2023
123456
Next