Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.11657
Cited By
Subword Sampling for Low Resource Word Alignment
21 December 2020
Ehsaneddin Asgari
Masoud Jalili Sabet
Philipp Dufter
Christoph Ringlstetter
Hinrich Schütze
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Subword Sampling for Low Resource Word Alignment"
4 / 4 papers shown
Title
MorphBPE: A Morpho-Aware Tokenizer Bridging Linguistic Complexity for Efficient LLM Training Across Morphologies
Ehsaneddin Asgari
Yassine El Kheir
Mohammad Ali Sadraei Javaheri
76
0
0
02 Feb 2025
Incorporating Context into Subword Vocabularies
Shaked Yehezkel
Yuval Pinter
47
8
0
13 Oct 2022
Wine is Not v i n. -- On the Compatibility of Tokenizations Across Languages
Antonis Maronikolakis
Philipp Dufter
Hinrich Schütze
24
17
0
13 Sep 2021
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
220
7,930
0
17 Aug 2015
1