Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.23656
Cited By
Morphological Typology in BPE Subword Productivity and Language Modeling
31 October 2024
Iñigo Parra
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Morphological Typology in BPE Subword Productivity and Language Modeling"
5 / 5 papers shown
Title
The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation
Jonne Saleva
Constantine Lignos
47
20
0
20 Mar 2021
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation
J. Clark
Dan Garrette
Iulia Turc
John Wieting
94
220
0
11 Mar 2021
Byte Pair Encoding is Suboptimal for Language Model Pretraining
Kaj Bostrom
Greg Durrett
61
210
0
07 Apr 2020
How Much Does Tokenization Affect Neural Machine Translation?
Miguel Domingo
Mercedes García-Martínez
A. Helle
F. Casacuberta
Manuel Herranz
51
57
0
20 Dec 2018
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
212
7,735
0
31 Aug 2015
1