Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.13754
Cited By
Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement
20 March 2024
Catherine Arnett
Pamela D. Rivière
Tyler A. Chang
Sean Trott
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement"
3 / 3 papers shown
Title
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
Taku Kudo
John Richardson
193
3,518
0
19 Aug 2018
Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies
Tal Linzen
Emmanuel Dupoux
Yoav Goldberg
101
903
0
04 Nov 2016
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
212
7,735
0
31 Aug 2015
1