Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.08078
Cited By
To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer
12 October 2023
Md. Mushfiqur Rahman
Fardin Ahsan Sakib
Fahim Faisal
Antonios Anastasopoulos
Re-assign community
ArXiv
PDF
HTML
Papers citing
"To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer"
7 / 7 papers shown
Title
Overcoming Vocabulary Constraints with Pixel-level Fallback
Jonas F. Lotz
Hendra Setiawan
Stephan Peitz
Yova Kementchedjhieva
43
0
0
02 Apr 2025
Examining Language Modeling Assumptions Using an Annotated Literary Dialect Corpus
Craig Messner
Tom Lippincott
21
1
0
03 Oct 2024
Text Rendering Strategies for Pixel Language Models
Jonas F. Lotz
Elizabeth Salesky
Phillip Rust
Desmond Elliott
VLM
29
11
0
01 Nov 2023
Phylogeny-Inspired Adaptation of Multilingual Models to New Languages
Fahim Faisal
Antonios Anastasopoulos
AI4CE
LRM
34
26
0
19 May 2022
Systematic Inequalities in Language Technology Performance across the World's Languages
Damián E. Blasi
Antonios Anastasopoulos
Graham Neubig
127
131
0
13 Oct 2021
When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models
Benjamin Muller
Antonis Anastasopoulos
Benoît Sagot
Djamé Seddah
LRM
134
165
0
24 Oct 2020
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters
Hicham El Boukkouri
Olivier Ferret
Thomas Lavergne
Hiroshi Noji
Pierre Zweigenbaum
Junichi Tsujii
77
156
0
20 Oct 2020
1