ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.14062
14
6

Morphological Disambiguation of South Sámi with FSTs and Neural Networks

29 April 2020
Mika Hämäläinen
Linda Wiechetek
ArXivPDFHTML
Abstract

We present a method for conducting morphological disambiguation for South S\ámi, which is an endangered language. Our method uses an FST-based morphological analyzer to produce an ambiguous set of morphological readings for each word in a sentence. These readings are disambiguated with a Bi-RNN model trained on the related North S\ámi UD Treebank and some synthetically generated South S\ámi data. The disambiguation is done on the level of morphological tags ignoring word forms and lemmas; this makes it possible to use North S\ámi training data for South S\ámi without the need for a bilingual dictionary or aligned word embeddings. Our approach requires only minimal resources for South S\ámi, which makes it usable and applicable in the contexts of any other endangered language as well.

View on arXiv
Comments on this paper