23
1

How Cute is Pikachu? Gathering and Ranking Pokémon Properties from Data with Pokémon Word Embeddings

Abstract

We present different methods for obtaining descriptive properties automatically for the 151 original Pok\émon. We train several different word embeddings models on a crawled Pok\émon corpus, and use them to rank automatically English adjectives based on how characteristic they are to a given Pok\émon. Based on our experiments, it is better to train a model with domain specific data than to use a pretrained model. Word2Vec produces less noise in the results than fastText model. Furthermore, we expand the list of properties for each Pok\émon automatically. However, none of the methods is spot on and there is a considerable amount of noise in the different semantic models. Our models have been released on Zenodo.

View on arXiv
Comments on this paper