Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.07396
Cited By
Beyond the English Web: Zero-Shot Cross-Lingual and Lightweight Monolingual Classification of Registers
15 February 2021
Liina Repo
Valtteri Skantsi
Samuel Rönnqvist
Saara Hellström
Miika Oinonen
Anna Salmela
D. Biber
Jesse Egbert
S. Pyysalo
Veronika Laippala
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Beyond the English Web: Zero-Shot Cross-Lingual and Lightweight Monolingual Classification of Registers"
4 / 4 papers shown
Title
Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation
A. Myntti
Erik Henriksson
Veronika Laippala
S. Pyysalo
152
0
0
02 Apr 2025
ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification
Taja Kuzman
I. Mozetič
Nikola Ljubesic
114
94
0
07 Mar 2023
The GINCO Training Dataset for Web Genre Identification of Documents Out in the Wild
Taja Kuzman
Peter Rupnik
Nikola Ljubesic
35
7
0
11 Jan 2022
Explaining Classes through Word Attribution
Samuel Rönnqvist
A. Myntti
Aki-Juhani Kyröläinen
S. Pyysalo
Veronika Laippala
Filip Ginter
FAtt
31
0
0
31 Aug 2021
1