Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.08693
Cited By
Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages
13 March 2024
Rik van Noord
Taja Kuzman
Peter Rupnik
Nikola Ljubesic
Miquel Espla-Gomis
Gema Ramírez-Sánchez
Antonio Toral
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages"
2 / 2 papers shown
Title
LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification
Taja Kuzman
Nikola Ljubesic
77
0
0
29 Nov 2024
When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models
Benjamin Muller
Antonis Anastasopoulos
Benoît Sagot
Djamé Seddah
LRM
136
165
0
24 Oct 2020
1