ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.08693
  4. Cited By
Do Language Models Care About Text Quality? Evaluating Web-Crawled
  Corpora Across 11 Languages

Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages

13 March 2024
Rik van Noord
Taja Kuzman
Peter Rupnik
Nikola Ljubesic
Miquel Espla-Gomis
Gema Ramírez-Sánchez
Antonio Toral
    ALM
ArXivPDFHTML

Papers citing "Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages"

2 / 2 papers shown
Title
LLM Teacher-Student Framework for Text Classification With No Manually
  Annotated Data: A Case Study in IPTC News Topic Classification
LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification
Taja Kuzman
Nikola Ljubesic
77
0
0
29 Nov 2024
When Being Unseen from mBERT is just the Beginning: Handling New
  Languages With Multilingual Language Models
When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models
Benjamin Muller
Antonis Anastasopoulos
Benoît Sagot
Djamé Seddah
LRM
136
165
0
24 Oct 2020
1