ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.00798
  4. Cited By
Mapping Languages: The Corpus of Global Language Use

Mapping Languages: The Corpus of Global Language Use

2 April 2020
Jonathan Dunn
ArXivPDFHTML

Papers citing "Mapping Languages: The Corpus of Global Language Use"

16 / 16 papers shown
Title
Large corpora and large language models: a replicable method for automating grammatical annotation
Cameron Morin
Matti Marttinen Larsson
38
1
0
18 Nov 2024
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages
Amir Hossein Kargaran
François Yvon
Hinrich Schutze
VLM
49
5
0
31 Oct 2024
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
Shaoxiong Ji
Zihao Li
Indraneil Paul
Jaakko Paavola
Peiqin Lin
...
Dayyán O'Brien
Hengyu Luo
Hinrich Schütze
Jörg Tiedemann
Barry Haddow
CLL
45
3
0
26 Sep 2024
Geographically-Informed Language Identification
Geographically-Informed Language Identification
Jonathan Dunn
Lane Edwards-Brown
29
2
0
14 Mar 2024
cantnlp@LT-EDI-2024: Automatic Detection of Anti-LGBTQ+ Hate Speech in
  Under-resourced Languages
cantnlp@LT-EDI-2024: Automatic Detection of Anti-LGBTQ+ Hate Speech in Under-resourced Languages
Sidney Gig-Jan Wong
Matthew Durward
19
0
0
28 Jan 2024
Comparing Measures of Linguistic Diversity Across Social Media Language
  Data and Census Data at Subnational Geographic Areas
Comparing Measures of Linguistic Diversity Across Social Media Language Data and Census Data at Subnational Geographic Areas
Sidney Gig-Jan Wong
Jonathan Dunn
B. Adams
37
1
0
21 Aug 2023
Glot500: Scaling Multilingual Corpora and Language Models to 500
  Languages
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
Ayyoob Imani
Peiqin Lin
Amir Hossein Kargaran
Silvia Severini
Masoud Jalili Sabet
...
Chunlan Ma
Helmut Schmid
André F. T. Martins
François Yvon
Hinrich Schütze
ALM
LRM
49
96
0
20 May 2023
AfroDigits: A Community-Driven Spoken Digit Dataset for African
  Languages
AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages
Chris C. Emezue
Sanchit Gandhi
Lewis Tunstall
Abubakar Abid
Josh Meyer
...
Douwe Kiela
Yacine Jernite
Julien Chaumond
Merve Noyan
Omar Sanseviero
33
2
0
22 Mar 2023
Exploring the Constructicon: Linguistic Analysis of a Computational CxG
Exploring the Constructicon: Linguistic Analysis of a Computational CxG
Jonathan Dunn
35
5
0
30 Jan 2023
Exposure and Emergence in Usage-Based Grammar: Computational Experiments
  in 35 Languages
Exposure and Emergence in Usage-Based Grammar: Computational Experiments in 35 Languages
Jonathan Dunn
31
8
0
25 Nov 2022
Register Variation Remains Stable Across 60 Languages
Register Variation Remains Stable Across 60 Languages
Haipeng Li
Jonathan Dunn
A. Nini
47
8
0
20 Sep 2022
Stability of Syntactic Dialect Classification Over Space and Time
Stability of Syntactic Dialect Classification Over Space and Time
Jonathan Dunn
Sidney Gig-Jan Wong
28
5
0
11 Sep 2022
Predicting Embedding Reliability in Low-Resource Settings Using Corpus
  Similarity Measures
Predicting Embedding Reliability in Low-Resource Settings Using Corpus Similarity Measures
Jonathan Dunn
Haipeng Li
Damian Sastre
27
5
0
09 Jun 2022
Building Machine Translation Systems for the Next Thousand Languages
Building Machine Translation Systems for the Next Thousand Languages
Ankur Bapna
Isaac Caswell
Julia Kreutzer
Orhan Firat
D. Esch
...
Apurva Shah
Yanping Huang
Zhehuai Chen
Yonghui Wu
Macduff Hughes
56
99
0
09 May 2022
Learned Construction Grammars Converge Across Registers Given Increased
  Exposure
Learned Construction Grammars Converge Across Registers Given Increased Exposure
Jonathan Dunn
Harish Tayyar Madabushi
33
8
0
12 Oct 2021
The Twitter of Babel: Mapping World Languages through Microblogging
  Platforms
The Twitter of Babel: Mapping World Languages through Microblogging Platforms
Delia Mocanu
Andrea Baronchelli
B. Gonçalves
N. Perra
Alessandro Vespignani
56
294
0
20 Dec 2012
1