Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.21315
Cited By
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
27 May 2025
Jesujoba Oluwadara Alabi
Michael A. Hedderich
David Ifeoluwa Adelani
Dietrich Klakow
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead"
22 / 122 papers shown
Title
Contemporary Amharic Corpus: Automatically Morpho-Syntactically Tagged Amharic Corpus
A. Gezmu
B. Seyoum
M. Gasser
A. Nürnberger
56
23
0
14 Jun 2021
ByT5: Towards a token-free future with pre-trained byte-to-byte models
Linting Xue
Aditya Barua
Noah Constant
Rami Al-Rfou
Sharan Narang
Mihir Kale
Adam Roberts
Colin Raffel
83
502
0
28 May 2021
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Julia Kreutzer
Isaac Caswell
Lisa Wang
Ahsan Wahab
D. Esch
...
Duygu Ataman
Orevaoghene Ahia
Oghenefego Ahia
Sweta Agrawal
Mofetoluwa Adeyemi
53
277
0
22 Mar 2021
Congolese Swahili Machine Translation for Humanitarian Response
A. Oktem
Eric DeLuca
Rodrigue Bashizi
Eric Paquin
G. Tang
37
6
0
19 Mar 2021
A Multilingual African Embedding for FAQ Chatbots
A. Mabrouk
Moez Ben Haj Hmida
Chayma Fourati
Hatem Haddad
Abir Messaoudi
46
7
0
16 Mar 2021
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Bonaventure F. P. Dossou
Chris C. Emezue
51
14
0
13 Mar 2021
A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios
Michael A. Hedderich
Lukas Lange
Heike Adel
Jannik Strötgen
Dietrich Klakow
289
299
0
23 Oct 2020
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi
Andre Niyongabo Rubungo
Hong Qu
Julia Kreutzer
Li Huang
51
39
0
23 Oct 2020
Language-agnostic BERT Sentence Embedding
Fangxiaoyu Feng
Yinfei Yang
Daniel Cer
N. Arivazhagan
Wei Wang
151
905
0
03 Jul 2020
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
Edoardo Ponti
Goran Glavaš
Olga Majewska
Qianchu Liu
Ivan Vulić
Anna Korhonen
LRM
61
320
0
01 May 2020
The State and Fate of Linguistic Diversity and Inclusion in the NLP World
Pratik M. Joshi
Sebastin Santy
A. Budhiraja
Kalika Bali
Monojit Choudhury
LMTD
107
847
0
20 Apr 2020
Semi-supervised acoustic modelling for five-lingual code-switched ASR using automatically-segmented soap opera speech
N. Wilkinson
A. Biswas
Emre Yilmaz
Febe de Wet
Ewald van der Westhuizen
T. Niesler
44
11
0
08 Apr 2020
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization
Junjie Hu
Sebastian Ruder
Aditya Siddhant
Graham Neubig
Orhan Firat
Melvin Johnson
ELM
161
970
0
24 Mar 2020
TArC: Incrementally and Semi-Automatically Collecting a Tunisian Arabish Corpus
Elisa Gugliotta
Marco Dinarelli
15
7
0
20 Mar 2020
Distant Supervision and Noisy Label Learning for Low Resource Named Entity Recognition: A Study on Hausa and Yorùbá
David Ifeoluwa Adelani
Michael A. Hedderich
D. Zhu
Esther van den Berg
Dietrich Klakow
43
12
0
18 Mar 2020
Investigating an approach for low resource language dataset creation, curation and classification: Setswana and Sepedi
Vukosi Marivate
T. Sefara
Vongani Chabalala
Keamogetswe Makhaya
T. Mokgonyane
Rethabile Mokoena
Abiodun Modupe
68
29
0
18 Feb 2020
Common Voice: A Massively-Multilingual Speech Corpus
Rosana Ardila
Megan Branson
Kelly Davis
Michael Henretty
M. Kohler
Josh Meyer
Reuben Morais
Lindsay Saunders
Francis M. Tyers
Gregor Weber
VLM
87
1,592
0
13 Dec 2019
CCAligned: A Massive Collection of Cross-Lingual Web-Document Pairs
Ahmed El-Kishky
Vishrav Chaudhary
Francisco Guzman
Philipp Koehn
86
199
0
10 Nov 2019
Unsupervised Cross-lingual Representation Learning at Scale
Alexis Conneau
Kartikay Khandelwal
Naman Goyal
Vishrav Chaudhary
Guillaume Wenzek
Francisco Guzmán
Edouard Grave
Myle Ott
Luke Zettlemoyer
Veselin Stoyanov
195
6,538
0
05 Nov 2019
On the Importance of Subword Information for Morphological Tasks in Truly Low-Resource Languages
Yi Zhu
Benjamin Heinzerling
Ivan Vulić
Michael Strube
Roi Reichart
Anna Korhonen
39
20
0
26 Sep 2019
WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia
Holger Schwenk
Vishrav Chaudhary
Shuo Sun
Hongyu Gong
Francisco Guzmán
CVBM
93
404
0
10 Jul 2019
Choosing Transfer Languages for Cross-Lingual Learning
Yu-Hsiang Lin
Chian-Yu Chen
Jean Lee
Zirui Li
Yuyan Zhang
...
Zhisong Zhang
Xuezhe Ma
Antonios Anastasopoulos
Patrick Littell
Graham Neubig
79
233
0
29 May 2019
Previous
1
2
3