Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.09227
Cited By
A Large and Diverse Arabic Corpus for Language Modeling
23 January 2022
Abbas Raza Ali
Muhammad Ajmal Siddiqui
Rema Algunaibet
Hasan Raza Ali
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Large and Diverse Arabic Corpus for Language Modeling"
8 / 8 papers shown
Title
Cognitive Computing to Optimize IT Services
Abbas Raza Ali
18
7
0
28 Dec 2021
Multi-Dialect Arabic Speech Recognition
Abbas Raza Ali
37
15
0
25 Dec 2021
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
378
2,051
0
31 Dec 2020
AraGPT2: Pre-Trained Transformer for Arabic Language Generation
Wissam Antoun
Fady Baly
Hazem M. Hajj
VLM
39
104
0
31 Dec 2020
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
Pedro Ortiz Suarez
Laurent Romary
Benoît Sagot
51
228
0
11 Jun 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
451
4,662
0
23 Jan 2020
Neural Arabic Question Answering
Hussein Mozannar
Karl El Hajal
Elie Maamary
Hazem M. Hajj
42
135
0
12 Jun 2019
1.5 billion words Arabic Corpus
I. A. El-Khair
3DV
34
98
0
12 Nov 2016
1