Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.00085
Cited By
AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic Languages
30 April 2020
Anoop Kunchukuttan
Divyanshu Kakwani
S. Golla
C. GokulN.
Avik Bhattacharyya
Mitesh M. Khapra
Pratyush Kumar
Re-assign community
ArXiv
PDF
HTML
Papers citing
"AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic Languages"
18 / 18 papers shown
Title
Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline
Hrishit Madhavi
Jacob Cherian
Yuvraj Khamkar
Dhananjay Bhagat
VLM
24
0
0
16 May 2025
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts
Muhammad Farid Adilazuarda
M. Wijanarko
Lucky Susanto
Khumaisa Nuráini
Derry Wijaya
Alham Fikri Aji
57
0
0
25 Feb 2025
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
David Romero
Chenyang Lyu
Haryo Akbarianto Wibowo
Teresa Lynn
Injy Hamed
...
Oana Ignat
Joan Nwatu
Rada Mihalcea
Thamar Solorio
Alham Fikri Aji
48
26
0
10 Jun 2024
MasakhaNEWS: News Topic Classification for African languages
David Ifeoluwa Adelani
Marek Masiak
Israel Abebe Azime
Jesujoba Oluwadara Alabi
A. Tonja
...
Moges Ahmed Mehamed
Evrard Ngabire
Jules Jules
Ivan Ssenkungu
Pontus Stenetorp
28
24
0
19 Apr 2023
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
133
2,319
0
09 Nov 2022
Re-contextualizing Fairness in NLP: The Case of India
Shaily Bhatt
Sunipa Dev
Partha P. Talukdar
Shachi Dave
Vinodkumar Prabhakaran
34
54
0
25 Sep 2022
Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Tusarkanta Dalai
Tapas Kumar Mishra
Pankaj K. Sa
16
12
0
07 Jul 2022
Mitigating Gender Stereotypes in Hindi and Marathi
Neeraja Kirtane
Tanvi Anand
34
8
0
12 May 2022
Detecting Anchors' Opinion in Hinghlish News Delivery
Siddharth Sadhwani
Nishant Grover
Md. Shad Akhtar
Tanmoy Chakraborty
27
1
0
05 Apr 2022
Cognition-aware Cognate Detection
Diptesh Kanojia
Prashant Sharma
Sayali Ghodekar
P. Bhattacharyya
Gholamreza Haffari
Malhar A. Kulkarni
22
10
0
15 Dec 2021
Crosslingual Embeddings are Essential in UNMT for Distant Languages: An English to IndoAryan Case Study
Tamali Banerjee
V. Rudra Murthy
P. Bhattacharyya
35
9
0
09 Jun 2021
Unsupervised Machine Translation On Dravidian Languages
Sai Koneru
Danni Liu
Jan Niehues
42
7
0
29 Mar 2021
Evaluation of Deep Learning Models for Hostility Detection in Hindi Text
Ramchandra Joshi
Rushabh Karnavat
Kaustubh Jirapure
Raviraj Joshi
36
22
0
11 Jan 2021
Bangla Text Classification using Transformers
Tanvirul Alam
A. Khan
Firoj Alam
31
34
0
09 Nov 2020
Indic-Transformers: An Analysis of Transformer Language Models for Indian Languages
Kushal Kumar Jain
Adwait Deshpande
Kumar Shridhar
F. Laumann
Ayushman Dash
51
51
0
04 Nov 2020
iNLTK: Natural Language Toolkit for Indic Languages
Gaurav Arora
VLM
22
66
0
26 Sep 2020
Revisiting Low Resource Status of Indian Languages in Machine Translation
Jerin Philip
Shashank Siripragada
Vinay P. Namboodiri
C. V. Jawahar
15
27
0
11 Aug 2020
Word Translation Without Parallel Data
Alexis Conneau
Guillaume Lample
MarcÁurelio Ranzato
Ludovic Denoyer
Hervé Jégou
189
1,639
0
11 Oct 2017
1