Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.10730
Cited By
MuRIL: Multilingual Representations for Indian Languages
19 March 2021
Simran Khanuja
Diksha Bansal
Sarvesh Mehtani
Savya Khosla
Atreyee Dey
Balaji Gopalan
D. Margam
Pooja Aggarwal
Rajiv Teja Nagipogu
Shachi Dave
Shruti Gupta
Subhash Chandra Bose Gali
Vishnu Subramanian
Partha P. Talukdar
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MuRIL: Multilingual Representations for Indian Languages"
49 / 49 papers shown
Title
CMLFormer: A Dual Decoder Transformer with Switching Point Learning for Code-Mixed Language Modeling
Aditeya Baral
Allen George Ajith
Roshan Nayak
Mrityunjay Abhijeet Bhanja
14
0
0
19 May 2025
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Faiza Hassan
Summra Saleem
Kashif Javed
Muhammad Nabeel Asim
A. Rehman
Andreas Dengel
33
0
0
08 May 2025
IndicSQuAD: A Comprehensive Multilingual Question Answering Dataset for Indic Languages
Sharvi Endait
Ruturaj Ghatage
Aditya Kulkarni
Rajlaxmi Patil
Raviraj Joshi
37
0
0
06 May 2025
User-Aware Multilingual Abusive Content Detection in Social Media
Mohammad Zia Ur Rehman
Somya Mehta
Kuldeep Singh
Kunal Kaushik
Nagendra Kumar
23
14
0
26 Oct 2024
Monolingual and Multilingual Misinformation Detection for Low-Resource Languages: A Comprehensive Survey
Xinyu Wang
Wenbo Zhang
Sarah Rajtmajer
37
1
0
24 Oct 2024
Towards Robust Knowledge Representations in Multilingual LLMs for Equivalence and Inheritance based Consistent Reasoning
Gaurav Arora
Srujana Merugu
Shreya Jain
Vaibhav Saxena
LRM
37
0
0
18 Oct 2024
Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus
Raviraj Joshi
Kanishk Singla
Anusha Kamath
Raunak Kalani
Rakesh Paul
Utkarsh Vaidya
Sanjay Singh Chauhan
Niranjan Wartikar
Eileen Long
SyDa
CLL
35
2
0
18 Oct 2024
Too Late to Train, Too Early To Use? A Study on Necessity and Viability of Low-Resource Bengali LLMs
Tamzeed Mahfuz
Satak Kumar Dey
Ruwad Naswan
Hasnaen Adil
Khondker Salman Sayeed
Haz Sameen Shahgir
39
0
0
29 Jun 2024
SemEval-2024 Task 1: Semantic Textual Relatedness for African and Asian Languages
N. Ousidhoum
Shamsuddeen Hassan Muhammad
Mohamed Abdalla
Idris Abdulmumin
I. Ahmad
...
Thamar Solorio
Nirmal Surange
Krishnapriya Vishnubhotla
Seid Muhie Yimam
Saif M. Mohammad
50
11
0
27 Mar 2024
Share What You Already Know: Cross-Language-Script Transfer and Alignment for Sentiment Detection in Code-Mixed Data
Niraj Pahari
Kazutaka Shimada
32
0
0
07 Feb 2024
A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis on Noisy Bangla Texts
Kazi Toufique Elahi
Tasnuva Binte Rahman
Shakil Shahriar
Samir Sarker
Md. Tanvir Rouf Shawon
G. M. Shahariar
35
1
0
25 Jan 2024
Multilingual Bias Detection and Mitigation for Indian Languages
Ankita Maity
Anubhav Sharma
Rudra Dhar
Tushar Abhishek
Manish Gupta
Vasudeva Varma
39
2
0
23 Dec 2023
HCDIR: End-to-end Hate Context Detection, and Intensity Reduction model for online comments
Neeraj Kumar Singh
Koyel Ghosh
Joy Mahapatra
Utpal Garain
Apurbalal Senapati
22
0
0
20 Dec 2023
From Multilingual Complexity to Emotional Clarity: Leveraging Commonsense to Unveil Emotions in Code-Mixed Dialogues
Shivani Kumar
S. Ramaneswaran
Md. Shad Akhtar
Tanmoy Chakraborty
36
23
0
19 Oct 2023
Mixed-Distil-BERT: Code-mixed Language Modeling for Bangla, English, and Hindi
Md. Nishat Raihan
Dhiman Goswami
Antara Mahmud
53
1
0
19 Sep 2023
DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction
Vineet Bhat
P. Jyothi
P. Bhattacharyya
24
0
0
26 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip Torr
Adel Bibi
45
98
0
17 May 2023
H-AES: Towards Automated Essay Scoring for Hindi
Shubhankar K. Singh
Anirudh Pupneja
Shivaansh Mital
Cheril Shah
Manish Bawkar
Lakshman Prasad Gupta
Ajit Kumar
Yaman Kumar Singla
Rushali Gupta
R. Shah
21
6
0
28 Feb 2023
Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages
A. Mhaske
Harsh Kedia
Sumanth Doddapaneni
Mitesh M. Khapra
Pratyush Kumar
V. Rudramurthy
Anoop Kunchukuttan
54
26
0
20 Dec 2022
A Twitter BERT Approach for Offensive Language Detection in Marathi
Tanmay Chavan
Shantanu Patankar
Aditya Kane
Omkar Gokhale
Raviraj Joshi
41
11
0
20 Dec 2022
L3Cube-HindBERT and DevBERT: Pre-Trained BERT Transformer models for Devanagari based Hindi and Marathi Languages
Raviraj Joshi
52
56
0
21 Nov 2022
Cultural Re-contextualization of Fairness Research in Language Technologies in India
Shaily Bhatt
Sunipa Dev
Partha P. Talukdar
Shachi Dave
Vinodkumar Prabhakaran
40
3
0
21 Nov 2022
Overview of the HASOC Subtrack at FIRE 2022: Offensive Language Identification in Marathi
Tharindu Ranasinghe
Kai North
Damith Premasiri
Marcos Zampieri
35
13
0
18 Nov 2022
Progressive Sentiment Analysis for Code-Switched Text Data
Sudhanshu Ranjan
Dheeraj Mekala
Jingbo Shang
29
4
0
25 Oct 2022
Spread Love Not Hate: Undermining the Importance of Hateful Pre-training for Hate Speech Detection
Omkar Gokhale
Aditya Kane
Shantanu Patankar
Tanmay Chavan
Raviraj Joshi
VLM
35
7
0
09 Oct 2022
Hate Speech and Offensive Language Detection in Bengali
Mithun Das
Somnath Banerjee
Punyajoy Saha
Animesh Mukherjee
28
27
0
07 Oct 2022
Re-contextualizing Fairness in NLP: The Case of India
Shaily Bhatt
Sunipa Dev
Partha P. Talukdar
Shachi Dave
Vinodkumar Prabhakaran
32
54
0
25 Sep 2022
Efficient Gender Debiasing of Pre-trained Indic Language Models
Neeraja Kirtane
V. Manushree
Aditya Kane
19
3
0
08 Sep 2022
MASALA: Modelling and Analysing the Semantics of Adpositions in Linguistic Annotation of Hindi
Aryaman Arora
N. Venkateswaran
Nathan Schneider
24
4
0
08 May 2022
HiNER: A Large Hindi Named Entity Recognition Dataset
Rudra Murthy
Pallab Bhattacharjee
R. Sharnagat
Jyotsana Khatri
Diptesh Kanojia
P. Bhattacharyya
40
14
0
28 Apr 2022
IndicXNLI: Evaluating Multilingual Inference for Indian Languages
Divyanshu Aggarwal
V. Gupta
Anoop Kunchukuttan
31
27
0
19 Apr 2022
MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages
Gokul Karthik Kumar
Abhishek Singh Gehlot
Sahal Shaji Mullappilly
Karthik Nandakumar
34
13
0
12 Apr 2022
hate-alert@DravidianLangTech-ACL2022: Ensembling Multi-Modalities for Tamil TrollMeme Classification
Mithun Das
Somnath Banerjee
Animesh Mukherjee
VLM
22
6
0
25 Mar 2022
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Alham Fikri Aji
Genta Indra Winata
Fajri Koto
Samuel Cahyawijaya
Ade Romadhony
...
David Moeljadi
Radityo Eko Prasojo
Timothy Baldwin
Jey Han Lau
Sebastian Ruder
40
100
0
24 Mar 2022
Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related Languages
Vaidehi Patil
Partha P. Talukdar
Sunita Sarawagi
24
21
0
03 Mar 2022
Multilingual Abusiveness Identification on Code-Mixed Social Media Text
Ekagra Ranjan
Naman Poddar
29
0
0
01 Mar 2022
TamilEmo: Finegrained Emotion Detection Dataset for Tamil
Charangan Vasantharajan
Sean Benhur
Prasanna Kumar Kumaresan
Rahul Ponnusamy
S. Thangasamy
...
Thenmozhi Durairaj
Kanchana Sivanraju
Anbukkarasi Sampath
Bharathi Raja Chakravarthi
John P. Mccrae
27
5
0
09 Feb 2022
XAlign: Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages
Tushar Abhishek
Shivprasad Sagare
Bhavyajeet Singh
Anubhav Sharma
Manish Gupta
Vasudeva Varma
24
9
0
01 Feb 2022
Hypers at ComMA@ICON: Modelling Aggressiveness, Gender Bias and Communal Bias Identification
Sean Benhur
Roshan Nayak
Kanchana Sivanraju
Adeep Hande
S. Navaneethakrishnan
R. Priyadharshini
Bharathi Raja Chakravarthi6
27
1
0
31 Dec 2021
Multilingual Text Classification for Dravidian Languages
Xiaotian Lin
Nankai Lin
Kanoksak Wattanachote
Shengyi Jiang
Lianxi Wang
69
3
0
03 Dec 2021
Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language
Arushi Sharma
Anubha Kabra
Minni Jain
29
52
0
18 Oct 2021
Pretrained Transformers for Offensive Language Identification in Tanglish
Sean Benhur
Kanchana Sivanraju
VLM
53
5
0
06 Oct 2021
IndicBART: A Pre-trained Model for Indic Natural Language Generation
Raj Dabre
Himani Shrotriya
Anoop Kunchukuttan
Ratish Puduppully
Mitesh M. Khapra
Pratyush Kumar
47
70
0
07 Sep 2021
Offensive Language Identification in Low-resourced Code-mixed Dravidian languages using Pseudo-labeling
Adeep Hande
Karthik Puranik
Konthala Yasaswini
R. Priyadharshini
Sajeetha Thavareesan
Anbukkarasi Sampath
Kogilavani Shanmugavadivel
D. Thenmozhi
Bharathi Raja Chakravarthi
29
29
0
27 Aug 2021
Do Images really do the Talking? Analysing the significance of Images in Tamil Troll meme classification
Siddhanth U Hegde
Adeep Hande
R. Priyadharshini
Sajeetha Thavareesan
Ratnasingam Sakuntharaj
S. Thangasamy
B. Bharathi
Bharathi Raja Chakravarthi
42
7
0
09 Aug 2021
The Effectiveness of Intermediate-Task Training for Code-Switched Natural Language Understanding
Archiki Prasad
Mohammad Ali Rehan
Shreyasi Pathak
P. Jyothi
27
9
0
21 Jul 2021
A Primer on Pretrained Multilingual Language Models
Sumanth Doddapaneni
Gowtham Ramesh
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
LRM
43
74
0
01 Jul 2021
Improving Multilingual Models with Language-Clustered Vocabularies
Hyung Won Chung
Dan Garrette
Kiat Chuan Tan
Jason Riesa
VLM
77
65
0
24 Oct 2020
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
718
6,748
0
26 Sep 2016
1