GS_DravidianLangTech@2025: Women Targeted Abusive Texts Detection on Social Media

The increasing misuse of social media has become a concern; however, technological solutions are being developed to moderate its content effectively. This paper focuses on detecting abusive texts targeting women on social media platforms. Abusive speech refers to communication intended to harm or incite hatred against vulnerable individuals or groups. Specifically, this study aims to identify abusive language directed toward women. To achieve this, we utilized logistic regression and BERT as base models to train datasets sourced from DravidianLangTech@2025 for Tamil and Malayalam languages. The models were evaluated on test datasets, resulting in a 0.729 macro F1 score for BERT and 0.6279 for logistic regression in Tamil and Malayalam, respectively.
View on arXiv@article{bade2025_2504.02863, title={ GS_DravidianLangTech@2025: Women Targeted Abusive Texts Detection on Social Media }, author={ Girma Yohannis Bade and Zahra Ahani and Olga Kolesnikova and José Luis Oropeza and Grigori Sidorov }, journal={arXiv preprint arXiv:2504.02863}, year={ 2025 } }