9
0

BTPD: A Multilingual Hand-curated Dataset of Bengali Transnational Political Discourse Across Online Communities

Main:6 Pages
2 Figures
Bibliography:3 Pages
6 Tables
Abstract

Understanding political discourse in online spaces is crucial for analyzing public opinion and ideological polarization. While social computing and computational linguistics have explored such discussions in English, such research efforts are significantly limited in major yet under-resourced languages like Bengali due to the unavailability of datasets. In this paper, we present a multilingual dataset of Bengali transnational political discourse (BTPD) collected from three online platforms, each representing distinct community structures and interaction dynamics. Besides describing how we hand-curated the dataset through community-informed keyword-based retrieval, this paper also provides a general overview of its topics and multilingual content.

View on arXiv
@article{das2025_2506.06813,
  title={ BTPD: A Multilingual Hand-curated Dataset of Bengali Transnational Political Discourse Across Online Communities },
  author={ Dipto Das and Syed Ishtiaque Ahmed and Shion Guha },
  journal={arXiv preprint arXiv:2506.06813},
  year={ 2025 }
}
Comments on this paper