EmoSign: A Multimodal Dataset for Understanding Emotions in American Sign Language

Unlike spoken languages where the use of prosodic features to convey emotion is well studied, indicators of emotion in sign language remain poorly understood, creating communication barriers in critical settings. Sign languages present unique challenges as facial expressions and hand movements simultaneously serve both grammatical and emotional functions. To address this gap, we introduce EmoSign, the first sign video dataset containing sentiment and emotion labels for 200 American Sign Language (ASL) videos. We also collect open-ended descriptions of emotion cues. Annotations were done by 3 Deaf ASL signers with professional interpretation experience. Alongside the annotations, we include baseline models for sentiment and emotion classification. This dataset not only addresses a critical gap in existing sign language research but also establishes a new benchmark for understanding model capabilities in multimodal emotion recognition for sign languages. The dataset is made available atthis https URL.
View on arXiv@article{chua2025_2505.17090, title={ EmoSign: A Multimodal Dataset for Understanding Emotions in American Sign Language }, author={ Phoebe Chua and Cathy Mengying Fang and Takehiko Ohkawa and Raja Kushalnagar and Suranga Nanayakkara and Pattie Maes }, journal={arXiv preprint arXiv:2505.17090}, year={ 2025 } }