38
0

Improving Keystep Recognition in Ego-Video via Dexterous Focus

Main:2 Pages
1 Figures
Bibliography:2 Pages
2 Tables
Abstract

In this paper, we address the challenge of understanding human activities from an egocentric perspective. Traditional activity recognition techniques face unique challenges in egocentric videos due to the highly dynamic nature of the head during many activities. We propose a framework that seeks to address these challenges in a way that is independent of network architecture by restricting the ego-video input to a stabilized, hand-focused video. We demonstrate that this straightforward video transformation alone outperforms existing egocentric video baselines on the Ego-Exo4D Fine-Grained Keystep Recognition benchmark without requiring any alteration of the underlying model infrastructure.

View on arXiv
@article{chavis2025_2506.00827,
  title={ Improving Keystep Recognition in Ego-Video via Dexterous Focus },
  author={ Zachary Chavis and Stephen J. Guy and Hyun Soo Park },
  journal={arXiv preprint arXiv:2506.00827},
  year={ 2025 }
}
Comments on this paper