18
0

Real-Time Pitch/F0 Detection Using Spectrogram Images and Convolutional Neural Networks

Abstract

This paper presents a novel approach to detect F0 through Convolutional Neural Networks and image processing techniques to directly estimate pitch from spectrogram images. Our new approach demonstrates a very good detection accuracy; a total of 92% of predicted pitch contours have strong or moderate correlations to the true pitch contours. Furthermore, the experimental comparison between our new approach and other state-of-the-art CNN methods reveals that our approach can enhance the detection rate by approximately 5% across various Signal-to-Noise Ratio conditions.

View on arXiv
@article{zhao2025_2504.06165,
  title={ Real-Time Pitch/F0 Detection Using Spectrogram Images and Convolutional Neural Networks },
  author={ Xufang Zhao and Omer Tsimhoni },
  journal={arXiv preprint arXiv:2504.06165},
  year={ 2025 }
}
Comments on this paper