ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01386
23
1

Spatial-temporal Graph Based Multi-channel Speaker Verification With Ad-hoc Microphone Arrays

3 July 2023
Yijiang Chen
Chen Liang
Xiao-Lei Zhang
ArXivPDFHTML
Abstract

The performance of speaker verification degrades significantly in adverse acoustic environments with strong reverberation and noise. To address this issue, this paper proposes a spatial-temporal graph convolutional network (GCN) method for the multi-channel speaker verification with ad-hoc microphone arrays. It includes a feature aggregation block and a channel selection block, both of which are built on graphs. The feature aggregation block fuses speaker features among different time and channels by a spatial-temporal GCN. The graph-based channel selection block discards the noisy channels that may contribute negatively to the system. The proposed method is flexible in incorporating various kinds of graphs and prior knowledge. We compared the proposed method with six representative methods in both real-world and simulated environments. Experimental results show that the proposed method achieves a relative equal error rate (EER) reduction of 15.39%\mathbf{15.39\%}15.39% lower than the strongest referenced method in the simulated datasets, and 17.70%\mathbf{17.70\%}17.70% lower than the latter in the real datasets. Moreover, its performance is robust across different signal-to-noise ratios and reverberation time.

View on arXiv
Comments on this paper