ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.03697
30
1

An audio-quality-based multi-strategy approach for target speaker extraction in the MISP 2023 Challenge

8 January 2024
Ru Han
Xiaopeng Yan
Weiming Xu
Pengcheng Guo
Jiayao Sun
He Wang
Quan Lu
Ning Jiang
Lei Xie
ArXivPDFHTML
Abstract

This paper describes our audio-quality-based multi-strategy approach for the audio-visual target speaker extraction (AVTSE) task in the Multi-modal Information based Speech Processing (MISP) 2023 Challenge. Specifically, our approach adopts different extraction strategies based on the audio quality, striking a balance between interference removal and speech preservation, which benifits the back-end automatic speech recognition (ASR) systems. Experiments show that our approach achieves a character error rate (CER) of 24.2% and 33.2% on the Dev and Eval set, respectively, obtaining the second place in the challenge.

View on arXiv
Comments on this paper