ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.11023
  4. Cited By
Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature

Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature

22 November 2021
Yiwen Shao
Shi-Xiong Zhang
Dong Yu
ArXiv (abs)PDFHTML

Papers citing "Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature"

9 / 9 papers shown
Title
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Ju Lin
Niko Moritz
Yiteng Huang
Ruiming Xie
Ming Sun
Christian Fuegen
Frank Seide
92
7
0
18 Jan 2024
RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech
  Recognition in Multi-Channel Multi-Speaker Scenarios
RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech Recognition in Multi-Channel Multi-Speaker Scenarios
Yiwen Shao
Shi-Xiong Zhang
Dong Yu
128
0
0
31 Oct 2023
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder
  and Input Feature Analysis
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis
Can Cui
Imran A. Sheikh
Mostafa Sadeghi
Emmanuel Vincent
68
4
0
16 Oct 2023
Measuring Acoustics with Collaborative Multiple Agents
Measuring Acoustics with Collaborative Multiple Agents
Yinfeng Yu
Changan Chen
Lele Cao
Fangkai Yang
Gang Hua
82
1
0
09 Oct 2023
Challenges and Insights: Exploring 3D Spatial Features and Complex
  Networks on the MISP Dataset
Challenges and Insights: Exploring 3D Spatial Features and Complex Networks on the MISP Dataset
Yiwen Shao
47
0
0
05 Oct 2023
Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation
  and Recognition
Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition
Guinan Li
Jiajun Deng
Mengzhe Geng
Zengrui Jin
Tianzi Wang
Shujie Hu
Mingyu Cui
Helen M. Meng
Xunying Liu
66
12
0
06 Jul 2023
End-to-End Integration of Speech Recognition, Dereverberation,
  Beamforming, and Self-Supervised Learning Representation
End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Yoshiki Masuyama
Xuankai Chang
Samuele Cornell
Shinji Watanabe
Nobutaka Ono
79
19
0
19 Oct 2022
Direction-Aware Joint Adaptation of Neural Speech Enhancement and
  Recognition in Real Multiparty Conversational Environments
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments
Yicheng Du
Aditya Arie Nugraha
Kouhei Sekiguchi
Yoshiaki Bando
Mathieu Fontaine
Kazuyoshi Yoshii
52
0
0
15 Jul 2022
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D
  Scenes
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes
Anton Ratnarajah
Zhenyu Tang
R. Aralikatti
Tianyi Zhou
AI4CE
117
36
0
18 May 2022
1