
v1v2 (latest)
The Multimodal Information Based Speech Processing (MISP) 2025 Challenge: Audio-Visual Diarization and Recognition
Papers citing "The Multimodal Information Based Speech Processing (MISP) 2025 Challenge: Audio-Visual Diarization and Recognition"
18 / 18 papers shown
Title |
---|
![]() WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech
Recognition Binbin Zhang Hang Lv Pengcheng Guo Qijie Shao Chao Yang ...Hui Bu Xiaoyu Chen Chenchen Zeng Di Wu Zhendong Peng |