Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.05267
Cited By
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition
11 October 2021
Yuchen Hu
Nana Hou
Chen Chen
Chng Eng Siong
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition"
25 / 25 papers shown
Title
A 71.2-
μ
μ
μ
W Speech Recognition Accelerator with Recurrent Spiking Neural Network
Chih-Chyau Yang
Tian-Sheuan Chang
60
1
0
27 Mar 2025
Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention
Ruijie Tao
Xinyuan Qian
Yidi Jiang
Junjie Li
Jiadong Wang
Haizhou Li
34
1
0
29 Apr 2024
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
Xincheng Yu
Dongyue Guo
Jianwei Zhang
Yi Lin
17
3
0
11 Dec 2023
FAT-HuBERT: Front-end Adaptive Training of Hidden-unit BERT for Distortion-Invariant Robust Speech Recognition
Dongning Yang
Wei Wang
Yanmin Qian
13
3
0
29 Nov 2023
Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection
Cunhang Fan
Mingming Ding
Jianhua Tao
Ruibo Fu
Jiangyan Yi
Zhengqi Wen
Zhao Lv
37
4
0
13 Oct 2023
Enhancing Code-switching Speech Recognition with Interactive Language Biases
Hexin Liu
Leibny Paola García
Jingze Lu
Wenchao Wang
Sanjeev Khudanpur
16
11
0
29 Sep 2023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Yuchen Hu
Cheng Chen
Ruizhe Li
Qiu-shi Zhu
E. Chng
DiffM
16
9
0
16 Jul 2023
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition
Yuchen Hu
Chen Chen
Ruizhe Li
Heqing Zou
Chng Eng Siong
GAN
42
9
0
18 Jun 2023
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition
Yuchen Hu
Ruizhe Li
Cheng Chen
Chengwei Qin
Qiu-shi Zhu
E. Chng
31
5
0
18 Jun 2023
speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition
Haoyu Lu
Nan Li
Tongtong Song
Longbiao Wang
J. Dang
Xiaobao Wang
Shiliang Zhang
NoLa
22
3
0
29 May 2023
On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition
L. Bansal
S. P. Dubagunta
Malolan Chetlur
Pushpak Jagtap
A. Ganapathiraju
22
1
0
21 May 2023
Noise-Aware Speech Separation with Contrastive Learning
Zizheng Zhang
Cheng Chen
Hsin-Hung Chen
Xiang Liu
Yuchen Hu
E. Chng
28
5
0
18 May 2023
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition
Yuchen Hu
Ruizhe Li
Chen Chen
Heqing Zou
Qiu-shi Zhu
E. Chng
28
7
0
16 May 2023
Contrastive Speech Mixup for Low-resource Keyword Spotting
Dianwen Ng
Ruixi Zhang
J. Yip
Chong Zhang
Yukun Ma
Trung Hieu Nguyen
Chongjia Ni
E. Chng
B. Ma
38
10
0
02 May 2023
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR
Yuchen Hu
Cheng Chen
Qiu-shi Zhu
E. Chng
22
15
0
11 Apr 2023
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
Chen Chen
Yuchen Hu
Weiwei Weng
Chng Eng Siong
DiffM
40
19
0
23 Feb 2023
Unsupervised Noise adaptation using Data Simulation
Chen Chen
Yuchen Hu
Heqing Zou
Linhui Sun
Chng Eng Siong
33
13
0
23 Feb 2023
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition
Yuchen Hu
Chen Chen
Ruizhe Li
Qiu-shi Zhu
E. Chng
35
15
0
22 Feb 2023
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation
Yuchen Hu
Chen Chen
Heqing Zou
Xionghu Zhong
Chng Eng Siong
47
16
0
22 Feb 2023
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Chen Chen
Yuchen Hu
Qiang Zhang
Heqing Zou
Beier Zhu
E. Chng
33
26
0
10 Dec 2022
Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Yuhang Yang
Haihua Xu
Hao-Ming Huang
E. Chng
Sheng Li
44
7
0
01 Nov 2022
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Qiu-shi Zhu
Long Zhou
Jie Zhang
Shujie Liu
Yu-Chen Hu
Lirong Dai
VLM
SSL
60
37
0
27 Oct 2022
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition
Zixun Guo
C. Chen
Chng Eng Siong
25
5
0
01 Aug 2022
Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR
Qiu-shi Zhu
Jie Zhang
Zitian Zhang
Lirong Dai
43
15
0
26 May 2022
Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition
Yuchen Hu
Nana Hou
Chen Chen
Chng Eng Siong
24
13
0
28 Mar 2022
1