ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.11131
  4. Cited By
Unifying Speech Enhancement and Separation with Gradient Modulation for
  End-to-End Noise-Robust Speech Separation

Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation

22 February 2023
Yuchen Hu
Chen Chen
Heqing Zou
Xionghu Zhong
Chng Eng Siong
ArXivPDFHTML

Papers citing "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"

13 / 13 papers shown
Title
Leveraging Modality-specific Representations for Audio-visual Speech
  Recognition via Reinforcement Learning
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Chen Chen
Yuchen Hu
Qiang Zhang
Heqing Zou
Beier Zhu
Eng Siong Chng
58
28
0
10 Dec 2022
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by
  Combining Regression and Improved Contrastive Learning
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Qiu-shi Zhu
Long Zhou
Jie Zhang
Shujie Liu
Yu-Chen Hu
Lirong Dai
VLM
SSL
71
37
0
27 Oct 2022
On the Use of Deep Mask Estimation Module for Neural Source Separation
  Systems
On the Use of Deep Mask Estimation Module for Neural Source Separation Systems
Kai Li
Xiaolin Hu
Yi Luo
41
16
0
15 Jun 2022
Self-critical Sequence Training for Automatic Speech Recognition
Self-critical Sequence Training for Automatic Speech Recognition
Chen Chen
Yuchen Hu
Nana Hou
Xiaofeng Qi
Heqing Zou
Chng Eng Siong
51
15
0
13 Apr 2022
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain
  Data
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data
Chen Chen
Nana Hou
Yuchen Hu
Shashank Shirol
Chng Eng Siong
NoLa
72
43
0
29 Mar 2022
Interactive Feature Fusion for End-to-End Noise-Robust Speech
  Recognition
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition
Yuchen Hu
Nana Hou
Chen Chen
Chng Eng Siong
41
40
0
11 Oct 2021
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at
  IWSLT 2021
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021
Dan Liu
Mengge Du
Xiaoxi Li
Yuchen Hu
Lirong Dai
60
21
0
01 Jul 2021
Attention is All You Need in Speech Separation
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
86
557
0
25 Oct 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
78
263
0
20 Feb 2020
Dual-path RNN: efficient long sequence modeling for time-domain
  single-channel speech separation
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
70
770
0
14 Oct 2019
SDR - half-baked or well done?
SDR - half-baked or well done?
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
138
1,191
0
06 Nov 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
144
1,783
0
20 Sep 2018
Delving Deep into Rectifiers: Surpassing Human-Level Performance on
  ImageNet Classification
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
VLM
276
18,587
0
06 Feb 2015
1