Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.13002
Cited By
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
27 April 2021
Feng Dang
Hangting Chen
Pengyuan Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement"
45 / 45 papers shown
Title
PrimeK-Net: Multi-scale Spectral Learning via Group Prime-Kernel Convolutional Neural Networks for Single Channel Speech Enhancement
Zizhen Lin
Junyu Wang
Ruili Li
Fei Shen
Xi Xuan
69
0
0
27 Feb 2025
Using RLHF to align speech enhancement approaches to mean-opinion quality scores
Anurag Kumar
Andrew Perrault
Donald S. Williamson
16
0
0
17 Oct 2024
A Lightweight and Real-Time Binaural Speech Enhancement Model with Spatial Cues Preservation
Jingyuan Wang
Jie Zhang
Shihao Chen
Miao Sun
21
0
0
19 Sep 2024
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
Bang Zeng
Ming Li
34
2
0
04 Sep 2024
Improving Speech Enhancement by Integrating Inter-Channel and Band Features with Dual-branch Conformer
Jizhen Li
Xinmeng Xu
Weiping Tu
Yuhong Yang
Rong Zhu
26
1
0
09 Jul 2024
SMRU: Split-and-Merge Recurrent-based UNet for Acoustic Echo Cancellation and Noise Suppression
Zhihang Sun
Andong Li
Rilin Chen
Hao Zhang
Meng Yu
Yi Zhou
Dong Yu
66
0
0
17 Jun 2024
Diffusion Gaussian Mixture Audio Denoise
Pu Wang
Junhui Li
Jialu Li
Liangdong Guo
Youshan Zhang
DiffM
34
0
0
13 Jun 2024
MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancement
Zizhen Lin
Xiaoting Chen
Junyu Wang
40
2
0
07 Jun 2024
An Investigation of Incorporating Mamba for Speech Enhancement
Rong-Yu Chao
Wen-Huang Cheng
Moreno La Quatra
Sabato Marco Siniscalchi
Chao-Han Huck Yang
Szu-Wei Fu
Yu Tsao
Mamba
53
25
0
10 May 2024
TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition
Chengxin Chen
Pengyuan Zhang
27
0
0
19 Apr 2024
Hallucination in Perceptual Metric-Driven Speech Enhancement Networks
George Close
Thomas Hain
Stefan Goetze
29
1
0
18 Mar 2024
Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN
Shiqi Zhang
Zheng Qiu
Daiki Takeuchi
Noboru Harada
Shoji Makino
13
3
0
13 Feb 2024
Single-channel speech enhancement using learnable loss mixup
Oscar Chang
Dung N. Tran
K. Koishida
45
7
0
20 Dec 2023
Improving Label Assignments Learning by Dynamic Sample Dropout Combined with Layer-wise Optimization in Speech Separation
Chenyu Gao
Yue Gu
I. Marsic
18
0
0
20 Nov 2023
DPATD: Dual-Phase Audio Transformer for Denoising
Junhui Li
Pu Wang
Jialu Li
Xinzhe Wang
Youshan Zhang
18
4
0
30 Oct 2023
Music Augmentation and Denoising For Peak-Based Audio Fingerprinting
Kamil Akesbi
Dorian Desblancs
Benjamin Martin
37
0
0
20 Oct 2023
Super Denoise Net: Speech Super Resolution with Noise Cancellation in Low Sampling Rate Noisy Environments
Junkang Yang
Hongqing Liu
Lu Gan
Yi Zhou
10
1
0
09 Oct 2023
Complexity Scaling for Speech Denoising
Hangting Chen
Jianwei Yu
Chao Weng
21
2
0
14 Sep 2023
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Hangting Chen
Jianwei Yu
Yi Luo
Rongzhi Gu
Weihua Li
Zhuocheng Lu
Chao Weng
21
6
0
21 Aug 2023
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Ye-Xin Lu
Yang Ai
Zhenhua Ling
25
7
0
17 Aug 2023
Efficient Encoder-Decoder and Dual-Path Conformer for Comprehensive Feature Learning in Speech Enhancement
Junyu Wang
21
4
0
09 Jun 2023
ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Feng Dang
Qi Hu
Pengyuan Zhang
Yonghong Yan
22
0
0
15 May 2023
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR
Yuchen Hu
Cheng Chen
Qiu-shi Zhu
E. Chng
22
15
0
11 Apr 2023
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Junaid Qadir
42
47
0
21 Mar 2023
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement
Shengkui Zhao
Bin Ma
29
16
0
23 Feb 2023
DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Shuo Wang
Xiangyu Kong
Xiulian Peng
H. Movassagh
Vinod Prakash
Yan Lu
23
11
0
21 Feb 2023
THLNet: two-stage heterogeneous lightweight network for monaural speech enhancement
Feng Dang
Qi Hu
Pengyuan Zhang
15
2
0
19 Jan 2023
Audio Denoising for Robust Audio Fingerprinting
Kamil Akesbi
21
3
0
21 Dec 2022
NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer
Changsheng Quan
Xiaofei Li
30
2
0
05 Dec 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
29
119
0
22 Nov 2022
Speech Enhancement with Fullband-Subband Cross-Attention Network
Jun Chen
Wei Rao
Z. Wang
Zhiyong Wu
Yannan Wang
Tao Yu
Shidong Shang
Helen M. Meng
17
16
0
10 Nov 2022
TT-Net: Dual-path transformer based sound field translation in the spherical harmonic domain
Yiwen Wang
Zijian Lan
Xihong Wu
T. Qu
15
1
0
30 Oct 2022
SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks
Vasily Zadorozhnyy
Qian Ye
K. Koishida
18
8
0
26 Oct 2022
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Dacheng Yin
Zhiyuan Zhao
Chuanxin Tang
Zhiwei Xiong
Chong Luo
22
14
0
24 Oct 2022
Music Source Separation with Band-split RNN
Yi Luo
Jianwei Yu
54
107
0
30 Sep 2022
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
Sherif Abdulatif
Ru Cao
Bin Yang
21
61
0
22 Sep 2022
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Zhong-Qiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
74
96
0
08 Sep 2022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
Yen-Ju Lu
Xuankai Chang
Chenda Li
Wangyou Zhang
Samuele Cornell
...
Robin Scheibler
Zhong-Qiu Wang
Yu Tsao
Y. Qian
Shinji Watanabe
VLM
19
28
0
19 Jul 2022
Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Rong-Yu Chao
Cheng Yu
Szu-Wei Fu
Xugang Lu
Yu Tsao
VLM
25
14
0
31 Mar 2022
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ru Cao
Sherif Abdulatif
Bin Yang
21
91
0
28 Mar 2022
U-shaped Transformer with Frequency-Band Aware Attention for Speech Enhancement
Yi Li
Yang Sun
S. M. Naqvi
17
25
0
11 Dec 2021
S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
Shubo Lv
Yihui Fu
Mengtao Xing
Jiayao Sun
Lei Xie
Jun Huang
Yannan Wang
Tao Yu
6
54
0
16 Nov 2021
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
Yihui Fu
Yun Liu
Jingdong Li
Dawei Luo
Shubo Lv
Yukai Jv
Lei Xie
21
49
0
11 Nov 2021
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jing-jing Chen
Qi-rong Mao
Dong Liu
62
280
0
28 Jul 2020
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network
Wenzhe Shi
Jose Caballero
Ferenc Huszár
J. Totz
Andrew P. Aitken
Rob Bishop
Daniel Rueckert
Zehan Wang
SupR
195
5,176
0
16 Sep 2016
1