Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.04470
Cited By
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
11 August 2020
Umut Isik
Ritwik Giri
Neerad Phansalkar
J. Valin
Karim Helwani
A. Krishnaswamy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss"
47 / 47 papers shown
Title
Speech Enhancement with Overlapped-Frame Information Fusion and Causal Self-Attention
Yuewei Zhang
Huanbin Zou
Jie Zhu
44
0
0
21 Jan 2025
FINALLY: fast and universal speech enhancement with studio-like quality
Nicholas Babaev
Kirill Tamogashev
Azat Saginbaev
Ivan Shchekotov
Hanbin Bae
Hosang Sung
WonJun Lee
Hoon-Young Cho
Pavel Andreev
29
2
0
08 Oct 2024
Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement
Tathagata Bandyopadhyay
ViT
18
0
0
02 Sep 2024
Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation
Min-Jae Hwang
Ilia Kulikov
Benjamin Peloquin
Hongyu Gong
Peng-Jen Chen
Ann Lee
29
1
0
04 Jun 2024
SICRN: Advancing Speech Enhancement through State Space Model and Inplace Convolution Techniques
Changjiang Zhao
Shulin He
Xueliang Zhang
21
7
0
22 Feb 2024
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram
Zhifeng Kong
Ming-Yu Liu
Ambrish Dantrey
Bryan Catanzaro
17
7
0
12 Sep 2023
SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic Spaces
Iván Vallés-Pérez
Grzegorz Beringer
Piotr Bilinski
G. Cook
Roberto Barra-Chicote
19
1
0
23 Jul 2023
Inter-SubNet: Speech Enhancement with Subband Interaction
Jun Chen
Wei Rao
Z. Wang
Jiuxin Lin
Zhiyong Wu
Yannan Wang
Shidong Shang
Helen M. Meng
11
13
0
09 May 2023
Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
AI4TS
15
10
0
18 Apr 2023
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement
Shengkui Zhao
Bin Ma
32
16
0
23 Feb 2023
A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Zhepei Wang
Ritwik Giri
Devansh P. Shah
J. Valin
Mike Goodwin
Paris Smaragdis
19
8
0
23 Feb 2023
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
28
21
0
01 Dec 2022
A General Unfolding Speech Enhancement Method Motivated by Taylor's Theorem
Andong Li
Guochen Yu
C. Zheng
Wenzhe Liu
Xiaodong Li
43
10
0
30 Nov 2022
Speech Enhancement with Fullband-Subband Cross-Attention Network
Jun Chen
Wei Rao
Z. Wang
Zhiyong Wu
Yannan Wang
Tao Yu
Shidong Shang
Helen M. Meng
17
16
0
10 Nov 2022
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Shulin He
Wei Rao
Jinjiang Liu
Jun Chen
Yukai Ju
Xueliang Zhang
Yannan Wang
Shidong Shang
13
6
0
28 Oct 2022
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Dacheng Yin
Zhiyuan Zhao
Chuanxin Tang
Zhiwei Xiong
Chong Luo
25
14
0
24 Oct 2022
Speech Enhancement with Perceptually-motivated Optimization and Dual Transformations
Xucheng Wan
Kai Liu
Z.C. Du
Huan Zhou
10
0
0
24 Sep 2022
Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks
Stefan Lattner
J. Nistal
30
11
0
04 Jul 2022
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
J. Valin
Ritwik Giri
Shrikant Venkataramani
Umut Isik
A. Krishnaswamy
13
2
0
16 Jun 2022
Universal Speech Enhancement with Score-based Diffusion
Joan Serrà
Santiago Pascual
Jordi Pons
R. O. Araz
D. Scaini
DiffM
22
95
0
07 Jun 2022
BEHM-GAN: Bandwidth Extension of Historical Music using Generative Adversarial Networks
Eloi Moliner
Vesa Valimaki
18
18
0
13 Apr 2022
Improved singing voice separation with chromagram-based pitch-aware remixing
Siyuan Yuan
Zhepei Wang
Umut Isik
Ritwik Giri
J. Valin
M. Goodwin
A. Krishnaswamy
16
11
0
28 Mar 2022
FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement
Jun Chen
Z. Wang
Deyi Tuo
Zhiyong Wu
Shiyin Kang
Helen Meng
24
107
0
23 Mar 2022
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Paris Smaragdis
Anurag Kumar
CLL
22
54
0
17 Feb 2022
A Two-Stage U-Net for High-Fidelity Denoising of Historical Recordings
Eloi Moliner
Vesa Valimaki
8
24
0
17 Feb 2022
Speech Denoising in the Waveform Domain with Self-Attention
Zhifeng Kong
Ming-Yu Liu
Ambrish Dantrey
Bryan Catanzaro
18
61
0
15 Feb 2022
Self-Supervised Learning based Monaural Speech Enhancement with Complex-Cycle-Consistent
Yi Li
Yang Sun
S. M. Naqvi
16
1
0
21 Dec 2021
Hybrid Spectrogram and Waveform Source Separation
Alexandre Défossez
24
162
0
05 Nov 2021
Continual self-training with bootstrapped remixing for speech enhancement
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Anurag Kumar
18
16
0
19 Oct 2021
Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement
Zhong-Qiu Wang
G. Wichern
Jonathan Le Roux
126
15
0
01 Oct 2021
Music Demixing Challenge 2021
Yuki Mitsufuji
Giorgio Fabbro
Stefan Uhlich
Fabian-Robert Stöter
Alexandre Défossez
Minseok Kim
Woosung Choi
Chin-Yun Yu
K. Cheuk
18
80
0
31 Aug 2021
On The Compensation Between Magnitude and Phase in Speech Separation
Zhong-Qiu Wang
G. Wichern
Jonathan Le Roux
21
71
0
11 Aug 2021
Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement
Andong Li
C. Zheng
Lu Zhang
Xiaodong Li
11
141
0
22 Jun 2021
Training Speech Enhancement Systems with Noisy Speech Datasets
Koichi Saito
Stefan Uhlich
Giorgio Fabbro
Yuki Mitsufuji
31
11
0
26 May 2021
Dual-Stage Low-Complexity Reconfigurable Speech Enhancement
Jun Yang
Nico Brailovsky
6
1
0
17 May 2021
Separate but Together: Unsupervised Federated Learning for Speech Enhancement from Non-IID Data
Efthymios Tzinis
Jonah Casebeer
Zhepei Wang
Paris Smaragdis
FedML
24
19
0
11 May 2021
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Feng Dang
Hangting Chen
Pengyuan Zhang
76
96
0
27 Apr 2021
Complex Spectral Mapping With Attention Based Convolution Recurrent Neural Network for Speech Enhancement
Liming Zhou
Yongyu Gao
Ziluo Wang
Jiwei Li
Wenbin Zhang
17
16
0
12 Apr 2021
Transformers with Competitive Ensembles of Independent Mechanisms
Alex Lamb
Di He
Anirudh Goyal
Guolin Ke
Chien-Feng Liao
Mirco Ravanelli
Yoshua Bengio
MoE
26
23
0
27 Feb 2021
Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Jonah Casebeer
Vinjai Vale
Umut Isik
J. Valin
Ritwik Giri
A. Krishnaswamy
51
18
0
12 Feb 2021
Enhancing Audio Augmentation Methods with Consistency Learning
Turab Iqbal
Karim Helwani
A. Krishnaswamy
Wenwu Wang
21
4
0
09 Feb 2021
Real-time Denoising and Dereverberation with Tiny Recurrent U-Net
Hyeong-Seok Choi
Sungjin Park
Jie Hwan Lee
Hoon Heo
Dongsuk Jeon
Kyogu Lee
34
57
0
05 Feb 2021
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
Shengkui Zhao
Trung Hieu Nguyen
B. Ma
21
41
0
03 Feb 2021
FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement
Xiang Hao
Xiangdong Su
Radu Horaud
Xiaofei Li
14
194
0
29 Oct 2020
Investigating Cross-Domain Losses for Speech Enhancement
Sherif Abdulatif
Karim Armanious
Jayasankar T. Sajeev
Karim Guirguis
B. Yang
17
7
0
20 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
24
88
0
04 Oct 2020
A Hybrid DSP/Deep Learning Approach to Real-Time Full-Band Speech Enhancement
J. Valin
56
190
0
24 Sep 2017
1