ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.04470
  4. Cited By
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings,
  Semi-Supervised Conversational Data, and Biased Loss

PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss

11 August 2020
Umut Isik
Ritwik Giri
Neerad Phansalkar
J. Valin
Karim Helwani
A. Krishnaswamy
ArXivPDFHTML

Papers citing "PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss"

47 / 47 papers shown
Title
Speech Enhancement with Overlapped-Frame Information Fusion and Causal Self-Attention
Speech Enhancement with Overlapped-Frame Information Fusion and Causal Self-Attention
Yuewei Zhang
Huanbin Zou
Jie Zhu
44
0
0
21 Jan 2025
FINALLY: fast and universal speech enhancement with studio-like quality
FINALLY: fast and universal speech enhancement with studio-like quality
Nicholas Babaev
Kirill Tamogashev
Azat Saginbaev
Ivan Shchekotov
Hanbin Bae
Hosang Sung
WonJun Lee
Hoon-Young Cho
Pavel Andreev
29
2
0
08 Oct 2024
Spectron: Target Speaker Extraction using Conditional Transformer with
  Adversarial Refinement
Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement
Tathagata Bandyopadhyay
ViT
18
0
0
02 Sep 2024
Textless Acoustic Model with Self-Supervised Distillation for
  Noise-Robust Expressive Speech-to-Speech Translation
Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation
Min-Jae Hwang
Ilia Kulikov
Benjamin Peloquin
Hongyu Gong
Peng-Jen Chen
Ann Lee
29
1
0
04 Jun 2024
SICRN: Advancing Speech Enhancement through State Space Model and
  Inplace Convolution Techniques
SICRN: Advancing Speech Enhancement through State Space Model and Inplace Convolution Techniques
Changjiang Zhao
Shulin He
Xueliang Zhang
21
7
0
22 Feb 2024
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram
Zhifeng Kong
Ming-Yu Liu
Ambrish Dantrey
Bryan Catanzaro
17
7
0
12 Sep 2023
SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic
  Spaces
SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic Spaces
Iván Vallés-Pérez
Grzegorz Beringer
Piotr Bilinski
G. Cook
Roberto Barra-Chicote
19
1
0
23 Jul 2023
Inter-SubNet: Speech Enhancement with Subband Interaction
Inter-SubNet: Speech Enhancement with Subband Interaction
Jun Chen
Wei Rao
Z. Wang
Jiuxin Lin
Zhiyong Wu
Yannan Wang
Shidong Shang
Helen M. Meng
11
13
0
09 May 2023
Neural Speech Enhancement with Very Low Algorithmic Latency and
  Complexity via Integrated Full- and Sub-Band Modeling
Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
AI4TS
15
10
0
18 Apr 2023
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using
  Joint Complex Masking and Complex Spectral Mapping for Monaural Speech
  Enhancement
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement
Shengkui Zhao
Bin Ma
32
16
0
23 Feb 2023
A Framework for Unified Real-time Personalized and Non-Personalized
  Speech Enhancement
A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Zhepei Wang
Ritwik Giri
Devansh P. Shah
J. Valin
Mike Goodwin
Paris Smaragdis
19
8
0
23 Feb 2023
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
28
21
0
01 Dec 2022
A General Unfolding Speech Enhancement Method Motivated by Taylor's
  Theorem
A General Unfolding Speech Enhancement Method Motivated by Taylor's Theorem
Andong Li
Guochen Yu
C. Zheng
Wenzhe Liu
Xiaodong Li
43
10
0
30 Nov 2022
Speech Enhancement with Fullband-Subband Cross-Attention Network
Speech Enhancement with Fullband-Subband Cross-Attention Network
Jun Chen
Wei Rao
Z. Wang
Zhiyong Wu
Yannan Wang
Tao Yu
Shidong Shang
Helen M. Meng
17
16
0
10 Nov 2022
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Shulin He
Wei Rao
Jinjiang Liu
Jun Chen
Yukai Ju
Xueliang Zhang
Yannan Wang
Shidong Shang
13
6
0
28 Oct 2022
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Dacheng Yin
Zhiyuan Zhao
Chuanxin Tang
Zhiwei Xiong
Chong Luo
25
14
0
24 Oct 2022
Speech Enhancement with Perceptually-motivated Optimization and Dual
  Transformations
Speech Enhancement with Perceptually-motivated Optimization and Dual Transformations
Xucheng Wan
Kai Liu
Z.C. Du
Huan Zhou
10
0
0
24 Sep 2022
Stochastic Restoration of Heavily Compressed Musical Audio using
  Generative Adversarial Networks
Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks
Stefan Lattner
J. Nistal
30
11
0
04 Jul 2022
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time
  Dereverberation Targets
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
J. Valin
Ritwik Giri
Shrikant Venkataramani
Umut Isik
A. Krishnaswamy
13
2
0
16 Jun 2022
Universal Speech Enhancement with Score-based Diffusion
Universal Speech Enhancement with Score-based Diffusion
Joan Serrà
Santiago Pascual
Jordi Pons
R. O. Araz
D. Scaini
DiffM
22
95
0
07 Jun 2022
BEHM-GAN: Bandwidth Extension of Historical Music using Generative
  Adversarial Networks
BEHM-GAN: Bandwidth Extension of Historical Music using Generative Adversarial Networks
Eloi Moliner
Vesa Valimaki
18
18
0
13 Apr 2022
Improved singing voice separation with chromagram-based pitch-aware
  remixing
Improved singing voice separation with chromagram-based pitch-aware remixing
Siyuan Yuan
Zhepei Wang
Umut Isik
Ritwik Giri
J. Valin
M. Goodwin
A. Krishnaswamy
16
11
0
28 Mar 2022
FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for
  Speech Enhancement
FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement
Jun Chen
Z. Wang
Deyi Tuo
Zhiyong Wu
Shiyin Kang
Helen Meng
24
107
0
23 Mar 2022
RemixIT: Continual self-training of speech enhancement models via
  bootstrapped remixing
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Paris Smaragdis
Anurag Kumar
CLL
22
54
0
17 Feb 2022
A Two-Stage U-Net for High-Fidelity Denoising of Historical Recordings
A Two-Stage U-Net for High-Fidelity Denoising of Historical Recordings
Eloi Moliner
Vesa Valimaki
8
24
0
17 Feb 2022
Speech Denoising in the Waveform Domain with Self-Attention
Speech Denoising in the Waveform Domain with Self-Attention
Zhifeng Kong
Ming-Yu Liu
Ambrish Dantrey
Bryan Catanzaro
18
61
0
15 Feb 2022
Self-Supervised Learning based Monaural Speech Enhancement with
  Complex-Cycle-Consistent
Self-Supervised Learning based Monaural Speech Enhancement with Complex-Cycle-Consistent
Yi Li
Yang Sun
S. M. Naqvi
16
1
0
21 Dec 2021
Hybrid Spectrogram and Waveform Source Separation
Hybrid Spectrogram and Waveform Source Separation
Alexandre Défossez
24
162
0
05 Nov 2021
Continual self-training with bootstrapped remixing for speech
  enhancement
Continual self-training with bootstrapped remixing for speech enhancement
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Anurag Kumar
18
16
0
19 Oct 2021
Leveraging Low-Distortion Target Estimates for Improved Speech
  Enhancement
Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement
Zhong-Qiu Wang
G. Wichern
Jonathan Le Roux
126
15
0
01 Oct 2021
Music Demixing Challenge 2021
Music Demixing Challenge 2021
Yuki Mitsufuji
Giorgio Fabbro
Stefan Uhlich
Fabian-Robert Stöter
Alexandre Défossez
Minseok Kim
Woosung Choi
Chin-Yun Yu
K. Cheuk
18
80
0
31 Aug 2021
On The Compensation Between Magnitude and Phase in Speech Separation
On The Compensation Between Magnitude and Phase in Speech Separation
Zhong-Qiu Wang
G. Wichern
Jonathan Le Roux
21
71
0
11 Aug 2021
Glance and Gaze: A Collaborative Learning Framework for Single-channel
  Speech Enhancement
Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement
Andong Li
C. Zheng
Lu Zhang
Xiaodong Li
11
141
0
22 Jun 2021
Training Speech Enhancement Systems with Noisy Speech Datasets
Training Speech Enhancement Systems with Noisy Speech Datasets
Koichi Saito
Stefan Uhlich
Giorgio Fabbro
Yuki Mitsufuji
31
11
0
26 May 2021
Dual-Stage Low-Complexity Reconfigurable Speech Enhancement
Dual-Stage Low-Complexity Reconfigurable Speech Enhancement
Jun Yang
Nico Brailovsky
6
1
0
17 May 2021
Separate but Together: Unsupervised Federated Learning for Speech
  Enhancement from Non-IID Data
Separate but Together: Unsupervised Federated Learning for Speech Enhancement from Non-IID Data
Efthymios Tzinis
Jonah Casebeer
Zhepei Wang
Paris Smaragdis
FedML
24
19
0
11 May 2021
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion
  Network for Speech Enhancement
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Feng Dang
Hangting Chen
Pengyuan Zhang
76
96
0
27 Apr 2021
Complex Spectral Mapping With Attention Based Convolution Recurrent
  Neural Network for Speech Enhancement
Complex Spectral Mapping With Attention Based Convolution Recurrent Neural Network for Speech Enhancement
Liming Zhou
Yongyu Gao
Ziluo Wang
Jiwei Li
Wenbin Zhang
17
16
0
12 Apr 2021
Transformers with Competitive Ensembles of Independent Mechanisms
Transformers with Competitive Ensembles of Independent Mechanisms
Alex Lamb
Di He
Anirudh Goyal
Guolin Ke
Chien-Feng Liao
Mirco Ravanelli
Yoshua Bengio
MoE
26
23
0
27 Feb 2021
Enhancing into the codec: Noise Robust Speech Coding with
  Vector-Quantized Autoencoders
Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Jonah Casebeer
Vinjai Vale
Umut Isik
J. Valin
Ritwik Giri
A. Krishnaswamy
51
18
0
12 Feb 2021
Enhancing Audio Augmentation Methods with Consistency Learning
Enhancing Audio Augmentation Methods with Consistency Learning
Turab Iqbal
Karim Helwani
A. Krishnaswamy
Wenwu Wang
21
4
0
09 Feb 2021
Real-time Denoising and Dereverberation with Tiny Recurrent U-Net
Real-time Denoising and Dereverberation with Tiny Recurrent U-Net
Hyeong-Seok Choi
Sungjin Park
Jie Hwan Lee
Hoon Heo
Dongsuk Jeon
Kyogu Lee
34
57
0
05 Feb 2021
Monaural Speech Enhancement with Complex Convolutional Block Attention
  Module and Joint Time Frequency Losses
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
Shengkui Zhao
Trung Hieu Nguyen
B. Ma
21
41
0
03 Feb 2021
FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time
  Single-Channel Speech Enhancement
FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement
Xiang Hao
Xiangdong Su
Radu Horaud
Xiaofei Li
14
194
0
29 Oct 2020
Investigating Cross-Domain Losses for Speech Enhancement
Investigating Cross-Domain Losses for Speech Enhancement
Sherif Abdulatif
Karim Armanious
Jayasankar T. Sajeev
Karim Guirguis
B. Yang
17
7
0
20 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and
  Continuous Speech Separation
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
24
88
0
04 Oct 2020
A Hybrid DSP/Deep Learning Approach to Real-Time Full-Band Speech
  Enhancement
A Hybrid DSP/Deep Learning Approach to Real-Time Full-Band Speech Enhancement
J. Valin
56
190
0
24 Sep 2017
1