ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.17004
  4. Cited By
Speech Enhancement with Score-Based Generative Models in the Complex
  STFT Domain

Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain

31 March 2022
Simon Welker
Julius Richter
Timo Gerkmann
    DiffM
ArXivPDFHTML

Papers citing "Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain"

50 / 67 papers shown
Title
FLOWER: Flow-Based Estimated Gaussian Guidance for General Speech Restoration
FLOWER: Flow-Based Estimated Gaussian Guidance for General Speech Restoration
Da-Hee Yang
Jaeuk Lee
Joon-Hyuk Chang
VLM
AI4CE
33
0
0
03 May 2025
DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers
DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers
Heitor R. Guimarães
Jiaqi Su
Rithesh Kumar
Tiago H. Falk
Zeyu Jin
DiffM
30
2
0
13 Apr 2025
On the Design of Diffusion-based Neural Speech Codecs
On the Design of Diffusion-based Neural Speech Codecs
Pietro Foti
Andreas Brendel
DiffM
36
0
0
11 Apr 2025
Align Your Rhythm: Generating Highly Aligned Dance Poses with Gating-Enhanced Rhythm-Aware Feature Representation
Align Your Rhythm: Generating Highly Aligned Dance Poses with Gating-Enhanced Rhythm-Aware Feature Representation
Congyi Fan
Jian Guan
Xuanjia Zhao
Dongli Xu
Youtian Lin
Tong Ye
Pengming Feng
Haiwei Pan
49
0
0
21 Mar 2025
Bilingual Dual-Head Deep Model for Parkinson's Disease Detection from Speech
Moreno La Quatra
Juan Rafael Orozco-Arroyave
Marco Sabato Siniscalchi
50
0
0
13 Mar 2025
Accelerating Diffusion Sampling via Exploiting Local Transition Coherence
Shangwen Zhu
Han Zhang
Zhantao Yang
Qianyu Peng
Zhao Pu
Haoran Wang
Fan Cheng
DiffM
48
0
0
12 Mar 2025
Linguistic Knowledge Transfer Learning for Speech Enhancement
Kuo-Hsuan Hung
Xugang Lu
Szu-Wei Fu
H. Tseng
Hsin-Yi Lin
Chii-Wann Lin
Yu Tsao
VLM
67
0
0
10 Mar 2025
FlowDec: A flow-based full-band general audio codec with high perceptual quality
Simon Welker
Matthew Le
Ricky T. Q. Chen
Wei-Ning Hsu
Timo Gerkmann
Alexander Richard
Yi-Chiao Wu
60
0
0
03 Mar 2025
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
Boyi Kang
Xinfa Zhu
Zihan Zhang
Zhen Ye
Mingshuai Liu
...
Jun Chen
Longshuai Xiao
Chao Weng
Wei Xue
Lei Xie
AuLLM
55
3
0
01 Mar 2025
Speech Enhancement Using Continuous Embeddings of Neural Audio Codec
Speech Enhancement Using Continuous Embeddings of Neural Audio Codec
Haoyang Li
J. Yip
Tianyu Fan
Eng Siong Chng
54
0
0
22 Feb 2025
RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior
RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior
Ching Hua Lee
Chouchang Yang
Jaejin Cho
Yashas Malur Saidutta
R. S. Srinivasa
Yilin Shen
Hongxia Jin
DiffM
85
0
0
19 Feb 2025
AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
Brandon Woodard
Margarita Geleta
Joseph J. LaViola Jr.
Andrea Fanelli
Rhonda Wilson
57
2
0
05 Feb 2025
ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Yi-Chiao Wu
Dejan Marković
Steven Krenn
I. D. Gebru
Alexander Richard
66
0
0
04 Feb 2025
EDSep: An Effective Diffusion-Based Method for Speech Source Separation
Jinwei Dong
Xinsheng Wang
Qirong Mao
63
0
0
28 Jan 2025
Task and Perception-aware Distributed Source Coding for Correlated Speech under Bandwidth-constrained Channels
Task and Perception-aware Distributed Source Coding for Correlated Speech under Bandwidth-constrained Channels
Sagnik Bhattacharya
Muhammad Ahmed Mohsin
Ahsan Bilal
John M. Cioffi
43
1
0
20 Jan 2025
FINALLY: fast and universal speech enhancement with studio-like quality
FINALLY: fast and universal speech enhancement with studio-like quality
Nicholas Babaev
Kirill Tamogashev
Azat Saginbaev
Ivan Shchekotov
Hanbin Bae
Hosang Sung
WonJun Lee
Hoon-Young Cho
Pavel Andreev
29
2
0
08 Oct 2024
GALD-SE: Guided Anisotropic Lightweight Diffusion for Efficient Speech Enhancement
GALD-SE: Guided Anisotropic Lightweight Diffusion for Efficient Speech Enhancement
Chengzhong Wang
Jianjun Gu
Dingding Yao
Junfeng Li
Yonghong Yan
DiffM
131
0
0
23 Sep 2024
Extract and Diffuse: Latent Integration for Improved Diffusion-based
  Speech and Vocal Enhancement
Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement
Yudong Yang
Zhan Liu
Wenyi Yu
Guangzhi Sun
Qiuqiang Kong
Chao Zhang
DiffM
46
0
0
15 Sep 2024
Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow
  Matching
Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching
Zhengyang Chen
Bing Han
Shuai Wang
Yidi Jiang
Yanmin Qian
48
0
0
07 Sep 2024
Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models
Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models
Jean-Marie Lemercier
Eloi Moliner
Simon Welker
Vesa Valimaki
Timo Gerkmann
51
2
0
14 Aug 2024
SNR-Progressive Model with Harmonic Compensation for Low-SNR Speech
  Enhancement
SNR-Progressive Model with Harmonic Compensation for Low-SNR Speech Enhancement
Zhongshu Hou
Tong Lei
Qinwen Hu
Zhanzhong Cao
Ming Tang
Jing Lu
32
0
0
24 Jun 2024
Stability and Generalizability in SDE Diffusion Models with
  Measure-Preserving Dynamics
Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics
Weitong Zhang
Chengqi Zang
Liu Li
Sarah Cechnicka
Cheng Ouyang
Bernhard Kainz
DiffM
28
2
0
19 Jun 2024
Diffusion-based Generative Modeling with Discriminative Guidance for
  Streamable Speech Enhancement
Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement
Chenda Li
Samuele Cornell
Shinji Watanabe
Yanmin Qian
DiffM
34
2
0
19 Jun 2024
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Yiyuan Yang
Niki Trigoni
Andrew Markham
34
3
0
11 Jun 2024
Thunder : Unified Regression-Diffusion Speech Enhancement with a Single
  Reverse Step using Brownian Bridge
Thunder : Unified Regression-Diffusion Speech Enhancement with a Single Reverse Step using Brownian Bridge
Thanapat Trachu
Chawan Piansaddhayanon
E. Chuangsuwanich
34
2
0
10 Jun 2024
A Survey on Diffusion Models for Time Series and Spatio-Temporal Data
A Survey on Diffusion Models for Time Series and Spatio-Temporal Data
Yiyuan Yang
Ming Jin
Haomin Wen
Chaoli Zhang
Yuxuan Liang
...
Bin Yang
Zenglin Xu
Jiang Bian
Shirui Pan
Qingsong Wen
DiffM
AI4TS
SyDa
37
39
0
29 Apr 2024
An Overview of Diffusion Models: Applications, Guided Generation,
  Statistical Rates and Optimization
An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization
Minshuo Chen
Song Mei
Jianqing Fan
Mengdi Wang
VLM
MedIm
DiffM
37
48
0
11 Apr 2024
On the Asymptotic Mean Square Error Optimality of Diffusion Models
On the Asymptotic Mean Square Error Optimality of Diffusion Models
B. Fesl
Benedikt Bock
Florian Strasser
Michael Baur
M. Joham
Wolfgang Utschick
DiffM
33
0
0
05 Mar 2024
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up
  Speech Diffusion Model
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model
Xiangyu Zhang
Daijiao Liu
Hexin Liu
Qiquan Zhang
Hanyu Meng
Leibny Paola García
Chng Eng Siong
Lina Yao
DiffM
25
2
0
16 Feb 2024
Diffusion Models for Audio Restoration
Diffusion Models for Audio Restoration
Jean-Marie Lemercier
Julius Richter
Simon Welker
Eloi Moliner
Vesa Valimaki
Timo Gerkmann
38
16
0
15 Feb 2024
An Analysis of the Variance of Diffusion-based Speech Enhancement
An Analysis of the Variance of Diffusion-based Speech Enhancement
Bunlong Lay
Timo Gerkmann
DiffM
17
0
0
01 Feb 2024
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic
  Control Using Multi-Objective Learning
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
Xincheng Yu
Dongyue Guo
Jianwei Zhang
Yi Lin
17
3
0
11 Dec 2023
Investigating the Design Space of Diffusion Models for Speech
  Enhancement
Investigating the Design Space of Diffusion Models for Speech Enhancement
Philippe Gonzalez
Zheng-Hua Tan
Jan Østergaard
Jesper Jensen
T. S. Alstrøm
Tobias May
DiffM
30
6
0
07 Dec 2023
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions
  Using a Heun-Based Sampler
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler
Philippe Gonzalez
Zheng-Hua Tan
Jan Østergaard
Jesper Jensen
T. S. Alstrøm
Tobias May
DiffM
21
4
0
05 Dec 2023
Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust
  Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation
Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation
Haram Choi
Sang-Hoon Lee
Seong-Whan Lee
DiffM
21
24
0
08 Nov 2023
Conditional Diffusion Model for Target Speaker Extraction
Conditional Diffusion Model for Target Speaker Extraction
Theodor Nguyen
Guangzhi Sun
Xianrui Zheng
Chao Zhang
0031 Philip C. Woodland
DiffM
41
4
0
07 Oct 2023
Unsupervised speech enhancement with diffusion-based generative models
Unsupervised speech enhancement with diffusion-based generative models
Berné Nortier
Mostafa Sadeghi
Romain Serizel
DiffM
22
7
0
19 Sep 2023
Single and Few-step Diffusion for Generative Speech Enhancement
Single and Few-step Diffusion for Generative Speech Enhancement
Bunlong Lay
Jean-Marie Lemercier
Julius Richter
Timo Gerkmann
DiffM
24
9
0
18 Sep 2023
Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
Zilu Guo
Jun Du
Chin-Hui Lee
25
0
0
17 Sep 2023
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
Wen Wang
Dongchao Yang
Qichen Ye
Bowen Cao
Yuexian Zou
DiffM
37
3
0
03 Sep 2023
Target Speech Extraction with Conditional Diffusion Model
Target Speech Extraction with Conditional Diffusion Model
Naoyuki Kamo
Marc Delcroix
Tomohiro Nakatan
DiffM
31
16
0
08 Aug 2023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Yuchen Hu
Cheng Chen
Ruizhe Li
Qiu-shi Zhu
E. Chng
DiffM
16
9
0
16 Jul 2023
The Ethical Implications of Generative Audio Models: A Systematic
  Literature Review
The Ethical Implications of Generative Audio Models: A Systematic Literature Review
J. Barnett
29
25
0
07 Jul 2023
Self-supervised learning with diffusion-based multichannel speech
  enhancement for speaker verification under noisy conditions
Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
Sandipana Dowerah
Ajinkya Kulkarni
Romain Serizel
D. Jouvet
DiffM
19
1
0
05 Jul 2023
Wind Noise Reduction with a Diffusion-based Stochastic Regeneration
  Model
Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model
Jean-Marie Lemercier
J. Thiemann
Raphael Koning
Timo Gerkmann
DiffM
31
1
0
22 Jun 2023
Diffusion Posterior Sampling for Informed Single-Channel Dereverberation
Diffusion Posterior Sampling for Informed Single-Channel Dereverberation
Jean-Marie Lemercier
Simon Welker
Timo Gerkmann
DiffM
23
5
0
21 Jun 2023
Variance-Preserving-Based Interpolation Diffusion Models for Speech
  Enhancement
Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement
Zilu Guo
Jun Du
Chin-Hui Lee
Yu Gao
Wen-bo Zhang
DiffM
29
10
0
14 Jun 2023
Audio-Visual Speech Enhancement with Score-Based Generative Models
Audio-Visual Speech Enhancement with Score-Based Generative Models
Julius Richter
Simone Frintrop
Timo Gerkmann
DiffM
18
10
0
02 Jun 2023
HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Doyeon Kim
Soo-Whan Chung
Hyewon Han
Youna Ji
Hong-Goo Kang
19
7
0
02 Jun 2023
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
Zhibin Qiu
Mengfan Fu
Gang Hua
G. Altenbek
Hao Huang
DiffM
51
4
0
23 May 2023
12
Next