Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.17004
Cited By
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain
31 March 2022
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain"
50 / 67 papers shown
Title
FLOWER: Flow-Based Estimated Gaussian Guidance for General Speech Restoration
Da-Hee Yang
Jaeuk Lee
Joon-Hyuk Chang
VLM
AI4CE
33
0
0
03 May 2025
DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers
Heitor R. Guimarães
Jiaqi Su
Rithesh Kumar
Tiago H. Falk
Zeyu Jin
DiffM
30
2
0
13 Apr 2025
On the Design of Diffusion-based Neural Speech Codecs
Pietro Foti
Andreas Brendel
DiffM
36
0
0
11 Apr 2025
Align Your Rhythm: Generating Highly Aligned Dance Poses with Gating-Enhanced Rhythm-Aware Feature Representation
Congyi Fan
Jian Guan
Xuanjia Zhao
Dongli Xu
Youtian Lin
Tong Ye
Pengming Feng
Haiwei Pan
49
0
0
21 Mar 2025
Bilingual Dual-Head Deep Model for Parkinson's Disease Detection from Speech
Moreno La Quatra
Juan Rafael Orozco-Arroyave
Marco Sabato Siniscalchi
50
0
0
13 Mar 2025
Accelerating Diffusion Sampling via Exploiting Local Transition Coherence
Shangwen Zhu
Han Zhang
Zhantao Yang
Qianyu Peng
Zhao Pu
Haoran Wang
Fan Cheng
DiffM
48
0
0
12 Mar 2025
Linguistic Knowledge Transfer Learning for Speech Enhancement
Kuo-Hsuan Hung
Xugang Lu
Szu-Wei Fu
H. Tseng
Hsin-Yi Lin
Chii-Wann Lin
Yu Tsao
VLM
67
0
0
10 Mar 2025
FlowDec: A flow-based full-band general audio codec with high perceptual quality
Simon Welker
Matthew Le
Ricky T. Q. Chen
Wei-Ning Hsu
Timo Gerkmann
Alexander Richard
Yi-Chiao Wu
60
0
0
03 Mar 2025
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
Boyi Kang
Xinfa Zhu
Zihan Zhang
Zhen Ye
Mingshuai Liu
...
Jun Chen
Longshuai Xiao
Chao Weng
Wei Xue
Lei Xie
AuLLM
55
3
0
01 Mar 2025
Speech Enhancement Using Continuous Embeddings of Neural Audio Codec
Haoyang Li
J. Yip
Tianyu Fan
Eng Siong Chng
54
0
0
22 Feb 2025
RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior
Ching Hua Lee
Chouchang Yang
Jaejin Cho
Yashas Malur Saidutta
R. S. Srinivasa
Yilin Shen
Hongxia Jin
DiffM
85
0
0
19 Feb 2025
AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
Brandon Woodard
Margarita Geleta
Joseph J. LaViola Jr.
Andrea Fanelli
Rhonda Wilson
57
2
0
05 Feb 2025
ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Yi-Chiao Wu
Dejan Marković
Steven Krenn
I. D. Gebru
Alexander Richard
66
0
0
04 Feb 2025
EDSep: An Effective Diffusion-Based Method for Speech Source Separation
Jinwei Dong
Xinsheng Wang
Qirong Mao
63
0
0
28 Jan 2025
Task and Perception-aware Distributed Source Coding for Correlated Speech under Bandwidth-constrained Channels
Sagnik Bhattacharya
Muhammad Ahmed Mohsin
Ahsan Bilal
John M. Cioffi
43
1
0
20 Jan 2025
FINALLY: fast and universal speech enhancement with studio-like quality
Nicholas Babaev
Kirill Tamogashev
Azat Saginbaev
Ivan Shchekotov
Hanbin Bae
Hosang Sung
WonJun Lee
Hoon-Young Cho
Pavel Andreev
29
2
0
08 Oct 2024
GALD-SE: Guided Anisotropic Lightweight Diffusion for Efficient Speech Enhancement
Chengzhong Wang
Jianjun Gu
Dingding Yao
Junfeng Li
Yonghong Yan
DiffM
131
0
0
23 Sep 2024
Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement
Yudong Yang
Zhan Liu
Wenyi Yu
Guangzhi Sun
Qiuqiang Kong
Chao Zhang
DiffM
46
0
0
15 Sep 2024
Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching
Zhengyang Chen
Bing Han
Shuai Wang
Yidi Jiang
Yanmin Qian
48
0
0
07 Sep 2024
Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models
Jean-Marie Lemercier
Eloi Moliner
Simon Welker
Vesa Valimaki
Timo Gerkmann
51
2
0
14 Aug 2024
SNR-Progressive Model with Harmonic Compensation for Low-SNR Speech Enhancement
Zhongshu Hou
Tong Lei
Qinwen Hu
Zhanzhong Cao
Ming Tang
Jing Lu
32
0
0
24 Jun 2024
Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics
Weitong Zhang
Chengqi Zang
Liu Li
Sarah Cechnicka
Cheng Ouyang
Bernhard Kainz
DiffM
28
2
0
19 Jun 2024
Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement
Chenda Li
Samuele Cornell
Shinji Watanabe
Yanmin Qian
DiffM
34
2
0
19 Jun 2024
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Yiyuan Yang
Niki Trigoni
Andrew Markham
34
3
0
11 Jun 2024
Thunder : Unified Regression-Diffusion Speech Enhancement with a Single Reverse Step using Brownian Bridge
Thanapat Trachu
Chawan Piansaddhayanon
E. Chuangsuwanich
34
2
0
10 Jun 2024
A Survey on Diffusion Models for Time Series and Spatio-Temporal Data
Yiyuan Yang
Ming Jin
Haomin Wen
Chaoli Zhang
Yuxuan Liang
...
Bin Yang
Zenglin Xu
Jiang Bian
Shirui Pan
Qingsong Wen
DiffM
AI4TS
SyDa
37
39
0
29 Apr 2024
An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization
Minshuo Chen
Song Mei
Jianqing Fan
Mengdi Wang
VLM
MedIm
DiffM
37
48
0
11 Apr 2024
On the Asymptotic Mean Square Error Optimality of Diffusion Models
B. Fesl
Benedikt Bock
Florian Strasser
Michael Baur
M. Joham
Wolfgang Utschick
DiffM
33
0
0
05 Mar 2024
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model
Xiangyu Zhang
Daijiao Liu
Hexin Liu
Qiquan Zhang
Hanyu Meng
Leibny Paola García
Chng Eng Siong
Lina Yao
DiffM
25
2
0
16 Feb 2024
Diffusion Models for Audio Restoration
Jean-Marie Lemercier
Julius Richter
Simon Welker
Eloi Moliner
Vesa Valimaki
Timo Gerkmann
38
16
0
15 Feb 2024
An Analysis of the Variance of Diffusion-based Speech Enhancement
Bunlong Lay
Timo Gerkmann
DiffM
17
0
0
01 Feb 2024
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
Xincheng Yu
Dongyue Guo
Jianwei Zhang
Yi Lin
17
3
0
11 Dec 2023
Investigating the Design Space of Diffusion Models for Speech Enhancement
Philippe Gonzalez
Zheng-Hua Tan
Jan Østergaard
Jesper Jensen
T. S. Alstrøm
Tobias May
DiffM
30
6
0
07 Dec 2023
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler
Philippe Gonzalez
Zheng-Hua Tan
Jan Østergaard
Jesper Jensen
T. S. Alstrøm
Tobias May
DiffM
21
4
0
05 Dec 2023
Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation
Haram Choi
Sang-Hoon Lee
Seong-Whan Lee
DiffM
21
24
0
08 Nov 2023
Conditional Diffusion Model for Target Speaker Extraction
Theodor Nguyen
Guangzhi Sun
Xianrui Zheng
Chao Zhang
0031 Philip C. Woodland
DiffM
41
4
0
07 Oct 2023
Unsupervised speech enhancement with diffusion-based generative models
Berné Nortier
Mostafa Sadeghi
Romain Serizel
DiffM
22
7
0
19 Sep 2023
Single and Few-step Diffusion for Generative Speech Enhancement
Bunlong Lay
Jean-Marie Lemercier
Julius Richter
Timo Gerkmann
DiffM
24
9
0
18 Sep 2023
Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
Zilu Guo
Jun Du
Chin-Hui Lee
25
0
0
17 Sep 2023
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
Wen Wang
Dongchao Yang
Qichen Ye
Bowen Cao
Yuexian Zou
DiffM
37
3
0
03 Sep 2023
Target Speech Extraction with Conditional Diffusion Model
Naoyuki Kamo
Marc Delcroix
Tomohiro Nakatan
DiffM
31
16
0
08 Aug 2023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Yuchen Hu
Cheng Chen
Ruizhe Li
Qiu-shi Zhu
E. Chng
DiffM
16
9
0
16 Jul 2023
The Ethical Implications of Generative Audio Models: A Systematic Literature Review
J. Barnett
29
25
0
07 Jul 2023
Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
Sandipana Dowerah
Ajinkya Kulkarni
Romain Serizel
D. Jouvet
DiffM
19
1
0
05 Jul 2023
Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model
Jean-Marie Lemercier
J. Thiemann
Raphael Koning
Timo Gerkmann
DiffM
31
1
0
22 Jun 2023
Diffusion Posterior Sampling for Informed Single-Channel Dereverberation
Jean-Marie Lemercier
Simon Welker
Timo Gerkmann
DiffM
23
5
0
21 Jun 2023
Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement
Zilu Guo
Jun Du
Chin-Hui Lee
Yu Gao
Wen-bo Zhang
DiffM
29
10
0
14 Jun 2023
Audio-Visual Speech Enhancement with Score-Based Generative Models
Julius Richter
Simone Frintrop
Timo Gerkmann
DiffM
18
10
0
02 Jun 2023
HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Doyeon Kim
Soo-Whan Chung
Hyewon Han
Youna Ji
Hong-Goo Kang
19
7
0
02 Jun 2023
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
Zhibin Qiu
Mengfan Fu
Gang Hua
G. Altenbek
Hao Huang
DiffM
51
4
0
23 May 2023
1
2
Next