ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.17287
  4. Cited By
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech
  Enhancement

Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement

27 October 2022
Ryosuke Sawata
Naoki Murata
Yuhta Takida
Toshimitsu Uesaka
Takashi Shibuya
Shusuke Takahashi
Yuki Mitsufuji
    DiffM
ArXivPDFHTML

Papers citing "Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement"

31 / 31 papers shown
Title
Diffusion-based Generative Speech Source Separation
Diffusion-based Generative Speech Source Separation
Robin Scheibler
Youna Ji
Soo-Whan Chung
J. Byun
Soyeon Choe
Min-Seok Choi
DiffM
80
47
0
31 Oct 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative
  Models
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
61
200
0
11 Aug 2022
Universal Speech Enhancement with Score-based Diffusion
Universal Speech Enhancement with Score-based Diffusion
Joan Serrà
Santiago Pascual
Jordi Pons
R. O. Araz
D. Scaini
DiffM
71
105
0
07 Jun 2022
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
Haohe Liu
Xubo Liu
Qiuqiang Kong
Qiao Tian
Yan Zhao
DeLiang Wang
Chuanzeng Huang
Yuxuan Wang
42
57
0
12 Apr 2022
Speech Enhancement with Score-Based Generative Models in the Complex
  STFT Domain
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
56
116
0
31 Mar 2022
Conditional Diffusion Probabilistic Model for Speech Enhancement
Conditional Diffusion Probabilistic Model for Speech Enhancement
Yen-Ju Lu
Zhongqiu Wang
Shinji Watanabe
Alexander Richard
Cheng Yu
Yu Tsao
DiffM
43
189
0
10 Feb 2022
Denoising Diffusion Restoration Models
Denoising Diffusion Restoration Models
Bahjat Kawar
Michael Elad
Stefano Ermon
Jiaming Song
DiffM
262
823
0
27 Jan 2022
Discretization and Re-synthesis: an alternative method to solve the
  Cocktail Party Problem
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Jing Shi
Xuankai Chang
Tomoki Hayashi
Yen-Ju Lu
Shinji Watanabe
Bo Xu
50
19
0
17 Dec 2021
DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric
  to Evaluate Noise Suppressors
DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
73
215
0
05 Oct 2021
A Study on Speech Enhancement Based on Diffusion Probabilistic Model
A Study on Speech Enhancement Based on Diffusion Probabilistic Model
Yen-Ju Lu
Yu Tsao
Shinji Watanabe
DiffM
30
74
0
25 Jul 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
181
7,765
0
11 May 2021
Restoring degraded speech via a modified diffusion model
Restoring degraded speech via a modified diffusion model
Jianwei Zhang
Suren Jayasuriya
Visar Berisha
DiffM
38
21
0
22 Apr 2021
Improved Denoising Diffusion Probabilistic Models
Improved Denoising Diffusion Probabilistic Models
Alex Nichol
Prafulla Dhariwal
DiffM
289
3,648
0
18 Feb 2021
Real-time Denoising and Dereverberation with Tiny Recurrent U-Net
Real-time Denoising and Dereverberation with Tiny Recurrent U-Net
Hyeong-Seok Choi
Sungjin Park
Jie Hwan Lee
Hoon Heo
Dongsuk Jeon
Kyogu Lee
57
57
0
05 Feb 2021
FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time
  Single-Channel Speech Enhancement
FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement
Xiang Hao
Xiangdong Su
Radu Horaud
Xiaofei Li
61
196
0
29 Oct 2020
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern
  Recognition and Self-Supervised Models
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models
Saurabh Kataria
Jesús Villalba
Najim Dehak
VLM
SSL
46
34
0
22 Oct 2020
Denoising Diffusion Implicit Models
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
213
7,294
0
06 Oct 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
498
17,888
0
19 Jun 2020
Improved Techniques for Training Score-Based Generative Models
Improved Techniques for Training Score-Based Generative Models
Yang Song
Stefano Ermon
DiffM
201
1,145
0
16 Jun 2020
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
331
10,591
0
17 Feb 2020
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores
  Optimization for Speech Enhancement
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement
Szu-Wei Fu
Chien-Feng Liao
Yu Tsao
Shou-De Lin
43
331
0
13 May 2019
Non-intrusive speech quality assessment using neural networks
Non-intrusive speech quality assessment using neural networks
Anderson R. Avila
H. Gamper
Chandan K. A. Reddy
Ross Cutler
I. Tashev
J. Gehrke
33
110
0
16 Mar 2019
Phase-aware Speech Enhancement with Deep Complex U-Net
Hyeong-Seok Choi
Jang-Hyun Kim
Jaesung Huh
A. Kim
Jung-Woo Ha
Kyogu Lee
53
331
0
07 Mar 2019
SDR - half-baked or well done?
SDR - half-baked or well done?
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
140
1,191
0
06 Nov 2018
DNN-based Source Enhancement to Increase Objective Sound Quality
  Assessment Score
DNN-based Source Enhancement to Increase Objective Sound Quality Assessment Score
Yuma Koizumi
Kenta Niwa
Yusuke Hioka
Kazunori Kobayashi
Y. Haneda
32
63
0
22 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
150
1,783
0
20 Sep 2018
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source
  Separation
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
Daniel Stoller
Sebastian Ewert
S. Dixon
AI4TS
128
595
0
08 Jun 2018
Building state-of-the-art distant speech recognition using the CHiME-4
  challenge with a setup of speech enhancement baseline
Building state-of-the-art distant speech recognition using the CHiME-4 challenge with a setup of speech enhancement baseline
Szu-Jui Chen
Aswin Shanmugam Subramanian
Hainan Xu
Shinji Watanabe
31
76
0
27 Mar 2018
SEGAN: Speech Enhancement Generative Adversarial Network
SEGAN: Speech Enhancement Generative Adversarial Network
Santiago Pascual
Antonio Bonafonte
Joan Serrà
GAN
76
1,146
0
28 Mar 2017
Variational Inference with Normalizing Flows
Variational Inference with Normalizing Flows
Danilo Jimenez Rezende
S. Mohamed
DRL
BDL
294
4,167
0
21 May 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.5K
149,842
0
22 Dec 2014
1