Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement

27 October 2022

Papers citing "Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement"

31 / 31 papers shown

Title
Diffusion-based Generative Speech Source Separation Robin Scheibler Youna Ji Soo-Whan Chung J. Byun Soyeon Choe Min-Seok Choi DiffM 80 47 0 31 Oct 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative Models Julius Richter Simon Welker Jean-Marie Lemercier Bunlong Lay Timo Gerkmann DiffM 61 200 0 11 Aug 2022
Universal Speech Enhancement with Score-based Diffusion Joan Serrà Santiago Pascual Jordi Pons R. O. Araz D. Scaini DiffM 71 105 0 07 Jun 2022
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration Haohe Liu Xubo Liu Qiuqiang Kong Qiao Tian Yan Zhao DeLiang Wang Chuanzeng Huang Yuxuan Wang 42 57 0 12 Apr 2022
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain Simon Welker Julius Richter Timo Gerkmann DiffM 56 116 0 31 Mar 2022
Conditional Diffusion Probabilistic Model for Speech Enhancement Yen-Ju Lu Zhongqiu Wang Shinji Watanabe Alexander Richard Cheng Yu Yu Tsao DiffM 43 189 0 10 Feb 2022
Denoising Diffusion Restoration Models Bahjat Kawar Michael Elad Stefano Ermon Jiaming Song DiffM 262 823 0 27 Jan 2022
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem Jing Shi Xuankai Chang Tomoki Hayashi Yen-Ju Lu Shinji Watanabe Bo Xu 50 19 0 17 Dec 2021
DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors Chandan K. A. Reddy Vishak Gopal Ross Cutler 73 215 0 05 Oct 2021
A Study on Speech Enhancement Based on Diffusion Probabilistic Model Yen-Ju Lu Yu Tsao Shinji Watanabe DiffM 30 74 0 25 Jul 2021
Diffusion Models Beat GANs on Image Synthesis Prafulla Dhariwal Alex Nichol 181 7,765 0 11 May 2021
Restoring degraded speech via a modified diffusion model Jianwei Zhang Suren Jayasuriya Visar Berisha DiffM 38 21 0 22 Apr 2021
Improved Denoising Diffusion Probabilistic Models Alex Nichol Prafulla Dhariwal DiffM 289 3,648 0 18 Feb 2021
Real-time Denoising and Dereverberation with Tiny Recurrent U-Net Hyeong-Seok Choi Sungjin Park Jie Hwan Lee Hoon Heo Dongsuk Jeon Kyogu Lee 57 57 0 05 Feb 2021
FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement Xiang Hao Xiangdong Su Radu Horaud Xiaofei Li 61 196 0 29 Oct 2020
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models Saurabh Kataria Jesús Villalba Najim Dehak VLM SSL 46 34 0 22 Oct 2020
Denoising Diffusion Implicit Models Jiaming Song Chenlin Meng Stefano Ermon VLM DiffM 213 7,294 0 06 Oct 2020
Denoising Diffusion Probabilistic Models Jonathan Ho Ajay Jain Pieter Abbeel DiffM 498 17,888 0 19 Jun 2020
Improved Techniques for Training Score-Based Generative Models Yang Song Stefano Ermon DiffM 201 1,145 0 16 Jun 2020
Decision-Making with Auto-Encoding Variational Bayes Romain Lopez Pierre Boyeau Nir Yosef Michael I. Jordan Jeffrey Regier BDL 331 10,591 0 17 Feb 2020
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement Szu-Wei Fu Chien-Feng Liao Yu Tsao Shou-De Lin 43 331 0 13 May 2019
Non-intrusive speech quality assessment using neural networks Anderson R. Avila H. Gamper Chandan K. A. Reddy Ross Cutler I. Tashev J. Gehrke 33 110 0 16 Mar 2019
Phase-aware Speech Enhancement with Deep Complex U-Net Hyeong-Seok Choi Jang-Hyun Kim Jaesung Huh A. Kim Jung-Woo Ha Kyogu Lee 53 331 0 07 Mar 2019
SDR - half-baked or well done? F. Sánchez-Martínez M. Esplà-Gomis Hakan Erdogan J. Hershey 140 1,191 0 06 Nov 2018
DNN-based Source Enhancement to Increase Objective Sound Quality Assessment Score Yuma Koizumi Kenta Niwa Yusuke Hioka Kazunori Kobayashi Y. Haneda 32 63 0 22 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Yi Luo N. Mesgarani 150 1,783 0 20 Sep 2018
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation Daniel Stoller Sebastian Ewert S. Dixon AI4TS 128 595 0 08 Jun 2018
Building state-of-the-art distant speech recognition using the CHiME-4 challenge with a setup of speech enhancement baseline Szu-Jui Chen Aswin Shanmugam Subramanian Hainan Xu Shinji Watanabe 31 76 0 27 Mar 2018
SEGAN: Speech Enhancement Generative Adversarial Network Santiago Pascual Antonio Bonafonte Joan Serrà GAN 76 1,146 0 28 Mar 2017
Variational Inference with Normalizing Flows Danilo Jimenez Rezende S. Mohamed DRL BDL 294 4,167 0 21 May 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 1.5K 149,842 0 22 Dec 2014