ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.05841
  4. Cited By
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration

VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration

12 April 2022
Haohe Liu
Xubo Liu
Qiuqiang Kong
Qiao Tian
Yan Zhao
DeLiang Wang
Chuanzeng Huang
Yuxuan Wang
ArXivPDFHTML

Papers citing "VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration"

37 / 37 papers shown
Title
Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement
Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement
Ziling Huang
Haixin Guan
Yanhua Long
2
0
0
18 May 2025
ReverbMiipher: Generative Speech Restoration meets Reverberation Characteristics Controllability
ReverbMiipher: Generative Speech Restoration meets Reverberation Characteristics Controllability
Wataru Nakata
Yuma Koizumi
Shigeki Karita
Robin Scheibler
Haruko Ishikawa
Adriana Guevara-Rukoz
Heiga Zen
M. Bacchiani
50
0
0
08 May 2025
Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration
Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration
Shigeki Karita
Yuma Koizumi
Heiga Zen
Haruko Ishikawa
Robin Scheibler
M. Bacchiani
VLM
196
1
0
07 May 2025
FLOWER: Flow-Based Estimated Gaussian Guidance for General Speech Restoration
FLOWER: Flow-Based Estimated Gaussian Guidance for General Speech Restoration
Da-Hee Yang
Jaeuk Lee
Joon-Hyuk Chang
VLM
AI4CE
33
0
0
03 May 2025
How much to Dereverberate? Low-Latency Single-Channel Speech Enhancement in Distant Microphone Scenarios
How much to Dereverberate? Low-Latency Single-Channel Speech Enhancement in Distant Microphone Scenarios
Satvik Venkatesh
Philip Coleman
Arthur Benilov
Simon Brown
Selim Sheta
Frederic Roskam
27
0
0
02 May 2025
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
Boyi Kang
Xinfa Zhu
Zihan Zhang
Zhen Ye
Mingshuai Liu
...
Jun Chen
Longshuai Xiao
Chao Weng
Wei Xue
Lei Xie
AuLLM
55
3
0
01 Mar 2025
CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR
CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR
Nian Shao
Rui Zhou
Pengyu Wang
Xian Li
Ying Fang
Yujie Yang
Xiaofei Li
41
0
0
27 Feb 2025
AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement
AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement
Junan Zhang
Jing Yang
Zihao Fang
Yansen Wang
Zehua Zhang
Zhuo Wang
Fan Fan
Zhikai Wu
41
3
0
26 Jan 2025
Enhancing Crowdsourced Audio for Text-to-Speech Models
Enhancing Crowdsourced Audio for Text-to-Speech Models
José Giraldo
Martí Llopart-Font
Alex Peiró-Lilja
Carme Armentano-Oller
Gerard Sant
Baybars Külebi
DiffM
26
0
0
17 Oct 2024
FINALLY: fast and universal speech enhancement with studio-like quality
FINALLY: fast and universal speech enhancement with studio-like quality
Nicholas Babaev
Kirill Tamogashev
Azat Saginbaev
Ivan Shchekotov
Hanbin Bae
Hosang Sung
WonJun Lee
Hoon-Young Cho
Pavel Andreev
29
2
0
08 Oct 2024
High-Resolution Speech Restoration with Latent Diffusion Model
High-Resolution Speech Restoration with Latent Diffusion Model
Tushar Dhyani
Florian Lux
Michele Mancusi
Giorgio Fabbro
Fritz Hohl
Ngoc Thang Vu
DiffM
37
0
0
17 Sep 2024
DM: Dual-path Magnitude Network for General Speech Restoration
DM: Dual-path Magnitude Network for General Speech Restoration
Da-Hee Yang
Dail Kim
Joon-Hyuk Chang
Jeonghwan Choi
Han-gil Moon
18
0
0
13 Sep 2024
VC-ENHANCE: Speech Restoration with Integrated Noise Suppression and
  Voice Conversion
VC-ENHANCE: Speech Restoration with Integrated Noise Suppression and Voice Conversion
Kyungguen Byun
Jason Filos
Erik Visser
Sunkuk Moon
34
0
0
10 Sep 2024
Vector Quantized Diffusion Model Based Speech Bandwidth Extension
Vector Quantized Diffusion Model Based Speech Bandwidth Extension
Yuan Fang
Jinglin Bai
Jiajie Wang
Xueliang Zhang
25
0
0
09 Sep 2024
SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion
SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion
Bingsong Bai
Fengping Wang
Yingming Gao
Ya Li
51
0
0
09 Jun 2024
URGENT Challenge: Universality, Robustness, and Generalizability For
  Speech Enhancement
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Wangyou Zhang
Robin Scheibler
Kohei Saijo
Samuele Cornell
Chenda Li
...
Jan Pirklbauer
Marvin Sach
Shinji Watanabe
Tim Fingscheidt
Yanmin Qian
VLM
37
7
0
07 Jun 2024
MaskSR: Masked Language Model for Full-band Speech Restoration
MaskSR: Masked Language Model for Full-band Speech Restoration
Xu Li
Qirui Wang
Xiaoyu Liu
47
8
0
04 Jun 2024
Self-Supervised Speech Quality Estimation and Enhancement Using Only
  Clean Speech
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech
Szu-Wei Fu
Kuo-Hsuan Hung
Yu Tsao
Yu-Chiang Frank Wang
SSL
19
11
0
26 Feb 2024
DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
Jae-Yeol Im
Juhan Nam
DiffM
20
3
0
16 Jan 2024
AV-RIR: Audio-Visual Room Impulse Response Estimation
AV-RIR: Audio-Visual Room Impulse Response Estimation
Anton Ratnarajah
Sreyan Ghosh
Sonal Kumar
Purva Chiniya
Dinesh Manocha
41
14
0
30 Nov 2023
The IMS Toucan System for the Blizzard Challenge 2023
The IMS Toucan System for the Blizzard Challenge 2023
Florian Lux
Julia Koch
Sarina Meyer
Thomas Bott
Nadja Schauffler
Pavel Denisov
Antje Schweitzer
Ngoc Thang Vu
24
6
0
26 Oct 2023
Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and
  Textually Described Voices
Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices
Matthew Baas
Herman Kamper
23
3
0
12 Oct 2023
Super Denoise Net: Speech Super Resolution with Noise Cancellation in
  Low Sampling Rate Noisy Environments
Super Denoise Net: Speech Super Resolution with Noise Cancellation in Low Sampling Rate Noisy Environments
Junkang Yang
Hongqing Liu
Lu Gan
Yi Zhou
12
1
0
09 Oct 2023
Unifying Robustness and Fidelity: A Comprehensive Study of Pretrained
  Generative Methods for Speech Enhancement in Adverse Conditions
Unifying Robustness and Fidelity: A Comprehensive Study of Pretrained Generative Methods for Speech Enhancement in Adverse Conditions
Heming Wang
Meng Yu
Huan Zhang
Chunlei Zhang
Zhongweiyang Xu
Muqiao Yang
Yixuan Zhang
Dong Yu
34
3
0
16 Sep 2023
Sparks of Large Audio Models: A Survey and Outlook
Sparks of Large Audio Models: A Survey and Outlook
S. Latif
Moazzam Shoukat
Fahad Shamshad
Muhammad Usama
Yi Ren
...
Wenwu Wang
Xulong Zhang
Roberto Togneri
Min Zhang
Björn W. Schuller
LM&MA
AuLLM
33
38
0
24 Aug 2023
AdVerb: Visually Guided Audio Dereverberation
AdVerb: Visually Guided Audio Dereverberation
Sanjoy Chowdhury
Sreyan Ghosh
Subhrajyoti Dasgupta
Anton Ratnarajah
Utkarsh Tyagi
Tianyi Zhou
30
11
0
23 Aug 2023
WavJourney: Compositional Audio Creation with Large Language Models
WavJourney: Compositional Audio Creation with Large Language Models
Xubo Liu
Zhongkai Zhu
Haohe Liu
Yiitan Yuan
Meng Cui
...
Jinhua Liang
Yin Cao
Qiuqiang Kong
Mark D. Plumbley
Wenwu Wang
AuLLM
29
25
0
26 Jul 2023
HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Doyeon Kim
Soo-Whan Chung
Hyewon Han
Youna Ji
Hong-Goo Kang
21
7
0
02 Jun 2023
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
Yuma Koizumi
Heiga Zen
Shigeki Karita
Yifan Ding
Kohei Yatabe
Nobuyuki Morioka
M. Bacchiani
Yu Zhang
Wei Han
Ankur Bapna
43
66
0
30 May 2023
Speech Separation based on Contrastive Learning and Deep Modularization
Speech Separation based on Contrastive Learning and Deep Modularization
Peter Ochieng
SSL
30
0
0
18 May 2023
Extending Audio Masked Autoencoders Toward Audio Restoration
Extending Audio Masked Autoencoders Toward Audio Restoration
Zhi-Wei Zhong
Hao Shi
M. Hirano
Kazuki Shimada
Kazuya Tateishi
Takashi Shibuya
Shusuke Takahashi
Yuki Mitsufuji
32
4
0
11 May 2023
Two-stage Neural Network for ICASSP 2023 Speech Signal Improvement
  Challenge
Two-stage Neural Network for ICASSP 2023 Speech Signal Improvement Challenge
Mingshuai Liu
Shubo Lv
Zihan Zhang
Ru Han
Xiang Hao
Xianjun Xia
Li Chen
Yijian Xiao
Linfu Xie
13
6
0
14 Mar 2023
Guided Speech Enhancement Network
Guided Speech Enhancement Network
Yang Yang
Shao-fu Shih
Hakan Erdogan
J. Lin
C. Lee
Yunpeng Li
George Sung
Matthias Grundmann
33
6
0
13 Mar 2023
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
30
21
0
01 Dec 2022
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech
  Enhancement
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
Ryosuke Sawata
Naoki Murata
Yuhta Takida
Toshimitsu Uesaka
Takashi Shibuya
Shusuke Takahashi
Yuki Mitsufuji
DiffM
34
15
0
27 Oct 2022
HiFi++: a Unified Framework for Bandwidth Extension and Speech
  Enhancement
HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement
Pavel Andreev
Aibek Alanov
Oleg Ivanov
Dmitry Vetrov
36
38
0
24 Mar 2022
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music
  Source Separation
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation
Qiuqiang Kong
Yin Cao
Haohe Liu
Keunwoo Choi
Yuxuan Wang
118
96
0
12 Sep 2021
1