Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.17384
Cited By
v1
v2 (latest)
Toward Universal Speech Enhancement for Diverse Input Conditions
29 September 2023
Wangyou Zhang
Kohei Saijo
Zhong-Qiu Wang
Shinji Watanabe
Yanmin Qian
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Toward Universal Speech Enhancement for Diverse Input Conditions"
16 / 16 papers shown
Title
Local Equivariance Error-Based Metrics for Evaluating Sampling-Frequency-Independent Property of Neural Network
Kanami Imamura
Tomohiko Nakamura
Norihiro Takamune
Kohei Yatabe
Hiroshi Saruwatari
54
0
0
04 Jun 2025
TS-URGENet: A Three-stage Universal Robust and Generalizable Speech Enhancement Network
Xiaobin Rong
Dahan Wang
Qinwen Hu
Yushi Wang
Yuxiang Hu
Jing Lu
21
0
0
24 May 2025
Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement
Ziling Huang
Haixin Guan
Yanhua Long
96
0
0
18 May 2025
A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models
Kohei Saijo
Tetsuji Ogawa
85
1
0
28 Apr 2025
AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement
Junan Zhang
Jing Yang
Zihao Fang
Yansen Wang
Zehua Zhang
Zhuo Wang
Fan Fan
Zhikai Wu
DiffM
143
4
0
26 Jan 2025
Task-Aware Unified Source Separation
Kohei Saijo
Janek Ebbers
François Germain
Gordon Wichern
Jonathan Le Roux
77
2
0
31 Oct 2024
Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
Wen Huang
Bing Han
Zhengyang Chen
Shuai Wang
Yanmin Qian
VLM
SSL
52
0
0
22 Oct 2024
Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement
Yudong Yang
Zhan Liu
Wenyi Yu
Guangzhi Sun
Qiuqiang Kong
Chao Zhang
DiffM
104
1
0
15 Sep 2024
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Kohei Saijo
Gordon Wichern
François G. Germain
Zexu Pan
Jonathan Le Roux
67
9
0
06 Aug 2024
ctPuLSE: Close-Talk, and Pseudo-Label Based Far-Field, Speech Enhancement
Zhong-Qiu Wang
62
1
0
28 Jul 2024
Improving Real-Time Music Accompaniment Separation with MMDenseNet
Chun-Hsiang Wang
Chung-Che Wang
Jun Wang
Jyh-Shing Roger Jang
Yen-Hsun Chu
82
0
0
30 Jun 2024
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Wangyou Zhang
Robin Scheibler
Kohei Saijo
Samuele Cornell
Chenda Li
...
Jan Pirklbauer
Marvin Sach
Shinji Watanabe
Tim Fingscheidt
Yanmin Qian
VLM
90
20
0
07 Jun 2024
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement
Wangyou Zhang
Kohei Saijo
Jee-weon Jung
Chenda Li
Shinji Watanabe
Yanmin Qian
74
7
0
06 Jun 2024
SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition
Yihan Wu
Soumi Maiti
Yifan Peng
Wangyou Zhang
Chenda Li
Yuyue Wang
Xihua Wang
Shinji Watanabe
Ruihua Song
80
4
0
31 Jan 2024
Improving Design of Input Condition Invariant Speech Enhancement
Wangyou Zhang
Jee-weon Jung
Shinji Watanabe
Yanmin Qian
AAML
47
4
0
25 Jan 2024
UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
Zhong-Qiu Wang
Shinji Watanabe
77
11
0
31 May 2023
1