Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.05873
Cited By
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
14 February 2020
Yuma Koizumi
Kohei Yatabe
Marc Delcroix
Yoshiki Masuyama
Daiki Takeuchi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention"
49 / 49 papers shown
Title
A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition
Dewan Tauhid Rahman
Yeahia Sarker
Antar Mazumder
Md. Shamim Anower
ViT
53
0
0
24 Feb 2025
Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement
Yudong Yang
Zhan Liu
Wenyi Yu
Guangzhi Sun
Qiuqiang Kong
Chao Zhang
DiffM
46
0
0
15 Sep 2024
MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancement
Zizhen Lin
Xiaoting Chen
Junyu Wang
40
2
0
07 Jun 2024
Mamba in Speech: Towards an Alternative to Self-Attention
Xiangyu Zhang
Qiquan Zhang
Hexin Liu
Tianyi Xiao
Xinyuan Qian
Beena Ahmed
E. Ambikairajah
Haizhou Li
Julien Epps
Mamba
54
36
0
21 May 2024
An Investigation of Incorporating Mamba for Speech Enhancement
Rong-Yu Chao
Wen-Huang Cheng
Moreno La Quatra
Sabato Marco Siniscalchi
Chao-Han Huck Yang
Szu-Wei Fu
Yu Tsao
Mamba
53
25
0
10 May 2024
Single-Channel Speech Enhancement with Deep Complex U-Networks and Probabilistic Latent Space Models
E. J. Nustede
Jörn Anemüller
24
3
0
04 Sep 2023
Attention-Based Acoustic Feature Fusion Network for Depression Detection
Xiao Xu
Yang Wang
Xinru Wei
Fei Wang
Xizhe Zhang
22
5
0
24 Aug 2023
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation
Ruixin Zheng
Yang Ai
Zhenhua Ling
26
8
0
24 May 2023
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
Chen Chen
Yuchen Hu
Weiwei Weng
Chng Eng Siong
DiffM
37
19
0
23 Feb 2023
HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Zhe Zuo
Changxin Gao
Rong Jin
Nong Sang
25
23
0
09 Jan 2023
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech
Li-Wei Chen
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
22
3
0
27 Oct 2022
A Monotonicity Constrained Attention Module for Emotion Classification with Limited EEG Data
Dongyang Kuang
C. Michoski
Wenting Li
R. Guo
15
2
0
17 Aug 2022
A two-stage full-band speech enhancement model with effective spectral compression mapping
Zhongshu Hou
Qi Hu
Kai-Jyun Chen
Jing Lu
31
0
0
27 Jun 2022
Adversarial Multi-Task Learning for Disentangling Timbre and Pitch in Singing Voice Synthesis
Tae-Woo Kim
Minguk Kang
Gyeong-Hoon Lee
AAML
22
6
0
23 Jun 2022
SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling
Takaaki Saeki
Shinnosuke Takamichi
Tomohiko Nakamura
Naoko Tanji
Hiroshi Saruwatari
31
6
0
24 Mar 2022
SepTr: Separable Transformer for Audio Spectrogram Processing
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
F. Khan
ViT
18
30
0
17 Mar 2022
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Paris Smaragdis
Anurag Kumar
CLL
22
54
0
17 Feb 2022
DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Guochen Yu
Andong Li
Hui Wang
Yutian Wang
Yuxuan Ke
C. Zheng
31
35
0
16 Feb 2022
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement
Tassadaq Hussain
Wei-Chien Wang
M. Gogate
K. Dashtipour
Yu Tsao
Xugang Lu
A. Ahsan
Amir Hussain
21
3
0
24 Jan 2022
U-shaped Transformer with Frequency-Band Aware Attention for Speech Enhancement
Yi Li
Yang Sun
S. M. Naqvi
20
25
0
11 Dec 2021
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport
Hsin-Yi Lin
H. Tseng
Xugang Lu
Yu Tsao
OT
14
31
0
11 Nov 2021
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features
Ryandhimas E. Zezario
Szu-Wei Fu
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
DiffM
28
75
0
03 Nov 2021
Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement
Guochen Yu
Andong Li
C. Zheng
Yinuo Guo
Yutian Wang
Hui Wang
35
84
0
13 Oct 2021
Self-Attention for Audio Super-Resolution
Nathanaël Carraz Rakotonirina
SupR
30
23
0
26 Aug 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
19
41
0
30 Jun 2021
Learning to Inference with Early Exit in the Progressive Speech Enhancement
Andong Li
C. Zheng
Lu Zhang
Xiaodong Li
11
5
0
22 Jun 2021
A Flow-Based Neural Network for Time Domain Speech Enhancement
Martin Strauss
B. Edler
15
33
0
16 Jun 2021
WASE: Learning When to Attend for Speaker Extraction in Cocktail Party Environments
Yunzhe Hao
Jiaming Xu
Peng Zhang
Bo Xu
17
17
0
13 Jun 2021
Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization
Ashutosh Pandey
DeLiang Wang
17
39
0
26 May 2021
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection
Aswin Sivaraman
Minje Kim
13
9
0
08 May 2021
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Xiyun Li
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Jiaming Xu
Bo Xu
Dong Yu
14
14
0
17 Apr 2021
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
Szu-Wei Fu
Cheng Yu
Tsun-An Hsieh
Peter William VanHarn Plantinga
Mirco Ravanelli
Xugang Lu
Yu Tsao
17
209
0
08 Apr 2021
Time-domain Speech Enhancement with Generative Adversarial Learning
Feiyang Xiao
Jian Guan
Qiuqiang Kong
Wenwu Wang
GAN
13
9
0
30 Mar 2021
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
Jun Wang
Max W. Y. Lam
Dan Su
Dong Yu
22
6
0
02 Mar 2021
Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks
Ju Lin
A. Wijngaarden
Kuang-Ching Wang
M. C. Smith
10
50
0
24 Feb 2021
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
Shengkui Zhao
Trung Hieu Nguyen
B. Ma
21
41
0
03 Feb 2021
Improving RNN Transducer With Target Speaker Extraction and Neural Uncertainty Estimation
Jiatong Shi
Chunlei Zhang
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
17
12
0
26 Nov 2020
Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information
Yen-Ju Lu
Chia-Yu Chang
Cheng Yu
Ching-Feng Liu
J. Hung
Shinji Watanabe
Yu Tsao
19
4
0
15 Nov 2020
Listening to Sounds of Silence for Speech Denoising
Ruilin Xu
Rundi Wu
Y. Ishiwaka
Carl Vondrick
Changxi Zheng
25
32
0
22 Oct 2020
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models
Saurabh Kataria
Jesús Villalba
Najim Dehak
VLM
SSL
21
34
0
22 Oct 2020
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
Ashutosh Pandey
DeLiang Wang
8
134
0
03 Sep 2020
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation
Ke Tan
Buye Xu
Anurag Kumar
Eliya Nachmani
Yossi Adi
20
29
0
02 Sep 2020
Incorporating Broad Phonetic Information for Speech Enhancement
Yen-Ju Lu
Chien-Feng Liao
Xugang Lu
J. Hung
Yu Tsao
15
14
0
13 Aug 2020
Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation with BERT
Yang Jiao
11
1
0
16 Jul 2020
Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing
Szu-Wei Fu
Chien-Feng Liao
Tsun-An Hsieh
Kuo-Hsuan Hung
Syu-Siang Wang
...
Ryandhimas E. Zezario
You-Jin Li
Shang-Yi Chuang
Yen-Ju Lu
Yu Tsao
19
6
0
18 Jun 2020
Phase-aware Single-stage Speech Denoising and Dereverberation with U-Net
Hyeong-Seok Choi
Hoon Heo
Jie Hwan Lee
Kyogu Lee
43
19
0
01 Jun 2020
Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function
M. Kawanaka
Yuma Koizumi
Ryoichi Miyazaki
Kohei Yatabe
AAML
19
22
0
14 Feb 2020
Real-time speech enhancement using equilibriated RNN
Daiki Takeuchi
Kohei Yatabe
Yuma Koizumi
Yasuhiro Oikawa
N. Harada
12
34
0
14 Feb 2020
Invertible DNN-based nonlinear time-frequency transform for speech enhancement
Daiki Takeuchi
Kohei Yatabe
Yuma Koizumi
Yasuhiro Oikawa
N. Harada
22
10
0
25 Nov 2019
1