ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.05873
  4. Cited By
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention

Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention

14 February 2020
Yuma Koizumi
Kohei Yatabe
Marc Delcroix
Yoshiki Masuyama
Daiki Takeuchi
ArXivPDFHTML

Papers citing "Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention"

49 / 49 papers shown
Title
A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition
A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition
Dewan Tauhid Rahman
Yeahia Sarker
Antar Mazumder
Md. Shamim Anower
ViT
53
0
0
24 Feb 2025
Extract and Diffuse: Latent Integration for Improved Diffusion-based
  Speech and Vocal Enhancement
Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement
Yudong Yang
Zhan Liu
Wenyi Yu
Guangzhi Sun
Qiuqiang Kong
Chao Zhang
DiffM
46
0
0
15 Sep 2024
MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion
  Enhanced Taylor Transformer for U-Net-based Speech Enhancement
MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancement
Zizhen Lin
Xiaoting Chen
Junyu Wang
40
2
0
07 Jun 2024
Mamba in Speech: Towards an Alternative to Self-Attention
Mamba in Speech: Towards an Alternative to Self-Attention
Xiangyu Zhang
Qiquan Zhang
Hexin Liu
Tianyi Xiao
Xinyuan Qian
Beena Ahmed
E. Ambikairajah
Haizhou Li
Julien Epps
Mamba
54
36
0
21 May 2024
An Investigation of Incorporating Mamba for Speech Enhancement
An Investigation of Incorporating Mamba for Speech Enhancement
Rong-Yu Chao
Wen-Huang Cheng
Moreno La Quatra
Sabato Marco Siniscalchi
Chao-Han Huck Yang
Szu-Wei Fu
Yu Tsao
Mamba
53
25
0
10 May 2024
Single-Channel Speech Enhancement with Deep Complex U-Networks and
  Probabilistic Latent Space Models
Single-Channel Speech Enhancement with Deep Complex U-Networks and Probabilistic Latent Space Models
E. J. Nustede
Jörn Anemüller
24
3
0
04 Sep 2023
Attention-Based Acoustic Feature Fusion Network for Depression Detection
Attention-Based Acoustic Feature Fusion Network for Depression Detection
Xiao Xu
Yang Wang
Xinru Wei
Fei Wang
Xizhe Zhang
22
5
0
24 Aug 2023
Incorporating Ultrasound Tongue Images for Audio-Visual Speech
  Enhancement through Knowledge Distillation
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation
Ruixin Zheng
Yang Ai
Zhenhua Ling
26
8
0
24 May 2023
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
Chen Chen
Yuchen Hu
Weiwei Weng
Chng Eng Siong
DiffM
37
19
0
23 Feb 2023
HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot
  Action Recognition
HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Zhe Zuo
Changxin Gao
Rong Jin
Nong Sang
25
23
0
09 Jan 2023
A Training and Inference Strategy Using Noisy and Enhanced Speech as
  Target for Speech Enhancement without Clean Speech
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech
Li-Wei Chen
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
22
3
0
27 Oct 2022
A Monotonicity Constrained Attention Module for Emotion Classification
  with Limited EEG Data
A Monotonicity Constrained Attention Module for Emotion Classification with Limited EEG Data
Dongyang Kuang
C. Michoski
Wenting Li
R. Guo
15
2
0
17 Aug 2022
A two-stage full-band speech enhancement model with effective spectral
  compression mapping
A two-stage full-band speech enhancement model with effective spectral compression mapping
Zhongshu Hou
Qi Hu
Kai-Jyun Chen
Jing Lu
31
0
0
27 Jun 2022
Adversarial Multi-Task Learning for Disentangling Timbre and Pitch in
  Singing Voice Synthesis
Adversarial Multi-Task Learning for Disentangling Timbre and Pitch in Singing Voice Synthesis
Tae-Woo Kim
Minguk Kang
Gyeong-Hoon Lee
AAML
22
6
0
23 Jun 2022
SelfRemaster: Self-Supervised Speech Restoration with
  Analysis-by-Synthesis Approach Using Channel Modeling
SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling
Takaaki Saeki
Shinnosuke Takamichi
Tomohiko Nakamura
Naoko Tanji
Hiroshi Saruwatari
31
6
0
24 Mar 2022
SepTr: Separable Transformer for Audio Spectrogram Processing
SepTr: Separable Transformer for Audio Spectrogram Processing
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
F. Khan
ViT
18
30
0
17 Mar 2022
RemixIT: Continual self-training of speech enhancement models via
  bootstrapped remixing
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Paris Smaragdis
Anurag Kumar
CLL
22
54
0
17 Feb 2022
DBT-Net: Dual-branch federative magnitude and phase estimation with
  attention-in-attention transformer for monaural speech enhancement
DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Guochen Yu
Andong Li
Hui Wang
Yutian Wang
Yuxuan Ke
C. Zheng
31
35
0
16 Feb 2022
A Novel Temporal Attentive-Pooling based Convolutional Recurrent
  Architecture for Acoustic Signal Enhancement
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement
Tassadaq Hussain
Wei-Chien Wang
M. Gogate
K. Dashtipour
Yu Tsao
Xugang Lu
A. Ahsan
Amir Hussain
21
3
0
24 Jan 2022
U-shaped Transformer with Frequency-Band Aware Attention for Speech
  Enhancement
U-shaped Transformer with Frequency-Band Aware Attention for Speech Enhancement
Yi Li
Yang Sun
S. M. Naqvi
20
25
0
11 Dec 2021
Unsupervised Noise Adaptive Speech Enhancement by
  Discriminator-Constrained Optimal Transport
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport
Hsin-Yi Lin
H. Tseng
Xugang Lu
Yu Tsao
OT
14
31
0
11 Nov 2021
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment
  Model with Cross-Domain Features
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features
Ryandhimas E. Zezario
Szu-Wei Fu
Fei Chen
C. Fuh
Hsin-Min Wang
Yu Tsao
DiffM
28
75
0
03 Nov 2021
Dual-branch Attention-In-Attention Transformer for single-channel speech
  enhancement
Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement
Guochen Yu
Andong Li
C. Zheng
Yinuo Guo
Yutian Wang
Hui Wang
35
84
0
13 Oct 2021
Self-Attention for Audio Super-Resolution
Self-Attention for Audio Super-Resolution
Nathanaël Carraz Rakotonirina
SupR
30
23
0
26 Aug 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using
  linear complexity self-attention for speech enhancement
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
19
41
0
30 Jun 2021
Learning to Inference with Early Exit in the Progressive Speech
  Enhancement
Learning to Inference with Early Exit in the Progressive Speech Enhancement
Andong Li
C. Zheng
Lu Zhang
Xiaodong Li
11
5
0
22 Jun 2021
A Flow-Based Neural Network for Time Domain Speech Enhancement
A Flow-Based Neural Network for Time Domain Speech Enhancement
Martin Strauss
B. Edler
15
33
0
16 Jun 2021
WASE: Learning When to Attend for Speaker Extraction in Cocktail Party
  Environments
WASE: Learning When to Attend for Speaker Extraction in Cocktail Party Environments
Yunzhe Hao
Jiaming Xu
Peng Zhang
Bo Xu
17
17
0
13 Jun 2021
Self-attending RNN for Speech Enhancement to Improve Cross-corpus
  Generalization
Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization
Ashutosh Pandey
DeLiang Wang
17
39
0
26 May 2021
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model
  Selection
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection
Aswin Sivaraman
Minje Kim
13
9
0
08 May 2021
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Xiyun Li
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Jiaming Xu
Bo Xu
Dong Yu
14
14
0
17 Apr 2021
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
Szu-Wei Fu
Cheng Yu
Tsun-An Hsieh
Peter William VanHarn Plantinga
Mirco Ravanelli
Xugang Lu
Yu Tsao
17
209
0
08 Apr 2021
Time-domain Speech Enhancement with Generative Adversarial Learning
Time-domain Speech Enhancement with Generative Adversarial Learning
Feiyang Xiao
Jian Guan
Qiuqiang Kong
Wenwu Wang
GAN
13
9
0
30 Mar 2021
Tune-In: Training Under Negative Environments with Interference for
  Attention Networks Simulating Cocktail Party Effect
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
Jun Wang
Max W. Y. Lam
Dan Su
Dong Yu
22
6
0
02 Mar 2021
Speech Enhancement Using Multi-Stage Self-Attentive Temporal
  Convolutional Networks
Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks
Ju Lin
A. Wijngaarden
Kuang-Ching Wang
M. C. Smith
10
50
0
24 Feb 2021
Monaural Speech Enhancement with Complex Convolutional Block Attention
  Module and Joint Time Frequency Losses
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
Shengkui Zhao
Trung Hieu Nguyen
B. Ma
21
41
0
03 Feb 2021
Improving RNN Transducer With Target Speaker Extraction and Neural
  Uncertainty Estimation
Improving RNN Transducer With Target Speaker Extraction and Neural Uncertainty Estimation
Jiatong Shi
Chunlei Zhang
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
17
12
0
26 Nov 2020
Improving Speech Enhancement Performance by Leveraging Contextual Broad
  Phonetic Class Information
Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information
Yen-Ju Lu
Chia-Yu Chang
Cheng Yu
Ching-Feng Liu
J. Hung
Shinji Watanabe
Yu Tsao
19
4
0
15 Nov 2020
Listening to Sounds of Silence for Speech Denoising
Listening to Sounds of Silence for Speech Denoising
Ruilin Xu
Rundi Wu
Y. Ishiwaka
Carl Vondrick
Changxi Zheng
25
32
0
22 Oct 2020
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern
  Recognition and Self-Supervised Models
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models
Saurabh Kataria
Jesús Villalba
Najim Dehak
VLM
SSL
21
34
0
22 Oct 2020
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
Ashutosh Pandey
DeLiang Wang
8
134
0
03 Sep 2020
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with
  Interaural Cue Preservation
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation
Ke Tan
Buye Xu
Anurag Kumar
Eliya Nachmani
Yossi Adi
20
29
0
02 Sep 2020
Incorporating Broad Phonetic Information for Speech Enhancement
Incorporating Broad Phonetic Information for Speech Enhancement
Yen-Ju Lu
Chien-Feng Liao
Xugang Lu
J. Hung
Yu Tsao
15
14
0
13 Aug 2020
Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation
  with BERT
Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation with BERT
Yang Jiao
11
1
0
16 Jul 2020
Boosting Objective Scores of a Speech Enhancement Model by MetricGAN
  Post-processing
Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing
Szu-Wei Fu
Chien-Feng Liao
Tsun-An Hsieh
Kuo-Hsuan Hung
Syu-Siang Wang
...
Ryandhimas E. Zezario
You-Jin Li
Shang-Yi Chuang
Yen-Ju Lu
Yu Tsao
19
6
0
18 Jun 2020
Phase-aware Single-stage Speech Denoising and Dereverberation with U-Net
Phase-aware Single-stage Speech Denoising and Dereverberation with U-Net
Hyeong-Seok Choi
Hoon Heo
Jie Hwan Lee
Kyogu Lee
43
19
0
01 Jun 2020
Stable Training of DNN for Speech Enhancement based on
  Perceptually-Motivated Black-Box Cost Function
Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function
M. Kawanaka
Yuma Koizumi
Ryoichi Miyazaki
Kohei Yatabe
AAML
19
22
0
14 Feb 2020
Real-time speech enhancement using equilibriated RNN
Real-time speech enhancement using equilibriated RNN
Daiki Takeuchi
Kohei Yatabe
Yuma Koizumi
Yasuhiro Oikawa
N. Harada
12
34
0
14 Feb 2020
Invertible DNN-based nonlinear time-frequency transform for speech
  enhancement
Invertible DNN-based nonlinear time-frequency transform for speech enhancement
Daiki Takeuchi
Kohei Yatabe
Yuma Koizumi
Yasuhiro Oikawa
N. Harada
22
10
0
25 Nov 2019
1