ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.13154
  4. Cited By
Attention is All You Need in Speech Separation

Attention is All You Need in Speech Separation

25 October 2020
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
ArXivPDFHTML

Papers citing "Attention is All You Need in Speech Separation"

50 / 219 papers shown
Title
Audio-Visual Active Speaker Extraction for Sparsely Overlapped
  Multi-talker Speech
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech
Jun Yu Li
Ruijie Tao
Zexu Pan
Meng Ge
Shuai Wang
Haizhou Li
29
5
0
15 Sep 2023
Analysis of Speech Separation Performance Degradation on Emotional
  Speech Mixtures
Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures
J. Yip
Dianwen Ng
Bin Ma
Chng Eng Siong
23
0
0
14 Sep 2023
Spiking Structured State Space Model for Monaural Speech Enhancement
Spiking Structured State Space Model for Monaural Speech Enhancement
Yu Du
Xu Liu
Yansong Chua
19
15
0
07 Sep 2023
A Generalized Bandsplit Neural Network for Cinematic Audio Source
  Separation
A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation
Karn N. Watcharasupat
Chih-Wei Wu
Yiwei Ding
Iroro Orife
Aaron J. Hipple
Aaron J. Hipple. Phillip A. Williams
Scott Kramer
Alexander Lerch
W. Wolcott
32
5
0
05 Sep 2023
ReZero: Region-customizable Sound Extraction
ReZero: Region-customizable Sound Extraction
Rongzhi Gu
Yi Luo
30
12
0
31 Aug 2023
Blind Source Separation of Single-Channel Mixtures via Multi-Encoder
  Autoencoders
Blind Source Separation of Single-Channel Mixtures via Multi-Encoder Autoencoders
Matthew B. Webster
Joonnyong Lee
24
1
0
31 Aug 2023
IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual
  Speech Separation
IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation
Kai Li
Run Yang
Fuchun Sun
Xiaolin Hu
29
5
0
16 Aug 2023
Conformer-based Target-Speaker Automatic Speech Recognition for
  Single-Channel Audio
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio
Yang Zhang
Krishna C. Puvvada
Vitaly Lavrukhin
Boris Ginsburg
32
14
0
09 Aug 2023
Monaural Multi-Speaker Speech Separation Using Efficient Transformer
  Model
Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
Sankalpa Rijal
Rajan Neupane
Saroj Prasad Mainali
Shishir K. Regmi
Shanta Maharjan
19
0
0
29 Jul 2023
Exploring the Integration of Speech Separation and Recognition with
  Self-Supervised Learning Representation
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Yoshiki Masuyama
Xuankai Chang
Wangyou Zhang
Samuele Cornell
Zhongqiu Wang
Nobutaka Ono
Y. Qian
Shinji Watanabe
28
6
0
23 Jul 2023
AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker
  Extraction
AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Jiuxin Lin
X. Cai
Heinrich Dinkel
Jun Chen
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Zhiyong Wu
Yujun Wang
Helen M. Meng
22
21
0
25 Jun 2023
Mixture Encoder for Joint Speech Separation and Recognition
Mixture Encoder for Joint Speech Separation and Recognition
Simon Berger
Peter Vieting
Christoph Boeddeker
Ralf Schluter
Reinhold Häb-Umbach
16
6
0
21 Jun 2023
A Comprehensive Survey on Applications of Transformers for Deep Learning
  Tasks
A Comprehensive Survey on Applications of Transformers for Deep Learning Tasks
Saidul Islam
Hanae Elmekki
Ahmed Elsebai
Jamal Bentahar
Najat Drawel
Gaith Rjoub
Witold Pedrycz
ViT
MedIm
24
171
0
11 Jun 2023
An Efficient Speech Separation Network Based on Recurrent Fusion Dilated
  Convolution and Channel Attention
An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention
Junyu Wang
22
1
0
09 Jun 2023
RescueSpeech: A German Corpus for Speech Recognition in Search and
  Rescue Domain
RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain
Sangeet Sagar
Mirco Ravanelli
B. Kiefer
Ivana Kruijff Korbayova
Josef van Genabith
19
1
0
06 Jun 2023
Audio-Visual Speech Separation in Noisy Environments with a Lightweight
  Iterative Model
Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model
H. Martel
Julius Richter
Kai Li
Xiaolin Hu
Timo Gerkmann
VLM
19
9
0
31 May 2023
Adaptive Sparsity Level during Training for Efficient Time Series
  Forecasting with Transformers
Adaptive Sparsity Level during Training for Efficient Time Series Forecasting with Transformers
Zahra Atashgahi
Mykola Pechenizkiy
Raymond N. J. Veldhuis
D. Mocanu
AI4TS
AI4CE
26
1
0
28 May 2023
A Neural State-Space Model Approach to Efficient Speech Separation
A Neural State-Space Model Approach to Efficient Speech Separation
Chen Chen
Chao-Han Huck Yang
Kai Li
Yuchen Hu
Pin-Jui Ku
Chng Eng Siong
34
11
0
26 May 2023
Towards Solving Cocktail-Party: The First Method to Build a Realistic
  Dataset with Ground Truths for Speech Separation
Towards Solving Cocktail-Party: The First Method to Build a Realistic Dataset with Ground Truths for Speech Separation
Rawad Melhem
Assef Jafar
Oumayma Al Dakkak
16
0
0
25 May 2023
Efficient Neural Music Generation
Efficient Neural Music Generation
Max W. Y. Lam
Qiao Tian
Tang-Chun Li
Zongyu Yin
Siyuan Feng
...
Mingbo Ma
Xuchen Song
Jitong Chen
Yuping Wang
Yuxuan Wang
DiffM
MGen
34
49
0
25 May 2023
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross
  Attention
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention
J. Yip
Tuan Truong
Dianwen Ng
Chong Zhang
Yukun Ma
Trung Hieu Nguyen
Chongjia Ni
Shengkui Zhao
Chng Eng Siong
Bin Ma
17
2
0
20 May 2023
Noise-Aware Speech Separation with Contrastive Learning
Noise-Aware Speech Separation with Contrastive Learning
Zizheng Zhang
Cheng Chen
Hsin-Hung Chen
Xiang Liu
Yuchen Hu
E. Chng
23
5
0
18 May 2023
Speech Separation based on Contrastive Learning and Deep Modularization
Speech Separation based on Contrastive Learning and Deep Modularization
Peter Ochieng
SSL
24
0
0
18 May 2023
Ripple sparse self-attention for monaural speech enhancement
Ripple sparse self-attention for monaural speech enhancement
Qiquan Zhang
Hongxu Zhu
Qi Song
Xinyuan Qian
Zhaoheng Ni
Haizhou Li
14
5
0
15 May 2023
Diffusion-based Signal Refiner for Speech Separation
Diffusion-based Signal Refiner for Speech Separation
M. Hirano
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Yuki Mitsufuji
DiffM
33
4
0
10 May 2023
AudioSlots: A slot-centric generative model for audio separation
AudioSlots: A slot-centric generative model for audio separation
P. Reddy
Scott Wisdom
Klaus Greff
J. Hershey
Thomas Kipf
OCL
VLM
20
6
0
09 May 2023
Neural Speech Enhancement with Very Low Algorithmic Latency and
  Complexity via Integrated Full- and Sub-Band Modeling
Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
AI4TS
15
10
0
18 Apr 2023
Fast Random Approximation of Multi-channel Room Impulse Response
Fast Random Approximation of Multi-channel Room Impulse Response
Yi Luo
Rongzhi Gu
10
4
0
17 Apr 2023
On Data Sampling Strategies for Training Neural Network Speech
  Separation Models
On Data Sampling Strategies for Training Neural Network Speech Separation Models
William Ravenscroft
Stefan Goetze
Thomas Hain
VLM
11
6
0
14 Apr 2023
Transformers in Speech Processing: A Survey
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Junaid Qadir
42
47
0
21 Mar 2023
Towards Real-Time Single-Channel Speech Separation in Noisy and
  Reverberant Environments
Towards Real-Time Single-Channel Speech Separation in Noisy and Reverberant Environments
Julian Neri
Sebastian Braun
9
1
0
14 Mar 2023
A two-stage speaker extraction algorithm under adverse acoustic
  conditions using a single-microphone
A two-stage speaker extraction algorithm under adverse acoustic conditions using a single-microphone
Aviad Eisenberg
Sharon Gannot
Shlomo E. Chazan
8
2
0
13 Mar 2023
Multi-Dimensional and Multi-Scale Modeling for Speech Separation
  Optimized by Discriminative Learning
Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Zhaoxi Mu
Xinyu Yang
Wenjing Zhu
19
5
0
07 Mar 2023
A Multi-Stage Triple-Path Method for Speech Separation in Noisy and
  Reverberant Environments
A Multi-Stage Triple-Path Method for Speech Separation in Noisy and Reverberant Environments
Zhaoxi Mu
Xinyu Yang
Xiangyuan Yang
Wenjing Zhu
13
5
0
07 Mar 2023
MossFormer: Pushing the Performance Limit of Monaural Speech Separation
  using Gated Single-Head Transformer with Convolution-Augmented Joint
  Self-Attentions
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions
Shengkui Zhao
Bin Ma
28
52
0
23 Feb 2023
Unifying Speech Enhancement and Separation with Gradient Modulation for
  End-to-End Noise-Robust Speech Separation
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation
Yuchen Hu
Chen Chen
Heqing Zou
Xionghu Zhong
Chng Eng Siong
47
16
0
22 Feb 2023
DasFormer: Deep Alternating Spectrogram Transformer for
  Multi/Single-Channel Speech Separation
DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Shuo Wang
Xiangyu Kong
Xiulian Peng
H. Movassagh
Vinod Prakash
Yan Lu
26
11
0
21 Feb 2023
RobustDistiller: Compressing Universal Speech Representations for
  Enhanced Environment Robustness
RobustDistiller: Compressing Universal Speech Representations for Enhanced Environment Robustness
Heitor R. Guimarães
Arthur Pimentel
Anderson R. Avila
Mehdi Rezagholizadeh
Boxing Chen
Tiago H. Falk
120
10
0
18 Feb 2023
Transformadores: Fundamentos teoricos y Aplicaciones
Transformadores: Fundamentos teoricos y Aplicaciones
J. D. L. Torre
78
0
0
18 Feb 2023
Local spectral attention for full-band speech enhancement
Local spectral attention for full-band speech enhancement
Zhongshu Hou
Qi Hu
Kai-Jyun Chen
Jing Lu
28
0
0
11 Feb 2023
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving
  Source Separation
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation
Shahar Lutati
Eliya Nachmani
Lior Wolf
DiffM
36
14
0
25 Jan 2023
Improving Target Speaker Extraction with Sparse LDA-transformed Speaker
  Embeddings
Improving Target Speaker Extraction with Sparse LDA-transformed Speaker Embeddings
Kai Liu
Xucheng Wan
Z.C. Du
Huan Zhou
VLM
27
1
0
16 Jan 2023
An Audio-Visual Speech Separation Model Inspired by
  Cortico-Thalamo-Cortical Circuits
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
Kai Li
Fenghua Xie
Hang Chen
K. Yuan
Xiaolin Hu
29
14
0
21 Dec 2022
Towards Unified All-Neural Beamforming for Time and Frequency Domain
  Speech Separation
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Rongzhi Gu
Shi-Xiong Zhang
Yuexian Zou
Dong Yu
AI4TS
22
24
0
16 Dec 2022
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech
  Enhancement
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Dongheon Lee
Jung-Woo Choi
27
25
0
15 Dec 2022
Multi-Scale Feature Fusion Transformer Network for End-to-End Single
  Channel Speech Separation
Multi-Scale Feature Fusion Transformer Network for End-to-End Single Channel Speech Separation
Yinhao Xu
Jian Zhou
L. Tao
H. Kwan
30
0
0
14 Dec 2022
GPU-accelerated Guided Source Separation for Meeting Transcription
GPU-accelerated Guided Source Separation for Meeting Transcription
Desh Raj
Daniel Povey
Sanjeev Khudanpur
20
34
0
10 Dec 2022
NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer
NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer
Changsheng Quan
Xiaofei Li
30
2
0
05 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
28
21
0
01 Dec 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech
  Separation
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
29
119
0
22 Nov 2022
Previous
12345
Next