ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.06379
  4. Cited By
Dual-path RNN: efficient long sequence modeling for time-domain
  single-channel speech separation

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation

14 October 2019
Yi Luo
Zhuo Chen
Takuya Yoshioka
    AI4TS
ArXivPDFHTML

Papers citing "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation"

50 / 117 papers shown
Title
Semi-supervised Time Domain Target Speaker Extraction with Attention
Semi-supervised Time Domain Target Speaker Extraction with Attention
Zhepei Wang
Ritwik Giri
Shrikant Venkataramani
Umut Isik
J. Valin
Paris Smaragdis
Mike Goodwin
A. Krishnaswamy
24
7
0
18 Jun 2022
SepIt: Approaching a Single Channel Speech Separation Bound
SepIt: Approaching a Single Channel Speech Separation Bound
Shahar Lutati
Eliya Nachmani
Lior Wolf
VLM
43
27
0
24 May 2022
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D
  Scenes
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes
Anton Ratnarajah
Zhenyu Tang
R. Aralikatti
Tianyi Zhou
AI4CE
32
36
0
18 May 2022
Efficient dynamic filter for robust and low computational feature
  extraction
Efficient dynamic filter for robust and low computational feature extraction
Donghyeon Kim
Gwantae Kim
Bokyeung Lee
Jeong-gi Kwak
D. Han
Hanseok Ko
31
3
0
03 May 2022
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural
  Speech Enhancement
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement
Andong Li
Shan You
Guochen Yu
C. Zheng
Xiaodong Li
30
26
0
30 Apr 2022
Heterogeneous Separation Consistency Training for Adaptation of
  Unsupervised Speech Separation
Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Jiangyu Han
Yanhua Long
28
6
0
23 Apr 2022
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation
  System
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
M. Z. Ozturk
Chenshu Wu
Beibei Wang
Min Wu
K. Liu
27
20
0
14 Apr 2022
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
Zhenyu Tang
R. Aralikatti
Anton Ratnarajah
Tianyi Zhou
35
31
0
04 Apr 2022
End-to-End Integration of Speech Recognition, Speech Enhancement, and
  Self-Supervised Learning Representation
End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Xuankai Chang
Takashi Maekaku
Yuya Fujita
Shinji Watanabe
VLM
54
45
0
01 Apr 2022
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain
  Target Speaker Extraction
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Zexu Pan
Meng Ge
Haizhou Li
21
17
0
31 Mar 2022
Speaker Extraction with Co-Speech Gestures Cue
Speaker Extraction with Co-Speech Gestures Cue
Zexu Pan
Xinyuan Qian
Haizhou Li
SLR
21
27
0
31 Mar 2022
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length
Tal Peer
Timo Gerkmann
22
21
0
30 Mar 2022
Single microphone speaker extraction using unified time-frequency
  Siamese-Unet
Single microphone speaker extraction using unified time-frequency Siamese-Unet
Aviad Eisenberg
Sharon Gannot
Shlomo E. Chazan
30
3
0
06 Mar 2022
MANNER: Multi-view Attention Network for Noise Erasure
MANNER: Multi-view Attention Network for Noise Erasure
Hyun Joon Park
Byung Ha Kang
Wooseok Shin
Jin Sob Kim
S. W. Han
30
48
0
04 Mar 2022
L-SpEx: Localized Target Speaker Extraction
L-SpEx: Localized Target Speaker Extraction
Meng Ge
Chenglin Xu
Longbiao Wang
Eng Siong Chng
J. Dang
Haizhou Li
30
21
0
21 Feb 2022
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech
  Separation
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation
Chenda Li
Lei Yang
Weiqin Wang
Y. Qian
32
25
0
26 Jan 2022
U-shaped Transformer with Frequency-Band Aware Attention for Speech
  Enhancement
U-shaped Transformer with Frequency-Band Aware Attention for Speech Enhancement
Yi Li
Yang Sun
S. M. Naqvi
23
25
0
11 Dec 2021
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel
  Neural Separation Systems
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel Neural Separation Systems
Yi Luo
29
14
0
07 Dec 2021
Uformer: A Unet based dilated complex & real dual-path conformer network
  for simultaneous speech enhancement and dereverberation
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
Yihui Fu
Yun Liu
Jingdong Li
Dawei Luo
Shubo Lv
Yukai Jv
Lei Xie
27
49
0
11 Nov 2021
Learning Filterbanks for End-to-End Acoustic Beamforming
Learning Filterbanks for End-to-End Acoustic Beamforming
Samuele Cornell
Manuel Pariente
François Grondin
S. Squartini
38
7
0
08 Nov 2021
Hybrid Spectrogram and Waveform Source Separation
Hybrid Spectrogram and Waveform Source Separation
Alexandre Défossez
24
162
0
05 Nov 2021
Reduction of Subjective Listening Effort for TV Broadcast Signals with
  Recurrent Neural Networks
Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural Networks
Nils L. Westhausen
R. Huber
Hannah Baumgartner
Ragini Sinha
J. Rennies
B. Meyer
25
10
0
02 Nov 2021
Self-Supervised Speech Denoising Using Only Noisy Audio Signals
Self-Supervised Speech Denoising Using Only Noisy Audio Signals
Jiasong Wu
Qingchun Li
Guanyu Yang
Lei Li
L. Senhadji
H. Shu
21
10
0
30 Oct 2021
SA-SDR: A novel loss function for separation of meeting style data
SA-SDR: A novel loss function for separation of meeting style data
Thilo von Neumann
K. Kinoshita
Christoph Boeddeker
Marc Delcroix
Reinhold Haeb-Umbach
29
20
0
29 Oct 2021
Continuous Speech Separation with Recurrent Selective Attention Network
Continuous Speech Separation with Recurrent Selective Attention Network
Yixuan Zhang
Zhuo Chen
Jian Wu
Takuya Yoshioka
Peidong Wang
Zhong Meng
Jinyu Li
BDL
27
7
0
28 Oct 2021
TPARN: Triple-path Attentive Recurrent Network for Time-domain
  Multichannel Speech Enhancement
TPARN: Triple-path Attentive Recurrent Network for Time-domain Multichannel Speech Enhancement
Ashutosh Pandey
Buye Xu
Anurag Kumar
Jacob Donley
P. Calamia
DeLiang Wang
KELM
19
40
0
20 Oct 2021
Progressive Learning for Stabilizing Label Selection in Speech
  Separation with Mapping-based Method
Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method
Chenyang Gao
Yue Gu
I. Marsic
38
0
0
20 Oct 2021
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription
  Challenge
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge
Fan Yu
Shiliang Zhang
Yihui Fu
Lei Xie
Siqi Zheng
...
Pengcheng Guo
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
8
106
0
14 Oct 2021
VarArray: Array-Geometry-Agnostic Continuous Speech Separation
VarArray: Array-Geometry-Agnostic Continuous Speech Separation
Takuya Yoshioka
Xiaofei Wang
Dongmei Wang
M. Tang
Zirun Zhu
Zhuo Chen
Naoyuki Kanda
17
37
0
12 Oct 2021
Location-based training for multi-channel talker-independent speaker
  separation
Location-based training for multi-channel talker-independent speaker separation
H. Taherian
Ke Tan
DeLiang Wang
27
10
0
08 Oct 2021
TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant
  Sound Source Separation
TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant Sound Source Separation
Ali Aroudi
Stefan Uhlich
M. Font
ViT
27
5
0
08 Oct 2021
USEV: Universal Speaker Extraction with Visual Cue
USEV: Universal Speaker Extraction with Visual Cue
Zexu Pan
Meng Ge
Haizhou Li
34
41
0
30 Sep 2021
Graph-PIT: Generalized permutation invariant training for continuous
  separation of arbitrary numbers of speakers
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers
Thilo von Neumann
K. Kinoshita
Christoph Boeddeker
Marc Delcroix
Reinhold Haeb-Umbach
28
23
0
30 Jul 2021
Speeding Up Permutation Invariant Training for Source Separation
Speeding Up Permutation Invariant Training for Source Separation
Thilo von Neumann
Christoph Boeddeker
K. Kinoshita
Marc Delcroix
Reinhold Haeb-Umbach
16
6
0
30 Jul 2021
Multi-Task Audio Source Separation
Multi-Task Audio Source Separation
Lu Zhang
Chenxing Li
Feng Deng
Xiaorui Wang
41
8
0
14 Jul 2021
DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech
  Enhancement
DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement
Xiaohuai Le
Hongsheng Chen
Kai-Jyun Chen
Jing Lu
23
78
0
12 Jul 2021
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse
  Response Simulation for Sound Event Localization and Detection
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection
Kazuki Shimada
Naoya Takahashi
Yuichiro Koyama
Shusuke Takahashi
E. Tsunoo
Masafumi Takahashi
Yuki Mitsufuji
30
23
0
21 Jun 2021
Lightweight Dual-channel Target Speaker Separation for Mobile Voice
  Communication
Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication
Yuanyuan Bao
Yanze Xu
Na Xu
Wenjing Yang
Hongfeng Li
Shicong Li
Y. Jia
Fei Xiang
Jincheng He
Ming Li
30
1
0
05 Jun 2021
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion
  Network for Speech Enhancement
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Feng Dang
Hangting Chen
Pengyuan Zhang
76
96
0
27 Apr 2021
Many-Speakers Single Channel Speech Separation with Optimal Permutation
  Training
Many-Speakers Single Channel Speech Separation with Optimal Permutation Training
Shaked Dovrat
Eliya Nachmani
Lior Wolf
VLM
6
21
0
18 Apr 2021
Target Speaker Verification with Selective Auditory Attention for Single
  and Multi-talker Speech
Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech
Chenglin Xu
Wei Rao
Jibin Wu
Haizhou Li
34
32
0
30 Mar 2021
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement
  in the Time Domain
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain
Kai Wang
Bengbeng He
Weiping Zhu
41
165
0
18 Mar 2021
End-to-End Dereverberation, Beamforming, and Speech Recognition with
  Improved Numerical Stability and Advanced Frontend
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
Wangyou Zhang
Christoph Boeddeker
Shinji Watanabe
Tomohiro Nakatani
Marc Delcroix
K. Kinoshita
Tsubasa Ochiai
Naoyuki Kamo
Reinhold Haeb-Umbach
Y. Qian
20
32
0
23 Feb 2021
TransMask: A Compact and Fast Speech Separation Model Based on
  Transformer
TransMask: A Compact and Fast Speech Separation Model Based on Transformer
Zining Zhang
Bingsheng He
Zhenjie Zhang
36
21
0
19 Feb 2021
LEAF: A Learnable Frontend for Audio Classification
LEAF: A Learnable Frontend for Audio Classification
Neil Zeghidour
O. Teboul
Félix de Chaumont Quitry
Marco Tagliasacchi
VLM
AAML
85
144
0
21 Jan 2021
Group Communication with Context Codec for Lightweight Source Separation
Group Communication with Context Codec for Lightweight Source Separation
Yi Luo
Cong Han
N. Mesgarani
26
20
0
14 Dec 2020
WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation
  and Dereverberation
WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation
Zhaoheng Ni
Yong-mei Xu
Meng Yu
Bo Wu
Shi-Xiong Zhang
Dong Yu
Michael I. Mandel
22
8
0
18 Nov 2020
FedSL: Federated Split Learning on Distributed Sequential Data in
  Recurrent Neural Networks
FedSL: Federated Split Learning on Distributed Sequential Data in Recurrent Neural Networks
Ali Abedi
Shehroz S. Khan
FedML
39
53
0
06 Nov 2020
Single channel voice separation for unknown number of speakers under
  reverberant and noisy settings
Single channel voice separation for unknown number of speakers under reverberant and noisy settings
Shlomo E. Chazan
Lior Wolf
Eliya Nachmani
Yossi Adi
29
29
0
04 Nov 2020
DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation,
  Enhancement and Separation
DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation
Yihui Fu
Jian Wu
Yanxin Hu
Mengtao Xing
Lei Xie
20
23
0
04 Nov 2020
Previous
123
Next