Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.06284
Cited By
Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks
18 March 2017
Morten Kolbaek
Dong Yu
Zheng-Hua Tan
Jesper Jensen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks"
50 / 107 papers shown
Title
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers
Thilo von Neumann
K. Kinoshita
Christoph Boeddeker
Marc Delcroix
Reinhold Haeb-Umbach
16
23
0
30 Jul 2021
Speeding Up Permutation Invariant Training for Source Separation
Thilo von Neumann
Christoph Boeddeker
K. Kinoshita
Marc Delcroix
Reinhold Haeb-Umbach
16
6
0
30 Jul 2021
Multi-Task Audio Source Separation
Lu Zhang
Chenxing Li
Feng Deng
Xiaorui Wang
41
8
0
14 Jul 2021
Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication
Yuanyuan Bao
Yanze Xu
Na Xu
Wenjing Yang
Hongfeng Li
Shicong Li
Y. Jia
Fei Xiang
Jincheng He
Ming Li
24
1
0
05 Jun 2021
A Database for Research on Detection and Enhancement of Speech Transmitted over HF links
Jens Heitkaemper
Joerg Schmalenstroeer
Joerg Ullmann
Valentin Ion
Reinhold Haeb-Umbach
16
3
0
04 Jun 2021
Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech
Chenglin Xu
Wei Rao
Jibin Wu
Haizhou Li
30
32
0
30 Mar 2021
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss
Naoki Makishima
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Shota Orihashi
Ryo Masumura
22
8
0
02 Mar 2021
TransMask: A Compact and Fast Speech Separation Model Based on Transformer
Zining Zhang
Bingsheng He
Zhenjie Zhang
21
21
0
19 Feb 2021
Group Communication with Context Codec for Lightweight Source Separation
Yi Luo
Cong Han
N. Mesgarani
23
20
0
14 Dec 2020
Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent Speech Separation
Ziye Yang
Shanzheng Guan
Xiao-Lei Zhang
14
14
0
01 Dec 2020
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
9
46
0
11 Nov 2020
Surrogate Source Model Learning for Determined Source Separation
Robin Scheibler
M. Togami
20
22
0
11 Nov 2020
Single channel voice separation for unknown number of speakers under reverberant and noisy settings
Shlomo E. Chazan
Lior Wolf
Eliya Nachmani
Yossi Adi
19
29
0
04 Nov 2020
Recent Developments on ESPnet Toolkit Boosted by Conformer
Pengcheng Guo
Florian Boyer
Xuankai Chang
Tomoki Hayashi
Yosuke Higuchi
...
Jing Shi
Shinji Watanabe
Kun Wei
Wangyou Zhang
Yuekai Zhang
34
262
0
26 Oct 2020
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
27
536
0
25 Oct 2020
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
Yin Cao
Turab Iqbal
Qiuqiang Kong
Y. Zhong
Wenwu Wang
Mark D. Plumbley
16
75
0
25 Oct 2020
Speaker Separation Using Speaker Inventories and Estimated Speech
Peidong Wang
Zhuo Chen
DeLiang Wang
Jinyu Li
Jiawei Liu
30
11
0
20 Oct 2020
Attention-based scaling adaptation for target speech extraction
Jiangyu Han
Wei Rao
Yanhua Long
Jiaen Liang
11
9
0
19 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
17
88
0
04 Oct 2020
An End-to-end Architecture of Online Multi-channel Speech Separation
Jian Wu
Zhuo Chen
Jinyu Li
Takuya Yoshioka
Zhili Tan
Ed Lin
Yi Luo
Lei Xie
3DV
11
20
0
07 Sep 2020
Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment
Hangting Chen
Pengyuan Zhang
6
6
0
01 Jul 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
41
20
0
25 Jun 2020
Listen to What You Want: Neural Network-based Universal Sound Selector
Tsubasa Ochiai
Marc Delcroix
Yuma Koizumi
Hiroaki Ito
K. Kinoshita
S. Araki
8
61
0
10 Jun 2020
Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation
Yuichiro Koyama
Oluwafemi Azeez
Bhiksha Raj
22
4
0
23 May 2020
Jointly optimal denoising, dereverberation, and source separation
Tomohiro Nakatani
Christoph Boeddeker
K. Kinoshita
Rintaro Ikeshita
Marc Delcroix
Reinhold Haeb-Umbach
16
46
0
20 May 2020
Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression
Nils L. Westhausen
B. Meyer
17
99
0
15 May 2020
FaceFilter: Audio-visual speech separation using still images
Soo-Whan Chung
Soyeon Choe
Joon Son Chung
Hong-Goo Kang
CVBM
21
66
0
14 May 2020
SpEx+: A Complete Time Domain Speaker Extraction Network
Meng Ge
Chenglin Xu
Longbiao Wang
Chng Eng Siong
J. Dang
Haizhou Li
21
141
0
10 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
21
151
0
08 May 2020
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss
Yi Luo
N. Mesgarani
16
29
0
27 Mar 2020
Improving noise robust automatic speech recognition with single-channel time-domain enhancement network
K. Kinoshita
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
21
97
0
09 Mar 2020
Tackling real noisy reverberant meetings with all-neural source separation, counting, and diarization system
K. Kinoshita
Marc Delcroix
S. Araki
Tomohiro Nakatani
194
30
0
09 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers
Eliya Nachmani
Yossi Adi
Lior Wolf
20
175
0
29 Feb 2020
End-to-End Neural Diarization: Reformulating Speaker Diarization as Simple Multi-label Classification
Yusuke Fujita
Shinji Watanabe
Shota Horiguchi
Yawen Xue
Kenji Nagamatsu
12
49
0
24 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
27
261
0
20 Feb 2020
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Yuma Koizumi
Kohei Yatabe
Marc Delcroix
Yoshiki Masuyama
Daiki Takeuchi
12
125
0
14 Feb 2020
CNN-LSTM models for Multi-Speaker Source Separation using Bayesian Hyper Parameter Optimization
Jeroen Zegers
Hugo Van hamme
BDL
26
7
0
19 Dec 2019
End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Yi Luo
Zhuo Chen
N. Mesgarani
Takuya Yoshioka
11
178
0
30 Oct 2019
Mixup-breakdown: a consistency training method for improving generalization of speech separation models
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
33
22
0
28 Oct 2019
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet
David Ditter
Timo Gerkmann
9
57
0
25 Oct 2019
Filterbank design for end-to-end speech separation
Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
18
69
0
23 Oct 2019
WHAMR!: Noisy and Reverberant Single-Channel Speech Separation
Matthew Maciejewski
G. Wichern
E. McQuinn
Jonathan Le Roux
6
179
0
22 Oct 2019
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
AAML
60
125
0
03 Sep 2019
A comprehensive study of speech separation: spectrogram vs waveform separation
F. Bahmaninezhad
Jian Wu
Rongzhi Gu
Shi-Xiong Zhang
Yong-mei Xu
Meng Yu
Dong Yu
34
80
0
17 May 2019
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
T. Menne
Ilya Sklyar
Ralf Schluter
Hermann Ney
14
35
0
09 May 2019
Universal Sound Separation
Ilya Kavalerov
Scott Wisdom
Hakan Erdogan
Brian Patton
K. Wilson
Jonathan Le Roux
J. Hershey
11
184
0
08 May 2019
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation
Yuzhou Liu
DeLiang Wang
27
157
0
25 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
20
25
0
16 Apr 2019
Time Domain Audio Visual Speech Separation
Jian Wu
Yong-mei Xu
Shi-Xiong Zhang
Lianwu Chen
Meng Yu
Lei Xie
Dong Yu
20
114
0
07 Apr 2019
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss
Chenglin Xu
Wei Rao
Chng Eng Siong
Haizhou Li
29
53
0
24 Mar 2019
Previous
1
2
3
Next