ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.06284
  4. Cited By
Multi-talker Speech Separation with Utterance-level Permutation
  Invariant Training of Deep Recurrent Neural Networks

Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks

18 March 2017
Morten Kolbaek
Dong Yu
Zheng-Hua Tan
Jesper Jensen
ArXivPDFHTML

Papers citing "Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks"

13 / 113 papers shown
Title
Analysis of Deep Clustering as Preprocessing for Automatic Speech
  Recognition of Sparsely Overlapping Speech
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
T. Menne
Ilya Sklyar
Ralf Schluter
Hermann Ney
19
35
0
09 May 2019
Universal Sound Separation
Universal Sound Separation
Ilya Kavalerov
Scott Wisdom
Hakan Erdogan
Brian Patton
K. Wilson
Jonathan Le Roux
J. Hershey
11
184
0
08 May 2019
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural
  Speaker Separation
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation
Yuzhou Liu
DeLiang Wang
27
157
0
25 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint
  Embedding and Clustering
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
20
25
0
16 Apr 2019
Time Domain Audio Visual Speech Separation
Time Domain Audio Visual Speech Separation
Jian Wu
Yong-mei Xu
Shi-Xiong Zhang
Lianwu Chen
Meng Yu
Lei Xie
Dong Yu
20
114
0
07 Apr 2019
Optimization of Speaker Extraction Neural Network with Magnitude and
  Temporal Spectrum Approximation Loss
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss
Chenglin Xu
Wei Rao
Chng Eng Siong
Haizhou Li
34
53
0
24 Mar 2019
FurcaNet: An end-to-end deep gated convolutional, long short-term
  memory, deep neural networks for single channel speech separation
FurcaNet: An end-to-end deep gated convolutional, long short-term memory, deep neural networks for single channel speech separation
Ziqiang Shi
Huibin Lin
L. Liu
Rujie Liu
Shoji Hayakawa
Shouji Harada
Jiqing Han
17
22
0
02 Feb 2019
Deep Learning Based Phase Reconstruction for Speaker Separation: A
  Trigonometric Perspective
Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
Zhong-Qiu Wang
Ke Tan
DeLiang Wang
50
95
0
22 Nov 2018
Trainable Adaptive Window Switching for Speech Enhancement
Trainable Adaptive Window Switching for Speech Enhancement
Yuma Koizumi
N. Harada
Y. Haneda
16
8
0
05 Nov 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation
  Approach Using Neural Networks
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
Takuya Yoshioka
Hakan Erdogan
Zhuo Chen
Xiong Xiao
F. Alleva
BDL
22
81
0
08 Oct 2018
Phasebook and Friends: Leveraging Discrete Representations for Source
  Separation
Phasebook and Friends: Leveraging Discrete Representations for Source Separation
Jonathan Le Roux
G. Wichern
Shinji Watanabe
Andy M. Sarroff
J. Hershey
16
76
0
02 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
19
1,748
0
20 Sep 2018
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics
  Optimization by Fully Convolutional Neural Networks
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks
Szu-Wei Fu
Tao-Wei Wang
Yu Tsao
Xugang Lu
Hisashi Kawai
22
271
0
12 Sep 2017
Previous
123