ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.07845
  4. Cited By
Improved Speech Separation with Time-and-Frequency Cross-domain Joint
  Embedding and Clustering

Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering

16 April 2019
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
ArXivPDFHTML

Papers citing "Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering"

8 / 8 papers shown
Title
Mixup-breakdown: a consistency training method for improving
  generalization of speech separation models
Mixup-breakdown: a consistency training method for improving generalization of speech separation models
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
56
22
0
28 Oct 2019
Phase-aware Speech Enhancement with Deep Complex U-Net
Hyeong-Seok Choi
Jang-Hyun Kim
Jaesung Huh
A. Kim
Jung-Woo Ha
Kyogu Lee
53
331
0
07 Mar 2019
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
150
1,783
0
20 Sep 2018
Speaker Recognition from Raw Waveform with SincNet
Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
136
712
0
29 Jul 2018
VoxCeleb: a large-scale speaker identification dataset
VoxCeleb: a large-scale speaker identification dataset
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
117
2,273
0
26 Jun 2017
SEGAN: Speech Enhancement Generative Adversarial Network
SEGAN: Speech Enhancement Generative Adversarial Network
Santiago Pascual
Antonio Bonafonte
Joan Serrà
GAN
76
1,146
0
28 Mar 2017
WaveNet: A Generative Model for Raw Audio
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
362
7,381
0
12 Sep 2016
EESEN: End-to-End Speech Recognition using Deep RNN Models and
  WFST-based Decoding
EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding
Yajie Miao
M. Gowayyed
Florian Metze
95
753
0
29 Jul 2015
1