Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.03737
Cited By
Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
7 March 2023
Zhaoxi Mu
Xinyu Yang
Wenjing Zhu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning"
20 / 20 papers shown
Title
SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation
Zhaoxi Mu
Xinyu Yang
Gang Wang
AuLLM
KELM
VLM
136
1
0
06 May 2025
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ru Cao
Sherif Abdulatif
Bin Yang
83
100
0
28 Mar 2022
Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
AI4TS
114
49
0
01 Mar 2021
FcaNet: Frequency Channel Attention Networks
Zequn Qin
Pengyi Zhang
Leilei Gan
Xi Li
98
712
0
22 Dec 2020
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
97
565
0
25 Oct 2020
Continuous Speech Separation with Conformer
Sanyuan Chen
Yu-Huan Wu
Zhuo Chen
Jian Wu
Jinyu Li
Takuya Yoshioka
Chengyi Wang
Shujie Liu
M. Zhou
68
130
0
13 Aug 2020
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jing-jing Chen
Qi-rong Mao
Dong Liu
110
288
0
28 Jul 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
229
3,164
0
16 May 2020
ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification
Brecht Desplanques
Jenthe Thienpondt
Kris Demuynck
78
1,346
0
14 May 2020
Voice Separation with an Unknown Number of Multiple Speakers
Eliya Nachmani
Yossi Adi
Lior Wolf
84
175
0
29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
108
265
0
20 Feb 2020
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
100
775
0
14 Oct 2019
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features
Cunhang Fan
B. Liu
J. Tao
Jiangyan Yi
Zhengqi Wen
38
22
0
23 Jul 2019
WHAM!: Extending Speech Separation to Noisy Environments
Gordon Wichern
J. Antognini
Michael Flynn
Licheng Richard Zhu
E. McQuinn
Dwight Crow
Ethan Manilow
Jonathan Le Roux
86
354
0
02 Jul 2019
SDR - half-baked or well done?
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
165
1,205
0
06 Nov 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
175
1,796
0
20 Sep 2018
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
358
2,287
0
14 Jun 2018
Squeeze-and-Excitation Networks
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
427
26,605
0
05 Sep 2017
Multi-scale Multi-band DenseNets for Audio Source Separation
Naoya Takahashi
Yuki Mitsufuji
61
152
0
29 Jun 2017
Deep clustering: Discriminative embeddings for segmentation and separation
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
64
1,321
0
18 Aug 2015
1