Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.07454
Cited By
v1
v2
v3 (latest)
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
20 September 2018
Yi Luo
N. Mesgarani
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"
50 / 773 papers shown
Title
Guided Variational Autoencoder for Speech Enhancement With a Supervised Classifier
Guillaume Carbajal
Julius Richter
Timo Gerkmann
DRL
SSL
50
17
0
12 Feb 2021
Real-time Monaural Speech Enhancement With Short-time Discrete Cosine Transform
Qinglong Li
Fei Gao
Haixing Guan
Kaichi Ma
57
24
0
09 Feb 2021
Speaker and Direction Inferred Dual-channel Speech Separation
Chenxing Li
Jiaming Xu
N. Mesgarani
Bo Xu
38
8
0
08 Feb 2021
Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
44
13
0
07 Feb 2021
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
Shengkui Zhao
Trung Hieu Nguyen
B. Ma
51
43
0
03 Feb 2021
Multimodal Attention Fusion for Target Speaker Extraction
Hiroshi Sato
Tsubasa Ochiai
K. Kinoshita
Marc Delcroix
Tomohiro Nakatani
S. Araki
40
29
0
02 Feb 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
386
336
0
24 Jan 2021
Towards efficient models for real-time deep noise suppression
Sebastian Braun
H. Gamper
Chandan K. A. Reddy
I. Tashev
79
111
0
22 Jan 2021
LEAF: A Learnable Frontend for Audio Classification
Neil Zeghidour
O. Teboul
Félix de Chaumont Quitry
Marco Tagliasacchi
VLM
AAML
131
148
0
21 Jan 2021
A Joint Diagonalization Based Efficient Approach to Underdetermined Blind Audio Source Separation Using the Multichannel Wiener Filter
N. Ito
Rintaro Ikeshita
H. Sawada
Tomohiro Nakatani
35
26
0
21 Jan 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
89
29
0
13 Jan 2021
Neural Network-based Virtual Microphone Estimator
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
S. Araki
36
10
0
12 Jan 2021
Generalized Spatio-Temporal RNN Beamformer for Target Speech Separation
Yong-mei Xu
Z. Zhang
Meng Yu
Shi-Xiong Zhang
Dong Yu
35
1
0
04 Jan 2021
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Z. Zhang
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Lianwu Chen
Donald Williamson
Dong Yu
38
29
0
24 Dec 2020
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Shinji Watanabe
Florian Boyer
Xuankai Chang
Pengcheng Guo
Tomoki Hayashi
...
Shigeki Karita
Chenda Li
Jing Shi
Aswin Shanmugam Subramanian
Wangyou Zhang
VLM
108
38
0
23 Dec 2020
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording
Cong Han
Yi Luo
Chenda Li
Tianyan Zhou
K. Kinoshita
...
Marc Delcroix
Hakan Erdogan
J. Hershey
N. Mesgarani
Zhuo Chen
58
8
0
17 Dec 2020
Group Communication with Context Codec for Lightweight Source Separation
Yi Luo
Cong Han
N. Mesgarani
87
20
0
14 Dec 2020
Towards speech enhancement using a variational U-Net architecture
E. J. Nustede
Jörn Anemüller
46
1
0
07 Dec 2020
Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent Speech Separation
Ziye Yang
Shanzheng Guan
Xiao-Lei Zhang
37
14
0
01 Dec 2020
Audio-visual Speech Separation with Adversarially Disentangled Visual Representation
Peng Zhang
Jiaming Xu
Jing Shi
Yunzhe Hao
Bo Xu
377
5
0
29 Nov 2020
A comparison of handcrafted, parameterized, and learnable features for speech separation
Wenbo Zhu
Mou Wang
Xiao-Lei Zhang
S. Rahardja
38
4
0
29 Nov 2020
Improving RNN Transducer With Target Speaker Extraction and Neural Uncertainty Estimation
Jiatong Shi
Chunlei Zhang
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
61
12
0
26 Nov 2020
Speech Denoising with Auditory Models
Mark R. Saddler
Andrew Francl
J. Feather
Kaizhi Qian
Yang Zhang
Josh H. McDermott
22
6
0
21 Nov 2020
One Shot Learning for Speech Separation
Yuan-Kuei Wu
Kuan-Po Huang
Yu Tsao
Hung-yi Lee
VLM
68
8
0
20 Nov 2020
Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals
Meng Ge
Chenglin Xu
Longbiao Wang
Chng Eng Siong
Jianwu Dang
Haizhou Li
46
44
0
19 Nov 2020
WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation
Zhaoheng Ni
Yong-mei Xu
Meng Yu
Bo Wu
Shi-Xiong Zhang
Dong Yu
Michael I. Mandel
53
9
0
18 Nov 2020
Rethinking the Separation Layers in Speech Separation Networks
Yi Luo
Zhuo Chen
Cong Han
Chenda Li
Tianyan Zhou
N. Mesgarani
36
10
0
17 Nov 2020
Ultra-Lightweight Speech Separation via Group Communication
Yi Luo
Cong Han
N. Mesgarani
VLM
80
30
0
17 Nov 2020
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
76
46
0
11 Nov 2020
Spoken Language Interaction with Robots: Research Issues and Recommendations, Report from the NSF Future Directions Workshop
M. Marge
C. Espy-Wilson
Roger K. Moore
77
79
0
11 Nov 2020
Informed Source Extraction With Application to Acoustic Echo Reduction
Mohamed Elminshawi
Wolfgang Mack
Emanuel Habets
50
2
0
09 Nov 2020
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration
Chenda Li
Jing Shi
Wangyou Zhang
Aswin Shanmugam Subramanian
Xuankai Chang
...
Moto Hira
Tomoki Hayashi
Christoph Boeddeker
Zhuo Chen
Shinji Watanabe
VLM
95
82
0
07 Nov 2020
Single channel voice separation for unknown number of speakers under reverberant and noisy settings
Shlomo E. Chazan
Lior Wolf
Eliya Nachmani
Yossi Adi
72
29
0
04 Nov 2020
DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation
Yihui Fu
Jian Wu
Yanxin Hu
Mengtao Xing
Lei Xie
72
24
0
04 Nov 2020
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
Desh Raj
Pavel Denisov
Zhuo Chen
Hakan Erdogan
Zili Huang
...
Yi Luo
Naoyuki Kanda
Jinyu Li
Scott Wisdom
J. Hershey
63
88
0
03 Nov 2020
Two Heads Are Better Than One: A Two-Stage Approach for Monaural Noise Reduction in the Complex Domain
Andong Li
C. Zheng
Renhua Peng
Xiaodong Li
78
10
0
03 Nov 2020
What's All the FUSS About Free Universal Sound Separation Data?
Scott Wisdom
Hakan Erdogan
D. Ellis
Romain Serizel
Nicolas Turpault
Eduardo Fonseca
Justin Salamon
Prem Seetharaman
J. Hershey
98
82
0
02 Nov 2020
FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement
Xiang Hao
Xiangdong Su
Radu Horaud
Xiaofei Li
80
201
0
29 Oct 2020
Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training
Sung-Feng Huang
Shun-Po Chuang
Da-Rong Liu
Yi-Chen Chen
Gene-Ping Yang
Hung-yi Lee
SSL
92
14
0
29 Oct 2020
Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
Efthymios Tzinis
Dimitrios Bralios
Paris Smaragdis
89
1
0
25 Oct 2020
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
109
567
0
25 Oct 2020
Speakerfilter-Pro: an improved target speaker extractor combines the time domain and frequency domain
Shulin He
Hao Li
Xueliang Zhang
31
3
0
25 Oct 2020
A Study of Transfer Learning in Music Source Separation
Andreas Bugler
Bryan Pardo
Prem Seetharaman
45
3
0
23 Oct 2020
Speech enhancement aided end-to-end multi-task learning for voice activity detection
Xu Tan
Xiao-Lei Zhang
83
33
0
23 Oct 2020
Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Sanyuan Chen
Yu-Huan Wu
Zhuo Chen
Takuya Yoshioka
Shujie Liu
Jinyu Li
94
26
0
23 Oct 2020
Listening to Sounds of Silence for Speech Denoising
Ruilin Xu
Rundi Wu
Y. Ishiwaka
Carl Vondrick
Changxi Zheng
66
33
0
22 Oct 2020
Transcription Is All You Need: Learning to Separate Musical Mixtures with Score as Supervision
Yun-Ning Hung
Gordon Wichern
Jonathan Le Roux
63
12
0
22 Oct 2020
Towards Listening to 10 People Simultaneously: An Efficient Permutation Invariant Training of Audio Source Separation Using Sinkhorn's Algorithm
Hideyuki Tachibana
70
14
0
22 Oct 2020
DBNET: DOA-driven beamforming network for end-to-end farfield sound source separation
Ali Aroudi
Sebastian Braun
50
7
0
22 Oct 2020
BERT for Joint Multichannel Speech Dereverberation with Spatial-aware Tasks
Yang Jiao
29
0
0
21 Oct 2020
Previous
1
2
3
...
12
13
14
15
16
Next