Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.07454
Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
20 September 2018
Yi Luo
N. Mesgarani
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"
50 / 754 papers shown
Title
On The Compensation Between Magnitude and Phase in Speech Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
27
71
0
11 Aug 2021
The Right to Talk: An Audio-Visual Transformer Approach
Thanh-Dat Truong
C. Duong
T. D. Vu
H. Pham
Bhiksha Raj
Ngan Le
Khoa Luu
63
36
0
06 Aug 2021
Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
H. Sawada
S. Araki
32
19
0
04 Aug 2021
A Multi-Head Relevance Weighting Framework For Learning Raw Waveform Audio Representations
Debottam Dutta
Purvi Agrawal
Sriram Ganapathy
16
2
0
30 Jul 2021
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers
Thilo von Neumann
K. Kinoshita
Christoph Boeddeker
Marc Delcroix
Reinhold Haeb-Umbach
28
23
0
30 Jul 2021
Speeding Up Permutation Invariant Training for Source Separation
Thilo von Neumann
Christoph Boeddeker
K. Kinoshita
Marc Delcroix
Reinhold Haeb-Umbach
16
6
0
30 Jul 2021
Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings
Prerak Srivastava
Antoine Deleforge
Emmanuel Vincent
34
17
0
29 Jul 2021
Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint Optimization
Haici Yang
Shivani Firodiya
Nicholas J. Bryan
Minje Kim
32
7
0
28 Jul 2021
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model
Quandong Wang
Junnan Wu
Zhao Yan
Sichong Qian
Liyong Guo
Lichun Fan
Weiji Zhuang
Peng Gao
Yujun Wang
21
0
0
23 Jul 2021
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Duo Ma
Nana Hou
Van Tung Pham
Haihua Xu
Chng Eng Siong
33
22
0
22 Jul 2021
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Cheng-Hung Hu
Yu-Huai Peng
Junichi Yamagishi
Yu Tsao
Hsin-Min Wang
29
5
0
20 Jul 2021
Joint Echo Cancellation and Noise Suppression based on Cascaded Magnitude and Complex Mask Estimation
Xiaofeng Shu
Yehang Zhu
Yanjie Chen
Li Chen
Haohe Liu
Chuanzeng Huang
Yuxuan Wang
10
11
0
20 Jul 2021
Multi-Task Audio Source Separation
Lu Zhang
Chenxing Li
Feng Deng
Xiaorui Wang
41
8
0
14 Jul 2021
DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement
Xiaohuai Le
Hongsheng Chen
Kai-Jyun Chen
Jing Lu
23
78
0
12 Jul 2021
Separation Guided Speaker Diarization in Realistic Mismatched Conditions
Shu-Tong Niu
Jun Du
Lei Sun
Chin-Hui Lee
6
4
0
06 Jul 2021
Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Jian Wu
Zhuo Chen
Sanyuan Chen
Yu-Huan Wu
Takuya Yoshioka
Naoyuki Kanda
Shujie Liu
Jinyu Li
30
17
0
05 Jul 2021
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yawen Xue
Yuki Takashima
Y. Kawaguchi
36
37
0
04 Jul 2021
TENET: A Time-reversal Enhancement Network for Noise-robust ASR
Fu-An Chao
Shao-Wei Fan-Jiang
Bi-Cheng Yan
J. Hung
Berlin Chen
18
12
0
04 Jul 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
19
41
0
30 Jun 2021
Online Self-Attentive Gated RNNs for Real-Time Speaker Separation
Ori Kabeli
Yossi Adi
Zhenyu Tang
Buye Xu
Anurag Kumar
20
2
0
25 Jun 2021
Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition
Zhengxi Liu
Y. Qian
DRL
19
10
0
25 Jun 2021
A Simultaneous Denoising and Dereverberation Framework with Target Decoupling
Andong Li
Wenzhe Liu
Xiaoxue Luo
Guochen Yu
C. Zheng
Xiaodong Li
31
59
0
24 Jun 2021
Deep neural network Based Low-latency Speech Separation with Asymmetric analysis-Synthesis Window Pair
Shanshan Wang
Gaurav Naithani
A. Politis
Tuomas Virtanen
40
10
0
22 Jun 2021
Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement
Andong Li
C. Zheng
Lu Zhang
Xiaodong Li
19
142
0
22 Jun 2021
Multi-accent Speech Separation with One Shot Learning
Kuan-Po Huang
Yuan-Kuei Wu
Hung-yi Lee
41
4
0
22 Jun 2021
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Leibny Paola García-Perera
37
64
0
20 Jun 2021
A Hands-on Comparison of DNNs for Dialog Separation Using Transfer Learning from Music Source Separation
Martin Strauss
Jouni Paulus
Matteo Torcoli
B. Edler
31
8
0
16 Jun 2021
DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement
Shubo Lv
Yanxin Hu
Shimin Zhang
Lei Xie
24
93
0
16 Jun 2021
Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
23
22
0
15 Jun 2021
F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement
Shimin Zhang
Yuxiang Kong
Shubo Lv
Yanxin Hu
Lei Xie
24
44
0
14 Jun 2021
Few-shot learning of new sound classes for target sound extraction
Marc Delcroix
Jorge Bennasar Vázquez
Tsubasa Ochiai
K. Kinoshita
S. Araki
VLM
29
11
0
14 Jun 2021
WASE: Learning When to Attend for Speaker Extraction in Cocktail Party Environments
Yunzhe Hao
Jiaming Xu
Peng Zhang
Bo Xu
17
17
0
13 Jun 2021
SpeechBrain: A General-Purpose Speech Toolkit
Mirco Ravanelli
Titouan Parcollet
Peter William VanHarn Plantinga
Aku Rouhe
Samuele Cornell
...
William Aris
Hwidong Na
Yan Gao
R. Mori
Yoshua Bengio
24
752
0
08 Jun 2021
Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Max W. Y. Lam
Jun Wang
Chao Weng
Dan Su
Dong Yu
31
6
0
08 Jun 2021
Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication
Yuanyuan Bao
Yanze Xu
Na Xu
Wenjing Yang
Hongfeng Li
Shicong Li
Y. Jia
Fei Xiang
Jincheng He
Ming Li
30
1
0
05 Jun 2021
Classification of Audio Segments in Call Center Recordings using Convolutional Recurrent Neural Networks
¸Sükrü Ozan
11
0
0
04 Jun 2021
Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex
Keitaro Tanaka
Ryosuke Sawata
Shusuke Takahashi
22
0
0
04 Jun 2021
Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Takafumi Moriya
Naoyuki Kamo
33
23
0
02 Jun 2021
Multi-Scale Attention Neural Network for Acoustic Echo Cancellation
Lu Ma
Song Yang
Y. Gong
Zhongqin Wu
14
7
0
31 May 2021
Multi-Scale Temporal Convolution Network for Classroom Voice Detection
Lu Ma
Xintian Wang
Song Yang
Y. Gong
Zhongqin Wu
9
1
0
31 May 2021
EchoFilter: End-to-End Neural Network for Acoustic Echo Cancellation
Lu Ma
Song Yang
Y. Gong
Xintian Wang
Zhongqin Wu
6
11
0
31 May 2021
DPLM: A Deep Perceptual Spatial-Audio Localization Metric
Pranay Manocha
Anurag Kumar
Buye Xu
Anjali Menon
I. D. Gebru
V. Ithapu
P. Calamia
18
10
0
29 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
V. Jayaram
John Thickstun
DiffM
25
23
0
17 May 2021
Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method
Koichi Saito
Tomohiko Nakamura
Kohei Yatabe
Yuma Koizumi
Hiroshi Saruwatari
BDL
VLM
31
7
0
10 May 2021
Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation
Sunwoo Kim
Minje Kim
36
19
0
08 May 2021
Speech Enhancement using Separable Polling Attention and Global Layer Normalization followed with PReLU
Dengfeng Ke
Jinsong Zhang
Yanlu Xie
Yanyan Xu
Binghuai Lin
24
2
0
06 May 2021
Self-Supervised Learning from Automatically Separated Sound Scenes
Eduardo Fonseca
A. Jansen
D. Ellis
Scott Wisdom
Marco Tagliasacchi
J. Hershey
Manoj Plakal
Shawn Hershey
R. C. Moore
Xavier Serra
SSL
31
13
0
05 May 2021
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings
Soumi Maiti
Hakan Erdogan
K. Wilson
Scott Wisdom
Shinji Watanabe
J. Hershey
27
21
0
05 May 2021
AvaTr: One-Shot Speaker Extraction with Transformers
S. Hu
Md Rifat Arefin
V. Nguyen
Alish Dipani
Xaq Pitkow
A. Tolias
38
4
0
03 May 2021
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Feng Dang
Hangting Chen
Pengyuan Zhang
76
96
0
27 Apr 2021
Previous
1
2
3
...
10
11
12
...
14
15
16
Next