Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.07454
Cited By
v1
v2
v3 (latest)
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
20 September 2018
Yi Luo
N. Mesgarani
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"
50 / 773 papers shown
Title
F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement
Shimin Zhang
Yuxiang Kong
Shubo Lv
Yanxin Hu
Lei Xie
67
44
0
14 Jun 2021
Few-shot learning of new sound classes for target sound extraction
Marc Delcroix
Jorge Bennasar Vázquez
Tsubasa Ochiai
K. Kinoshita
S. Araki
VLM
58
11
0
14 Jun 2021
WASE: Learning When to Attend for Speaker Extraction in Cocktail Party Environments
Yunzhe Hao
Jiaming Xu
Peng Zhang
Bo Xu
32
17
0
13 Jun 2021
SpeechBrain: A General-Purpose Speech Toolkit
Mirco Ravanelli
Titouan Parcollet
Peter William VanHarn Plantinga
Aku Rouhe
Samuele Cornell
...
William Aris
Hwidong Na
Yan Gao
R. Mori
Yoshua Bengio
129
769
0
08 Jun 2021
Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Max W. Y. Lam
Jun Wang
Chao Weng
Dan Su
Dong Yu
65
6
0
08 Jun 2021
Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication
Yuanyuan Bao
Yanze Xu
Na Xu
Wenjing Yang
Hongfeng Li
Shicong Li
Y. Jia
Fei Xiang
Jincheng He
Ming Li
87
1
0
05 Jun 2021
Classification of Audio Segments in Call Center Recordings using Convolutional Recurrent Neural Networks
¸Sükrü Ozan
18
0
0
04 Jun 2021
Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex
Keitaro Tanaka
Ryosuke Sawata
Shusuke Takahashi
36
0
0
04 Jun 2021
Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Takafumi Moriya
Naoyuki Kamo
68
23
0
02 Jun 2021
Multi-Scale Attention Neural Network for Acoustic Echo Cancellation
Lu Ma
Song Yang
Y. Gong
Zhongqin Wu
48
7
0
31 May 2021
Multi-Scale Temporal Convolution Network for Classroom Voice Detection
Lu Ma
Xintian Wang
Song Yang
Y. Gong
Zhongqin Wu
36
1
0
31 May 2021
EchoFilter: End-to-End Neural Network for Acoustic Echo Cancellation
Lu Ma
Song Yang
Y. Gong
Xintian Wang
Zhongqin Wu
44
12
0
31 May 2021
DPLM: A Deep Perceptual Spatial-Audio Localization Metric
Pranay Manocha
Anurag Kumar
Buye Xu
Anjali Menon
I. D. Gebru
V. Ithapu
P. Calamia
62
10
0
29 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
V. Jayaram
John Thickstun
DiffM
107
25
0
17 May 2021
Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method
Koichi Saito
Tomohiko Nakamura
Kohei Yatabe
Yuma Koizumi
Hiroshi Saruwatari
BDL
VLM
36
7
0
10 May 2021
Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation
Sunwoo Kim
Minje Kim
88
20
0
08 May 2021
Speech Enhancement using Separable Polling Attention and Global Layer Normalization followed with PReLU
Dengfeng Ke
Jinsong Zhang
Yanlu Xie
Yanyan Xu
Binghuai Lin
39
2
0
06 May 2021
Self-Supervised Learning from Automatically Separated Sound Scenes
Eduardo Fonseca
A. Jansen
D. Ellis
Scott Wisdom
Marco Tagliasacchi
J. Hershey
Manoj Plakal
Shawn Hershey
R. C. Moore
Xavier Serra
SSL
81
13
0
05 May 2021
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings
Soumi Maiti
Hakan Erdogan
K. Wilson
Scott Wisdom
Shinji Watanabe
J. Hershey
72
22
0
05 May 2021
AvaTr: One-Shot Speaker Extraction with Transformers
S. Hu
Md Rifat Arefin
V. Nguyen
Alish Dipani
Xaq Pitkow
A. Tolias
64
4
0
03 May 2021
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Feng Dang
Hangting Chen
Pengyuan Zhang
131
105
0
27 Apr 2021
Points2Sound: From mono to binaural audio using 3D point cloud scenes
Francesc Lluís
V. Chatziioannou
A. Hofmann
3DPC
113
6
0
26 Apr 2021
Many-Speakers Single Channel Speech Separation with Optimal Permutation Training
Shaked Dovrat
Eliya Nachmani
Lior Wolf
VLM
96
22
0
18 Apr 2021
Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement
Haoyu Li
Junichi Yamagishi
27
9
0
17 Apr 2021
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Xiyun Li
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Jiaming Xu
Bo Xu
Dong Yu
52
14
0
17 Apr 2021
On the Design of Deep Priors for Unsupervised Audio Restoration
V. Narayanaswamy
Jayaraman J. Thiagarajan
A. Spanias
AI4CE
49
5
0
14 Apr 2021
L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing
E. Guizzo
R. F. Gramaccioni
Saeid Jamili
Christian Marinoni
Edoardo Massaro
...
Marco Pennese
Sveva Pepe
Enrico Rocchi
A. Uncini
Danilo Comminiello
154
27
0
12 Apr 2021
Learning to Rank Microphones for Distant Speech Recognition
Samuele Cornell
Alessio Brutti
M. Matassoni
S. Squartini
45
4
0
06 Apr 2021
Noise Estimation for Generative Diffusion Models
Robin San-Roman
Eliya Nachmani
Lior Wolf
DiffM
126
107
0
06 Apr 2021
Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification
Aswin Sivaraman
Sunwoo Kim
Minje Kim
100
23
0
05 Apr 2021
Efficient Personalized Speech Enhancement through Self-Supervised Learning
Aswin Sivaraman
Minje Kim
67
20
0
05 Apr 2021
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Meng Yu
Chunlei Zhang
Yong-mei Xu
Shi-Xiong Zhang
Dong Yu
55
31
0
02 Apr 2021
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
Giorgio Barnabò
Giovanni Trappolini
L. Lastilla
Cesare Campagnano
Angela Fan
Fabio Petroni
Fabrizio Silvestri
64
4
0
01 Apr 2021
Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech
Chenglin Xu
Wei Rao
Jibin Wu
Haizhou Li
68
32
0
30 Mar 2021
Time-domain Speech Enhancement with Generative Adversarial Learning
Feiyang Xiao
Jian Guan
Qiuqiang Kong
Wenwu Wang
GAN
74
9
0
30 Mar 2021
On TasNet for Low-Latency Single-Speaker Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
81
2
0
27 Mar 2021
Blind Speech Separation and Dereverberation using Neural Beamforming
Lukas Pfeifenberger
Franz Pernkopf
36
5
0
24 Mar 2021
USTC-NELSLIP System Description for DIHARD-III Challenge
Yuxuan Wang
Maokui He
Shutong Niu
Lei Sun
Tian Gao
Xin Fang
Jia Pan
Jun Du
Chin-Hui Lee
76
30
0
19 Mar 2021
HTMD-Net: A Hybrid Masking-Denoising Approach to Time-Domain Monaural Singing Voice Separation
C. Garoufis
Athanasia Zlatintsi
Petros Maragos
69
2
0
07 Mar 2021
Compute and memory efficient universal sound source separation
Efthymios Tzinis
Zhepei Wang
Xilin Jiang
Paris Smaragdis
90
40
0
03 Mar 2021
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
Jun Wang
Max W. Y. Lam
Dan Su
Dong Yu
55
6
0
02 Mar 2021
Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
AI4TS
121
49
0
01 Mar 2021
Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks
Ju Lin
A. Wijngaarden
Kuang-Ching Wang
M. C. Smith
78
51
0
24 Feb 2021
Handling Background Noise in Neural Speech Generation
Tom Denton
Alejandro Luebs
Felicia S. C. Lim
Andrew Storus
Hengchin Yeh
W. Kleijn
Jan Skoglund
52
2
0
23 Feb 2021
Dual-Path Modeling for Long Recording Speech Separation in Meetings
Chenda Li
Zhuo Chen
Yi Luo
Cong Han
Tianyan Zhou
K. Kinoshita
Marc Delcroix
Shinji Watanabe
Y. Qian
41
10
0
23 Feb 2021
TransMask: A Compact and Fast Speech Separation Model Based on Transformer
Zining Zhang
Bingsheng He
Zhenjie Zhang
62
23
0
19 Feb 2021
Speech enhancement with weakly labelled data from AudioSet
Qiuqiang Kong
Haohe Liu
Xingjian Du
Li Chen
Rui Xia
Yuxuan Wang
82
18
0
19 Feb 2021
CatNet: music source separation system with mix-audio augmentation
Xuchen Song
Qiuqiang Kong
Xingjian Du
Yuxuan Wang
56
10
0
19 Feb 2021
Generative Speech Coding with Predictive Variance Regularization
W. Kleijn
Andrew Storus
Michael Chinen
Tom Denton
Felicia S. C. Lim
Alejandro Luebs
Jan Skoglund
Hengchin Yeh
65
68
0
18 Feb 2021
Deep Convolutional and Recurrent Networks for Polyphonic Instrument Classification from Monophonic Raw Audio Waveforms
Kleanthis Avramidis
Agelos Kratimenos
C. Garoufis
Athanasia Zlatintsi
Petros Maragos
30
8
0
13 Feb 2021
Previous
1
2
3
...
11
12
13
14
15
16
Next