v1v2v3 (latest)

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 773 papers shown

Title
F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement Shimin Zhang Yuxiang Kong Shubo Lv Yanxin Hu Lei Xie 67 44 0 14 Jun 2021
Few-shot learning of new sound classes for target sound extraction Marc Delcroix Jorge Bennasar Vázquez Tsubasa Ochiai K. Kinoshita S. Araki VLM 58 11 0 14 Jun 2021
WASE: Learning When to Attend for Speaker Extraction in Cocktail Party Environments Yunzhe Hao Jiaming Xu Peng Zhang Bo Xu 32 17 0 13 Jun 2021
SpeechBrain: A General-Purpose Speech Toolkit Mirco Ravanelli Titouan Parcollet Peter William VanHarn Plantinga Aku Rouhe Samuele Cornell ... William Aris Hwidong Na Yan Gao R. Mori Yoshua Bengio 129 769 0 08 Jun 2021
Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition Max W. Y. Lam Jun Wang Chao Weng Dan Su Dong Yu 65 6 0 08 Jun 2021
Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication Yuanyuan Bao Yanze Xu Na Xu Wenjing Yang Hongfeng Li Shicong Li Y. Jia Fei Xiang Jincheng He Ming Li 87 1 0 05 Jun 2021
Classification of Audio Segments in Call Center Recordings using Convolutional Recurrent Neural Networks ¸Sükrü Ozan 18 0 0 04 Jun 2021
Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex Keitaro Tanaka Ryosuke Sawata Shusuke Takahashi 36 0 0 04 Jun 2021
Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition Hiroshi Sato Tsubasa Ochiai Marc Delcroix K. Kinoshita Takafumi Moriya Naoyuki Kamo 68 23 0 02 Jun 2021
Multi-Scale Attention Neural Network for Acoustic Echo Cancellation Lu Ma Song Yang Y. Gong Zhongqin Wu 48 7 0 31 May 2021
Multi-Scale Temporal Convolution Network for Classroom Voice Detection Lu Ma Xintian Wang Song Yang Y. Gong Zhongqin Wu 36 1 0 31 May 2021
EchoFilter: End-to-End Neural Network for Acoustic Echo Cancellation Lu Ma Song Yang Y. Gong Xintian Wang Zhongqin Wu 44 12 0 31 May 2021
DPLM: A Deep Perceptual Spatial-Audio Localization Metric Pranay Manocha Anurag Kumar Buye Xu Anjali Menon I. D. Gebru V. Ithapu P. Calamia 62 10 0 29 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics V. Jayaram John Thickstun DiffM 107 25 0 17 May 2021
Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method Koichi Saito Tomohiko Nakamura Kohei Yatabe Yuma Koizumi Hiroshi Saruwatari BDL VLM 36 7 0 10 May 2021
Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation Sunwoo Kim Minje Kim 88 20 0 08 May 2021
Speech Enhancement using Separable Polling Attention and Global Layer Normalization followed with PReLU Dengfeng Ke Jinsong Zhang Yanlu Xie Yanyan Xu Binghuai Lin 39 2 0 06 May 2021
Self-Supervised Learning from Automatically Separated Sound Scenes Eduardo Fonseca A. Jansen D. Ellis Scott Wisdom Marco Tagliasacchi J. Hershey Manoj Plakal Shawn Hershey R. C. Moore Xavier Serra SSL 81 13 0 05 May 2021
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings Soumi Maiti Hakan Erdogan K. Wilson Scott Wisdom Shinji Watanabe J. Hershey 72 22 0 05 May 2021
AvaTr: One-Shot Speaker Extraction with Transformers S. Hu Md Rifat Arefin V. Nguyen Alish Dipani Xaq Pitkow A. Tolias 64 4 0 03 May 2021
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement Feng Dang Hangting Chen Pengyuan Zhang 131 105 0 27 Apr 2021
Points2Sound: From mono to binaural audio using 3D point cloud scenes Francesc Lluís V. Chatziioannou A. Hofmann 3DPC 113 6 0 26 Apr 2021
Many-Speakers Single Channel Speech Separation with Optimal Permutation Training Shaked Dovrat Eliya Nachmani Lior Wolf VLM 96 22 0 18 Apr 2021
Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement Haoyu Li Junichi Yamagishi 27 9 0 17 Apr 2021
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation Xiyun Li Yong-mei Xu Meng Yu Shi-Xiong Zhang Jiaming Xu Bo Xu Dong Yu 52 14 0 17 Apr 2021
On the Design of Deep Priors for Unsupervised Audio Restoration V. Narayanaswamy Jayaraman J. Thiagarajan A. Spanias AI4CE 49 5 0 14 Apr 2021
L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing E. Guizzo R. F. Gramaccioni Saeid Jamili Christian Marinoni Edoardo Massaro ... Marco Pennese Sveva Pepe Enrico Rocchi A. Uncini Danilo Comminiello 154 27 0 12 Apr 2021
Learning to Rank Microphones for Distant Speech Recognition Samuele Cornell Alessio Brutti M. Matassoni S. Squartini 45 4 0 06 Apr 2021
Noise Estimation for Generative Diffusion Models Robin San-Roman Eliya Nachmani Lior Wolf DiffM 126 107 0 06 Apr 2021
Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification Aswin Sivaraman Sunwoo Kim Minje Kim 100 23 0 05 Apr 2021
Efficient Personalized Speech Enhancement through Self-Supervised Learning Aswin Sivaraman Minje Kim 67 20 0 05 Apr 2021
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment Meng Yu Chunlei Zhang Yong-mei Xu Shi-Xiong Zhang Dong Yu 55 31 0 02 Apr 2021
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN Giorgio Barnabò Giovanni Trappolini L. Lastilla Cesare Campagnano Angela Fan Fabio Petroni Fabrizio Silvestri 64 4 0 01 Apr 2021
Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech Chenglin Xu Wei Rao Jibin Wu Haizhou Li 68 32 0 30 Mar 2021
Time-domain Speech Enhancement with Generative Adversarial Learning Feiyang Xiao Jian Guan Qiuqiang Kong Wenwu Wang GAN 74 9 0 30 Mar 2021
On TasNet for Low-Latency Single-Speaker Speech Enhancement Morten Kolbæk Zheng-Hua Tan S. H. Jensen Jesper Jensen 81 2 0 27 Mar 2021
Blind Speech Separation and Dereverberation using Neural Beamforming Lukas Pfeifenberger Franz Pernkopf 36 5 0 24 Mar 2021
USTC-NELSLIP System Description for DIHARD-III Challenge Yuxuan Wang Maokui He Shutong Niu Lei Sun Tian Gao Xin Fang Jia Pan Jun Du Chin-Hui Lee 76 30 0 19 Mar 2021
HTMD-Net: A Hybrid Masking-Denoising Approach to Time-Domain Monaural Singing Voice Separation C. Garoufis Athanasia Zlatintsi Petros Maragos 69 2 0 07 Mar 2021
Compute and memory efficient universal sound source separation Efthymios Tzinis Zhepei Wang Xilin Jiang Paris Smaragdis 90 40 0 03 Mar 2021
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect Jun Wang Max W. Y. Lam Dan Su Dong Yu 55 6 0 02 Mar 2021
Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation Max W. Y. Lam Jun Wang Dan Su Dong Yu AI4TS 121 49 0 01 Mar 2021
Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks Ju Lin A. Wijngaarden Kuang-Ching Wang M. C. Smith 78 51 0 24 Feb 2021
Handling Background Noise in Neural Speech Generation Tom Denton Alejandro Luebs Felicia S. C. Lim Andrew Storus Hengchin Yeh W. Kleijn Jan Skoglund 52 2 0 23 Feb 2021
Dual-Path Modeling for Long Recording Speech Separation in Meetings Chenda Li Zhuo Chen Yi Luo Cong Han Tianyan Zhou K. Kinoshita Marc Delcroix Shinji Watanabe Y. Qian 41 10 0 23 Feb 2021
TransMask: A Compact and Fast Speech Separation Model Based on Transformer Zining Zhang Bingsheng He Zhenjie Zhang 62 23 0 19 Feb 2021
Speech enhancement with weakly labelled data from AudioSet Qiuqiang Kong Haohe Liu Xingjian Du Li Chen Rui Xia Yuxuan Wang 82 18 0 19 Feb 2021
CatNet: music source separation system with mix-audio augmentation Xuchen Song Qiuqiang Kong Xingjian Du Yuxuan Wang 56 10 0 19 Feb 2021
Generative Speech Coding with Predictive Variance Regularization W. Kleijn Andrew Storus Michael Chinen Tom Denton Felicia S. C. Lim Alejandro Luebs Jan Skoglund Hengchin Yeh 65 68 0 18 Feb 2021
Deep Convolutional and Recurrent Networks for Polyphonic Instrument Classification from Monophonic Raw Audio Waveforms Kleanthis Avramidis Agelos Kratimenos C. Garoufis Athanasia Zlatintsi Petros Maragos 30 8 0 13 Feb 2021