v1v2v3 (latest)

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 773 papers shown

Title
Music Source Separation with Band-split RNN Yi Luo Jianwei Yu 121 120 0 30 Sep 2022
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization Xiaokang Zhao Qiu-shi Zhu Jie Zhang 113 5 0 28 Sep 2022
Speech Enhancement with Perceptually-motivated Optimization and Dual Transformations Xucheng Wan Kai Liu Z.C. Du Huan Zhou 36 0 0 24 Sep 2022
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement Sherif Abdulatif Ru Cao Bin Yang 107 75 0 22 Sep 2022
MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement Jianrong Wang Xiaomin Li Xuewei Li Mei Yu Qiang Fang Li Liu 60 0 0 15 Sep 2022
Streaming Target-Speaker ASR with Neural Transducer Takafumi Moriya Hiroshi Sato Tsubasa Ochiai Marc Delcroix T. Shinozaki 81 21 0 09 Sep 2022
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation Zhong-Qiu Wang Samuele Cornell Shukjae Choi Younglo Lee Byeonghak Kim Shinji Watanabe 149 108 0 08 Sep 2022
Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments Kai Chen Hao-Wen Dong Yi Luo Julian McAuley Taylor Berg-Kirkpatrick M. Puckette Shlomo Dubnov 72 5 0 07 Sep 2022
Automatic music mixing with deep learning and out-of-domain data Marco A. Martínez-Ramírez Wei-Hsiang Liao Giorgio Fabbro Stefan Uhlich Chihiro Nagashima Yuki Mitsufuji 80 27 0 24 Aug 2022
Exploiting Temporal Structures of Cyclostationary Signals for Data-Driven Single-Channel Source Separation Gary C. F. Lee Amir Weiss A. Lancho Jennifer Tang Yuheng Bu Yury Polyanskiy G. Wornell 57 6 0 22 Aug 2022
Analysis of impact of emotions on target speech extraction and speech separation Jan vSvec Katevrina vZmolíková M. Kocour Marc Delcroix Tsubasa Ochiai Ladislav Movsner JanHonza'' vCernocký 44 4 0 15 Aug 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative Models Julius Richter Simon Welker Jean-Marie Lemercier Bunlong Lay Timo Gerkmann DiffM 93 207 0 11 Aug 2022
Conv-NILM-Net, a causal and multi-appliance model for energy source separation Mohamed Alami Chehboune Jérémie Decock Rim Kaddah Jesse Read 44 1 0 03 Aug 2022
Spatial Aware Multi-Task Learning Based Speech Separation Wei Sun Mei Wang L. Qiu 33 3 0 20 Jul 2022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding Yen-Ju Lu Xuankai Chang Chenda Li Wangyou Zhang Samuele Cornell ... Robin Scheibler Zhong-Qiu Wang Yu Tsao Y. Qian Shinji Watanabe VLM 74 28 0 19 Jul 2022
PodcastMix: A dataset for separating music and speech in podcasts Nico M. Schmidt Jordi Pons M. Miron 46 3 0 15 Jul 2022
SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate Nabarun Goswami Tatsuya Harada 78 5 0 13 Jul 2022
Dual-Path Cross-Modal Attention for better Audio-Visual Speech Extraction Zhongweiyang Xu Xulin Fan M. Hasegawa-Johnson 46 3 0 09 Jul 2022
Learning to Separate Voices by Spatial Regions Alan Xu Romit Roy Choudhury 110 10 0 09 Jul 2022
Implicit Neural Spatial Filtering for Multichannel Source Separation in the Waveform Domain Dejan Marković Alexandre Défossez Alexander Richard 86 16 0 30 Jun 2022
Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion Ahmad Aloradi Wolfgang Mack Mohamed Elminshawi Emanuel Habets 63 5 0 28 Jun 2022
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech Separation Jian Luo Jianzong Wang Ning Cheng Edward Xiao Xulong Zhang Jing Xiao ViT 78 12 0 28 Jun 2022
ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement Ishan Chatterjee Maruchi Kim V. Jayaram Shyamnath Gollakota Ira Kemelmacher-Shlizerman Shwetak N. Patel S. M. Seitz 72 25 0 27 Jun 2022
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes Danilo de Oliveira Tal Peer Timo Gerkmann 61 21 0 23 Jun 2022
Restoring speech intelligibility for hearing aid users with deep learning P. U. Diehl Y. Singer Hannes Zilly U. Schonfeld Paul Meyer-Rachner Mark Berry Henning Sprekeler Elias Sprengel A. Pudszuhn V. Hofmann 36 20 0 23 Jun 2022
An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models Rahil Parikh G. Rochette C. Espy-Wilson S. Shamma UQCV 35 0 0 20 Jun 2022
Resource-Efficient Separation Transformer Luca Della Libera Cem Subakan Mirco Ravanelli Samuele Cornell Frédéric Lepoutre François Grondin VLM 99 18 0 19 Jun 2022
GMM based multi-stage Wiener filtering for low SNR speech enhancement Wageesha Manamperi P. Samarasinghe T. Abhayapala J. Zhang 30 6 0 19 Jun 2022
Semi-supervised Time Domain Target Speaker Extraction with Attention Zhepei Wang Ritwik Giri Shrikant Venkataramani Umut Isik J. Valin Paris Smaragdis Mike Goodwin A. Krishnaswamy 59 7 0 18 Jun 2022
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios Bang Zeng Weiqing Wang Yuanyuan Bao Ming Li 59 0 0 17 Jun 2022
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations Hiroshi Sato Tsubasa Ochiai Marc Delcroix K. Kinoshita Takafumi Moriya Naoki Makishima Mana Ihori Tomohiro Tanaka Ryo Masumura 18 5 0 16 Jun 2022
On the Use of Deep Mask Estimation Module for Neural Source Separation Systems Kai Li Xiaolin Hu Yi Luo 72 16 0 15 Jun 2022
On the Design and Training Strategies for RNN-based Online Neural Speech Separation Systems Kai Li Yi Luo 86 13 0 15 Jun 2022
LPCSE: Neural Speech Enhancement through Linear Predictive Coding Yang Liu Na Tang Xia Chu Yang Yang Jun Wang 66 1 0 14 Jun 2022
Physics-Inspired Temporal Learning of Quadrotor Dynamics for Accurate Model Predictive Trajectory Tracking Alessandro Saviolo Guanrui Li Giuseppe Loianno 97 52 0 07 Jun 2022
Sampling Frequency Independent Dialogue Separation Jouni Paulus Matteo Torcoli 50 13 0 05 Jun 2022
Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR Qiu-shi Zhu Jie Zhang Zitian Zhang Lirong Dai 90 15 0 26 May 2022
SepIt: Approaching a Single Channel Speech Separation Bound Shahar Lutati Eliya Nachmani Lior Wolf VLM 133 27 0 24 May 2022
Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments Joseph Peter Caroselli A. Narayanan Yiteng Huang 32 1 0 17 May 2022
Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation William Ravenscroft Stefan Goetze Thomas Hain 47 6 0 17 May 2022
A deep representation learning speech enhancement method using $β$ -VAE Yang Xiang Jesper Lisby Højvang M. Rasmussen M. G. Christensen DRL 55 2 0 11 May 2022
Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition Catalin Zorila R. Doddipatla 38 11 0 09 May 2022
Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking Tsubasa Ochiai Marc Delcroix Tomohiro Nakatani S. Araki 36 20 0 07 May 2022
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement Andong Li Shan You Guochen Yu C. Zheng Xiaodong Li 65 28 0 30 Apr 2022
Cleanformer: A multichannel array configuration-invariant neural enhancement frontend for ASR in smart speakers Joseph Peter Caroselli A. Narayanan N. Howard Tom O'Malley 62 5 0 25 Apr 2022
Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation Jiangyu Han Yanhua Long 65 6 0 23 Apr 2022
STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency Zhong-Qiu Wang Gordon Wichern Shinji Watanabe Jonathan Le Roux 87 36 0 21 Apr 2022
Music Source Separation with Generative Flow Ge Zhu Jordan Darefsky Fei Jiang A. Selitskiy Z. Duan 88 8 0 19 Apr 2022
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction Zifeng Zhao Rongzhi Gu Dongchao Yang Jinchuan Tian Yuexian Zou 59 2 0 15 Apr 2022
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System M. Z. Ozturk Chenshu Wu Beibei Wang Min Wu K. Liu 62 21 0 14 Apr 2022