v1v2v3 (latest)

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 773 papers shown

Title
All-neural beamformer for continuous speech separation Zhuohuang Zhang Takuya Yoshioka Naoyuki Kanda Zhuo Chen Xiaofei Wang Dongmei Wang Sefik Emre Eskimez 69 16 0 13 Oct 2021
Improving Character Error Rate Is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-box Acoustic Models Ryosuke Sawata Yosuke Kashiwagi Shusuke Takahashi 52 6 0 12 Oct 2021
Source Mixing and Separation Robust Audio Steganography Naoya Takahashi M. Singh Yuki Mitsufuji 56 6 0 11 Oct 2021
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain Zengwei Yao Wenjie Pei Fanglin Chen Guangming Lu David C. Zhang 74 12 0 10 Oct 2021
A study of the robustness of raw waveform based speaker embeddings under mismatched conditions Ge Zhu Frank Cwitkowitz Z. Duan 55 2 0 08 Oct 2021
TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant Sound Source Separation Ali Aroudi Stefan Uhlich M. Font ViT 51 5 0 08 Oct 2021
An Investigation of the Effectiveness of Phase for Audio Classification Shunsuke Hidaka Kohei Wakamiya T. Kaburagi 28 4 0 06 Oct 2021
End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression Karn N. Watcharasupat Thi Ngoc Tho Nguyen W. Gan Shengkui Zhao Bin Ma 75 12 0 02 Oct 2021
USEV: Universal Speaker Extraction with Visual Cue Zexu Pan Meng Ge Haizhou Li 75 44 0 30 Sep 2021
VoiceFixer: Toward General Speech Restoration with Neural Vocoder Haohe Liu Qiuqiang Kong Qiao Tian Yan Zhao DeLiang Wang Chuanzeng Huang Yuxuan Wang 87 58 0 28 Sep 2021
FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures Li Li Hirokazu Kameoka S. Makino DRL 77 8 0 28 Sep 2021
Noisy-to-Noisy Voice Conversion Framework with Denoising Model Chao Xie Yi-Chiao Wu Patrick Lumban Tobing Wen-Chin Huang Tomoki Toda 57 8 0 22 Sep 2021
NORESQA: A Framework for Speech Quality Assessment using Non-Matching References Pranay Manocha Buye Xu Anurag Kumar 98 49 0 16 Sep 2021
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation Qiuqiang Kong Yin Cao Haohe Liu Keunwoo Choi Yuxuan Wang 190 100 0 12 Sep 2021
Incorporating Real-world Noisy Speech in Neural-network-based Speech Enhancement Systems Yangyang Xia Buye Xu Anurag Kumar 40 7 0 11 Sep 2021
BeamTransformer: Microphone Array-based Overlapping Speech Detection Siqi Zheng Shiliang Zhang Weilong Huang Qian Chen Hongbin Suo Ming Lei Jinwei Feng Zhijie Yan 75 8 0 09 Sep 2021
A Survey of Sound Source Localization with Deep Learning Methods Pierre-Amaury Grumiaux Srdjan Kitić Laurent Girin Alexandre Guérin 80 257 0 08 Sep 2021
Cross-domain Single-channel Speech Enhancement Model with Bi-projection Fusion Module for Noise-robust ASR Fu-An Chao J. Hung Berlin Chen 34 7 0 26 Aug 2021
Learning Sparse Analytic Filters for Piano Transcription Frank Cwitkowitz M. Heydari Z. Duan 67 2 0 23 Aug 2021
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation Zhong-Qiu Wang Gordon Wichern Jonathan Le Roux 97 33 0 16 Aug 2021
Convolutive Prediction for Reverberant Speech Separation Zhong-Qiu Wang Gordon Wichern Jonathan Le Roux 87 12 0 16 Aug 2021
On The Compensation Between Magnitude and Phase in Speech Separation Zhong-Qiu Wang Gordon Wichern Jonathan Le Roux 81 74 0 11 Aug 2021
The Right to Talk: An Audio-Visual Transformer Approach Thanh-Dat Truong C. Duong T. D. Vu H. Pham Bhiksha Raj Ngan Le Khoa Luu 120 36 0 06 Aug 2021
Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation Tomohiro Nakatani Rintaro Ikeshita K. Kinoshita H. Sawada S. Araki 60 19 0 04 Aug 2021
A Multi-Head Relevance Weighting Framework For Learning Raw Waveform Audio Representations Debottam Dutta Purvi Agrawal Sriram Ganapathy 41 2 0 30 Jul 2021
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers Thilo von Neumann K. Kinoshita Christoph Boeddeker Marc Delcroix Reinhold Haeb-Umbach 65 23 0 30 Jul 2021
Speeding Up Permutation Invariant Training for Source Separation Thilo von Neumann Christoph Boeddeker K. Kinoshita Marc Delcroix Reinhold Haeb-Umbach 58 6 0 30 Jul 2021
Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings Prerak Srivastava Antoine Deleforge Emmanuel Vincent 68 17 0 29 Jul 2021
Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint Optimization Haici Yang Shivani Firodiya Nicholas J. Bryan Minje Kim 69 7 0 28 Jul 2021
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model Quandong Wang Junnan Wu Zhao Yan Sichong Qian Liyong Guo Lichun Fan Weiji Zhuang Peng Gao Yujun Wang 72 0 0 23 Jul 2021
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech Duo Ma Nana Hou Van Tung Pham Haihua Xu Chng Eng Siong 69 22 0 22 Jul 2021
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model Cheng-Hung Hu Yu-Huai Peng Junichi Yamagishi Yu Tsao Hsin-Min Wang 46 5 0 20 Jul 2021
Joint Echo Cancellation and Noise Suppression based on Cascaded Magnitude and Complex Mask Estimation Xiaofeng Shu Yehang Zhu Yanjie Chen Li Chen Haohe Liu Chuanzeng Huang Yuxuan Wang 51 11 0 20 Jul 2021
Multi-Task Audio Source Separation Lu Zhang Chenxing Li Feng Deng Xiaorui Wang 67 9 0 14 Jul 2021
DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement Xiaohuai Le Hongsheng Chen Kai-Jyun Chen Jing Lu 71 83 0 12 Jul 2021
Separation Guided Speaker Diarization in Realistic Mismatched Conditions Shu-Tong Niu Jun Du Lei Sun Chin-Hui Lee 43 5 0 06 Jul 2021
Investigation of Practical Aspects of Single Channel Speech Separation for ASR Jian Wu Zhuo Chen Sanyuan Chen Yu-Huan Wu Takuya Yoshioka Naoyuki Kanda Shujie Liu Jinyu Li 68 17 0 05 Jul 2021
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors Shota Horiguchi Shinji Watanabe Leibny Paola García-Perera Yawen Xue Yuki Takashima Yohei Kawaguchi 79 38 0 04 Jul 2021
TENET: A Time-reversal Enhancement Network for Noise-robust ASR Fu-An Chao Shao-Wei Fan-Jiang Bi-Cheng Yan J. Hung Berlin Chen 60 13 0 04 Jul 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement Yuma Koizumi Shigeki Karita Scott Wisdom Hakan Erdogan J. Hershey Llion Jones M. Bacchiani 86 41 0 30 Jun 2021
Online Self-Attentive Gated RNNs for Real-Time Speaker Separation Ori Kabeli Yossi Adi Zhenyu Tang Buye Xu Anurag Kumar 34 2 0 25 Jun 2021
Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition Zhengxi Liu Y. Qian DRL 49 10 0 25 Jun 2021
A Simultaneous Denoising and Dereverberation Framework with Target Decoupling Andong Li Wenzhe Liu Xiaoxue Luo Guochen Yu C. Zheng Xiaodong Li 75 60 0 24 Jun 2021
Deep neural network Based Low-latency Speech Separation with Asymmetric analysis-Synthesis Window Pair Shanshan Wang Gaurav Naithani Archontis Politis Tuomas Virtanen 61 10 0 22 Jun 2021
Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement Andong Li C. Zheng Lu Zhang Xiaodong Li 91 147 0 22 Jun 2021
Multi-accent Speech Separation with One Shot Learning Kuan-Po Huang Yuan-Kuei Wu Hung-yi Lee 100 4 0 22 Jun 2021
Encoder-Decoder Based Attractors for End-to-End Neural Diarization Shota Horiguchi Yusuke Fujita Shinji Watanabe Yawen Xue Leibny Paola García-Perera 74 68 0 20 Jun 2021
A Hands-on Comparison of DNNs for Dialog Separation Using Transfer Learning from Music Source Separation Martin Strauss Jouni Paulus Matteo Torcoli B. Edler 51 9 0 16 Jun 2021
DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement Shubo Lv Yanxin Hu Shimin Zhang Lei Xie 61 94 0 16 Jun 2021
Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation Jisi Zhang Catalin Zorila R. Doddipatla Jon Barker 56 22 0 15 Jun 2021