Filterbank design for end-to-end speech separation

23 October 2019

Antoine Deleforge

Papers citing "Filterbank design for end-to-end speech separation"

31 / 31 papers shown

Title
Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and Subtitling Jakob Poncelet Hugo Van hamme 69 0 0 05 Feb 2025
MR-RawNet: Speaker verification system with multiple temporal resolutions for variable duration utterances using raw waveforms Seung-bin Kim Chan-yeong Lim Jungwoo Heo Ju-ho Kim Hyun-Seo Shin Kyo-Won Koo Ha-Jin Yu 52 0 0 11 Jun 2024
To what extent can ASV systems naturally defend against spoofing attacks? Jee-weon Jung Xin Eric Wang Nicholas W. D. Evans Shinji Watanabe Hye-jin Shim Hemlata Tak Sidhhant Arora Junichi Yamagishi Joon Son Chung AAML 43 3 0 08 Jun 2024
Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet Satvik Venkatesh Arthur Benilov Philip Coleman Frederic Roskam 37 5 0 27 Feb 2024
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection Théo Mariotte Anthony Larcher Silvio Montrésor Jean-Hugh Thomas 27 2 0 13 Feb 2024
A Convolutional Network Adaptation for Cortical Classification During Mobile Brain Imaging B. Cichy J. Lukos Mohammad Alam J. C. Bradford Nicholas Wymbs 15 0 0 11 Oct 2023
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints P. Magron Tuomas Virtanen 24 0 0 03 Mar 2023
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions Shengkui Zhao Bin Ma 33 52 0 23 Feb 2023
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes Danilo de Oliveira Tal Peer Timo Gerkmann 18 18 0 23 Jun 2022
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length Tal Peer Timo Gerkmann 19 21 0 30 Mar 2022
Pushing the limits of raw waveform speaker recognition Jee-weon Jung You Jin Kim Hee-Soo Heo Bong-Jin Lee Youngki Kwon Joon Son Chung 31 87 0 16 Mar 2022
Learning Filterbanks for End-to-End Acoustic Beamforming Samuele Cornell Manuel Pariente François Grondin S. Squartini 35 7 0 08 Nov 2021
SNRi Target Training for Joint Speech Enhancement and Recognition Yuma Koizumi Shigeki Karita A. Narayanan S. Panchapagesan M. Bacchiani 27 14 0 01 Nov 2021
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain Zengwei Yao Wenjie Pei Fanglin Chen Guangming Lu David C. Zhang 21 12 0 10 Oct 2021
A study of the robustness of raw waveform based speaker embeddings under mismatched conditions Ge Zhu Frank Cwitkowitz Z. Duan 22 2 0 08 Oct 2021
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification Xuechen Liu Md. Sahidullah Tomi Kinnunen 32 6 0 24 Sep 2021
Learning Sparse Analytic Filters for Piano Transcription Frank Cwitkowitz M. Heydari Z. Duan 27 2 0 23 Aug 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement Yuma Koizumi Shigeki Karita Scott Wisdom Hakan Erdogan J. Hershey Llion Jones M. Bacchiani 19 41 0 30 Jun 2021
A Modulation Front-End for Music Audio Tagging Cyrus Vahidi C. Saitis Gyorgy Fazekas 21 2 0 25 May 2021
Learnable MFCCs for Speaker Verification Xuechen Liu Md. Sahidullah Tomi Kinnunen 26 17 0 20 Feb 2021
LEAF: A Learnable Frontend for Audio Classification Neil Zeghidour O. Teboul Félix de Chaumont Quitry Marco Tagliasacchi VLM AAML 85 144 0 21 Jan 2021
A comparison of handcrafted, parameterized, and learnable features for speech separation Wenbo Zhu Mou Wang Xiao-Lei Zhang S. Rahardja 21 4 0 29 Nov 2020
Attention-based scaling adaptation for target speech extraction Jiangyu Han Wei Rao Yanhua Long Jiaen Liang 16 9 0 19 Oct 2020
Vector-Quantized Timbre Representation Adrien Bitton P. Esling Tatsuya Harada 20 12 0 13 Jul 2020
Unsupervised Sound Separation Using Mixture Invariant Training Scott Wisdom Efthymios Tzinis Hakan Erdogan Ron J. Weiss K. Wilson J. Hershey 16 27 0 23 Jun 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers Manuel Pariente Samuele Cornell Joris Cosentino S. Sivasankaran Efthymios Tzinis ... Juan M. Martín-Donas David Ditter Ariel Frank Antoine Deleforge Emmanuel Vincent 27 151 0 08 May 2020
Unsupervised Interpretable Representation Learning for Singing Voice Separation S. I. Mimilakis K. Drossos G. Schuller 27 8 0 03 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers Eliya Nachmani Yossi Adi Lior Wolf 20 175 0 29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering Neil Zeghidour David Grangier VLM 27 261 0 20 Feb 2020
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet David Ditter Timo Gerkmann 17 57 0 25 Oct 2019
Deep Ad-hoc Beamforming Xiao-Lei Zhang 10 21 0 03 Nov 2018