ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10400
  4. Cited By
Filterbank design for end-to-end speech separation

Filterbank design for end-to-end speech separation

23 October 2019
Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
ArXivPDFHTML

Papers citing "Filterbank design for end-to-end speech separation"

31 / 31 papers shown
Title
Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and Subtitling
Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and Subtitling
Jakob Poncelet
Hugo Van hamme
69
0
0
05 Feb 2025
MR-RawNet: Speaker verification system with multiple temporal
  resolutions for variable duration utterances using raw waveforms
MR-RawNet: Speaker verification system with multiple temporal resolutions for variable duration utterances using raw waveforms
Seung-bin Kim
Chan-yeong Lim
Jungwoo Heo
Ju-ho Kim
Hyun-Seo Shin
Kyo-Won Koo
Ha-Jin Yu
52
0
0
11 Jun 2024
To what extent can ASV systems naturally defend against spoofing
  attacks?
To what extent can ASV systems naturally defend against spoofing attacks?
Jee-weon Jung
Xin Eric Wang
Nicholas W. D. Evans
Shinji Watanabe
Hye-jin Shim
Hemlata Tak
Sidhhant Arora
Junichi Yamagishi
Joon Son Chung
AAML
43
3
0
08 Jun 2024
Real-time Low-latency Music Source Separation using Hybrid
  Spectrogram-TasNet
Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet
Satvik Venkatesh
Arthur Benilov
Philip Coleman
Frederic Roskam
37
5
0
27 Feb 2024
Channel-Combination Algorithms for Robust Distant Voice Activity and
  Overlapped Speech Detection
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection
Théo Mariotte
Anthony Larcher
Silvio Montrésor
Jean-Hugh Thomas
27
2
0
13 Feb 2024
A Convolutional Network Adaptation for Cortical Classification During
  Mobile Brain Imaging
A Convolutional Network Adaptation for Cortical Classification During Mobile Brain Imaging
B. Cichy
J. Lukos
Mohammad Alam
J. C. Bradford
Nicholas Wymbs
15
0
0
11 Oct 2023
Spectrogram Inversion for Audio Source Separation via Consistency,
  Mixing, and Magnitude Constraints
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints
P. Magron
Tuomas Virtanen
24
0
0
03 Mar 2023
MossFormer: Pushing the Performance Limit of Monaural Speech Separation
  using Gated Single-Head Transformer with Convolution-Augmented Joint
  Self-Attentions
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions
Shengkui Zhao
Bin Ma
33
52
0
23 Feb 2023
Efficient Transformer-based Speech Enhancement Using Long Frames and
  STFT Magnitudes
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Danilo de Oliveira
Tal Peer
Timo Gerkmann
18
18
0
23 Jun 2022
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length
Tal Peer
Timo Gerkmann
19
21
0
30 Mar 2022
Pushing the limits of raw waveform speaker recognition
Pushing the limits of raw waveform speaker recognition
Jee-weon Jung
You Jin Kim
Hee-Soo Heo
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
31
87
0
16 Mar 2022
Learning Filterbanks for End-to-End Acoustic Beamforming
Learning Filterbanks for End-to-End Acoustic Beamforming
Samuele Cornell
Manuel Pariente
François Grondin
S. Squartini
35
7
0
08 Nov 2021
SNRi Target Training for Joint Speech Enhancement and Recognition
SNRi Target Training for Joint Speech Enhancement and Recognition
Yuma Koizumi
Shigeki Karita
A. Narayanan
S. Panchapagesan
M. Bacchiani
27
14
0
01 Nov 2021
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in
  High-order Latent Domain
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain
Zengwei Yao
Wenjie Pei
Fanglin Chen
Guangming Lu
David C. Zhang
21
12
0
10 Oct 2021
A study of the robustness of raw waveform based speaker embeddings under
  mismatched conditions
A study of the robustness of raw waveform based speaker embeddings under mismatched conditions
Ge Zhu
Frank Cwitkowitz
Z. Duan
22
2
0
08 Oct 2021
Optimized Power Normalized Cepstral Coefficients towards Robust Deep
  Speaker Verification
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
32
6
0
24 Sep 2021
Learning Sparse Analytic Filters for Piano Transcription
Learning Sparse Analytic Filters for Piano Transcription
Frank Cwitkowitz
M. Heydari
Z. Duan
27
2
0
23 Aug 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using
  linear complexity self-attention for speech enhancement
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
19
41
0
30 Jun 2021
A Modulation Front-End for Music Audio Tagging
A Modulation Front-End for Music Audio Tagging
Cyrus Vahidi
C. Saitis
Gyorgy Fazekas
21
2
0
25 May 2021
Learnable MFCCs for Speaker Verification
Learnable MFCCs for Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
26
17
0
20 Feb 2021
LEAF: A Learnable Frontend for Audio Classification
LEAF: A Learnable Frontend for Audio Classification
Neil Zeghidour
O. Teboul
Félix de Chaumont Quitry
Marco Tagliasacchi
VLM
AAML
85
144
0
21 Jan 2021
A comparison of handcrafted, parameterized, and learnable features for
  speech separation
A comparison of handcrafted, parameterized, and learnable features for speech separation
Wenbo Zhu
Mou Wang
Xiao-Lei Zhang
S. Rahardja
21
4
0
29 Nov 2020
Attention-based scaling adaptation for target speech extraction
Attention-based scaling adaptation for target speech extraction
Jiangyu Han
Wei Rao
Yanhua Long
Jiaen Liang
16
9
0
19 Oct 2020
Vector-Quantized Timbre Representation
Vector-Quantized Timbre Representation
Adrien Bitton
P. Esling
Tatsuya Harada
20
12
0
13 Jul 2020
Unsupervised Sound Separation Using Mixture Invariant Training
Unsupervised Sound Separation Using Mixture Invariant Training
Scott Wisdom
Efthymios Tzinis
Hakan Erdogan
Ron J. Weiss
K. Wilson
J. Hershey
16
27
0
23 Jun 2020
Asteroid: the PyTorch-based audio source separation toolkit for
  researchers
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
27
151
0
08 May 2020
Unsupervised Interpretable Representation Learning for Singing Voice
  Separation
Unsupervised Interpretable Representation Learning for Singing Voice Separation
S. I. Mimilakis
K. Drossos
G. Schuller
27
8
0
03 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers
Voice Separation with an Unknown Number of Multiple Speakers
Eliya Nachmani
Yossi Adi
Lior Wolf
20
175
0
29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
27
261
0
20 Feb 2020
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet
David Ditter
Timo Gerkmann
17
57
0
25 Oct 2019
Deep Ad-hoc Beamforming
Deep Ad-hoc Beamforming
Xiao-Lei Zhang
10
21
0
03 Nov 2018
1