SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning

8 April 2022

Marc Delcroix

Jorge Bennasar Vázquez

Papers citing "SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning"

42 / 42 papers shown

Title
Language-Queried Target Sound Extraction Without Parallel Training Data Hao Ma Zhiyuan Peng Xu Li Yukai Li Mingjie Shao Qiuqiang Kong Xuelong Li VLM 128 2 0 14 Sep 2024
Listen only to me! How well can target speech extraction handle false alarms? Marc Delcroix K. Kinoshita Tsubasa Ochiai Kateřina Žmolíková Hiroshi Sato Tomohiro Nakatani 63 15 0 11 Apr 2022
Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data Ke Chen Xingjian Du Bilei Zhu Zejun Ma Taylor Berg-Kirkpatrick Shlomo Dubnov 49 46 0 15 Dec 2021
Environmental Sound Extraction Using Onomatopoeic Words Yuki Okamoto Shota Horiguchi Masaaki Yamamoto Keisuke Imoto Yohei Kawaguchi 47 9 0 01 Dec 2021
Visual Scene Graphs for Audio Source Separation Moitreya Chatterjee Jonathan Le Roux Narendra Ahuja A. Cherian 64 37 0 24 Sep 2021
Few-shot learning of new sound classes for target sound extraction Marc Delcroix Jorge Bennasar Vázquez Tsubasa Ochiai K. Kinoshita S. Araki VLM 46 11 0 14 Jun 2021
What's All the FUSS About Free Universal Sound Separation Data? Scott Wisdom Hakan Erdogan D. Ellis Romain Serizel Nicolas Turpault Eduardo Fonseca Justin Salamon Prem Seetharaman J. Hershey 66 82 0 02 Nov 2020
Transcription Is All You Need: Learning to Separate Musical Mixtures with Score as Supervision Yun-Ning Hung Gordon Wichern Jonathan Le Roux 51 12 0 22 Oct 2020
LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation Woosung Choi Minseok Kim Jaehwa Chung Soonyoung Jung 75 33 0 22 Oct 2020
FSD50K: An Open Dataset of Human-Labeled Sound Events Eduardo Fonseca Xavier Favory Jordi Pons F. Font Xavier Serra 75 458 0 01 Oct 2020
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation Efthymios Tzinis Zhepei Wang Paris Smaragdis 84 128 0 14 Jul 2020
Improving Sound Event Detection In Domestic Environments Using Sound Separation Nicolas Turpault Scott Wisdom Hakan Erdogan J. Hershey Romain Serizel Eduardo Fonseca Prem Seetharaman Justin Salamon 74 49 0 08 Jul 2020
Unsupervised Sound Separation Using Mixture Invariant Training Scott Wisdom Efthymios Tzinis Hakan Erdogan Ron J. Weiss K. Wilson J. Hershey 56 27 0 23 Jun 2020
Listen to What You Want: Neural Network-based Universal Sound Selector Tsubasa Ochiai Marc Delcroix Yuma Koizumi Hiroaki Ito K. Kinoshita S. Araki 45 62 0 10 Jun 2020
Foreground-Background Ambient Sound Scene Separation Michel Olvera Emmanuel Vincent Romain Serizel Gilles Gasso 52 9 0 11 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers Manuel Pariente Samuele Cornell Joris Cosentino S. Sivasankaran Efthymios Tzinis ... Juan M. Martín-Donas David Ditter Ariel Frank Antoine Deleforge Emmanuel Vincent 63 155 0 08 May 2020
Conditioned Source Separation for Music Instrument Performances Olga Slizovskaia G. Haro E. Gómez 47 39 0 08 Apr 2020
Meta-learning Extractors for Music Source Separation David Samuel Aditya Ganeshan Jason Naradowsky 56 62 0 17 Feb 2020
Source separation with weakly labelled data: An approach to computational auditory scene analysis Qiuqiang Kong Yuxuan Wang Xuchen Song Yin Cao Wenwu Wang Mark D. Plumbley 68 47 0 06 Feb 2020
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam Marc Delcroix Tsubasa Ochiai Kateřina Žmolíková K. Kinoshita Naohiro Tawara Tomohiro Nakatani S. Araki 117 122 0 23 Jan 2020
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition Qiuqiang Kong Yin Cao Turab Iqbal Yuxuan Wang Wenwu Wang Mark D. Plumbley VLM SSL 186 1,076 0 21 Dec 2019
Improving Universal Sound Separation Using Sound Classification Efthymios Tzinis Scott Wisdom J. Hershey A. Jansen D. Ellis VLM 65 73 0 18 Nov 2019
Finding Strength in Weakness: Learning to Separate Sounds with Weak Supervision Fatemeh Pishdadian Gordon Wichern Jonathan Le Roux 56 43 0 06 Nov 2019
Audio query-based music source separation Jie Hwan Lee Hyeong-Seok Choi Kyogu Lee 49 45 0 19 Aug 2019
Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for Multiple Source Separations Gabriel Meseguer-Brocal Geoffroy Peeters 46 61 0 02 Jul 2019
Universal Sound Separation Ilya Kavalerov Scott Wisdom Hakan Erdogan Brian Patton K. Wilson Jonathan Le Roux J. Hershey 44 187 0 08 May 2019
Co-Separating Sounds of Visual Objects Ruohan Gao Kristen Grauman 126 209 0 16 Apr 2019
Class-conditional embeddings for music source separation A. Labatie Gordon Wichern Shrikant Venkataramani Jonathan Le Roux BDL 65 42 0 07 Nov 2018
End-to-End Sound Source Separation Conditioned On Instrument Labels Olga Slizovskaia Leo Kim G. Haro Emilia Gómez 49 32 0 05 Nov 2018
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking Quan Wang Hannah Muckenhirn K. Wilson Prashant Sridhar Zelin Wu J. Hershey Rif A. Saurous Ron J. Weiss Ye Jia Ignacio López Moreno 68 368 0 11 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Yi Luo N. Mesgarani 156 1,786 0 20 Sep 2018
General-purpose Tagging of Freesound Audio with AudioSet Labels: Task Description, Dataset, and Baseline Eduardo Fonseca Manoj Plakal F. Font D. Ellis Xavier Favory Jordi Pons Xavier Serra 83 148 0 26 Jul 2018
Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures Jun Wang Jie Chen Dan Su Lianwu Chen Meng Yu Y. Qian Dong Yu 69 90 0 24 Jul 2018
An Overview of Lead and Accompaniment Separation in Music Z. Rafii Antoine Liutkus Fabian-Robert Stöter S. I. Mimilakis D. Fitzgerald Bryan Pardo 49 103 0 23 Apr 2018
The Sound of Pixels Hang Zhao Chuang Gan Andrew Rouditchenko Carl Vondrick Josh H. McDermott Antonio Torralba VLM 102 535 0 09 Apr 2018
Adversarial Semi-Supervised Audio Source Separation applied to Singing Voice Extraction Daniel Stoller Sebastian Ewert S. Dixon 67 72 0 31 Oct 2017
Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks Morten Kolbaek Dong Yu Zheng-Hua Tan Jesper Jensen 55 725 0 18 Mar 2017
Deep Clustering and Conventional Networks for Music Separation: Stronger Together Yi Luo Zhuo Chen J. Hershey Jonathan Le Roux N. Mesgarani 58 162 0 18 Nov 2016
CNN Architectures for Large-Scale Audio Classification Shawn Hershey Sourish Chaudhuri D. Ellis J. Gemmeke A. Jansen ... Rif A. Saurous Bryan Seybold M. Slaney Ron J. Weiss K. Wilson 120 2,500 0 29 Sep 2016
Deep clustering: Discriminative embeddings for segmentation and separation J. Hershey Zhuo Chen Jonathan Le Roux Shinji Watanabe 60 1,317 0 18 Aug 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 1.8K 150,039 0 22 Dec 2014
An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks Ian Goodfellow M. Berk Mirza Xia Da Aaron Courville Yoshua Bengio 149 1,442 0 21 Dec 2013