Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.03895
Cited By
SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning
8 April 2022
Marc Delcroix
Jorge Bennasar Vázquez
Tsubasa Ochiai
K. Kinoshita
Yasunori Ohishi
S. Araki
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning"
42 / 42 papers shown
Title
Language-Queried Target Sound Extraction Without Parallel Training Data
Hao Ma
Zhiyuan Peng
Xu Li
Yukai Li
Mingjie Shao
Qiuqiang Kong
Xuelong Li
VLM
128
2
0
14 Sep 2024
Listen only to me! How well can target speech extraction handle false alarms?
Marc Delcroix
K. Kinoshita
Tsubasa Ochiai
Kateřina Žmolíková
Hiroshi Sato
Tomohiro Nakatani
63
15
0
11 Apr 2022
Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
49
46
0
15 Dec 2021
Environmental Sound Extraction Using Onomatopoeic Words
Yuki Okamoto
Shota Horiguchi
Masaaki Yamamoto
Keisuke Imoto
Yohei Kawaguchi
47
9
0
01 Dec 2021
Visual Scene Graphs for Audio Source Separation
Moitreya Chatterjee
Jonathan Le Roux
Narendra Ahuja
A. Cherian
64
37
0
24 Sep 2021
Few-shot learning of new sound classes for target sound extraction
Marc Delcroix
Jorge Bennasar Vázquez
Tsubasa Ochiai
K. Kinoshita
S. Araki
VLM
46
11
0
14 Jun 2021
What's All the FUSS About Free Universal Sound Separation Data?
Scott Wisdom
Hakan Erdogan
D. Ellis
Romain Serizel
Nicolas Turpault
Eduardo Fonseca
Justin Salamon
Prem Seetharaman
J. Hershey
66
82
0
02 Nov 2020
Transcription Is All You Need: Learning to Separate Musical Mixtures with Score as Supervision
Yun-Ning Hung
Gordon Wichern
Jonathan Le Roux
51
12
0
22 Oct 2020
LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation
Woosung Choi
Minseok Kim
Jaehwa Chung
Soonyoung Jung
75
33
0
22 Oct 2020
FSD50K: An Open Dataset of Human-Labeled Sound Events
Eduardo Fonseca
Xavier Favory
Jordi Pons
F. Font
Xavier Serra
75
458
0
01 Oct 2020
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
Efthymios Tzinis
Zhepei Wang
Paris Smaragdis
84
128
0
14 Jul 2020
Improving Sound Event Detection In Domestic Environments Using Sound Separation
Nicolas Turpault
Scott Wisdom
Hakan Erdogan
J. Hershey
Romain Serizel
Eduardo Fonseca
Prem Seetharaman
Justin Salamon
74
49
0
08 Jul 2020
Unsupervised Sound Separation Using Mixture Invariant Training
Scott Wisdom
Efthymios Tzinis
Hakan Erdogan
Ron J. Weiss
K. Wilson
J. Hershey
56
27
0
23 Jun 2020
Listen to What You Want: Neural Network-based Universal Sound Selector
Tsubasa Ochiai
Marc Delcroix
Yuma Koizumi
Hiroaki Ito
K. Kinoshita
S. Araki
45
62
0
10 Jun 2020
Foreground-Background Ambient Sound Scene Separation
Michel Olvera
Emmanuel Vincent
Romain Serizel
Gilles Gasso
52
9
0
11 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
63
155
0
08 May 2020
Conditioned Source Separation for Music Instrument Performances
Olga Slizovskaia
G. Haro
E. Gómez
47
39
0
08 Apr 2020
Meta-learning Extractors for Music Source Separation
David Samuel
Aditya Ganeshan
Jason Naradowsky
56
62
0
17 Feb 2020
Source separation with weakly labelled data: An approach to computational auditory scene analysis
Qiuqiang Kong
Yuxuan Wang
Xuchen Song
Yin Cao
Wenwu Wang
Mark D. Plumbley
68
47
0
06 Feb 2020
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam
Marc Delcroix
Tsubasa Ochiai
Kateřina Žmolíková
K. Kinoshita
Naohiro Tawara
Tomohiro Nakatani
S. Araki
117
122
0
23 Jan 2020
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition
Qiuqiang Kong
Yin Cao
Turab Iqbal
Yuxuan Wang
Wenwu Wang
Mark D. Plumbley
VLM
SSL
186
1,076
0
21 Dec 2019
Improving Universal Sound Separation Using Sound Classification
Efthymios Tzinis
Scott Wisdom
J. Hershey
A. Jansen
D. Ellis
VLM
65
73
0
18 Nov 2019
Finding Strength in Weakness: Learning to Separate Sounds with Weak Supervision
Fatemeh Pishdadian
Gordon Wichern
Jonathan Le Roux
56
43
0
06 Nov 2019
Audio query-based music source separation
Jie Hwan Lee
Hyeong-Seok Choi
Kyogu Lee
49
45
0
19 Aug 2019
Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for Multiple Source Separations
Gabriel Meseguer-Brocal
Geoffroy Peeters
46
61
0
02 Jul 2019
Universal Sound Separation
Ilya Kavalerov
Scott Wisdom
Hakan Erdogan
Brian Patton
K. Wilson
Jonathan Le Roux
J. Hershey
44
187
0
08 May 2019
Co-Separating Sounds of Visual Objects
Ruohan Gao
Kristen Grauman
126
209
0
16 Apr 2019
Class-conditional embeddings for music source separation
A. Labatie
Gordon Wichern
Shrikant Venkataramani
Jonathan Le Roux
BDL
65
42
0
07 Nov 2018
End-to-End Sound Source Separation Conditioned On Instrument Labels
Olga Slizovskaia
Leo Kim
G. Haro
Emilia Gómez
49
32
0
05 Nov 2018
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Quan Wang
Hannah Muckenhirn
K. Wilson
Prashant Sridhar
Zelin Wu
J. Hershey
Rif A. Saurous
Ron J. Weiss
Ye Jia
Ignacio López Moreno
68
368
0
11 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
156
1,786
0
20 Sep 2018
General-purpose Tagging of Freesound Audio with AudioSet Labels: Task Description, Dataset, and Baseline
Eduardo Fonseca
Manoj Plakal
F. Font
D. Ellis
Xavier Favory
Jordi Pons
Xavier Serra
83
148
0
26 Jul 2018
Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures
Jun Wang
Jie Chen
Dan Su
Lianwu Chen
Meng Yu
Y. Qian
Dong Yu
69
90
0
24 Jul 2018
An Overview of Lead and Accompaniment Separation in Music
Z. Rafii
Antoine Liutkus
Fabian-Robert Stöter
S. I. Mimilakis
D. Fitzgerald
Bryan Pardo
49
103
0
23 Apr 2018
The Sound of Pixels
Hang Zhao
Chuang Gan
Andrew Rouditchenko
Carl Vondrick
Josh H. McDermott
Antonio Torralba
VLM
102
535
0
09 Apr 2018
Adversarial Semi-Supervised Audio Source Separation applied to Singing Voice Extraction
Daniel Stoller
Sebastian Ewert
S. Dixon
67
72
0
31 Oct 2017
Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks
Morten Kolbaek
Dong Yu
Zheng-Hua Tan
Jesper Jensen
55
725
0
18 Mar 2017
Deep Clustering and Conventional Networks for Music Separation: Stronger Together
Yi Luo
Zhuo Chen
J. Hershey
Jonathan Le Roux
N. Mesgarani
58
162
0
18 Nov 2016
CNN Architectures for Large-Scale Audio Classification
Shawn Hershey
Sourish Chaudhuri
D. Ellis
J. Gemmeke
A. Jansen
...
Rif A. Saurous
Bryan Seybold
M. Slaney
Ron J. Weiss
K. Wilson
120
2,500
0
29 Sep 2016
Deep clustering: Discriminative embeddings for segmentation and separation
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
60
1,317
0
18 Aug 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.8K
150,039
0
22 Dec 2014
An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks
Ian Goodfellow
M. Berk Mirza
Xia Da
Aaron Courville
Yoshua Bengio
149
1,442
0
21 Dec 2013
1