Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.15463
Cited By
Consistent and Relevant: Rethink the Query Embedding in General Sound Separation
24 December 2023
Yuanyuan Wang
Hangting Chen
Dongchao Yang
Jianwei Yu
Chao Weng
Zhiyong Wu
Helen M. Meng
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Consistent and Relevant: Rethink the Query Embedding in General Sound Separation"
18 / 18 papers shown
Title
A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
Feiyang Xiao
Jian Guan
Qiaoxi Zhu
Xubo Liu
Wenbo Wang
Shuhan Qi
Kejia Zhang
Jianyuan Sun
Wenwu Wang
66
7
0
06 Jul 2024
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Hangting Chen
Jianwei Yu
Yi Luo
Rongzhi Gu
Weihua Li
Zhuocheng Lu
Chao Weng
65
7
0
21 Aug 2023
Improving Target Sound Extraction with Timestamp Information
Helin Wang
Dongchao Yang
Chao Weng
Jianwei Yu
Yuexian Zou
64
10
0
02 Apr 2022
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
159
274
0
02 Feb 2022
Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
64
46
0
15 Dec 2021
Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation
Hu Cao
Yueyue Wang
Jieneng Chen
Dongsheng Jiang
Xiaopeng Zhang
Qi Tian
Manning Wang
ViT
MedIm
141
2,922
0
12 May 2021
AST: Audio Spectrogram Transformer
Yuan Gong
Yu-An Chung
James R. Glass
ViT
145
883
0
05 Apr 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
467
21,603
0
25 Mar 2021
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Yuan Gong
Yu-An Chung
James R. Glass
VLM
186
147
0
02 Feb 2021
Source separation with weakly labelled data: An approach to computational auditory scene analysis
Qiuqiang Kong
Yuxuan Wang
Xuchen Song
Yin Cao
Wenwu Wang
Mark D. Plumbley
73
47
0
06 Feb 2020
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition
Qiuqiang Kong
Yin Cao
Turab Iqbal
Yuxuan Wang
Wenwu Wang
Mark D. Plumbley
VLM
SSL
199
1,084
0
21 Dec 2019
Audio query-based music source separation
Jie Hwan Lee
Hyeong-Seok Choi
Kyogu Lee
51
45
0
19 Aug 2019
Class-conditional embeddings for music source separation
A. Labatie
Gordon Wichern
Shrikant Venkataramani
Jonathan Le Roux
BDL
69
42
0
07 Nov 2018
End-to-End Sound Source Separation Conditioned On Instrument Labels
Olga Slizovskaia
Leo Kim
G. Haro
Emilia Gómez
58
32
0
05 Nov 2018
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
Daniel Stoller
Sebastian Ewert
S. Dixon
AI4TS
140
598
0
08 Jun 2018
MMDenseLSTM: An efficient combination of convolutional and recurrent neural networks for audio source separation
Naoya Takahashi
Nabarun Goswami
Yuki Mitsufuji
93
143
0
07 May 2018
TasNet: time-domain audio separation network for real-time, single-channel speech separation
Yi Luo
N. Mesgarani
78
633
0
01 Nov 2017
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
1.9K
77,441
0
18 May 2015
1