Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.08372
Cited By
Target Sound Extraction with Variable Cross-modality Clues
15 March 2023
Chenda Li
Yao Qian
Zhuo Chen
Dongmei Wang
Takuya Yoshioka
Shujie Liu
Y. Qian
Michael Zeng
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Target Sound Extraction with Variable Cross-modality Clues"
11 / 11 papers shown
Title
Beyond Speaker Identity: Text Guided Target Speech Extraction
Mingyue Huo
Abhinav Jain
Cong Phuoc Huynh
Fanjie Kong
Pichao Wang
Zhu Liu
Vimal Bhat
51
0
0
17 Jan 2025
Language-based Audio Moment Retrieval
Hokuto Munakata
Taichi Nishimura
Shota Nakada
Tatsuya Komatsu
35
1
0
24 Sep 2024
Multichannel-to-Multichannel Target Sound Extraction Using Direction and Timestamp Clues
Dayun Choi
Jung-Woo Choi
37
0
0
19 Sep 2024
DENSE: Dynamic Embedding Causal Target Speech Extraction
Yiwen Wang
Zeyu Yuan
Xihong Wu
41
0
0
10 Sep 2024
Interaural time difference loss for binaural target sound extraction
Carlos Hernandez-Olivan
Marc Delcroix
Tsubasa Ochiai
Naohiro Tawara
Tomohiro Nakatani
Shoko Araki
21
1
0
01 Aug 2024
TSE-PI: Target Sound Extraction under Reverberant Environments with Pitch Information
Yiwen Wang
Xihong Wu
46
2
0
13 Jun 2024
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction
Xiang Hao
Jibin Wu
Jianwei Yu
Chenglin Xu
Kay Chen Tan
24
10
0
11 Oct 2023
Beyond the Status Quo: A Contemporary Survey of Advances and Challenges in Audio Captioning
Xuenan Xu
Zeyu Xie
Mengyue Wu
K. Yu
34
13
0
11 May 2022
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Ruohan Gao
Kristen Grauman
CVBM
190
198
0
08 Jan 2021
Source separation with weakly labelled data: An approach to computational auditory scene analysis
Qiuqiang Kong
Yuxuan Wang
Xuchen Song
Yin Cao
Wenwu Wang
Mark D. Plumbley
27
47
0
06 Feb 2020
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
Daniel Stoller
Sebastian Ewert
S. Dixon
AI4TS
104
588
0
08 Jun 2018
1