Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.07454
Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
20 September 2018
Yi Luo
N. Mesgarani
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"
50 / 753 papers shown
Title
A Near-Real-Time Processing Ego Speech Filtering Pipeline Designed for Speech Interruption During Human-Robot Interaction
Yue Li
Florian A. Kunneman
Koen V. Hindriks
31
2
0
22 May 2024
Look Once to Hear: Target Speech Hearing with Noisy Examples
Bandhav Veluri
Malek Itani
Tuochao Chen
Takuya Yoshioka
Shyamnath Gollakota
38
14
0
10 May 2024
Embedded Distributed Inference of Deep Neural Networks: A Systematic Review
Federico Nicolás Peccia
Oliver Bringmann
36
0
0
06 May 2024
TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable Platforms
Yueyuan Sui
Minghui Zhao
Junxi Xia
Xiaofan Jiang
S. Xia
Mamba
45
11
0
02 May 2024
Deep low-latency joint speech transmission and enhancement over a gaussian channel
Mohammad Bokaei
Jesper Jensen
Simon Doclo
Jan Østergaard
21
0
0
30 Apr 2024
Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention
Ruijie Tao
Xinyuan Qian
Yidi Jiang
Junjie Li
Jiadong Wang
Haizhou Li
34
1
0
29 Apr 2024
Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
Tsubasa Ochiai
Kazuma Iwamoto
Marc Delcroix
Rintaro Ikeshita
Hiroshi Sato
Shoko Araki
Shigeru Katagiri
29
2
0
23 Apr 2024
Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation
Ye Bai
Chenxing Li
Hao Li
Yuanyuan Zhao
Xiaorui Wang
24
0
0
17 Apr 2024
A Large-Scale Evaluation of Speech Foundation Models
Shu-Wen Yang
Heng-Jui Chang
Zili Huang
Andy T. Liu
Cheng-I Jeff Lai
...
Kushal Lakhotia
Shang-Wen Li
Abdelrahman Mohamed
Shinji Watanabe
Hung-yi Lee
38
19
0
15 Apr 2024
What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions
Hanyu Meng
V. Sethu
E. Ambikairajah
35
2
0
10 Apr 2024
Gull: A Generative Multifunctional Audio Codec
Yi Luo
Jianwei Yu
Hangting Chen
Rongzhi Gu
Chao Weng
AuLLM
41
3
0
07 Apr 2024
SPMamba: State-space model is all you need in speech separation
Kai Li
Guo Chen
Mamba
50
26
0
02 Apr 2024
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
Ali Behrouz
Michele Santacatterina
Ramin Zabih
44
31
0
29 Mar 2024
Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation
Xilin Jiang
Cong Han
N. Mesgarani
Mamba
39
41
0
27 Mar 2024
Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy
Wenxuan Wu
Xueyuan Chen
Xixin Wu
Haizhou Li
Helen M. Meng
34
1
0
24 Mar 2024
CATSE: A Context-Aware Framework for Causal Target Sound Extraction
Shrishail Baligar
M. Kegler
Bryce Irvin
Marko Stamenovic
Shawn Newsam
36
0
0
21 Mar 2024
Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Changsheng Quan
Xiaofei Li
47
23
0
12 Mar 2024
Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR
Yufeng Yang
Ashutosh Pandey
DeLiang Wang
44
4
0
11 Mar 2024
sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
Qu Yang
Qianhui Liu
Nan Li
Meng Ge
Zeyang Song
Haizhou Li
32
5
0
09 Mar 2024
CrossNet: Leveraging Global, Cross-Band, Narrow-Band, and Positional Encoding for Single- and Multi-Channel Speaker Separation
Vahid Ahmadi Kalkhorani
DeLiang Wang
38
3
0
06 Mar 2024
ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by Magnitude Conditioning
Kuan-Hsun Ho
J. Hung
Berlin Chen
42
0
0
04 Mar 2024
What do neural networks listen to? Exploring the crucial bands in Speech Enhancement using Sinc-convolution
Kuan-Hsun Ho
J. Hung
Berlin Chen
26
1
0
04 Mar 2024
Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet
Satvik Venkatesh
Arthur Benilov
Philip Coleman
Frederic Roskam
37
5
0
27 Feb 2024
SICRN: Advancing Speech Enhancement through State Space Model and Inplace Convolution Techniques
Changjiang Zhao
Shulin He
Xueliang Zhang
21
7
0
22 Feb 2024
Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN
Shiqi Zhang
Zheng Qiu
Daiki Takeuchi
Noboru Harada
Shoji Makino
13
3
0
13 Feb 2024
Sound Source Separation Using Latent Variational Block-Wise Disentanglement
Karim Helwani
M. Togami
Paris Smaragdis
Michael M. Goodwin
BDL
DRL
23
1
0
08 Feb 2024
Listen, Chat, and Edit: Text-Guided Soundscape Modification for Enhanced Auditory Experience
Xilin Jiang
Cong Han
Yinghao Aaron Li
N. Mesgarani
KELM
28
4
0
06 Feb 2024
Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers
Marvin Tammen
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
S. Araki
Simon Doclo
18
0
0
05 Feb 2024
Spiking Music: Audio Compression with Event Based Auto-encoders
Martim Lisboa
Guillaume Bellec
40
2
0
02 Feb 2024
An Analysis of the Variance of Diffusion-based Speech Enhancement
Bunlong Lay
Timo Gerkmann
DiffM
17
0
0
01 Feb 2024
Proactive Detection of Voice Cloning with Localized Watermarking
Robin San Roman
Pierre Fernandez
Alexandre Défossez
Teddy Furon
Tuan Tran
Hady ElSahar
49
41
0
30 Jan 2024
Spatial-Temporal Activity-Informed Diarization and Separation
Yicheng Hsu
Ssuhan Chen
Mingsian R. Bai
21
0
0
30 Jan 2024
Online speaker diarization of meetings guided by speech separation
Elio Gruttadauria
Mathieu Fontaine
S. Essid
17
4
0
30 Jan 2024
Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings
He Zhao
Hangting Chen
Jianwei Yu
Yuehai Wang
51
0
0
29 Jan 2024
Phoneme-Based Proactive Anti-Eavesdropping with Controlled Recording Privilege
Peng Huang
Yao Wei
Peng Cheng
Zhongjie Ba
Liwang Lu
Feng Lin
Yang Wang
Kui Ren
26
0
0
28 Jan 2024
Improving Design of Input Condition Invariant Speech Enhancement
Wangyou Zhang
Jee-weon Jung
Shinji Watanabe
Yanmin Qian
AAML
26
3
0
25 Jan 2024
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion
Samuel Pegg
Kai Li
Xiaolin Hu
32
1
0
25 Jan 2024
Resource-constrained stereo singing voice cancellation
Clara Borrelli
James Rae
Dogac Basaran
Matt McVicar
M. Souden
Matthias Mauch
28
0
0
22 Jan 2024
Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement
Ashutosh Pandey
Buye Xu
40
1
0
15 Jan 2024
Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
Kenichi Fujita
Hiroshi Sato
Takanori Ashihara
Hiroki Kanagawa
Marc Delcroix
Takafumi Moriya
Yusuke Ijima
36
8
0
10 Jan 2024
DDD: A Perceptually Superior Low-Response-Time DNN-based Declipper
Jayeon Yi
Junghyun Koo
Kyogu Lee
22
2
0
08 Jan 2024
Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments
Renana Opochinsky
Mordehay Moradi
Sharon Gannot
15
4
0
07 Jan 2024
Remixed2Remixed: Domain adaptation for speech enhancement by Noise2Noise learning with Remixing
Li Li
Shogo Seki
31
2
0
28 Dec 2023
Online Similarity-and-Independence-Aware Beamformer for Low-latency Target Sound Extraction
Atsuo Hiroe
29
0
0
27 Dec 2023
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation
Shengkui Zhao
Yukun Ma
Chongjia Ni
Chong Zhang
Hao Wang
Trung Hieu Nguyen
Kun Zhou
J. Yip
Dianwen Ng
Bin Ma
13
21
0
19 Dec 2023
A Refining Underlying Information Framework for Monaural Speech Enhancement
Rui Cao
Tianrui Wang
Meng Ge
Longbiao Wang
Jianwu Dang
15
1
0
18 Dec 2023
3S-TSE: Efficient Three-Stage Target Speaker Extraction for Real-Time and Low-Resource Applications
Shulin He
Jinjiang Liu
Hao Li
Yang-Rui Yang
Fei Chen
Xueliang Zhang
22
1
0
18 Dec 2023
Attention-Driven Multichannel Speech Enhancement in Moving Sound Source Scenarios
Yuzhu Wang
A. Politis
Tuomas Virtanen
20
3
0
17 Dec 2023
Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction
Zhaoxi Mu
Xinyu Yang
Sining Sun
Qing Yang
SSL
23
8
0
16 Dec 2023
A 1.6-mW Sparse Deep Learning Accelerator for Speech Separation
Chih-Chyau Yang
Tian-Sheuan Chang
26
0
0
15 Dec 2023
Previous
1
2
3
4
5
6
...
14
15
16
Next