Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.07454
Cited By
v1
v2
v3 (latest)
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
20 September 2018
Yi Luo
N. Mesgarani
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"
50 / 773 papers shown
Title
On Batching Variable Size Inputs for Training End-to-End Speech Enhancement Systems
Philippe Gonzalez
T. S. Alstrøm
Tobias May
74
9
0
25 Jan 2023
Latent Autoregressive Source Separation
Emilian Postolache
Giorgio Mariani
Michele Mancusi
Andrea Santilli
Luca Cosmo
Emanuele Rodolà
BDL
DRL
63
10
0
09 Jan 2023
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Wei-Ning Hsu
Tal Remez
Bowen Shi
Jacob Donley
Yossi Adi
DiffM
93
12
0
21 Dec 2022
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
Kai Li
Fenghua Xie
Hang Chen
K. Yuan
Xiaolin Hu
91
16
0
21 Dec 2022
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Rongzhi Gu
Shi-Xiong Zhang
Yuexian Zou
Dong Yu
AI4TS
88
25
0
16 Dec 2022
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Dongheon Lee
Jung-Woo Choi
72
29
0
15 Dec 2022
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Darius Petermann
Gordon Wichern
Aswin Shanmugam Subramanian
Zhong-Qiu Wang
Jonathan Le Roux
63
10
0
14 Dec 2022
Multi-Scale Feature Fusion Transformer Network for End-to-End Single Channel Speech Separation
Yinhao Xu
Jian Zhou
L. Tao
H. Kwan
106
0
0
14 Dec 2022
GPU-accelerated Guided Source Separation for Meeting Transcription
Desh Raj
Daniel Povey
Sanjeev Khudanpur
69
40
0
10 Dec 2022
Hyperbolic Audio Source Separation
Darius Petermann
Gordon Wichern
Aswin Shanmugam Subramanian
Jonathan Le Roux
73
10
0
09 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
119
22
0
01 Dec 2022
A General Unfolding Speech Enhancement Method Motivated by Taylor's Theorem
Andong Li
Guochen Yu
C. Zheng
Wenzhe Liu
Xiaodong Li
91
12
0
30 Nov 2022
Deep Neural Mel-Subband Beamformer for In-car Speech Separation
Vinay Kothapally
Yong-mei Xu
Meng Yu
Shizhong Zhang
Dong Yu
65
12
0
22 Nov 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
102
138
0
22 Nov 2022
LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
Rodrigo Mira
Buye Xu
Jacob Donley
Anurag Kumar
Stavros Petridis
V. Ithapu
Maja Pantic
55
13
0
20 Nov 2022
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
62
6
0
16 Nov 2022
Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Yicheng Hsu
Yonghan Lee
M. Bai
45
3
0
16 Nov 2022
Hybrid Transformers for Music Source Separation
Simon Rouard
Francisco Massa
Alexandre Défossez
78
147
0
15 Nov 2022
Reverberation as Supervision for Speech Separation
R. Aralikatti
Christoph Boeddeker
Gordon Wichern
Aswin Shanmugam Subramanian
Jonathan Le Roux
65
7
0
15 Nov 2022
An Investigation of the Combination of Rehearsal and Knowledge Distillation in Continual Learning for Spoken Language Understanding
Umberto Cappellazzo
Daniele Falavigna
Alessio Brutti
CLL
61
2
0
15 Nov 2022
The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement
Anastasia Kuznetsova
Aswin Sivaraman
Minje Kim
54
3
0
14 Nov 2022
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation
Chang-Bin Jeon
Hyeongi Moon
Keunwoo Choi
Ben Sangbae Chon
Kyogu Lee
54
5
0
14 Nov 2022
Speech Enhancement with Fullband-Subband Cross-Attention Network
Jun Chen
Wei Rao
Zehao Wang
Zhiyong Wu
Yannan Wang
Tao Yu
Shidong Shang
Helen M. Meng
49
16
0
10 Nov 2022
Speech separation with large-scale self-supervised learning
Zhuo Chen
Naoyuki Kanda
Jian Wu
Yu-Huan Wu
Xiaofei Wang
Takuya Yoshioka
Jinyu Li
S. Sivasankaran
Sefik Emre Eskimez
81
15
0
09 Nov 2022
Cross-Attention is all you need: Real-Time Streaming Transformers for Personalised Speech Enhancement
Shucong Zhang
Malcolm Chadwick
Alberto Gil C. P. Ramos
S. Bhattacharya
57
5
0
08 Nov 2022
Cold Diffusion for Speech Enhancement
Hao Yen
François Germain
Gordon Wichern
Jonathan Le Roux
DiffM
96
45
0
04 Nov 2022
Real-Time Target Sound Extraction
Bandhav Veluri
Justin Chan
Malek Itani
Tuochao Chen
Takuya Yoshioka
Shyamnath Gollakota
112
33
0
04 Nov 2022
Iterative autoregression: a novel trick to improve your low-latency speech enhancement model
Pavel Andreev
Nicholas Babaev
Azat Saginbaev
Ivan Shchekotov
Aibek Alanov
75
5
0
03 Nov 2022
Analysis of Noisy-target Training for DNN-based speech enhancement
Takuya Fujimura
Tomoki Toda
62
6
0
02 Nov 2022
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue through Embedding Inpainting
Zexu Pan
Wupeng Wang
Marvin Borsdorf
Haizhou Li
85
12
0
31 Oct 2022
Denoising neural networks for magnetic resonance spectroscopy
Natalie Klein
Amber J. Day
Harris Mason
M. Malone
Sinead Williamson
59
1
0
31 Oct 2022
Diffusion-based Generative Speech Source Separation
Robin Scheibler
Youna Ji
Soo-Whan Chung
J. Byun
Soyeon Choe
Min-Seok Choi
DiffM
123
48
0
31 Oct 2022
Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Ayal Schwartz
Sharon Gannot
Shlomo E. Chazan
68
0
0
31 Oct 2022
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
Zhibin Qiu
Mengfan Fu
Yinfeng Yu
Lili Yin
Gang Hua
Hao-Ming Huang
DiffM
145
19
0
30 Oct 2022
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Shulin He
Wei Rao
Jinjiang Liu
Jun Chen
Yukai Ju
Xueliang Zhang
Yannan Wang
Shidong Shang
46
6
0
28 Oct 2022
Hierarchical speaker representation for target speaker extraction
Shulin He
Huaiwen Zhang
Wei Rao
Kanghao Zhang
Yukai Ju
Yang-Rui Yang
Xueliang Zhang
60
7
0
28 Oct 2022
UX-NET: Filter-and-Process-based Improved U-Net for Real-time Time-domain Audio Separation
Kashyap Patel
A. Kovalyov
Issa Panahi
50
6
0
28 Oct 2022
CasNet: Investigating Channel Robustness for Speech Separation
Fan Wang
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
54
2
0
27 Oct 2022
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
Ryosuke Sawata
Naoki Murata
Yuhta Takida
Toshimitsu Uesaka
Takashi Shibuya
Shusuke Takahashi
Yuki Mitsufuji
DiffM
89
17
0
27 Oct 2022
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation
William Ravenscroft
Stefan Goetze
Thomas Hain
84
11
0
27 Oct 2022
Parallel Gated Neural Network With Attention Mechanism For Speech Enhancement
Jia Cui
S. Bleeck
39
0
0
26 Oct 2022
High Fidelity Neural Audio Compression
Alexandre Défossez
Jade Copet
Gabriel Synnaeve
Yossi Adi
126
674
0
24 Oct 2022
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Dacheng Yin
Zhiyuan Zhao
Chuanxin Tang
Zhiwei Xiong
Chong Luo
85
17
0
24 Oct 2022
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation
Xiaoyu Liu
Xu Li
Joan Serrà
76
9
0
23 Oct 2022
Neural Sound Field Decomposition with Super-resolution of Sound Direction
Qiuqiang Kong
Shilei Liu
Junjie Shi
Xuzhou Ye
Yin Cao
Qiaoxi Zhu
Yong-mei Xu
Yuxuan Wang
48
0
0
22 Oct 2022
Adversarial Permutation Invariant Training for Universal Sound Separation
Emilian Postolache
Jordi Pons
Santiago Pascual
Joan Serrà
VLM
65
7
0
21 Oct 2022
Improved Normalizing Flow-Based Speech Enhancement using an All-pole Gammatone Filterbank for Conditional Input Representation
Martin Strauss
Matteo Torcoli
B. Edler
44
5
0
21 Oct 2022
spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement
Shubo Lv
Yihui Fu
Yukai Jv
Linfu Xie
Weixin Zhu
Wei Rao
Yannan Wang
51
10
0
17 Oct 2022
Individualized Conditioning and Negative Distances for Speaker Separation
Tao Sun
Nidal Abuhajar
Shuyu Gong
Zhewei Wang
Charles D. Smith
Xianhui Wang
Li Xu
Jundong Liu
VLM
59
1
0
12 Oct 2022
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Junjie Li
Meng Ge
Zexu Pan
Longbiao Wang
Jianwu Dang
55
10
0
09 Oct 2022
Previous
1
2
3
...
6
7
8
...
14
15
16
Next