Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.07454
Cited By
v1
v2
v3 (latest)
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
20 September 2018
Yi Luo
N. Mesgarani
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"
50 / 773 papers shown
Title
A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement
Bengt J. Borgström
M. Brandstein
62
2
0
21 Sep 2023
Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Shafique Ahmed
Chia-Wei Chen
Wenze Ren
Chin-Jou Li
Ernie Chu
Jun-Cheng Chen
Amir Hussain
H. Wang
Yu Tsao
Jen-Cheng Hou
57
3
0
20 Sep 2023
Single and Few-step Diffusion for Generative Speech Enhancement
Bunlong Lay
Jean-Marie Lemercier
Julius Richter
Timo Gerkmann
DiffM
61
10
0
18 Sep 2023
Refining DNN-based Mask Estimation using CGMM-based EM Algorithm for Multi-channel Noise Reduction
Julitta Bartolewska
Stanisław Kacprzak
K. Kowalczyk
42
0
0
18 Sep 2023
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech
Jun Yu Li
Ruijie Tao
Zexu Pan
Meng Ge
Shuai Wang
Haizhou Li
76
6
0
15 Sep 2023
DiaCorrect: Error Correction Back-end For Speaker Diarization
Jiangyu Han
Federico Landini
Johan Rohdin
Mireia Díez
Lukás Burget
Yuhang Cao
Heng Lu
J. Černocký
66
3
0
15 Sep 2023
Two-Step Knowledge Distillation for Tiny Speech Enhancement
Rayan Daod Nathoo
M. Kegler
Marko Stamenovic
59
6
0
15 Sep 2023
Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription
Peter Vieting
Simon Berger
Thilo von Neumann
Christoph Boeddeker
Ralf Schluter
Reinhold Haeb-Umbach
105
0
0
15 Sep 2023
Complexity Scaling for Speech Denoising
Hangting Chen
Jianwei Yu
Chao Weng
54
2
0
14 Sep 2023
Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures
J. Yip
Dianwen Ng
Bin Ma
Chng Eng Siong
52
0
0
14 Sep 2023
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network
Qinghua Liu
Meng Ge
Zhizheng Wu
Haizhou Li
73
1
0
13 Sep 2023
Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments
Philippe Gonzalez
T. S. Alstrøm
Tobias May
66
14
0
12 Sep 2023
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram
Zhifeng Kong
Ming-Yu Liu
Ambrish Dantrey
Bryan Catanzaro
49
7
0
12 Sep 2023
Music Source Separation with Band-Split RoPE Transformer
Wei-Tsung Lu
Ju-Chiang Wang
Qiuqiang Kong
Yun-Ning Hung
102
25
0
05 Sep 2023
A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation
Karn N. Watcharasupat
Chih-Wei Wu
Yiwei Ding
Iroro Orife
Aaron J. Hipple
Aaron J. Hipple. Phillip A. Williams
Scott Kramer
Alexander Lerch
W. Wolcott
67
6
0
05 Sep 2023
Single-Channel Speech Enhancement with Deep Complex U-Networks and Probabilistic Latent Space Models
E. J. Nustede
Jörn Anemüller
41
3
0
04 Sep 2023
Remixing-based Unsupervised Source Separation from Scratch
Kohei Saijo
Tetsuji Ogawa
50
3
0
01 Sep 2023
Blind Source Separation of Single-Channel Mixtures via Multi-Encoder Autoencoders
Matthew B. Webster
Joonnyong Lee
59
1
0
31 Aug 2023
Deep learning-based denoising streamed from mobile phones improves speech-in-noise understanding for hearing aid users
P. U. Diehl
Hannes Zilly
Felix Sattler
Y. Singer
Kevin Kepp
...
Paul Meyer-Rachner
A. Pudszuhn
V. Hofmann
M. Vormann
Elias Sprengel
64
3
0
22 Aug 2023
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Hangting Chen
Jianwei Yu
Yi Luo
Rongzhi Gu
Weihua Li
Zhuocheng Lu
Chao Weng
75
7
0
21 Aug 2023
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
Hakan Erdogan
Scott Wisdom
Xuankai Chang
Zalan Borsos
Marco Tagliasacchi
Neil Zeghidour
J. Hershey
69
11
0
21 Aug 2023
IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation
Kai Li
Run Yang
Fuchun Sun
Xiaolin Hu
90
8
0
16 Aug 2023
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio
Yang Zhang
Krishna C. Puvvada
Vitaly Lavrukhin
Boris Ginsburg
74
16
0
09 Aug 2023
Separate Anything You Describe
Xubo Liu
Qiuqiang Kong
Yan Zhao
Haohe Liu
Yiitan Yuan
Yuzhuo Liu
Rui Xia
Yuxuan Wang
Mark D. Plumbley
Wenwu Wang
VLM
105
52
0
09 Aug 2023
Music De-limiter Networks via Sample-wise Gain Inversion
Chang-Bin Jeon
Kyogu Lee
55
1
0
02 Aug 2023
PCNN: A Lightweight Parallel Conformer Neural Network for Efficient Monaural Speech Enhancement
Xinmeng Xu
Weiping Tu
Yuhong Yang
57
9
0
28 Jul 2023
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Yoshiki Masuyama
Xuankai Chang
Wangyou Zhang
Samuele Cornell
Zhongqiu Wang
Nobutaka Ono
Y. Qian
Shinji Watanabe
78
6
0
23 Jul 2023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Yuchen Hu
Cheng Chen
Ruizhe Li
Qiu-shi Zhu
Eng Siong Chng
DiffM
93
13
0
16 Jul 2023
Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition
Guinan Li
Jiajun Deng
Mengzhe Geng
Zengrui Jin
Tianzi Wang
Shujie Hu
Mingyu Cui
Helen M. Meng
Xunying Liu
60
12
0
06 Jul 2023
Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
Sandipana Dowerah
Ajinkya Kulkarni
Romain Serizel
D. Jouvet
DiffM
129
1
0
05 Jul 2023
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Jiuxin Lin
Peng Wang
Heinrich Dinkel
Jun Chen
Zhiyong Wu
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Yujun Wang
67
9
0
28 Jun 2023
Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Kanami Imamura
Tomohiko Nakamura
Norihiro Takamune
Kohei Yatabe
Hiroshi Saruwatari
48
2
0
19 Jun 2023
Visually-Guided Sound Source Separation with Audio-Visual Predictive Coding
Zengjie Song
Zhaoxiang Zhang
50
1
0
19 Jun 2023
Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation
Yoshiaki Bando
Yoshiki Masuyama
Aditya Arie Nugraha
Kazuyoshi Yoshii
BDL
57
4
0
17 Jun 2023
Multi-Loss Convolutional Network with Time-Frequency Attention for Speech Enhancement
Liang Wan
Hongqing Liu
Yi Zhou
Jie Ji
61
2
0
15 Jun 2023
Quantifying Spatial Audio Quality Impairment
Karn N. Watcharasupat
Alexander Lerch
60
2
0
13 Jun 2023
Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction
Tomoya Yoshinaga
Keitaro Tanaka
Shigeo Morishima
63
0
0
10 Jun 2023
An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention
Junyu Wang
49
1
0
09 Jun 2023
Efficient Encoder-Decoder and Dual-Path Conformer for Comprehensive Feature Learning in Speech Enhancement
Junyu Wang
65
5
0
09 Jun 2023
A Mask Free Neural Network for Monaural Speech Enhancement
Liangqi Liu
Haixing Guan
Jinlong Ma
Wei Dai
Guang-Yi Wang
Shaowei Ding
75
12
0
07 Jun 2023
Rethinking the visual cues in audio-visual speaker extraction
Junjie Li
Meng Ge
Zexu Pan
Rui Cao
Longbiao Wang
Jianwu Dang
Shiliang Zhang
72
9
0
05 Jun 2023
UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model
A. Iashchenko
Pavel Andreev
Ivan Shchekotov
Nicholas Babaev
Dmitry Vetrov
DiffM
81
2
0
01 Jun 2023
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models
Pin-Jui Ku
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin-Hui Lee
89
13
0
01 Jun 2023
Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model
H. Martel
Julius Richter
Kai Li
Xiaolin Hu
Timo Gerkmann
VLM
57
10
0
31 May 2023
UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
Zhong-Qiu Wang
Shinji Watanabe
77
11
0
31 May 2023
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
Alessio Brutti
S. Squartini
83
9
0
29 May 2023
CAPTDURE: Captioned Sound Dataset of Single Sources
Yuki Okamoto
Kanta Shimonishi
Keisuke Imoto
Kota Dohi
Shota Horiguchi
Yohei Kawaguchi
52
1
0
28 May 2023
A Neural State-Space Model Approach to Efficient Speech Separation
Chen Chen
Chao-Han Huck Yang
Kai Li
Yuchen Hu
Pin-Jui Ku
Chng Eng Siong
70
11
0
26 May 2023
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator
Lingwei Meng
Jiawen Kang
Mingyu Cui
Haibin Wu
Xixin Wu
Helen M. Meng
72
10
0
25 May 2023
Anomalous Sound Detection Based on Sound Separation
Kanta Shimonishi
Kota Dohi
Yohei Kawaguchi
61
5
0
25 May 2023
Previous
1
2
3
4
5
6
...
14
15
16
Next