Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.07454
Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
20 September 2018
Yi Luo
N. Mesgarani
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"
50 / 753 papers shown
Title
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio
Yang Zhang
Krishna C. Puvvada
Vitaly Lavrukhin
Boris Ginsburg
35
14
0
09 Aug 2023
Separate Anything You Describe
Xubo Liu
Qiuqiang Kong
Yan Zhao
Haohe Liu
Yiitan Yuan
Yuzhuo Liu
Rui Xia
Yuxuan Wang
Mark D. Plumbley
Wenwu Wang
VLM
30
43
0
09 Aug 2023
Music De-limiter Networks via Sample-wise Gain Inversion
Chang-Bin Jeon
Kyogu Lee
16
1
0
02 Aug 2023
PCNN: A Lightweight Parallel Conformer Neural Network for Efficient Monaural Speech Enhancement
Xinmeng Xu
Weiping Tu
Yuhong Yang
18
9
0
28 Jul 2023
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Yoshiki Masuyama
Xuankai Chang
Wangyou Zhang
Samuele Cornell
Zhongqiu Wang
Nobutaka Ono
Y. Qian
Shinji Watanabe
33
6
0
23 Jul 2023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Yuchen Hu
Cheng Chen
Ruizhe Li
Qiu-shi Zhu
E. Chng
DiffM
16
9
0
16 Jul 2023
Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition
Guinan Li
Jiajun Deng
Mengzhe Geng
Zengrui Jin
Tianzi Wang
Shujie Hu
Mingyu Cui
Helen M. Meng
Xunying Liu
37
10
0
06 Jul 2023
Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
Sandipana Dowerah
Ajinkya Kulkarni
Romain Serizel
D. Jouvet
DiffM
21
1
0
05 Jul 2023
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Jiuxin Lin
Peng Wang
Heinrich Dinkel
Jun Chen
Zhiyong Wu
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Yujun Wang
11
7
0
28 Jun 2023
Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Kanami Imamura
Tomohiko Nakamura
Norihiro Takamune
Kohei Yatabe
Hiroshi Saruwatari
23
1
0
19 Jun 2023
Visually-Guided Sound Source Separation with Audio-Visual Predictive Coding
Zengjie Song
Zhaoxiang Zhang
21
1
0
19 Jun 2023
Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation
Yoshiaki Bando
Yoshiki Masuyama
Aditya Arie Nugraha
Kazuyoshi Yoshii
BDL
18
4
0
17 Jun 2023
Multi-Loss Convolutional Network with Time-Frequency Attention for Speech Enhancement
Liang Wan
Hongqing Liu
Yi Zhou
Jie Ji
25
2
0
15 Jun 2023
Quantifying Spatial Audio Quality Impairment
Karn N. Watcharasupat
Alexander Lerch
20
1
0
13 Jun 2023
Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction
Tomoya Yoshinaga
Keitaro Tanaka
Shigeo Morishima
30
0
0
10 Jun 2023
An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention
Junyu Wang
22
1
0
09 Jun 2023
Efficient Encoder-Decoder and Dual-Path Conformer for Comprehensive Feature Learning in Speech Enhancement
Junyu Wang
21
4
0
09 Jun 2023
A Mask Free Neural Network for Monaural Speech Enhancement
Liangqi Liu
Haixing Guan
Jinlong Ma
Wei Dai
Guang-Yi Wang
Shaowei Ding
21
11
0
07 Jun 2023
Rethinking the visual cues in audio-visual speaker extraction
Junjie Li
Meng Ge
Zexu Pan
Rui Cao
Longbiao Wang
J. Dang
Shiliang Zhang
28
9
0
05 Jun 2023
UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model
A. Iashchenko
Pavel Andreev
Ivan Shchekotov
Nicholas Babaev
Dmitry Vetrov
DiffM
21
1
0
01 Jun 2023
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models
Pin-Jui Ku
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin-Hui Lee
19
10
0
01 Jun 2023
Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model
H. Martel
Julius Richter
Kai Li
Xiaolin Hu
Timo Gerkmann
VLM
22
9
0
31 May 2023
UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
Zhong-Qiu Wang
Shinji Watanabe
32
10
0
31 May 2023
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
A. Brutti
S. Squartini
47
9
0
29 May 2023
CAPTDURE: Captioned Sound Dataset of Single Sources
Yuki Okamoto
Kanta Shimonishi
Keisuke Imoto
Kota Dohi
Shota Horiguchi
Y. Kawaguchi
26
1
0
28 May 2023
A Neural State-Space Model Approach to Efficient Speech Separation
Chen Chen
Chao-Han Huck Yang
Kai Li
Yuchen Hu
Pin-Jui Ku
Chng Eng Siong
34
11
0
26 May 2023
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator
Lingwei Meng
Jiawen Kang
Mingyu Cui
Haibin Wu
Xixin Wu
Helen M. Meng
39
10
0
25 May 2023
Anomalous Sound Detection Based on Sound Separation
Kanta Shimonishi
Kota Dohi
Y. Kawaguchi
21
5
0
25 May 2023
TLNets: Transformation Learning Networks for long-range time-series prediction
Wen Wang
Yang Liu
Haoqin Sun
AI4TS
27
3
0
25 May 2023
Towards Solving Cocktail-Party: The First Method to Build a Realistic Dataset with Ground Truths for Speech Separation
Rawad Melhem
Assef Jafar
Oumayma Al Dakkak
18
0
0
25 May 2023
Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model
Aoi Ito
Shota Horiguchi
SSL
27
2
0
24 May 2023
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
Hiroshi Sato
Ryo Masumura
Tsubasa Ochiai
Marc Delcroix
Takafumi Moriya
...
Kentaro Shinayama
Saki Mizuno
Mana Ihori
Tomohiro Tanaka
Nobukatsu Hojo
29
5
0
24 May 2023
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
Zhibin Qiu
Mengfan Fu
Gang Hua
G. Altenbek
Hao Huang
DiffM
51
4
0
23 May 2023
DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting
Shubo Lv
Xiong Wang
Sining Sun
Long Ma
Linfu Xie
38
5
0
21 May 2023
Unsupervised Multi-channel Separation and Adaptation
Cong Han
K. Wilson
Scott Wisdom
J. Hershey
20
4
0
18 May 2023
Noise-Aware Speech Separation with Contrastive Learning
Zizheng Zhang
Cheng Chen
Hsin-Hung Chen
Xiang Liu
Yuchen Hu
E. Chng
28
5
0
18 May 2023
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Hao Shi
Kazuki Shimada
M. Hirano
Takashi Shibuya
Yuichiro Koyama
Zhi-Wei Zhong
Shusuke Takahashi
Tatsuya Kawahara
Yuki Mitsufuji
DiffM
30
14
0
18 May 2023
Speech Separation based on Contrastive Learning and Deep Modularization
Peter Ochieng
SSL
27
0
0
18 May 2023
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions
Jie Zhang
Qingquan Xu
Qiu-shi Zhu
Zhenhua Ling
21
11
0
17 May 2023
ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Feng Dang
Qi Hu
Pengyuan Zhang
Yonghong Yan
27
0
0
15 May 2023
Universal Source Separation with Weakly Labelled Data
Qiuqiang Kong
K. Chen
Haohe Liu
Xingjian Du
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Mark D. Plumbley
18
17
0
11 May 2023
Diffusion-based Signal Refiner for Speech Separation
M. Hirano
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Yuki Mitsufuji
DiffM
35
4
0
10 May 2023
AudioSlots: A slot-centric generative model for audio separation
P. Reddy
Scott Wisdom
Klaus Greff
J. Hershey
Thomas Kipf
OCL
VLM
20
6
0
09 May 2023
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Rongjie Huang
Mingze Li
Dongchao Yang
Jiatong Shi
Xuankai Chang
...
Jia-Bin Huang
Jinglin Liu
Yixiang Ren
Zhou Zhao
Shinji Watanabe
LM&MA
AuLLM
42
198
0
25 Apr 2023
Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
AI4TS
17
10
0
18 Apr 2023
On Data Sampling Strategies for Training Neural Network Speech Separation Models
William Ravenscroft
Stefan Goetze
Thomas Hain
VLM
16
6
0
14 Apr 2023
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Giovanni Morrone
Samuele Cornell
L. Serafini
Enrico Zovato
A. Brutti
S. Squartini
23
4
0
21 Mar 2023
The Intel Neuromorphic DNS Challenge
Jonathan Timcheck
S. Shrestha
D. B. Rubin
A. Kupryjanow
Garrick Orchard
Lukasz Pindor
Timothy M. Shea
Mike Davies
36
27
0
16 Mar 2023
Beamformer-Guided Target Speaker Extraction
Mohamed Elminshawi
Srikanth Raj Chetupalli
Emanuel Habets
19
7
0
15 Mar 2023
Target Sound Extraction with Variable Cross-modality Clues
Chenda Li
Yao Qian
Zhuo Chen
Dongmei Wang
Takuya Yoshioka
Shujie Liu
Y. Qian
Michael Zeng
VLM
29
13
0
15 Mar 2023
Previous
1
2
3
4
5
6
...
14
15
16
Next