ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
v1v2v3 (latest)

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXiv (abs)PDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 773 papers shown
Title
TLNets: Transformation Learning Networks for long-range time-series
  prediction
TLNets: Transformation Learning Networks for long-range time-series prediction
Wen Wang
Yang Liu
Haoqin Sun
AI4TS
72
3
0
25 May 2023
Towards Solving Cocktail-Party: The First Method to Build a Realistic
  Dataset with Ground Truths for Speech Separation
Towards Solving Cocktail-Party: The First Method to Build a Realistic Dataset with Ground Truths for Speech Separation
Rawad Melhem
Assef Jafar
Oumayma Al Dakkak
41
0
0
25 May 2023
Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model
Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model
Aoi Ito
Shota Horiguchi
SSL
51
3
0
24 May 2023
Downstream Task Agnostic Speech Enhancement with Self-Supervised
  Representation Loss
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
Hiroshi Sato
Ryo Masumura
Tsubasa Ochiai
Marc Delcroix
Takafumi Moriya
...
Kentaro Shinayama
Saki Mizuno
Mana Ihori
Tomohiro Tanaka
Nobukatsu Hojo
79
5
0
24 May 2023
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
Zhibin Qiu
Mengfan Fu
Gang Hua
G. Altenbek
Hao Huang
DiffM
82
5
0
23 May 2023
DCCRN-KWS: an audio bias based model for noise robust small-footprint
  keyword spotting
DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting
Shubo Lv
Xiong Wang
Sining Sun
Long Ma
Linfu Xie
81
5
0
21 May 2023
Unsupervised Multi-channel Separation and Adaptation
Unsupervised Multi-channel Separation and Adaptation
Cong Han
K. Wilson
Scott Wisdom
J. Hershey
78
4
0
18 May 2023
Noise-Aware Speech Separation with Contrastive Learning
Noise-Aware Speech Separation with Contrastive Learning
Zizheng Zhang
Cheng Chen
Hsin-Hung Chen
Xiang Liu
Yuchen Hu
Eng Siong Chng
75
5
0
18 May 2023
Diffusion-Based Speech Enhancement with Joint Generative and Predictive
  Decoders
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Hao Shi
Kazuki Shimada
M. Hirano
Takashi Shibuya
Yuichiro Koyama
Zhi-Wei Zhong
Shusuke Takahashi
Tatsuya Kawahara
Yuki Mitsufuji
DiffM
94
16
0
18 May 2023
Speech Separation based on Contrastive Learning and Deep Modularization
Speech Separation based on Contrastive Learning and Deep Modularization
Peter Ochieng
SSL
83
0
0
18 May 2023
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with
  Convolutional Cross Attention in Multi-talker Conditions
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions
Jie Zhang
Qingquan Xu
Qiu-shi Zhu
Zhenhua Ling
70
12
0
17 May 2023
ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech
  Enhancement
ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Feng Dang
Qi Hu
Pengyuan Zhang
Yonghong Yan
56
1
0
15 May 2023
Universal Source Separation with Weakly Labelled Data
Universal Source Separation with Weakly Labelled Data
Qiuqiang Kong
Kai Chen
Haohe Liu
Xingjian Du
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Mark D. Plumbley
82
22
0
11 May 2023
Diffusion-based Signal Refiner for Speech Separation
Diffusion-based Signal Refiner for Speech Separation
M. Hirano
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Yuki Mitsufuji
DiffM
93
8
0
10 May 2023
AudioSlots: A slot-centric generative model for audio separation
AudioSlots: A slot-centric generative model for audio separation
P. Reddy
Scott Wisdom
Klaus Greff
J. Hershey
Thomas Kipf
OCLVLM
86
6
0
09 May 2023
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking
  Head
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Rongjie Huang
Mingze Li
Dongchao Yang
Jiatong Shi
Xuankai Chang
...
Jia-Bin Huang
Jinglin Liu
Yixiang Ren
Zhou Zhao
Shinji Watanabe
LM&MAAuLLM
104
228
0
25 Apr 2023
Neural Speech Enhancement with Very Low Algorithmic Latency and
  Complexity via Integrated Full- and Sub-Band Modeling
Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
AI4TS
52
12
0
18 Apr 2023
On Data Sampling Strategies for Training Neural Network Speech
  Separation Models
On Data Sampling Strategies for Training Neural Network Speech Separation Models
William Ravenscroft
Stefan Goetze
Thomas Hain
VLM
45
6
0
14 Apr 2023
End-to-End Integration of Speech Separation and Voice Activity Detection
  for Low-Latency Diarization of Telephone Conversations
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Giovanni Morrone
Samuele Cornell
L. Serafini
Enrico Zovato
Alessio Brutti
S. Squartini
73
5
0
21 Mar 2023
The Intel Neuromorphic DNS Challenge
The Intel Neuromorphic DNS Challenge
Jonathan Timcheck
S. Shrestha
D. B. Rubin
A. Kupryjanow
Garrick Orchard
Lukasz Pindor
Timothy M. Shea
Mike Davies
65
28
0
16 Mar 2023
Beamformer-Guided Target Speaker Extraction
Beamformer-Guided Target Speaker Extraction
Mohamed Elminshawi
Srikanth Raj Chetupalli
Emanuel Habets
62
7
0
15 Mar 2023
Target Sound Extraction with Variable Cross-modality Clues
Target Sound Extraction with Variable Cross-modality Clues
Chenda Li
Yao Qian
Zhuo Chen
Dongmei Wang
Takuya Yoshioka
Shujie Liu
Y. Qian
Michael Zeng
VLM
68
14
0
15 Mar 2023
Multi-Channel Masking with Learnable Filterbank for Sound Source
  Separation
Multi-Channel Masking with Learnable Filterbank for Sound Source Separation
Wang Dai
Archontis Politis
Tuomas Virtanen
50
0
0
14 Mar 2023
Towards Real-Time Single-Channel Speech Separation in Noisy and
  Reverberant Environments
Towards Real-Time Single-Channel Speech Separation in Noisy and Reverberant Environments
Julian Neri
Sebastian Braun
64
1
0
14 Mar 2023
Guided Speech Enhancement Network
Guided Speech Enhancement Network
Yang Yang
Shao-fu Shih
Hakan Erdogan
J. Lin
C. Lee
Yunpeng Li
George Sung
Matthias Grundmann
70
6
0
13 Mar 2023
Online Binaural Speech Separation of Moving Speakers With a Wavesplit
  Network
Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network
Cong Han
N. Mesgarani
61
4
0
13 Mar 2023
A two-stage speaker extraction algorithm under adverse acoustic
  conditions using a single-microphone
A two-stage speaker extraction algorithm under adverse acoustic conditions using a single-microphone
Aviad Eisenberg
Sharon Gannot
Shlomo E. Chazan
84
2
0
13 Mar 2023
Improving the Intent Classification accuracy in Noisy Environment
Improving the Intent Classification accuracy in Noisy Environment
Mohamed Nabih Ali
Alessio Brutti
Daniele Falavigna
36
0
0
12 Mar 2023
On Neural Architectures for Deep Learning-based Source Separation of
  Co-Channel OFDM Signals
On Neural Architectures for Deep Learning-based Source Separation of Co-Channel OFDM Signals
Gary C. F. Lee
Amir Weiss
A. Lancho
Yury Polyanskiy
G. Wornell
AI4TS
63
6
0
11 Mar 2023
Multi-Dimensional and Multi-Scale Modeling for Speech Separation
  Optimized by Discriminative Learning
Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Zhaoxi Mu
Xinyu Yang
Wenjing Zhu
71
5
0
07 Mar 2023
A Multi-Stage Triple-Path Method for Speech Separation in Noisy and
  Reverberant Environments
A Multi-Stage Triple-Path Method for Speech Separation in Noisy and Reverberant Environments
Zhaoxi Mu
Xinyu Yang
Xiangyuan Yang
Wenjing Zhu
40
5
0
07 Mar 2023
Scaling strategies for on-device low-complexity source separation with
  Conv-Tasnet
Scaling strategies for on-device low-complexity source separation with Conv-Tasnet
Mohamed Nabih Ali
Francesco Paissan
Daniele Falavigna
Alessio Brutti
48
2
0
06 Mar 2023
Hybrid Y-Net Architecture for Singing Voice Separation
Hybrid Y-Net Architecture for Singing Voice Separation
Rashen Fernando
Pamudu Ranasinghe
Udula Ranasinghe
J. Wijayakulasooriya
Pantaleon Perera
47
2
0
05 Mar 2023
Spectrogram Inversion for Audio Source Separation via Consistency,
  Mixing, and Magnitude Constraints
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints
P. Magron
Tuomas Virtanen
48
0
0
03 Mar 2023
Defending against Adversarial Audio via Diffusion Model
Defending against Adversarial Audio via Diffusion Model
Shutong Wu
Jiong Wang
Ming-Yu Liu
Weili Nie
Chaowei Xiao
DiffM
86
26
0
02 Mar 2023
Extending DNN-based Multiplicative Masking to Deep Subband Filtering for
  Improved Dereverberation
Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation
Jean-Marie Lemercier
Julian Tobergte
Timo Gerkmann
60
2
0
01 Mar 2023
Reducing the Prior Mismatch of Stochastic Differential Equations for
  Diffusion-based Speech Enhancement
Reducing the Prior Mismatch of Stochastic Differential Equations for Diffusion-based Speech Enhancement
Bunlong Lay
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
87
27
0
28 Feb 2023
3D Neural Beamforming for Multi-channel Speech Separation Against
  Location Uncertainty
3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Rongzhi Gu
Shi-Xiong Zhang
Dong Yu
23
2
0
27 Feb 2023
DFSNet: A Steerable Neural Beamformer Invariant to Microphone Array
  Configuration for Real-Time, Low-Latency Speech Enhancement
DFSNet: A Steerable Neural Beamformer Invariant to Microphone Array Configuration for Real-Time, Low-Latency Speech Enhancement
A. Kovalyov
Kashyap Patel
Issa Panahi
68
4
0
26 Feb 2023
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
Chen Chen
Yuchen Hu
Weiwei Weng
Chng Eng Siong
DiffM
97
21
0
23 Feb 2023
Unsupervised Noise adaptation using Data Simulation
Unsupervised Noise adaptation using Data Simulation
Chen Chen
Yuchen Hu
Heqing Zou
Linhui Sun
Chng Eng Siong
84
14
0
23 Feb 2023
MossFormer: Pushing the Performance Limit of Monaural Speech Separation
  using Gated Single-Head Transformer with Convolution-Augmented Joint
  Self-Attentions
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions
Shengkui Zhao
Bin Ma
110
56
0
23 Feb 2023
Unifying Speech Enhancement and Separation with Gradient Modulation for
  End-to-End Noise-Robust Speech Separation
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation
Yuchen Hu
Chen Chen
Heqing Zou
Xionghu Zhong
Chng Eng Siong
114
16
0
22 Feb 2023
DasFormer: Deep Alternating Spectrogram Transformer for
  Multi/Single-Channel Speech Separation
DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Shuo Wang
Xiangyu Kong
Xiulian Peng
H. Movassagh
Vinod Prakash
Yan Lu
55
12
0
21 Feb 2023
A Sidecar Separator Can Convert a Single-Talker Speech Recognition
  System to a Multi-Talker One
A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One
Lingwei Meng
Jiawen Kang
Mingyu Cui
Yuejiao Wang
Xixin Wu
Helen M. Meng
76
17
0
20 Feb 2023
Speech Enhancement with Multi-granularity Vector Quantization
Speech Enhancement with Multi-granularity Vector Quantization
Xiaokang Zhao
Qiu-shi Zhu
Jie Zhang
62
0
0
16 Feb 2023
Local spectral attention for full-band speech enhancement
Local spectral attention for full-band speech enhancement
Zhongshu Hou
Qi Hu
Kai-Jyun Chen
Jing Lu
82
0
0
11 Feb 2023
Multi-Source Diffusion Models for Simultaneous Music Generation and
  Separation
Multi-Source Diffusion Models for Simultaneous Music Generation and Separation
Giorgio Mariani
Irene Tallini
Emilian Postolache
Michele Mancusi
Luca Cosmo
Emanuele Rodolà
DiffM
143
43
0
04 Feb 2023
Neural Target Speech Extraction: An Overview
Neural Target Speech Extraction: An Overview
Kateřina Žmolíková
Marc Delcroix
Tsubasa Ochiai
K. Kinoshita
JanHonza'' vCernocký
Dong Yu
70
95
0
31 Jan 2023
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving
  Source Separation
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation
Shahar Lutati
Eliya Nachmani
Lior Wolf
DiffM
73
16
0
25 Jan 2023
Previous
123...567...141516
Next