ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXivPDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 754 papers shown
Title
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation
Chang-Bin Jeon
Hyeongi Moon
Keunwoo Choi
Ben Sangbae Chon
Kyogu Lee
14
5
0
14 Nov 2022
Speech Enhancement with Fullband-Subband Cross-Attention Network
Speech Enhancement with Fullband-Subband Cross-Attention Network
Jun Chen
Wei Rao
Zehao Wang
Zhiyong Wu
Yannan Wang
Tao Yu
Shidong Shang
Helen M. Meng
17
16
0
10 Nov 2022
Speech separation with large-scale self-supervised learning
Speech separation with large-scale self-supervised learning
Zhuo Chen
Naoyuki Kanda
Jian Wu
Yu-Huan Wu
Xiaofei Wang
Takuya Yoshioka
Jinyu Li
S. Sivasankaran
Sefik Emre Eskimez
19
14
0
09 Nov 2022
Cross-Attention is all you need: Real-Time Streaming Transformers for
  Personalised Speech Enhancement
Cross-Attention is all you need: Real-Time Streaming Transformers for Personalised Speech Enhancement
Shucong Zhang
Malcolm Chadwick
Alberto Gil C. P. Ramos
S. Bhattacharya
8
4
0
08 Nov 2022
Cold Diffusion for Speech Enhancement
Cold Diffusion for Speech Enhancement
Hao Yen
François Germain
G. Wichern
Jonathan Le Roux
DiffM
21
40
0
04 Nov 2022
Real-Time Target Sound Extraction
Real-Time Target Sound Extraction
Bandhav Veluri
Justin Chan
Malek Itani
Tuochao Chen
Takuya Yoshioka
Shyamnath Gollakota
36
30
0
04 Nov 2022
Iterative autoregression: a novel trick to improve your low-latency
  speech enhancement model
Iterative autoregression: a novel trick to improve your low-latency speech enhancement model
Pavel Andreev
Nicholas Babaev
Azat Saginbaev
Ivan Shchekotov
Aibek Alanov
24
4
0
03 Nov 2022
Analysis of Noisy-target Training for DNN-based speech enhancement
Analysis of Noisy-target Training for DNN-based speech enhancement
Takuya Fujimura
T. Toda
32
4
0
02 Nov 2022
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue
  through Embedding Inpainting
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue through Embedding Inpainting
Zexu Pan
Wupeng Wang
Marvin Borsdorf
Haizhou Li
14
10
0
31 Oct 2022
Denoising neural networks for magnetic resonance spectroscopy
Denoising neural networks for magnetic resonance spectroscopy
Natalie Klein
Amber J. Day
Harris Mason
M. Malone
Sinead Williamson
24
1
0
31 Oct 2022
Diffusion-based Generative Speech Source Separation
Diffusion-based Generative Speech Source Separation
Robin Scheibler
Youna Ji
Soo-Whan Chung
J. Byun
Soyeon Choe
Min-Seok Choi
DiffM
29
41
0
31 Oct 2022
Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Ayal Schwartz
Sharon Gannot
Shlomo E. Chazan
11
0
0
31 Oct 2022
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
Zhibin Qiu
Mengfan Fu
Yinfeng Yu
Lili Yin
Gang Hua
Hao-Ming Huang
DiffM
112
17
0
30 Oct 2022
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Shulin He
Wei Rao
Jinjiang Liu
Jun Chen
Yukai Ju
Xueliang Zhang
Yannan Wang
Shidong Shang
21
6
0
28 Oct 2022
Hierarchical speaker representation for target speaker extraction
Hierarchical speaker representation for target speaker extraction
Shulin He
Huaiwen Zhang
Wei Rao
Kanghao Zhang
Yukai Ju
Yang-Rui Yang
Xueliang Zhang
34
3
0
28 Oct 2022
UX-NET: Filter-and-Process-based Improved U-Net for Real-time
  Time-domain Audio Separation
UX-NET: Filter-and-Process-based Improved U-Net for Real-time Time-domain Audio Separation
Kashyap Patel
A. Kovalyov
Issa Panahi
21
6
0
28 Oct 2022
CasNet: Investigating Channel Robustness for Speech Separation
CasNet: Investigating Channel Robustness for Speech Separation
Fan Wang
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
28
2
0
27 Oct 2022
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech
  Enhancement
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
Ryosuke Sawata
Naoki Murata
Yuhta Takida
Toshimitsu Uesaka
Takashi Shibuya
Shusuke Takahashi
Yuki Mitsufuji
DiffM
34
15
0
27 Oct 2022
Deformable Temporal Convolutional Networks for Monaural Noisy
  Reverberant Speech Separation
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation
William Ravenscroft
Stefan Goetze
Thomas Hain
30
11
0
27 Oct 2022
Parallel Gated Neural Network With Attention Mechanism For Speech
  Enhancement
Parallel Gated Neural Network With Attention Mechanism For Speech Enhancement
Jia Cui
S. Bleeck
16
0
0
26 Oct 2022
High Fidelity Neural Audio Compression
High Fidelity Neural Audio Compression
Alexandre Défossez
Jade Copet
Gabriel Synnaeve
Yossi Adi
30
599
0
24 Oct 2022
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Dacheng Yin
Zhiyuan Zhao
Chuanxin Tang
Zhiwei Xiong
Chong Luo
28
14
0
24 Oct 2022
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker
  Embeddings for Target Speaker Separation
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation
Xiaoyu Liu
Xu Li
Joan Serrà
44
9
0
23 Oct 2022
Neural Sound Field Decomposition with Super-resolution of Sound
  Direction
Neural Sound Field Decomposition with Super-resolution of Sound Direction
Qiuqiang Kong
Shilei Liu
Junjie Shi
Xuzhou Ye
Yin Cao
Qiaoxi Zhu
Yong-mei Xu
Yuxuan Wang
17
0
0
22 Oct 2022
Adversarial Permutation Invariant Training for Universal Sound
  Separation
Adversarial Permutation Invariant Training for Universal Sound Separation
Emilian Postolache
Jordi Pons
Santiago Pascual
Joan Serrà
VLM
28
6
0
21 Oct 2022
Improved Normalizing Flow-Based Speech Enhancement using an All-pole
  Gammatone Filterbank for Conditional Input Representation
Improved Normalizing Flow-Based Speech Enhancement using an All-pole Gammatone Filterbank for Conditional Input Representation
Martin Strauss
Matteo Torcoli
B. Edler
21
4
0
21 Oct 2022
spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid
  filtering for multi-channel speech enhancement
spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement
Shubo Lv
Yihui Fu
Yukai Jv
Linfu Xie
Weixin Zhu
Wei Rao
Yannan Wang
19
8
0
17 Oct 2022
Individualized Conditioning and Negative Distances for Speaker
  Separation
Individualized Conditioning and Negative Distances for Speaker Separation
Tao Sun
Nidal Abuhajar
Shuyu Gong
Zhewei Wang
Charles D. Smith
Xianhui Wang
Li Xu
Jundong Liu
VLM
32
1
0
12 Oct 2022
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Junjie Li
Meng Ge
Zexu Pan
Longbiao Wang
J. Dang
18
10
0
09 Oct 2022
Music Source Separation with Band-split RNN
Music Source Separation with Band-split RNN
Yi Luo
Jianwei Yu
57
107
0
30 Sep 2022
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector
  Quantization
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization
Xiaokang Zhao
Qiu-shi Zhu
Jie Zhang
39
4
0
28 Sep 2022
Speech Enhancement with Perceptually-motivated Optimization and Dual
  Transformations
Speech Enhancement with Perceptually-motivated Optimization and Dual Transformations
Xucheng Wan
Kai Liu
Z.C. Du
Huan Zhou
16
0
0
24 Sep 2022
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
Sherif Abdulatif
Ru Cao
Bin Yang
29
61
0
22 Sep 2022
MVNet: Memory Assistance and Vocal Reinforcement Network for Speech
  Enhancement
MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement
Jianrong Wang
Xiaomin Li
Xuewei Li
Mei Yu
Qiang Fang
Li Liu
36
0
0
15 Sep 2022
Streaming Target-Speaker ASR with Neural Transducer
Streaming Target-Speaker ASR with Neural Transducer
Takafumi Moriya
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
T. Shinozaki
28
21
0
09 Sep 2022
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural
  Speaker Separation
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Zhong-Qiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
74
97
0
08 Sep 2022
Improving Choral Music Separation through Expressive Synthesized Data
  from Sampled Instruments
Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments
K. Chen
Hao-Wen Dong
Yi Luo
Julian McAuley
Taylor Berg-Kirkpatrick
M. Puckette
Shlomo Dubnov
41
5
0
07 Sep 2022
Automatic music mixing with deep learning and out-of-domain data
Automatic music mixing with deep learning and out-of-domain data
Marco A. Martínez-Ramírez
Wei-Hsiang Liao
Giorgio Fabbro
Stefan Uhlich
Chihiro Nagashima
Yuki Mitsufuji
29
25
0
24 Aug 2022
Exploiting Temporal Structures of Cyclostationary Signals for
  Data-Driven Single-Channel Source Separation
Exploiting Temporal Structures of Cyclostationary Signals for Data-Driven Single-Channel Source Separation
Gary C. F. Lee
Amir Weiss
A. Lancho
Jennifer Tang
Yuheng Bu
Yury Polyanskiy
G. Wornell
29
6
0
22 Aug 2022
Analysis of impact of emotions on target speech extraction and speech
  separation
Analysis of impact of emotions on target speech extraction and speech separation
Jan vSvec
Katevrina vZmolíková
M. Kocour
Marc Delcroix
Tsubasa Ochiai
Ladislav Movsner
JanHonza'' vCernocký
25
4
0
15 Aug 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative
  Models
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
16
180
0
11 Aug 2022
Conv-NILM-Net, a causal and multi-appliance model for energy source
  separation
Conv-NILM-Net, a causal and multi-appliance model for energy source separation
Mohamed Alami Chehboune
Jérémie Decock
Rim Kaddah
Jesse Read
19
1
0
03 Aug 2022
Spatial Aware Multi-Task Learning Based Speech Separation
Spatial Aware Multi-Task Learning Based Speech Separation
Wei Sun
Mei Wang
L. Qiu
19
3
0
20 Jul 2022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition,
  Translation, and Understanding
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
Yen-Ju Lu
Xuankai Chang
Chenda Li
Wangyou Zhang
Samuele Cornell
...
Robin Scheibler
Zhong-Qiu Wang
Yu Tsao
Y. Qian
Shinji Watanabe
VLM
19
28
0
19 Jul 2022
PodcastMix: A dataset for separating music and speech in podcasts
PodcastMix: A dataset for separating music and speech in podcasts
Nico M. Schmidt
Jordi Pons
M. Miron
25
2
0
15 Jul 2022
SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning
  to Separate
SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate
Nabarun Goswami
Tatsuya Harada
20
5
0
13 Jul 2022
Dual-Path Cross-Modal Attention for better Audio-Visual Speech
  Extraction
Dual-Path Cross-Modal Attention for better Audio-Visual Speech Extraction
Zhongweiyang Xu
Xulin Fan
M. Hasegawa-Johnson
19
2
0
09 Jul 2022
Learning to Separate Voices by Spatial Regions
Learning to Separate Voices by Spatial Regions
Alan Xu
Romit Roy Choudhury
39
10
0
09 Jul 2022
Implicit Neural Spatial Filtering for Multichannel Source Separation in
  the Waveform Domain
Implicit Neural Spatial Filtering for Multichannel Source Separation in the Waveform Domain
Dejan Marković
Alexandre Défossez
Alexander Richard
18
16
0
30 Jun 2022
Speaker Verification in Multi-Speaker Environments Using Temporal
  Feature Fusion
Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion
Ahmad Aloradi
Wolfgang Mack
Mohamed Elminshawi
Emanuel Habets
32
5
0
28 Jun 2022
Previous
123...678...141516
Next