ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
v1v2v3 (latest)

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXiv (abs)PDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 773 papers shown
Title
On Batching Variable Size Inputs for Training End-to-End Speech
  Enhancement Systems
On Batching Variable Size Inputs for Training End-to-End Speech Enhancement Systems
Philippe Gonzalez
T. S. Alstrøm
Tobias May
74
9
0
25 Jan 2023
Latent Autoregressive Source Separation
Latent Autoregressive Source Separation
Emilian Postolache
Giorgio Mariani
Michele Mancusi
Andrea Santilli
Luca Cosmo
Emanuele Rodolà
BDLDRL
63
10
0
09 Jan 2023
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for
  Universal and Generalized Speech Enhancement
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Wei-Ning Hsu
Tal Remez
Bowen Shi
Jacob Donley
Yossi Adi
DiffM
93
12
0
21 Dec 2022
An Audio-Visual Speech Separation Model Inspired by
  Cortico-Thalamo-Cortical Circuits
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
Kai Li
Fenghua Xie
Hang Chen
K. Yuan
Xiaolin Hu
91
16
0
21 Dec 2022
Towards Unified All-Neural Beamforming for Time and Frequency Domain
  Speech Separation
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Rongzhi Gu
Shi-Xiong Zhang
Yuexian Zou
Dong Yu
AI4TS
88
25
0
16 Dec 2022
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech
  Enhancement
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Dongheon Lee
Jung-Woo Choi
72
29
0
15 Dec 2022
Tackling the Cocktail Fork Problem for Separation and Transcription of
  Real-World Soundtracks
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Darius Petermann
Gordon Wichern
Aswin Shanmugam Subramanian
Zhong-Qiu Wang
Jonathan Le Roux
63
10
0
14 Dec 2022
Multi-Scale Feature Fusion Transformer Network for End-to-End Single
  Channel Speech Separation
Multi-Scale Feature Fusion Transformer Network for End-to-End Single Channel Speech Separation
Yinhao Xu
Jian Zhou
L. Tao
H. Kwan
106
0
0
14 Dec 2022
GPU-accelerated Guided Source Separation for Meeting Transcription
GPU-accelerated Guided Source Separation for Meeting Transcription
Desh Raj
Daniel Povey
Sanjeev Khudanpur
69
40
0
10 Dec 2022
Hyperbolic Audio Source Separation
Hyperbolic Audio Source Separation
Darius Petermann
Gordon Wichern
Aswin Shanmugam Subramanian
Jonathan Le Roux
73
10
0
09 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
119
22
0
01 Dec 2022
A General Unfolding Speech Enhancement Method Motivated by Taylor's
  Theorem
A General Unfolding Speech Enhancement Method Motivated by Taylor's Theorem
Andong Li
Guochen Yu
C. Zheng
Wenzhe Liu
Xiaodong Li
91
12
0
30 Nov 2022
Deep Neural Mel-Subband Beamformer for In-car Speech Separation
Deep Neural Mel-Subband Beamformer for In-car Speech Separation
Vinay Kothapally
Yong-mei Xu
Meng Yu
Shizhong Zhang
Dong Yu
65
12
0
22 Nov 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech
  Separation
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
102
138
0
22 Nov 2022
LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
Rodrigo Mira
Buye Xu
Jacob Donley
Anurag Kumar
Stavros Petridis
V. Ithapu
Maja Pantic
55
13
0
20 Nov 2022
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method
  Using Variational Autoencoder and Adversarial Training
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
62
6
0
16 Nov 2022
Array Configuration-Agnostic Personalized Speech Enhancement using
  Long-Short-Term Spatial Coherence
Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Yicheng Hsu
Yonghan Lee
M. Bai
45
3
0
16 Nov 2022
Hybrid Transformers for Music Source Separation
Hybrid Transformers for Music Source Separation
Simon Rouard
Francisco Massa
Alexandre Défossez
78
147
0
15 Nov 2022
Reverberation as Supervision for Speech Separation
Reverberation as Supervision for Speech Separation
R. Aralikatti
Christoph Boeddeker
Gordon Wichern
Aswin Shanmugam Subramanian
Jonathan Le Roux
65
7
0
15 Nov 2022
An Investigation of the Combination of Rehearsal and Knowledge
  Distillation in Continual Learning for Spoken Language Understanding
An Investigation of the Combination of Rehearsal and Knowledge Distillation in Continual Learning for Spoken Language Understanding
Umberto Cappellazzo
Daniele Falavigna
Alessio Brutti
CLL
61
2
0
15 Nov 2022
The Potential of Neural Speech Synthesis-based Data Augmentation for
  Personalized Speech Enhancement
The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement
Anastasia Kuznetsova
Aswin Sivaraman
Minje Kim
54
3
0
14 Nov 2022
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation
Chang-Bin Jeon
Hyeongi Moon
Keunwoo Choi
Ben Sangbae Chon
Kyogu Lee
54
5
0
14 Nov 2022
Speech Enhancement with Fullband-Subband Cross-Attention Network
Speech Enhancement with Fullband-Subband Cross-Attention Network
Jun Chen
Wei Rao
Zehao Wang
Zhiyong Wu
Yannan Wang
Tao Yu
Shidong Shang
Helen M. Meng
49
16
0
10 Nov 2022
Speech separation with large-scale self-supervised learning
Speech separation with large-scale self-supervised learning
Zhuo Chen
Naoyuki Kanda
Jian Wu
Yu-Huan Wu
Xiaofei Wang
Takuya Yoshioka
Jinyu Li
S. Sivasankaran
Sefik Emre Eskimez
81
15
0
09 Nov 2022
Cross-Attention is all you need: Real-Time Streaming Transformers for
  Personalised Speech Enhancement
Cross-Attention is all you need: Real-Time Streaming Transformers for Personalised Speech Enhancement
Shucong Zhang
Malcolm Chadwick
Alberto Gil C. P. Ramos
S. Bhattacharya
57
5
0
08 Nov 2022
Cold Diffusion for Speech Enhancement
Cold Diffusion for Speech Enhancement
Hao Yen
François Germain
Gordon Wichern
Jonathan Le Roux
DiffM
96
45
0
04 Nov 2022
Real-Time Target Sound Extraction
Real-Time Target Sound Extraction
Bandhav Veluri
Justin Chan
Malek Itani
Tuochao Chen
Takuya Yoshioka
Shyamnath Gollakota
112
33
0
04 Nov 2022
Iterative autoregression: a novel trick to improve your low-latency
  speech enhancement model
Iterative autoregression: a novel trick to improve your low-latency speech enhancement model
Pavel Andreev
Nicholas Babaev
Azat Saginbaev
Ivan Shchekotov
Aibek Alanov
75
5
0
03 Nov 2022
Analysis of Noisy-target Training for DNN-based speech enhancement
Analysis of Noisy-target Training for DNN-based speech enhancement
Takuya Fujimura
Tomoki Toda
62
6
0
02 Nov 2022
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue
  through Embedding Inpainting
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue through Embedding Inpainting
Zexu Pan
Wupeng Wang
Marvin Borsdorf
Haizhou Li
85
12
0
31 Oct 2022
Denoising neural networks for magnetic resonance spectroscopy
Denoising neural networks for magnetic resonance spectroscopy
Natalie Klein
Amber J. Day
Harris Mason
M. Malone
Sinead Williamson
59
1
0
31 Oct 2022
Diffusion-based Generative Speech Source Separation
Diffusion-based Generative Speech Source Separation
Robin Scheibler
Youna Ji
Soo-Whan Chung
J. Byun
Soyeon Choe
Min-Seok Choi
DiffM
123
48
0
31 Oct 2022
Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Ayal Schwartz
Sharon Gannot
Shlomo E. Chazan
68
0
0
31 Oct 2022
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
Zhibin Qiu
Mengfan Fu
Yinfeng Yu
Lili Yin
Gang Hua
Hao-Ming Huang
DiffM
145
19
0
30 Oct 2022
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Shulin He
Wei Rao
Jinjiang Liu
Jun Chen
Yukai Ju
Xueliang Zhang
Yannan Wang
Shidong Shang
46
6
0
28 Oct 2022
Hierarchical speaker representation for target speaker extraction
Hierarchical speaker representation for target speaker extraction
Shulin He
Huaiwen Zhang
Wei Rao
Kanghao Zhang
Yukai Ju
Yang-Rui Yang
Xueliang Zhang
60
7
0
28 Oct 2022
UX-NET: Filter-and-Process-based Improved U-Net for Real-time
  Time-domain Audio Separation
UX-NET: Filter-and-Process-based Improved U-Net for Real-time Time-domain Audio Separation
Kashyap Patel
A. Kovalyov
Issa Panahi
50
6
0
28 Oct 2022
CasNet: Investigating Channel Robustness for Speech Separation
CasNet: Investigating Channel Robustness for Speech Separation
Fan Wang
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
54
2
0
27 Oct 2022
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech
  Enhancement
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
Ryosuke Sawata
Naoki Murata
Yuhta Takida
Toshimitsu Uesaka
Takashi Shibuya
Shusuke Takahashi
Yuki Mitsufuji
DiffM
89
17
0
27 Oct 2022
Deformable Temporal Convolutional Networks for Monaural Noisy
  Reverberant Speech Separation
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation
William Ravenscroft
Stefan Goetze
Thomas Hain
84
11
0
27 Oct 2022
Parallel Gated Neural Network With Attention Mechanism For Speech
  Enhancement
Parallel Gated Neural Network With Attention Mechanism For Speech Enhancement
Jia Cui
S. Bleeck
39
0
0
26 Oct 2022
High Fidelity Neural Audio Compression
High Fidelity Neural Audio Compression
Alexandre Défossez
Jade Copet
Gabriel Synnaeve
Yossi Adi
126
674
0
24 Oct 2022
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Dacheng Yin
Zhiyuan Zhao
Chuanxin Tang
Zhiwei Xiong
Chong Luo
85
17
0
24 Oct 2022
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker
  Embeddings for Target Speaker Separation
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation
Xiaoyu Liu
Xu Li
Joan Serrà
76
9
0
23 Oct 2022
Neural Sound Field Decomposition with Super-resolution of Sound
  Direction
Neural Sound Field Decomposition with Super-resolution of Sound Direction
Qiuqiang Kong
Shilei Liu
Junjie Shi
Xuzhou Ye
Yin Cao
Qiaoxi Zhu
Yong-mei Xu
Yuxuan Wang
48
0
0
22 Oct 2022
Adversarial Permutation Invariant Training for Universal Sound
  Separation
Adversarial Permutation Invariant Training for Universal Sound Separation
Emilian Postolache
Jordi Pons
Santiago Pascual
Joan Serrà
VLM
65
7
0
21 Oct 2022
Improved Normalizing Flow-Based Speech Enhancement using an All-pole
  Gammatone Filterbank for Conditional Input Representation
Improved Normalizing Flow-Based Speech Enhancement using an All-pole Gammatone Filterbank for Conditional Input Representation
Martin Strauss
Matteo Torcoli
B. Edler
44
5
0
21 Oct 2022
spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid
  filtering for multi-channel speech enhancement
spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement
Shubo Lv
Yihui Fu
Yukai Jv
Linfu Xie
Weixin Zhu
Wei Rao
Yannan Wang
51
10
0
17 Oct 2022
Individualized Conditioning and Negative Distances for Speaker
  Separation
Individualized Conditioning and Negative Distances for Speaker Separation
Tao Sun
Nidal Abuhajar
Shuyu Gong
Zhewei Wang
Charles D. Smith
Xianhui Wang
Li Xu
Jundong Liu
VLM
59
1
0
12 Oct 2022
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Junjie Li
Meng Ge
Zexu Pan
Longbiao Wang
Jianwu Dang
55
10
0
09 Oct 2022
Previous
123...678...141516
Next