ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXivPDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 754 papers shown
Title
Separate What You Describe: Language-Queried Audio Source Separation
Separate What You Describe: Language-Queried Audio Source Separation
Xubo Liu
Haohe Liu
Qiuqiang Kong
Xinhao Mei
Jinzheng Zhao
Qiushi Huang
Mark D. Plumbley
Wenwu Wang
42
58
0
28 Mar 2022
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of
  Convolutional Network for Speaker-Independent Speech Separation
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation
Xue Yang
C. Bao
27
3
0
25 Mar 2022
SelfRemaster: Self-Supervised Speech Restoration with
  Analysis-by-Synthesis Approach Using Channel Modeling
SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling
Takaaki Saeki
Shinnosuke Takamichi
Tomohiko Nakamura
Naoko Tanji
Hiroshi Saruwatari
33
6
0
24 Mar 2022
FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for
  Speech Enhancement
FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement
Jun Chen
Zehao Wang
Deyi Tuo
Zhiyong Wu
Shiyin Kang
Helen Meng
27
107
0
23 Mar 2022
Joint Noise Reduction and Listening Enhancement for Full-End Speech
  Enhancement
Joint Noise Reduction and Listening Enhancement for Full-End Speech Enhancement
Haoyu Li
Yun Liu
Junichi Yamagishi
13
2
0
22 Mar 2022
RoSS: Utilizing Robotic Rotation for Audio Source Separation
RoSS: Utilizing Robotic Rotation for Audio Source Separation
Hyungjoo Seo
Sahil Bhandary Karnoor
Romit Roy Choudhury
20
0
0
18 Mar 2022
A Squeeze-and-Excitation and Transformer based Cross-task System for
  Environmental Sound Recognition
A Squeeze-and-Excitation and Transformer based Cross-task System for Environmental Sound Recognition
Jisheng Bai
Jianfeng Chen
Mou Wang
Muhammad Saad Ayub
14
9
0
16 Mar 2022
MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Andong Li
C. Zheng
Ziyang Zhang
Xiaodong Li
24
3
0
14 Mar 2022
Improving the transferability of speech separation by meta-learning
Improving the transferability of speech separation by meta-learning
Kuan-Po Huang
Yuan-Kuei Wu
Hung-yi Lee
35
1
0
11 Mar 2022
Harmonicity Plays a Critical Role in DNN Based Versus in
  Biologically-Inspired Monaural Speech Segregation Systems
Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation Systems
Rahil Parikh
Ilya Kavalerov
C. Espy-Wilson
Shihab Shamma Institute for Systems Research
11
3
0
08 Mar 2022
Single microphone speaker extraction using unified time-frequency
  Siamese-Unet
Single microphone speaker extraction using unified time-frequency Siamese-Unet
Aviad Eisenberg
Sharon Gannot
Shlomo E. Chazan
30
3
0
06 Mar 2022
Integrating Statistical Uncertainty into Neural Network-Based Speech
  Enhancement
Integrating Statistical Uncertainty into Neural Network-Based Speech Enhancement
Hu Fang
Tal Peer
S. Wermter
Timo Gerkmann
31
6
0
04 Mar 2022
Look\&Listen: Multi-Modal Correlation Learning for Active Speaker
  Detection and Speech Enhancement
Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
Jun Xiong
Yu Zhou
Peng Zhang
Lei Xie
Wei Huang
Yufei Zha
33
20
0
04 Mar 2022
DMF-Net: A decoupling-style multi-band fusion model for full-band speech
  enhancement
DMF-Net: A decoupling-style multi-band fusion model for full-band speech enhancement
Guochen Yu
Yuansheng Guan
Weixin Meng
C. Zheng
Haibo Wang
24
2
0
01 Mar 2022
Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE
  Submission to The L3DAS22 Challenge
Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 Challenge
Yen-Ju Lu
Samuele Cornell
Xuankai Chang
Wangyou Zhang
Chenda Li
Zhaoheng Ni
Zhong-Qiu Wang
Shinji Watanabe
19
28
0
24 Feb 2022
Benchmarking Generative Latent Variable Models for Speech
Benchmarking Generative Latent Variable Models for Speech
Jakob Drachmann Havtorn
Lasse Borgholt
Søren Hauberg
J. Frellsen
Lars Maaløe
26
3
0
22 Feb 2022
L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office
  Environment
L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment
E. Guizzo
Christian Marinoni
Marco Pennese
Xinlei Ren
Xiguang Zheng
Chen Zhang
Bruno Masiero
A. Uncini
Danilo Comminiello
14
52
0
21 Feb 2022
L-SpEx: Localized Target Speaker Extraction
L-SpEx: Localized Target Speaker Extraction
Meng Ge
Chenglin Xu
Longbiao Wang
E. Chng
J. Dang
Haizhou Li
30
21
0
21 Feb 2022
Multi-Channel Speech Denoising for Machine Ears
Multi-Channel Speech Denoising for Machine Ears
Cong Han
Emine Merve Kaya
Kyle Hoefer
M. Slaney
S. Carlile
15
2
0
17 Feb 2022
On loss functions and evaluation metrics for music source separation
On loss functions and evaluation metrics for music source separation
Enric Gusó
Jordi Pons
Santiago Pascual
Joan Serrà
14
19
0
16 Feb 2022
DBT-Net: Dual-branch federative magnitude and phase estimation with
  attention-in-attention transformer for monaural speech enhancement
DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Guochen Yu
Andong Li
Hui Wang
Yutian Wang
Yuxuan Ke
C. Zheng
34
35
0
16 Feb 2022
Speech Denoising in the Waveform Domain with Self-Attention
Speech Denoising in the Waveform Domain with Self-Attention
Zhifeng Kong
Ming-Yu Liu
Ambrish Dantrey
Bryan Catanzaro
21
61
0
15 Feb 2022
Conditional Diffusion Probabilistic Model for Speech Enhancement
Conditional Diffusion Probabilistic Model for Speech Enhancement
Yen-Ju Lu
Zhongqiu Wang
Shinji Watanabe
Alexander Richard
Cheng Yu
Yu Tsao
DiffM
28
177
0
10 Feb 2022
Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel
  Multi-party Meeting Transcription Challenge
Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge
Jingguang Tian
Xinhui Hu
Xinkang Xu
24
9
0
10 Feb 2022
MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation
  Invariant Training
MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation Invariant Training
Ertuğ Karamatlı
S. Kırbız
SSL
36
9
0
08 Feb 2022
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting
  Transcription Grand Challenge
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Fan Yu
Shiliang Zhang
Pengcheng Guo
Yihui Fu
Zhihao Du
...
Kong Aik Lee
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
18
28
0
08 Feb 2022
Exploring Self-Attention Mechanisms for Speech Separation
Exploring Self-Attention Mechanisms for Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
Mirko Bronzi
40
23
0
06 Feb 2022
The CUHK-TENCENT speaker diarization system for the ICASSP 2022
  multi-channel multi-party meeting transcription challenge
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge
Naijun Zheng
Na Li
Xixin Wu
Lingwei Meng
Jiawen Kang
Haibin Wu
Chao Weng
Dan Su
Helen Meng
25
10
0
04 Feb 2022
New Insights on Target Speaker Extraction
New Insights on Target Speaker Extraction
Mohamed Elminshawi
Wolfgang Mack
Srikanth Raj Chetupalli
Soumitro Chakrabarty
Emanuel Habets
19
18
0
01 Feb 2022
HGCN: Harmonic gated compensation network for speech enhancement
HGCN: Harmonic gated compensation network for speech enhancement
Tianrui Wang
Weibin Zhu
Yingying Gao
Junlan Feng
Shilei Zhang
33
22
0
30 Jan 2022
J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Shinnosuke Takamichi
Wataru Nakata
Naoko Tanji
Hiroshi Saruwatari
AuLLM
30
6
0
26 Jan 2022
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech
  Separation
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation
Chenda Li
Lei Yang
Weiqin Wang
Y. Qian
32
25
0
26 Jan 2022
A Bayesian Permutation training deep representation learning method for
  speech enhancement with variational autoencoder
A Bayesian Permutation training deep representation learning method for speech enhancement with variational autoencoder
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
BDL
DRL
24
4
0
24 Jan 2022
End-to-End Neural Speech Coding for Real-Time Communications
End-to-End Neural Speech Coding for Real-Time Communications
Xue Jiang
Xiulian Peng
Chengyu Zheng
Huaying Xue
Yuan Zhang
Yan Lu
29
27
0
24 Jan 2022
How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement
  Errors on ASR
How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR
Kazuma Iwamoto
Tsubasa Ochiai
Marc Delcroix
Rintaro Ikeshita
Hiroshi Sato
S. Araki
S. Katagiri
30
57
0
18 Jan 2022
Fish sounds: towards the evaluation of marine acoustic biodiversity
  through data-driven audio source separation
Fish sounds: towards the evaluation of marine acoustic biodiversity through data-driven audio source separation
Michele Mancusi
Nicola Zonca
Emanuele Rodolà
Silvia Zuffi
21
2
0
13 Jan 2022
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced
  and Observed Signals for Overlapping Speech Recognition
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Naoyuki Kamo
Takafumi Moriya
38
26
0
11 Jan 2022
Discretization and Re-synthesis: an alternative method to solve the
  Cocktail Party Problem
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Jing Shi
Xuankai Chang
Tomoki Hayashi
Yen-Ju Lu
Shinji Watanabe
Bo Xu
30
19
0
17 Dec 2021
U-shaped Transformer with Frequency-Band Aware Attention for Speech
  Enhancement
U-shaped Transformer with Frequency-Band Aware Attention for Speech Enhancement
Yi Li
Yang Sun
S. M. Naqvi
23
25
0
11 Dec 2021
Hybrid Neural Networks for On-device Directional Hearing
Hybrid Neural Networks for On-device Directional Hearing
Anran Wang
Maruchi Kim
Hao Zhang
Shyamnath Gollakota
16
15
0
11 Dec 2021
Directed Speech Separation for Automatic Speech Recognition of Long Form
  Conversational Speech
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Rohit Paturi
S. Srinivasan
Katrin Kirchhoff
Daniel Garcia-Romero
17
9
0
10 Dec 2021
Learning-based personal speech enhancement for teleconferencing by
  exploiting spatial-spectral features
Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Yicheng Hsu
Yonghan Lee
M. Bai
22
10
0
10 Dec 2021
Domain Adaptation and Autoencoder Based Unsupervised Speech Enhancement
Domain Adaptation and Autoencoder Based Unsupervised Speech Enhancement
Yi Li
Yang Sun
K. Horoshenkov
S. M. Naqvi
11
23
0
09 Dec 2021
Noise-robust blind reverberation time estimation using noise-aware
  time-frequency masking
Noise-robust blind reverberation time estimation using noise-aware time-frequency masking
Kaitong Zheng
C. Zheng
Jinqiu Sang
Yulong Zhang
Xiaodong Li
16
6
0
09 Dec 2021
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel
  Neural Separation Systems
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel Neural Separation Systems
Yi Luo
29
14
0
07 Dec 2021
Speech Separation Using an Asynchronous Fully Recurrent Convolutional
  Neural Network
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Xiaolin Hu
Kai Li
Weiyi Zhang
Yi Luo
Jean-Marie Lemercier
Timo Gerkmann
49
47
0
04 Dec 2021
Environmental Sound Extraction Using Onomatopoeic Words
Environmental Sound Extraction Using Onomatopoeic Words
Yuki Okamoto
Shota Horiguchi
Masaaki Yamamoto
Keisuke Imoto
Y. Kawaguchi
24
9
0
01 Dec 2021
Mixed Precision DNN Qunatization for Overlapped Speech Separation and
  Recognition
Mixed Precision DNN Qunatization for Overlapped Speech Separation and Recognition
Junhao Xu
Jianwei Yu
Xunying Liu
Helen Meng
MQ
36
10
0
29 Nov 2021
Active Restoration of Lost Audio Signals Using Machine Learning and
  Latent Information
Active Restoration of Lost Audio Signals Using Machine Learning and Latent Information
Zohra Cheddad
A. Cheddad
11
1
0
21 Nov 2021
Implicit Acoustic Echo Cancellation for Keyword Spotting and
  Device-Directed Speech Detection
Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection
Samuele Cornell
T. Balestri
Thibaud Sénéchal
11
5
0
20 Nov 2021
Previous
123...8910...141516
Next