ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
v1v2v3 (latest)

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXiv (abs)PDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 773 papers shown
Title
HGCN: Harmonic gated compensation network for speech enhancement
HGCN: Harmonic gated compensation network for speech enhancement
Tianrui Wang
Weibin Zhu
Yingying Gao
Junlan Feng
Shilei Zhang
57
23
0
30 Jan 2022
J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Shinnosuke Takamichi
Wataru Nakata
Naoko Tanji
Hiroshi Saruwatari
AuLLM
70
7
0
26 Jan 2022
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech
  Separation
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation
Chenda Li
Lei Yang
Weiqin Wang
Y. Qian
86
27
0
26 Jan 2022
A Bayesian Permutation training deep representation learning method for
  speech enhancement with variational autoencoder
A Bayesian Permutation training deep representation learning method for speech enhancement with variational autoencoder
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
BDLDRL
51
4
0
24 Jan 2022
End-to-End Neural Speech Coding for Real-Time Communications
End-to-End Neural Speech Coding for Real-Time Communications
Xue Jiang
Xiulian Peng
Chengyu Zheng
Huaying Xue
Yuan Zhang
Yan Lu
92
30
0
24 Jan 2022
How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement
  Errors on ASR
How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR
Kazuma Iwamoto
Tsubasa Ochiai
Marc Delcroix
Rintaro Ikeshita
Hiroshi Sato
S. Araki
S. Katagiri
88
62
0
18 Jan 2022
Fish sounds: towards the evaluation of marine acoustic biodiversity
  through data-driven audio source separation
Fish sounds: towards the evaluation of marine acoustic biodiversity through data-driven audio source separation
Michele Mancusi
Nicola Zonca
Emanuele Rodolà
Silvia Zuffi
40
2
0
13 Jan 2022
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced
  and Observed Signals for Overlapping Speech Recognition
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Naoyuki Kamo
Takafumi Moriya
70
27
0
11 Jan 2022
Discretization and Re-synthesis: an alternative method to solve the
  Cocktail Party Problem
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Jing Shi
Xuankai Chang
Tomoki Hayashi
Yen-Ju Lu
Shinji Watanabe
Bo Xu
105
19
0
17 Dec 2021
U-shaped Transformer with Frequency-Band Aware Attention for Speech
  Enhancement
U-shaped Transformer with Frequency-Band Aware Attention for Speech Enhancement
Yi Li
Yang Sun
S. M. Naqvi
58
29
0
11 Dec 2021
Hybrid Neural Networks for On-device Directional Hearing
Hybrid Neural Networks for On-device Directional Hearing
Anran Wang
Maruchi Kim
Hao Zhang
Shyamnath Gollakota
53
16
0
11 Dec 2021
Directed Speech Separation for Automatic Speech Recognition of Long Form
  Conversational Speech
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Rohit Paturi
S. Srinivasan
Katrin Kirchhoff
Daniel Garcia-Romero
67
9
0
10 Dec 2021
Learning-based personal speech enhancement for teleconferencing by
  exploiting spatial-spectral features
Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Yicheng Hsu
Yonghan Lee
M. Bai
52
10
0
10 Dec 2021
Domain Adaptation and Autoencoder Based Unsupervised Speech Enhancement
Domain Adaptation and Autoencoder Based Unsupervised Speech Enhancement
Yi Li
Yang Sun
K. Horoshenkov
S. M. Naqvi
48
24
0
09 Dec 2021
Noise-robust blind reverberation time estimation using noise-aware
  time-frequency masking
Noise-robust blind reverberation time estimation using noise-aware time-frequency masking
Kaitong Zheng
C. Zheng
Jinqiu Sang
Yulong Zhang
Xiaodong Li
62
6
0
09 Dec 2021
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel
  Neural Separation Systems
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel Neural Separation Systems
Yi Luo
81
14
0
07 Dec 2021
Speech Separation Using an Asynchronous Fully Recurrent Convolutional
  Neural Network
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Xiaolin Hu
Kai Li
Weiyi Zhang
Yi Luo
Jean-Marie Lemercier
Timo Gerkmann
91
51
0
04 Dec 2021
Environmental Sound Extraction Using Onomatopoeic Words
Environmental Sound Extraction Using Onomatopoeic Words
Yuki Okamoto
Shota Horiguchi
Masaaki Yamamoto
Keisuke Imoto
Yohei Kawaguchi
69
9
0
01 Dec 2021
Mixed Precision DNN Qunatization for Overlapped Speech Separation and
  Recognition
Mixed Precision DNN Qunatization for Overlapped Speech Separation and Recognition
Junhao Xu
Jianwei Yu
Xunying Liu
Helen Meng
MQ
48
10
0
29 Nov 2021
Active Restoration of Lost Audio Signals Using Machine Learning and
  Latent Information
Active Restoration of Lost Audio Signals Using Machine Learning and Latent Information
Zohra Cheddad
A. Cheddad
25
1
0
21 Nov 2021
Implicit Acoustic Echo Cancellation for Keyword Spotting and
  Device-Directed Speech Detection
Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection
Samuele Cornell
T. Balestri
Thibaud Sénéchal
53
5
0
20 Nov 2021
Switching Independent Vector Analysis and Its Extension to Blind and
  Spatially Guided Convolutional Beamforming Algorithms
Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithms
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
H. Sawada
Naoyuki Kamo
S. Araki
69
8
0
20 Nov 2021
BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable
  and Efficient Speech Enhancement
BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement
Sunwoo Kim
Minje Kim
113
5
0
17 Nov 2021
Unsupervised Speech Enhancement with speech recognition embedding and
  disentanglement losses
Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
V. Trinh
Sebastian Braun
55
19
0
16 Nov 2021
S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech
  enhancement
S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
Shubo Lv
Yihui Fu
Mengtao Xing
Jiayao Sun
Lei Xie
Jun Huang
Yannan Wang
Tao Yu
100
54
0
16 Nov 2021
Monaural source separation: From anechoic to reverberant environments
Monaural source separation: From anechoic to reverberant environments
Tobias Cord-Landwehr
Christoph Boeddeker
Thilo von Neumann
Catalin Zorila
R. Doddipatla
Reinhold Haeb-Umbach
58
31
0
15 Nov 2021
Time-Frequency Attention for Monaural Speech Enhancement
Time-Frequency Attention for Monaural Speech Enhancement
Qiquan Zhang
Qi Song
Zhaoheng Ni
Aaron Nicolson
Haizhou Li
55
29
0
15 Nov 2021
MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification
MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification
Ladislav Mošner
Oldrich Plchot
L. Burget
J. Černocký
58
7
0
11 Nov 2021
Uformer: A Unet based dilated complex & real dual-path conformer network
  for simultaneous speech enhancement and dereverberation
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
Yihui Fu
Yun Liu
Jingdong Li
Dawei Luo
Shubo Lv
Yukai Jv
Lei Xie
90
50
0
11 Nov 2021
Joint Neural AEC and Beamforming with Double-Talk Detection
Joint Neural AEC and Beamforming with Double-Talk Detection
Vinay Kothapally
Yong-mei Xu
Meng Yu
Shizhong Zhang
Dong Yu
51
5
0
09 Nov 2021
Learning Filterbanks for End-to-End Acoustic Beamforming
Learning Filterbanks for End-to-End Acoustic Beamforming
Samuele Cornell
Manuel Pariente
François Grondin
S. Squartini
54
7
0
08 Nov 2021
Inter-channel Conv-TasNet for multichannel speech enhancement
Inter-channel Conv-TasNet for multichannel speech enhancement
Dongheon Lee
Seongrae Kim
Jung-Woo Choi
45
12
0
08 Nov 2021
LiMuSE: Lightweight Multi-modal Speaker Extraction
LiMuSE: Lightweight Multi-modal Speaker Extraction
Qinghua Liu
Yating Huang
Yunzhe Hao
Jiaming Xu
Bo Xu
71
6
0
07 Nov 2021
Hybrid Spectrogram and Waveform Source Separation
Hybrid Spectrogram and Waveform Source Separation
Alexandre Défossez
104
174
0
05 Nov 2021
Target Speech Extraction: Independent Vector Extraction Guided by
  Supervised Speaker Identification
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification
J. Málek
Jakub Janský
Zbyněk Koldovský
Tomás Kounovský
Jaroslav Cmejla
J. Zdánský
50
10
0
05 Nov 2021
Reduction of Subjective Listening Effort for TV Broadcast Signals with
  Recurrent Neural Networks
Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural Networks
Nils L. Westhausen
R. Huber
Hannah Baumgartner
Ragini Sinha
J. Rennies
B. Meyer
58
10
0
02 Nov 2021
SNRi Target Training for Joint Speech Enhancement and Recognition
SNRi Target Training for Joint Speech Enhancement and Recognition
Yuma Koizumi
Shigeki Karita
A. Narayanan
S. Panchapagesan
M. Bacchiani
75
15
0
01 Nov 2021
Self-Supervised Speech Denoising Using Only Noisy Audio Signals
Self-Supervised Speech Denoising Using Only Noisy Audio Signals
Jiasong Wu
Qingchun Li
Guanyu Yang
Lei Li
L. Senhadji
H. Shu
50
10
0
30 Oct 2021
Personalized breath based biometric authentication with wearable
  multimodality
Personalized breath based biometric authentication with wearable multimodality
Manh-Ha Bui
Viet-Anh Tran
Cuong Pham
36
10
0
29 Oct 2021
TorchAudio: Building Blocks for Audio and Speech Processing
TorchAudio: Building Blocks for Audio and Speech Processing
Yao-Yuan Yang
Moto Hira
Zhaoheng Ni
Anjali Chourdia
Artyom Astafurov
...
Mehrzad Samadi
Shinji Watanabe
Soumith Chintala
Vincent Quenneville-Bélair
Yangyang Shi
106
169
0
28 Oct 2021
Continuous Speech Separation with Recurrent Selective Attention Network
Continuous Speech Separation with Recurrent Selective Attention Network
Yixuan Zhang
Zhuo Chen
Jian Wu
Takuya Yoshioka
Peidong Wang
Zhong Meng
Jinyu Li
BDL
80
8
0
28 Oct 2021
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on
  Real and Simulation Conditions
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions
Wangyou Zhang
Jing Shi
Chenda Li
Shinji Watanabe
Y. Qian
93
24
0
27 Oct 2021
REAL-M: Towards Speech Separation on Real Mixtures
REAL-M: Towards Speech Separation on Real Mixtures
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
63
18
0
20 Oct 2021
TPARN: Triple-path Attentive Recurrent Network for Time-domain
  Multichannel Speech Enhancement
TPARN: Triple-path Attentive Recurrent Network for Time-domain Multichannel Speech Enhancement
Ashutosh Pandey
Buye Xu
Anurag Kumar
Jacob Donley
P. Calamia
DeLiang Wang
KELM
91
45
0
20 Oct 2021
Adapting Speech Separation to Real-World Meetings Using Mixture
  Invariant Training
Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training
Aswin Sivaraman
Scott Wisdom
Hakan Erdogan
J. Hershey
49
22
0
20 Oct 2021
Progressive Learning for Stabilizing Label Selection in Speech
  Separation with Mapping-based Method
Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method
Chenyang Gao
Yue Gu
I. Marsic
105
0
0
20 Oct 2021
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World
  Soundtracks
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks
Darius Petermann
Gordon Wichern
Zhong-Qiu Wang
Jonathan Le Roux
64
38
0
19 Oct 2021
NN3A: Neural Network supported Acoustic Echo Cancellation, Noise
  Suppression and Automatic Gain Control for Real-Time Communications
NN3A: Neural Network supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time Communications
Ziteng Wang
Yueyue Na
Biao Tian
Q. Fu
54
11
0
16 Oct 2021
Toward Degradation-Robust Voice Conversion
Toward Degradation-Robust Voice Conversion
Chien-yu Huang
Kai-Wei Chang
Hung-yi Lee
85
9
0
14 Oct 2021
Music Source Separation with Deep Equilibrium Models
Music Source Separation with Deep Equilibrium Models
Yuichiro Koyama
Naoki Murata
Stefan Uhlich
Giorgio Fabbro
Shusuke Takahashi
Yuki Mitsufuji
61
5
0
13 Oct 2021
Previous
123...91011...141516
Next