ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXivPDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 754 papers shown
Title
Switching Independent Vector Analysis and Its Extension to Blind and
  Spatially Guided Convolutional Beamforming Algorithms
Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithms
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
H. Sawada
Naoyuki Kamo
S. Araki
33
8
0
20 Nov 2021
BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable
  and Efficient Speech Enhancement
BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement
Sunwoo Kim
Minje Kim
31
4
0
17 Nov 2021
Unsupervised Speech Enhancement with speech recognition embedding and
  disentanglement losses
Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
V. Trinh
Sebastian Braun
17
17
0
16 Nov 2021
S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech
  enhancement
S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
Shubo Lv
Yihui Fu
Mengtao Xing
Jiayao Sun
Lei Xie
Jun Huang
Yannan Wang
Tao Yu
8
54
0
16 Nov 2021
Monaural source separation: From anechoic to reverberant environments
Monaural source separation: From anechoic to reverberant environments
Tobias Cord-Landwehr
Christoph Boeddeker
Thilo von Neumann
Catalin Zorila
R. Doddipatla
Reinhold Haeb-Umbach
19
31
0
15 Nov 2021
Time-Frequency Attention for Monaural Speech Enhancement
Time-Frequency Attention for Monaural Speech Enhancement
Qiquan Zhang
Qi Song
Zhaoheng Ni
Aaron Nicolson
Haizhou Li
11
27
0
15 Nov 2021
MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification
MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification
Ladislav Mošner
Oldrich Plchot
L. Burget
J. Černocký
37
7
0
11 Nov 2021
Uformer: A Unet based dilated complex & real dual-path conformer network
  for simultaneous speech enhancement and dereverberation
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
Yihui Fu
Yun Liu
Jingdong Li
Dawei Luo
Shubo Lv
Yukai Jv
Lei Xie
27
49
0
11 Nov 2021
Joint Neural AEC and Beamforming with Double-Talk Detection
Joint Neural AEC and Beamforming with Double-Talk Detection
Vinay Kothapally
Yong-mei Xu
Meng Yu
Shizhong Zhang
Dong Yu
28
5
0
09 Nov 2021
Learning Filterbanks for End-to-End Acoustic Beamforming
Learning Filterbanks for End-to-End Acoustic Beamforming
Samuele Cornell
Manuel Pariente
François Grondin
S. Squartini
38
7
0
08 Nov 2021
Inter-channel Conv-TasNet for multichannel speech enhancement
Inter-channel Conv-TasNet for multichannel speech enhancement
Dongheon Lee
Seongrae Kim
Jung-Woo Choi
16
12
0
08 Nov 2021
LiMuSE: Lightweight Multi-modal Speaker Extraction
LiMuSE: Lightweight Multi-modal Speaker Extraction
Qinghua Liu
Yating Huang
Yunzhe Hao
Jiaming Xu
Bo Xu
43
6
0
07 Nov 2021
Hybrid Spectrogram and Waveform Source Separation
Hybrid Spectrogram and Waveform Source Separation
Alexandre Défossez
24
162
0
05 Nov 2021
Target Speech Extraction: Independent Vector Extraction Guided by
  Supervised Speaker Identification
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification
J. Málek
Jakub Janský
Zbyněk Koldovský
Tomás Kounovský
Jaroslav Cmejla
J. Zdánský
25
10
0
05 Nov 2021
Reduction of Subjective Listening Effort for TV Broadcast Signals with
  Recurrent Neural Networks
Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural Networks
Nils L. Westhausen
R. Huber
Hannah Baumgartner
Ragini Sinha
J. Rennies
B. Meyer
27
10
0
02 Nov 2021
SNRi Target Training for Joint Speech Enhancement and Recognition
SNRi Target Training for Joint Speech Enhancement and Recognition
Yuma Koizumi
Shigeki Karita
A. Narayanan
S. Panchapagesan
M. Bacchiani
30
14
0
01 Nov 2021
Self-Supervised Speech Denoising Using Only Noisy Audio Signals
Self-Supervised Speech Denoising Using Only Noisy Audio Signals
Jiasong Wu
Qingchun Li
Guanyu Yang
Lei Li
L. Senhadji
H. Shu
21
10
0
30 Oct 2021
Personalized breath based biometric authentication with wearable
  multimodality
Personalized breath based biometric authentication with wearable multimodality
Manh-Ha Bui
Viet-Anh Tran
Cuong Pham
15
9
0
29 Oct 2021
TorchAudio: Building Blocks for Audio and Speech Processing
TorchAudio: Building Blocks for Audio and Speech Processing
Yao-Yuan Yang
Moto Hira
Zhaoheng Ni
Anjali Chourdia
Artyom Astafurov
...
Sean Narenthiran
Shinji Watanabe
Soumith Chintala
Vincent Quenneville-Bélair
Yangyang Shi
31
165
0
28 Oct 2021
Continuous Speech Separation with Recurrent Selective Attention Network
Continuous Speech Separation with Recurrent Selective Attention Network
Yixuan Zhang
Zhuo Chen
Jian Wu
Takuya Yoshioka
Peidong Wang
Zhong Meng
Jinyu Li
BDL
27
7
0
28 Oct 2021
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on
  Real and Simulation Conditions
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions
Wangyou Zhang
Jing Shi
Chenda Li
Shinji Watanabe
Y. Qian
36
22
0
27 Oct 2021
REAL-M: Towards Speech Separation on Real Mixtures
REAL-M: Towards Speech Separation on Real Mixtures
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
30
17
0
20 Oct 2021
TPARN: Triple-path Attentive Recurrent Network for Time-domain
  Multichannel Speech Enhancement
TPARN: Triple-path Attentive Recurrent Network for Time-domain Multichannel Speech Enhancement
Ashutosh Pandey
Buye Xu
Anurag Kumar
Jacob Donley
P. Calamia
DeLiang Wang
KELM
19
40
0
20 Oct 2021
Adapting Speech Separation to Real-World Meetings Using Mixture
  Invariant Training
Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training
Aswin Sivaraman
Scott Wisdom
Hakan Erdogan
J. Hershey
22
22
0
20 Oct 2021
Progressive Learning for Stabilizing Label Selection in Speech
  Separation with Mapping-based Method
Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method
Chenyang Gao
Yue Gu
I. Marsic
38
0
0
20 Oct 2021
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World
  Soundtracks
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks
Darius Petermann
Gordon Wichern
Zhong-Qiu Wang
Jonathan Le Roux
23
37
0
19 Oct 2021
NN3A: Neural Network supported Acoustic Echo Cancellation, Noise
  Suppression and Automatic Gain Control for Real-Time Communications
NN3A: Neural Network supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time Communications
Ziteng Wang
Yueyue Na
Biao Tian
Q. Fu
29
11
0
16 Oct 2021
Toward Degradation-Robust Voice Conversion
Toward Degradation-Robust Voice Conversion
Chien-yu Huang
Kai-Wei Chang
Hung-yi Lee
30
7
0
14 Oct 2021
Music Source Separation with Deep Equilibrium Models
Music Source Separation with Deep Equilibrium Models
Yuichiro Koyama
Naoki Murata
Stefan Uhlich
Giorgio Fabbro
Shusuke Takahashi
Yuki Mitsufuji
31
5
0
13 Oct 2021
All-neural beamformer for continuous speech separation
All-neural beamformer for continuous speech separation
Zhuohuang Zhang
Takuya Yoshioka
Naoyuki Kanda
Zhuo Chen
Xiaofei Wang
Dongmei Wang
Sefik Emre Eskimez
33
15
0
13 Oct 2021
Improving Character Error Rate Is Not Equal to Having Clean Speech:
  Speech Enhancement for ASR Systems with Black-box Acoustic Models
Improving Character Error Rate Is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-box Acoustic Models
Ryosuke Sawata
Yosuke Kashiwagi
Shusuke Takahashi
11
6
0
12 Oct 2021
Source Mixing and Separation Robust Audio Steganography
Source Mixing and Separation Robust Audio Steganography
Naoya Takahashi
M. Singh
Yuki Mitsufuji
34
6
0
11 Oct 2021
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in
  High-order Latent Domain
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain
Zengwei Yao
Wenjie Pei
Fanglin Chen
Guangming Lu
David C. Zhang
21
12
0
10 Oct 2021
A study of the robustness of raw waveform based speaker embeddings under
  mismatched conditions
A study of the robustness of raw waveform based speaker embeddings under mismatched conditions
Ge Zhu
Frank Cwitkowitz
Z. Duan
22
2
0
08 Oct 2021
TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant
  Sound Source Separation
TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant Sound Source Separation
Ali Aroudi
Stefan Uhlich
M. Font
ViT
27
5
0
08 Oct 2021
An Investigation of the Effectiveness of Phase for Audio Classification
An Investigation of the Effectiveness of Phase for Audio Classification
Shunsuke Hidaka
Kohei Wakamiya
T. Kaburagi
23
4
0
06 Oct 2021
End-to-End Complex-Valued Multidilated Convolutional Neural Network for
  Joint Acoustic Echo Cancellation and Noise Suppression
End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression
Karn N. Watcharasupat
Thi Ngoc Tho Nguyen
W. Gan
Shengkui Zhao
Bin Ma
33
12
0
02 Oct 2021
USEV: Universal Speaker Extraction with Visual Cue
USEV: Universal Speaker Extraction with Visual Cue
Zexu Pan
Meng Ge
Haizhou Li
34
41
0
30 Sep 2021
VoiceFixer: Toward General Speech Restoration with Neural Vocoder
VoiceFixer: Toward General Speech Restoration with Neural Vocoder
Haohe Liu
Qiuqiang Kong
Qiao Tian
Yan Zhao
DeLiang Wang
Chuanzeng Huang
Yuxuan Wang
33
57
0
28 Sep 2021
FastMVAE2: On improving and accelerating the fast variational
  autoencoder-based source separation algorithm for determined mixtures
FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures
Li Li
Hirokazu Kameoka
S. Makino
DRL
35
8
0
28 Sep 2021
Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Chao Xie
Yi-Chiao Wu
Patrick Lumban Tobing
Wen-Chin Huang
T. Toda
23
7
0
22 Sep 2021
NORESQA: A Framework for Speech Quality Assessment using Non-Matching
  References
NORESQA: A Framework for Speech Quality Assessment using Non-Matching References
Pranay Manocha
Buye Xu
Anurag Kumar
35
44
0
16 Sep 2021
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music
  Source Separation
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation
Qiuqiang Kong
Yin Cao
Haohe Liu
Keunwoo Choi
Yuxuan Wang
118
96
0
12 Sep 2021
Incorporating Real-world Noisy Speech in Neural-network-based Speech
  Enhancement Systems
Incorporating Real-world Noisy Speech in Neural-network-based Speech Enhancement Systems
Yangyang Xia
Buye Xu
Anurag Kumar
19
7
0
11 Sep 2021
BeamTransformer: Microphone Array-based Overlapping Speech Detection
BeamTransformer: Microphone Array-based Overlapping Speech Detection
Siqi Zheng
Shiliang Zhang
Weilong Huang
Qian Chen
Hongbin Suo
Ming Lei
Jinwei Feng
Zhijie Yan
37
7
0
09 Sep 2021
A Survey of Sound Source Localization with Deep Learning Methods
A Survey of Sound Source Localization with Deep Learning Methods
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
38
246
0
08 Sep 2021
Cross-domain Single-channel Speech Enhancement Model with Bi-projection
  Fusion Module for Noise-robust ASR
Cross-domain Single-channel Speech Enhancement Model with Bi-projection Fusion Module for Noise-robust ASR
Fu-An Chao
J. Hung
Berlin Chen
10
7
0
26 Aug 2021
Learning Sparse Analytic Filters for Piano Transcription
Learning Sparse Analytic Filters for Piano Transcription
Frank Cwitkowitz
M. Heydari
Z. Duan
27
2
0
23 Aug 2021
Convolutive Prediction for Monaural Speech Dereverberation and
  Noisy-Reverberant Speaker Separation
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
22
31
0
16 Aug 2021
Convolutive Prediction for Reverberant Speech Separation
Convolutive Prediction for Reverberant Speech Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
28
12
0
16 Aug 2021
Previous
123...91011...141516
Next