ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
v1v2v3 (latest)

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXiv (abs)PDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 773 papers shown
Title
All-neural beamformer for continuous speech separation
All-neural beamformer for continuous speech separation
Zhuohuang Zhang
Takuya Yoshioka
Naoyuki Kanda
Zhuo Chen
Xiaofei Wang
Dongmei Wang
Sefik Emre Eskimez
69
16
0
13 Oct 2021
Improving Character Error Rate Is Not Equal to Having Clean Speech:
  Speech Enhancement for ASR Systems with Black-box Acoustic Models
Improving Character Error Rate Is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-box Acoustic Models
Ryosuke Sawata
Yosuke Kashiwagi
Shusuke Takahashi
52
6
0
12 Oct 2021
Source Mixing and Separation Robust Audio Steganography
Source Mixing and Separation Robust Audio Steganography
Naoya Takahashi
M. Singh
Yuki Mitsufuji
56
6
0
11 Oct 2021
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in
  High-order Latent Domain
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain
Zengwei Yao
Wenjie Pei
Fanglin Chen
Guangming Lu
David C. Zhang
74
12
0
10 Oct 2021
A study of the robustness of raw waveform based speaker embeddings under
  mismatched conditions
A study of the robustness of raw waveform based speaker embeddings under mismatched conditions
Ge Zhu
Frank Cwitkowitz
Z. Duan
55
2
0
08 Oct 2021
TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant
  Sound Source Separation
TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant Sound Source Separation
Ali Aroudi
Stefan Uhlich
M. Font
ViT
51
5
0
08 Oct 2021
An Investigation of the Effectiveness of Phase for Audio Classification
An Investigation of the Effectiveness of Phase for Audio Classification
Shunsuke Hidaka
Kohei Wakamiya
T. Kaburagi
28
4
0
06 Oct 2021
End-to-End Complex-Valued Multidilated Convolutional Neural Network for
  Joint Acoustic Echo Cancellation and Noise Suppression
End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression
Karn N. Watcharasupat
Thi Ngoc Tho Nguyen
W. Gan
Shengkui Zhao
Bin Ma
75
12
0
02 Oct 2021
USEV: Universal Speaker Extraction with Visual Cue
USEV: Universal Speaker Extraction with Visual Cue
Zexu Pan
Meng Ge
Haizhou Li
75
44
0
30 Sep 2021
VoiceFixer: Toward General Speech Restoration with Neural Vocoder
VoiceFixer: Toward General Speech Restoration with Neural Vocoder
Haohe Liu
Qiuqiang Kong
Qiao Tian
Yan Zhao
DeLiang Wang
Chuanzeng Huang
Yuxuan Wang
87
58
0
28 Sep 2021
FastMVAE2: On improving and accelerating the fast variational
  autoencoder-based source separation algorithm for determined mixtures
FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures
Li Li
Hirokazu Kameoka
S. Makino
DRL
77
8
0
28 Sep 2021
Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Chao Xie
Yi-Chiao Wu
Patrick Lumban Tobing
Wen-Chin Huang
Tomoki Toda
57
8
0
22 Sep 2021
NORESQA: A Framework for Speech Quality Assessment using Non-Matching
  References
NORESQA: A Framework for Speech Quality Assessment using Non-Matching References
Pranay Manocha
Buye Xu
Anurag Kumar
98
49
0
16 Sep 2021
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music
  Source Separation
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation
Qiuqiang Kong
Yin Cao
Haohe Liu
Keunwoo Choi
Yuxuan Wang
190
100
0
12 Sep 2021
Incorporating Real-world Noisy Speech in Neural-network-based Speech
  Enhancement Systems
Incorporating Real-world Noisy Speech in Neural-network-based Speech Enhancement Systems
Yangyang Xia
Buye Xu
Anurag Kumar
40
7
0
11 Sep 2021
BeamTransformer: Microphone Array-based Overlapping Speech Detection
BeamTransformer: Microphone Array-based Overlapping Speech Detection
Siqi Zheng
Shiliang Zhang
Weilong Huang
Qian Chen
Hongbin Suo
Ming Lei
Jinwei Feng
Zhijie Yan
75
8
0
09 Sep 2021
A Survey of Sound Source Localization with Deep Learning Methods
A Survey of Sound Source Localization with Deep Learning Methods
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
80
257
0
08 Sep 2021
Cross-domain Single-channel Speech Enhancement Model with Bi-projection
  Fusion Module for Noise-robust ASR
Cross-domain Single-channel Speech Enhancement Model with Bi-projection Fusion Module for Noise-robust ASR
Fu-An Chao
J. Hung
Berlin Chen
34
7
0
26 Aug 2021
Learning Sparse Analytic Filters for Piano Transcription
Learning Sparse Analytic Filters for Piano Transcription
Frank Cwitkowitz
M. Heydari
Z. Duan
67
2
0
23 Aug 2021
Convolutive Prediction for Monaural Speech Dereverberation and
  Noisy-Reverberant Speaker Separation
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
97
33
0
16 Aug 2021
Convolutive Prediction for Reverberant Speech Separation
Convolutive Prediction for Reverberant Speech Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
87
12
0
16 Aug 2021
On The Compensation Between Magnitude and Phase in Speech Separation
On The Compensation Between Magnitude and Phase in Speech Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
81
74
0
11 Aug 2021
The Right to Talk: An Audio-Visual Transformer Approach
The Right to Talk: An Audio-Visual Transformer Approach
Thanh-Dat Truong
C. Duong
T. D. Vu
H. Pham
Bhiksha Raj
Ngan Le
Khoa Luu
120
36
0
06 Aug 2021
Blind and neural network-guided convolutional beamformer for joint
  denoising, dereverberation, and source separation
Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
H. Sawada
S. Araki
60
19
0
04 Aug 2021
A Multi-Head Relevance Weighting Framework For Learning Raw Waveform
  Audio Representations
A Multi-Head Relevance Weighting Framework For Learning Raw Waveform Audio Representations
Debottam Dutta
Purvi Agrawal
Sriram Ganapathy
41
2
0
30 Jul 2021
Graph-PIT: Generalized permutation invariant training for continuous
  separation of arbitrary numbers of speakers
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers
Thilo von Neumann
K. Kinoshita
Christoph Boeddeker
Marc Delcroix
Reinhold Haeb-Umbach
65
23
0
30 Jul 2021
Speeding Up Permutation Invariant Training for Source Separation
Speeding Up Permutation Invariant Training for Source Separation
Thilo von Neumann
Christoph Boeddeker
K. Kinoshita
Marc Delcroix
Reinhold Haeb-Umbach
58
6
0
30 Jul 2021
Blind Room Parameter Estimation Using Multiple-Multichannel Speech
  Recordings
Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings
Prerak Srivastava
Antoine Deleforge
Emmanuel Vincent
68
17
0
29 Jul 2021
Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint
  Optimization
Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint Optimization
Haici Yang
Shivani Firodiya
Nicholas J. Bryan
Minje Kim
69
7
0
28 Jul 2021
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency
  Domain Features and a Pre-trained Acoustic Model
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model
Quandong Wang
Junnan Wu
Zhao Yan
Sichong Qian
Liyong Guo
Lichun Fan
Weiji Zhuang
Peng Gao
Yujun Wang
72
0
0
23 Jul 2021
Multitask-Based Joint Learning Approach To Robust ASR For Radio
  Communication Speech
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Duo Ma
Nana Hou
Van Tung Pham
Haihua Xu
Chng Eng Siong
69
22
0
22 Jul 2021
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Cheng-Hung Hu
Yu-Huai Peng
Junichi Yamagishi
Yu Tsao
Hsin-Min Wang
46
5
0
20 Jul 2021
Joint Echo Cancellation and Noise Suppression based on Cascaded
  Magnitude and Complex Mask Estimation
Joint Echo Cancellation and Noise Suppression based on Cascaded Magnitude and Complex Mask Estimation
Xiaofeng Shu
Yehang Zhu
Yanjie Chen
Li Chen
Haohe Liu
Chuanzeng Huang
Yuxuan Wang
51
11
0
20 Jul 2021
Multi-Task Audio Source Separation
Multi-Task Audio Source Separation
Lu Zhang
Chenxing Li
Feng Deng
Xiaorui Wang
67
9
0
14 Jul 2021
DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech
  Enhancement
DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement
Xiaohuai Le
Hongsheng Chen
Kai-Jyun Chen
Jing Lu
71
83
0
12 Jul 2021
Separation Guided Speaker Diarization in Realistic Mismatched Conditions
Separation Guided Speaker Diarization in Realistic Mismatched Conditions
Shu-Tong Niu
Jun Du
Lei Sun
Chin-Hui Lee
43
5
0
06 Jul 2021
Investigation of Practical Aspects of Single Channel Speech Separation
  for ASR
Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Jian Wu
Zhuo Chen
Sanyuan Chen
Yu-Huan Wu
Takuya Yoshioka
Naoyuki Kanda
Shujie Liu
Jinyu Li
68
17
0
05 Jul 2021
Towards Neural Diarization for Unlimited Numbers of Speakers Using
  Global and Local Attractors
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yawen Xue
Yuki Takashima
Yohei Kawaguchi
79
38
0
04 Jul 2021
TENET: A Time-reversal Enhancement Network for Noise-robust ASR
TENET: A Time-reversal Enhancement Network for Noise-robust ASR
Fu-An Chao
Shao-Wei Fan-Jiang
Bi-Cheng Yan
J. Hung
Berlin Chen
60
13
0
04 Jul 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using
  linear complexity self-attention for speech enhancement
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
86
41
0
30 Jun 2021
Online Self-Attentive Gated RNNs for Real-Time Speaker Separation
Online Self-Attentive Gated RNNs for Real-Time Speaker Separation
Ori Kabeli
Yossi Adi
Zhenyu Tang
Buye Xu
Anurag Kumar
34
2
0
25 Jun 2021
Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition
Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition
Zhengxi Liu
Y. Qian
DRL
49
10
0
25 Jun 2021
A Simultaneous Denoising and Dereverberation Framework with Target
  Decoupling
A Simultaneous Denoising and Dereverberation Framework with Target Decoupling
Andong Li
Wenzhe Liu
Xiaoxue Luo
Guochen Yu
C. Zheng
Xiaodong Li
75
60
0
24 Jun 2021
Deep neural network Based Low-latency Speech Separation with Asymmetric
  analysis-Synthesis Window Pair
Deep neural network Based Low-latency Speech Separation with Asymmetric analysis-Synthesis Window Pair
Shanshan Wang
Gaurav Naithani
Archontis Politis
Tuomas Virtanen
61
10
0
22 Jun 2021
Glance and Gaze: A Collaborative Learning Framework for Single-channel
  Speech Enhancement
Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement
Andong Li
C. Zheng
Lu Zhang
Xiaodong Li
91
147
0
22 Jun 2021
Multi-accent Speech Separation with One Shot Learning
Multi-accent Speech Separation with One Shot Learning
Kuan-Po Huang
Yuan-Kuei Wu
Hung-yi Lee
100
4
0
22 Jun 2021
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Leibny Paola García-Perera
74
68
0
20 Jun 2021
A Hands-on Comparison of DNNs for Dialog Separation Using Transfer
  Learning from Music Source Separation
A Hands-on Comparison of DNNs for Dialog Separation Using Transfer Learning from Music Source Separation
Martin Strauss
Jouni Paulus
Matteo Torcoli
B. Edler
51
9
0
16 Jun 2021
DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech
  Enhancement
DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement
Shubo Lv
Yanxin Hu
Shimin Zhang
Lei Xie
61
94
0
16 Jun 2021
Teacher-Student MixIT for Unsupervised and Semi-supervised Speech
  Separation
Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
56
22
0
15 Jun 2021
Previous
123...101112...141516
Next