ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
v1v2v3 (latest)

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXiv (abs)PDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 773 papers shown
Title
A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network
  Speech Enhancement
A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement
Bengt J. Borgström
M. Brandstein
62
2
0
21 Sep 2023
Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Shafique Ahmed
Chia-Wei Chen
Wenze Ren
Chin-Jou Li
Ernie Chu
Jun-Cheng Chen
Amir Hussain
H. Wang
Yu Tsao
Jen-Cheng Hou
57
3
0
20 Sep 2023
Single and Few-step Diffusion for Generative Speech Enhancement
Single and Few-step Diffusion for Generative Speech Enhancement
Bunlong Lay
Jean-Marie Lemercier
Julius Richter
Timo Gerkmann
DiffM
61
10
0
18 Sep 2023
Refining DNN-based Mask Estimation using CGMM-based EM Algorithm for
  Multi-channel Noise Reduction
Refining DNN-based Mask Estimation using CGMM-based EM Algorithm for Multi-channel Noise Reduction
Julitta Bartolewska
Stanisław Kacprzak
K. Kowalczyk
42
0
0
18 Sep 2023
Audio-Visual Active Speaker Extraction for Sparsely Overlapped
  Multi-talker Speech
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech
Jun Yu Li
Ruijie Tao
Zexu Pan
Meng Ge
Shuai Wang
Haizhou Li
76
6
0
15 Sep 2023
DiaCorrect: Error Correction Back-end For Speaker Diarization
DiaCorrect: Error Correction Back-end For Speaker Diarization
Jiangyu Han
Federico Landini
Johan Rohdin
Mireia Díez
Lukás Burget
Yuhang Cao
Heng Lu
J. Černocký
66
3
0
15 Sep 2023
Two-Step Knowledge Distillation for Tiny Speech Enhancement
Two-Step Knowledge Distillation for Tiny Speech Enhancement
Rayan Daod Nathoo
M. Kegler
Marko Stamenovic
59
6
0
15 Sep 2023
Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription
Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription
Peter Vieting
Simon Berger
Thilo von Neumann
Christoph Boeddeker
Ralf Schluter
Reinhold Haeb-Umbach
105
0
0
15 Sep 2023
Complexity Scaling for Speech Denoising
Complexity Scaling for Speech Denoising
Hangting Chen
Jianwei Yu
Chao Weng
54
2
0
14 Sep 2023
Analysis of Speech Separation Performance Degradation on Emotional
  Speech Mixtures
Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures
J. Yip
Dianwen Ng
Bin Ma
Chng Eng Siong
52
0
0
14 Sep 2023
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network
Qinghua Liu
Meng Ge
Zhizheng Wu
Haizhou Li
73
1
0
13 Sep 2023
Assessing the Generalization Gap of Learning-Based Speech Enhancement
  Systems in Noisy and Reverberant Environments
Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments
Philippe Gonzalez
T. S. Alstrøm
Tobias May
66
14
0
12 Sep 2023
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram
Zhifeng Kong
Ming-Yu Liu
Ambrish Dantrey
Bryan Catanzaro
49
7
0
12 Sep 2023
Music Source Separation with Band-Split RoPE Transformer
Music Source Separation with Band-Split RoPE Transformer
Wei-Tsung Lu
Ju-Chiang Wang
Qiuqiang Kong
Yun-Ning Hung
102
25
0
05 Sep 2023
A Generalized Bandsplit Neural Network for Cinematic Audio Source
  Separation
A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation
Karn N. Watcharasupat
Chih-Wei Wu
Yiwei Ding
Iroro Orife
Aaron J. Hipple
Aaron J. Hipple. Phillip A. Williams
Scott Kramer
Alexander Lerch
W. Wolcott
67
6
0
05 Sep 2023
Single-Channel Speech Enhancement with Deep Complex U-Networks and
  Probabilistic Latent Space Models
Single-Channel Speech Enhancement with Deep Complex U-Networks and Probabilistic Latent Space Models
E. J. Nustede
Jörn Anemüller
41
3
0
04 Sep 2023
Remixing-based Unsupervised Source Separation from Scratch
Remixing-based Unsupervised Source Separation from Scratch
Kohei Saijo
Tetsuji Ogawa
50
3
0
01 Sep 2023
Blind Source Separation of Single-Channel Mixtures via Multi-Encoder
  Autoencoders
Blind Source Separation of Single-Channel Mixtures via Multi-Encoder Autoencoders
Matthew B. Webster
Joonnyong Lee
59
1
0
31 Aug 2023
Deep learning-based denoising streamed from mobile phones improves
  speech-in-noise understanding for hearing aid users
Deep learning-based denoising streamed from mobile phones improves speech-in-noise understanding for hearing aid users
P. U. Diehl
Hannes Zilly
Felix Sattler
Y. Singer
Kevin Kepp
...
Paul Meyer-Rachner
A. Pudszuhn
V. Hofmann
M. Vormann
Elias Sprengel
64
3
0
22 Aug 2023
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise
  Suppression
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Hangting Chen
Jianwei Yu
Yi Luo
Rongzhi Gu
Weihua Li
Zhuocheng Lu
Chao Weng
75
7
0
21 Aug 2023
TokenSplit: Using Discrete Speech Representations for Direct, Refined,
  and Transcript-Conditioned Speech Separation and Recognition
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
Hakan Erdogan
Scott Wisdom
Xuankai Chang
Zalan Borsos
Marco Tagliasacchi
Neil Zeghidour
J. Hershey
69
11
0
21 Aug 2023
IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual
  Speech Separation
IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation
Kai Li
Run Yang
Fuchun Sun
Xiaolin Hu
90
8
0
16 Aug 2023
Conformer-based Target-Speaker Automatic Speech Recognition for
  Single-Channel Audio
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio
Yang Zhang
Krishna C. Puvvada
Vitaly Lavrukhin
Boris Ginsburg
74
16
0
09 Aug 2023
Separate Anything You Describe
Separate Anything You Describe
Xubo Liu
Qiuqiang Kong
Yan Zhao
Haohe Liu
Yiitan Yuan
Yuzhuo Liu
Rui Xia
Yuxuan Wang
Mark D. Plumbley
Wenwu Wang
VLM
105
52
0
09 Aug 2023
Music De-limiter Networks via Sample-wise Gain Inversion
Music De-limiter Networks via Sample-wise Gain Inversion
Chang-Bin Jeon
Kyogu Lee
55
1
0
02 Aug 2023
PCNN: A Lightweight Parallel Conformer Neural Network for Efficient
  Monaural Speech Enhancement
PCNN: A Lightweight Parallel Conformer Neural Network for Efficient Monaural Speech Enhancement
Xinmeng Xu
Weiping Tu
Yuhong Yang
57
9
0
28 Jul 2023
Exploring the Integration of Speech Separation and Recognition with
  Self-Supervised Learning Representation
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Yoshiki Masuyama
Xuankai Chang
Wangyou Zhang
Samuele Cornell
Zhongqiu Wang
Nobutaka Ono
Y. Qian
Shinji Watanabe
78
6
0
23 Jul 2023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Yuchen Hu
Cheng Chen
Ruizhe Li
Qiu-shi Zhu
Eng Siong Chng
DiffM
93
13
0
16 Jul 2023
Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation
  and Recognition
Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition
Guinan Li
Jiajun Deng
Mengzhe Geng
Zengrui Jin
Tianzi Wang
Shujie Hu
Mingyu Cui
Helen M. Meng
Xunying Liu
60
12
0
06 Jul 2023
Self-supervised learning with diffusion-based multichannel speech
  enhancement for speaker verification under noisy conditions
Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
Sandipana Dowerah
Ajinkya Kulkarni
Romain Serizel
D. Jouvet
DiffM
129
1
0
05 Jul 2023
Focus on the Sound around You: Monaural Target Speaker Extraction via
  Distance and Speaker Information
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Jiuxin Lin
Peng Wang
Heinrich Dinkel
Jun Chen
Zhiyong Wu
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Yujun Wang
67
9
0
28 Jun 2023
Algorithms of Sampling-Frequency-Independent Layers for Non-integer
  Strides
Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Kanami Imamura
Tomohiko Nakamura
Norihiro Takamune
Kohei Yatabe
Hiroshi Saruwatari
48
2
0
19 Jun 2023
Visually-Guided Sound Source Separation with Audio-Visual Predictive
  Coding
Visually-Guided Sound Source Separation with Audio-Visual Predictive Coding
Zengjie Song
Zhaoxiang Zhang
50
1
0
19 Jun 2023
Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source
  Separation
Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation
Yoshiaki Bando
Yoshiki Masuyama
Aditya Arie Nugraha
Kazuyoshi Yoshii
BDL
57
4
0
17 Jun 2023
Multi-Loss Convolutional Network with Time-Frequency Attention for
  Speech Enhancement
Multi-Loss Convolutional Network with Time-Frequency Attention for Speech Enhancement
Liang Wan
Hongqing Liu
Yi Zhou
Jie Ji
61
2
0
15 Jun 2023
Quantifying Spatial Audio Quality Impairment
Quantifying Spatial Audio Quality Impairment
Karn N. Watcharasupat
Alexander Lerch
60
2
0
13 Jun 2023
Audio-Visual Speech Enhancement With Selective Off-Screen Speech
  Extraction
Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction
Tomoya Yoshinaga
Keitaro Tanaka
Shigeo Morishima
63
0
0
10 Jun 2023
An Efficient Speech Separation Network Based on Recurrent Fusion Dilated
  Convolution and Channel Attention
An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention
Junyu Wang
49
1
0
09 Jun 2023
Efficient Encoder-Decoder and Dual-Path Conformer for Comprehensive
  Feature Learning in Speech Enhancement
Efficient Encoder-Decoder and Dual-Path Conformer for Comprehensive Feature Learning in Speech Enhancement
Junyu Wang
65
5
0
09 Jun 2023
A Mask Free Neural Network for Monaural Speech Enhancement
A Mask Free Neural Network for Monaural Speech Enhancement
Liangqi Liu
Haixing Guan
Jinlong Ma
Wei Dai
Guang-Yi Wang
Shaowei Ding
75
12
0
07 Jun 2023
Rethinking the visual cues in audio-visual speaker extraction
Rethinking the visual cues in audio-visual speaker extraction
Junjie Li
Meng Ge
Zexu Pan
Rui Cao
Longbiao Wang
Jianwu Dang
Shiliang Zhang
72
9
0
05 Jun 2023
UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion
  Model
UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model
A. Iashchenko
Pavel Andreev
Ivan Shchekotov
Nicholas Babaev
Dmitry Vetrov
DiffM
81
2
0
01 Jun 2023
A Multi-dimensional Deep Structured State Space Approach to Speech
  Enhancement Using Small-footprint Models
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models
Pin-Jui Ku
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin-Hui Lee
89
13
0
01 Jun 2023
Audio-Visual Speech Separation in Noisy Environments with a Lightweight
  Iterative Model
Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model
H. Martel
Julius Richter
Kai Li
Xiaolin Hu
Timo Gerkmann
VLM
57
10
0
31 May 2023
UNSSOR: Unsupervised Neural Speech Separation by Leveraging
  Over-determined Training Mixtures
UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
Zhong-Qiu Wang
Shinji Watanabe
77
11
0
31 May 2023
An Experimental Review of Speaker Diarization methods with application
  to Two-Speaker Conversational Telephone Speech recordings
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
Alessio Brutti
S. Squartini
83
9
0
29 May 2023
CAPTDURE: Captioned Sound Dataset of Single Sources
CAPTDURE: Captioned Sound Dataset of Single Sources
Yuki Okamoto
Kanta Shimonishi
Keisuke Imoto
Kota Dohi
Shota Horiguchi
Yohei Kawaguchi
52
1
0
28 May 2023
A Neural State-Space Model Approach to Efficient Speech Separation
A Neural State-Space Model Approach to Efficient Speech Separation
Chen Chen
Chao-Han Huck Yang
Kai Li
Yuchen Hu
Pin-Jui Ku
Chng Eng Siong
70
11
0
26 May 2023
Unified Modeling of Multi-Talker Overlapped Speech Recognition and
  Diarization with a Sidecar Separator
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator
Lingwei Meng
Jiawen Kang
Mingyu Cui
Haibin Wu
Xixin Wu
Helen M. Meng
72
10
0
25 May 2023
Anomalous Sound Detection Based on Sound Separation
Anomalous Sound Detection Based on Sound Separation
Kanta Shimonishi
Kota Dohi
Yohei Kawaguchi
61
5
0
25 May 2023
Previous
123456...141516
Next