ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXivPDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 757 papers shown
Title
End-to-End Diarization for Variable Number of Speakers with Local-Global
  Networks and Discriminative Speaker Embeddings
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings
Soumi Maiti
Hakan Erdogan
K. Wilson
Scott Wisdom
Shinji Watanabe
J. Hershey
27
21
0
05 May 2021
AvaTr: One-Shot Speaker Extraction with Transformers
AvaTr: One-Shot Speaker Extraction with Transformers
S. Hu
Md Rifat Arefin
V. Nguyen
Alish Dipani
Xaq Pitkow
A. Tolias
38
4
0
03 May 2021
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion
  Network for Speech Enhancement
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Feng Dang
Hangting Chen
Pengyuan Zhang
76
96
0
27 Apr 2021
Points2Sound: From mono to binaural audio using 3D point cloud scenes
Points2Sound: From mono to binaural audio using 3D point cloud scenes
Francesc Lluís
V. Chatziioannou
A. Hofmann
3DPC
40
6
0
26 Apr 2021
Many-Speakers Single Channel Speech Separation with Optimal Permutation
  Training
Many-Speakers Single Channel Speech Separation with Optimal Permutation Training
Shaked Dovrat
Eliya Nachmani
Lior Wolf
VLM
14
21
0
18 Apr 2021
Multi-Metric Optimization using Generative Adversarial Networks for
  Near-End Speech Intelligibility Enhancement
Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement
Haoyu Li
Junichi Yamagishi
27
9
0
17 Apr 2021
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Xiyun Li
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Jiaming Xu
Bo Xu
Dong Yu
22
14
0
17 Apr 2021
On the Design of Deep Priors for Unsupervised Audio Restoration
On the Design of Deep Priors for Unsupervised Audio Restoration
V. Narayanaswamy
Jayaraman J. Thiagarajan
A. Spanias
AI4CE
37
5
0
14 Apr 2021
L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing
L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing
E. Guizzo
R. F. Gramaccioni
Saeid Jamili
Christian Marinoni
Edoardo Massaro
...
Marco Pennese
Sveva Pepe
Enrico Rocchi
A. Uncini
Danilo Comminiello
21
27
0
12 Apr 2021
Learning to Rank Microphones for Distant Speech Recognition
Learning to Rank Microphones for Distant Speech Recognition
Samuele Cornell
Alessio Brutti
M. Matassoni
S. Squartini
25
4
0
06 Apr 2021
Noise Estimation for Generative Diffusion Models
Noise Estimation for Generative Diffusion Models
Robin San-Roman
Eliya Nachmani
Lior Wolf
DiffM
38
105
0
06 Apr 2021
Personalized Speech Enhancement through Self-Supervised Data
  Augmentation and Purification
Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification
Aswin Sivaraman
Sunwoo Kim
Minje Kim
11
23
0
05 Apr 2021
Efficient Personalized Speech Enhancement through Self-Supervised
  Learning
Efficient Personalized Speech Enhancement through Self-Supervised Learning
Aswin Sivaraman
Minje Kim
26
19
0
05 Apr 2021
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality
  Assessment
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Meng Yu
Chunlei Zhang
Yong-mei Xu
Shi-Xiong Zhang
Dong Yu
10
30
0
02 Apr 2021
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
Giorgio Barnabò
Giovanni Trappolini
L. Lastilla
Cesare Campagnano
Angela Fan
Fabio Petroni
Fabrizio Silvestri
18
4
0
01 Apr 2021
Target Speaker Verification with Selective Auditory Attention for Single
  and Multi-talker Speech
Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech
Chenglin Xu
Wei Rao
Jibin Wu
Haizhou Li
34
32
0
30 Mar 2021
Time-domain Speech Enhancement with Generative Adversarial Learning
Time-domain Speech Enhancement with Generative Adversarial Learning
Feiyang Xiao
Jian Guan
Qiuqiang Kong
Wenwu Wang
GAN
21
9
0
30 Mar 2021
On TasNet for Low-Latency Single-Speaker Speech Enhancement
On TasNet for Low-Latency Single-Speaker Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
17
2
0
27 Mar 2021
Blind Speech Separation and Dereverberation using Neural Beamforming
Blind Speech Separation and Dereverberation using Neural Beamforming
Lukas Pfeifenberger
Franz Pernkopf
18
5
0
24 Mar 2021
USTC-NELSLIP System Description for DIHARD-III Challenge
USTC-NELSLIP System Description for DIHARD-III Challenge
Yuxuan Wang
Maokui He
Shutong Niu
Lei Sun
Tian Gao
Xin Fang
Jia Pan
Jun Du
Chin-Hui Lee
16
28
0
19 Mar 2021
HTMD-Net: A Hybrid Masking-Denoising Approach to Time-Domain Monaural
  Singing Voice Separation
HTMD-Net: A Hybrid Masking-Denoising Approach to Time-Domain Monaural Singing Voice Separation
C. Garoufis
Athanasia Zlatintsi
Petros Maragos
40
2
0
07 Mar 2021
Compute and memory efficient universal sound source separation
Compute and memory efficient universal sound source separation
Efthymios Tzinis
Zhepei Wang
Xilin Jiang
Paris Smaragdis
26
40
0
03 Mar 2021
Tune-In: Training Under Negative Environments with Interference for
  Attention Networks Simulating Cocktail Party Effect
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
Jun Wang
Max W. Y. Lam
Dan Su
Dong Yu
22
6
0
02 Mar 2021
Sandglasset: A Light Multi-Granularity Self-attentive Network For
  Time-Domain Speech Separation
Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
AI4TS
72
49
0
01 Mar 2021
Speech Enhancement Using Multi-Stage Self-Attentive Temporal
  Convolutional Networks
Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks
Ju Lin
A. Wijngaarden
Kuang-Ching Wang
M. C. Smith
27
50
0
24 Feb 2021
Handling Background Noise in Neural Speech Generation
Handling Background Noise in Neural Speech Generation
Tom Denton
Alejandro Luebs
Felicia S. C. Lim
Andrew Storus
Hengchin Yeh
W. Kleijn
Jan Skoglund
13
2
0
23 Feb 2021
Dual-Path Modeling for Long Recording Speech Separation in Meetings
Dual-Path Modeling for Long Recording Speech Separation in Meetings
Chenda Li
Zhuo Chen
Yi Luo
Cong Han
Tianyan Zhou
K. Kinoshita
Marc Delcroix
Shinji Watanabe
Y. Qian
24
10
0
23 Feb 2021
TransMask: A Compact and Fast Speech Separation Model Based on
  Transformer
TransMask: A Compact and Fast Speech Separation Model Based on Transformer
Zining Zhang
Bingsheng He
Zhenjie Zhang
36
21
0
19 Feb 2021
Speech enhancement with weakly labelled data from AudioSet
Speech enhancement with weakly labelled data from AudioSet
Qiuqiang Kong
Haohe Liu
Xingjian Du
Li Chen
Rui Xia
Yuxuan Wang
18
18
0
19 Feb 2021
CatNet: music source separation system with mix-audio augmentation
CatNet: music source separation system with mix-audio augmentation
Xuchen Song
Qiuqiang Kong
Xingjian Du
Yuxuan Wang
31
10
0
19 Feb 2021
Generative Speech Coding with Predictive Variance Regularization
Generative Speech Coding with Predictive Variance Regularization
W. Kleijn
Andrew Storus
Michael Chinen
Tom Denton
Felicia S. C. Lim
Alejandro Luebs
Jan Skoglund
Hengchin Yeh
29
67
0
18 Feb 2021
Deep Convolutional and Recurrent Networks for Polyphonic Instrument
  Classification from Monophonic Raw Audio Waveforms
Deep Convolutional and Recurrent Networks for Polyphonic Instrument Classification from Monophonic Raw Audio Waveforms
Kleanthis Avramidis
Agelos Kratimenos
C. Garoufis
Athanasia Zlatintsi
Petros Maragos
11
8
0
13 Feb 2021
Guided Variational Autoencoder for Speech Enhancement With a Supervised
  Classifier
Guided Variational Autoencoder for Speech Enhancement With a Supervised Classifier
Guillaume Carbajal
Julius Richter
Timo Gerkmann
DRL
SSL
13
16
0
12 Feb 2021
Real-time Monaural Speech Enhancement With Short-time Discrete Cosine
  Transform
Real-time Monaural Speech Enhancement With Short-time Discrete Cosine Transform
Qinglong Li
Fei Gao
Haixing Guan
Kaichi Ma
33
24
0
09 Feb 2021
Speaker and Direction Inferred Dual-channel Speech Separation
Speaker and Direction Inferred Dual-channel Speech Separation
Chenxing Li
Jiaming Xu
N. Mesgarani
Bo Xu
16
8
0
08 Feb 2021
Time-Domain Speech Extraction with Spatial Information and Multi Speaker
  Conditioning Mechanism
Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
22
13
0
07 Feb 2021
Monaural Speech Enhancement with Complex Convolutional Block Attention
  Module and Joint Time Frequency Losses
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
Shengkui Zhao
Trung Hieu Nguyen
B. Ma
29
41
0
03 Feb 2021
Multimodal Attention Fusion for Target Speaker Extraction
Multimodal Attention Fusion for Target Speaker Extraction
Hiroshi Sato
Tsubasa Ochiai
K. Kinoshita
Marc Delcroix
Tomohiro Nakatani
S. Araki
11
27
0
02 Feb 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
274
327
0
24 Jan 2021
Towards efficient models for real-time deep noise suppression
Towards efficient models for real-time deep noise suppression
Sebastian Braun
H. Gamper
Chandan K. A. Reddy
I. Tashev
21
104
0
22 Jan 2021
LEAF: A Learnable Frontend for Audio Classification
LEAF: A Learnable Frontend for Audio Classification
Neil Zeghidour
O. Teboul
Félix de Chaumont Quitry
Marco Tagliasacchi
VLM
AAML
85
144
0
21 Jan 2021
A Joint Diagonalization Based Efficient Approach to Underdetermined
  Blind Audio Source Separation Using the Multichannel Wiener Filter
A Joint Diagonalization Based Efficient Approach to Underdetermined Blind Audio Source Separation Using the Multichannel Wiener Filter
N. Ito
Rintaro Ikeshita
H. Sawada
Tomohiro Nakatani
6
25
0
21 Jan 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive
  Locally Recurrent Networks
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
43
29
0
13 Jan 2021
Neural Network-based Virtual Microphone Estimator
Neural Network-based Virtual Microphone Estimator
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
S. Araki
22
10
0
12 Jan 2021
Generalized Spatio-Temporal RNN Beamformer for Target Speech Separation
Generalized Spatio-Temporal RNN Beamformer for Target Speech Separation
Yong-mei Xu
Z. Zhang
Meng Yu
Shi-Xiong Zhang
Dong Yu
18
1
0
04 Jan 2021
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Z. Zhang
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Lianwu Chen
Donald Williamson
Dong Yu
16
28
0
24 Dec 2020
The 2020 ESPnet update: new features, broadened applications,
  performance improvements, and future plans
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Shinji Watanabe
Florian Boyer
Xuankai Chang
Pengcheng Guo
Tomoki Hayashi
...
Shigeki Karita
Chenda Li
Jing Shi
Aswin Shanmugam Subramanian
Wangyou Zhang
VLM
47
38
0
23 Dec 2020
Continuous Speech Separation Using Speaker Inventory for Long
  Multi-talker Recording
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording
Cong Han
Yi Luo
Chenda Li
Tianyan Zhou
K. Kinoshita
...
Marc Delcroix
Hakan Erdogan
J. Hershey
N. Mesgarani
Zhuo Chen
11
8
0
17 Dec 2020
Group Communication with Context Codec for Lightweight Source Separation
Group Communication with Context Codec for Lightweight Source Separation
Yi Luo
Cong Han
N. Mesgarani
26
20
0
14 Dec 2020
Towards speech enhancement using a variational U-Net architecture
Towards speech enhancement using a variational U-Net architecture
E. J. Nustede
Jörn Anemüller
17
1
0
07 Dec 2020
Previous
123...111213141516
Next