ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
v1v2v3 (latest)

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXiv (abs)PDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 773 papers shown
Title
Speaker Separation Using Speaker Inventories and Estimated Speech
Speaker Separation Using Speaker Inventories and Estimated Speech
Peidong Wang
Zhuo Chen
DeLiang Wang
Jinyu Li
Jiawei Liu
93
11
0
20 Oct 2020
Phase recovery with Bregman divergences for audio source separation
Phase recovery with Bregman divergences for audio source separation
P. Magron
Pierre-Hugo Vial
Thomas Oberlin
Cédric Févotte
46
1
0
20 Oct 2020
Fast accuracy estimation of deep learning based multi-class musical
  source separation
Fast accuracy estimation of deep learning based multi-class musical source separation
A. Mocanu
B. Ricaud
Milos Cernak
22
0
0
19 Oct 2020
Attention-based scaling adaptation for target speech extraction
Attention-based scaling adaptation for target speech extraction
Jiangyu Han
Wei Rao
Yanhua Long
Jiaen Liang
62
9
0
19 Oct 2020
Muse: Multi-modal target speaker extraction with visual cues
Muse: Multi-modal target speaker extraction with visual cues
Zexu Pan
Ruijie Tao
Chenglin Xu
Haizhou Li
55
50
0
15 Oct 2020
The Cone of Silence: Speech Separation by Localization
The Cone of Silence: Speech Separation by Localization
Teerapat Jenrungrot
V. Jayaram
S. M. Seitz
Ira Kemelmacher-Shlizerman
81
56
0
12 Oct 2020
All for One and One for All: Improving Music Separation by Bridging
  Networks
All for One and One for All: Improving Music Separation by Bridging Networks
Ryosuke Sawata
Stefan Uhlich
Shusuke Takahashi
Yuki Mitsufuji
85
48
0
08 Oct 2020
Adversarial attacks on audio source separation
Adversarial attacks on audio source separation
Naoya Takahashi
S. Inoue
Yuki Mitsufuji
AAML
42
10
0
07 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and
  Continuous Speech Separation
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
58
90
0
04 Oct 2020
Sense and Learn: Self-Supervision for Omnipresent Sensors
Sense and Learn: Self-Supervision for Omnipresent Sensors
Aaqib Saeed
Victor Ungureanu
Beat Gfeller
OODSSL
77
41
0
28 Sep 2020
Correlating Subword Articulation with Lip Shapes for Embedding Aware
  Audio-Visual Speech Enhancement
Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Hang Chen
Jun Du
Yu Hu
Lirong Dai
Baocai Yin
Chin-Hui Lee
86
20
0
21 Sep 2020
Online Speaker Diarization with Relation Network
Xiang Li
Yucheng Zhao
Chong Luo
Wenjun Zeng
32
2
0
17 Sep 2020
An End-to-end Architecture of Online Multi-channel Speech Separation
An End-to-end Architecture of Online Multi-channel Speech Separation
Jian Wu
Zhuo Chen
Jinyu Li
Takuya Yoshioka
Zhili Tan
Ed Lin
Yi Luo
Lei Xie
3DV
38
21
0
07 Sep 2020
Toward Speech Separation in The Pre-Cocktail Party Problem with TasTas
Toward Speech Separation in The Pre-Cocktail Party Problem with TasTas
Ziqiang Shi
Jiqing Han
31
0
0
07 Sep 2020
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
Ashutosh Pandey
DeLiang Wang
100
138
0
03 Sep 2020
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with
  Interaural Cue Preservation
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation
Ke Tan
Buye Xu
Anurag Kumar
Eliya Nachmani
Yossi Adi
71
29
0
02 Sep 2020
Dynamical Variational Autoencoders: A Comprehensive Review
Dynamical Variational Autoencoders: A Comprehensive Review
Laurent Girin
Simon Leglaive
Xiaoyu Bie
Julien Diard
Thomas Hueber
Xavier Alameda-Pineda
BDL
141
222
0
28 Aug 2020
Continuous Speech Separation with Conformer
Continuous Speech Separation with Conformer
Sanyuan Chen
Yu-Huan Wu
Zhuo Chen
Jian Wu
Jinyu Li
Takuya Yoshioka
Chengyi Wang
Shujie Liu
M. Zhou
79
130
0
13 Aug 2020
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings,
  Semi-Supervised Conversational Data, and Biased Loss
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Umut Isik
Ritwik Giri
Neerad Phansalkar
J. Valin
Karim Helwani
A. Krishnaswamy
72
84
0
11 Aug 2020
Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM
  with Auxiliary Identity Loss
Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM with Auxiliary Identity Loss
Ziqiang Shi
Rujie Liu
Jiqing Han
38
7
0
06 Aug 2020
Content based singing voice source separation via strong conditioning
  using aligned phonemes
Content based singing voice source separation via strong conditioning using aligned phonemes
Gabriel Meseguer-Brocal
Geoffroy Peeters
56
9
0
05 Aug 2020
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech
  Enhancement
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement
Yanxin Hu
Yun Liu
Shubo Lv
Mengtao Xing
Shimin Zhang
Yihui Fu
Jian Wu
Bihong Zhang
Lei Xie
136
599
0
01 Aug 2020
Efficient Independent Vector Extraction of Dominant Target Speech
Efficient Independent Vector Extraction of Dominant Target Speech
Lele Liao
Zhaoyi Gu
Jing Lu
46
0
0
01 Aug 2020
Dual-Path Transformer Network: Direct Context-Aware Modeling for
  End-to-End Monaural Speech Separation
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jing-jing Chen
Qi-rong Mao
Dong Liu
120
289
0
28 Jul 2020
AutoClip: Adaptive Gradient Clipping for Source Separation Networks
AutoClip: Adaptive Gradient Clipping for Source Separation Networks
Prem Seetharaman
Gordon Wichern
Bryan Pardo
Jonathan Le Roux
67
34
0
25 Jul 2020
CSLNSpeech: solving extended speech separation problem with the help of
  Chinese sign language
CSLNSpeech: solving extended speech separation problem with the help of Chinese sign language
Jiasong Wu
Xuan Li
Taotao Li
Fanman Meng
Youyong Kong
Guanyu Yang
L. Senhadji
Huazhong Shu
CVBM
59
0
0
21 Jul 2020
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
Efthymios Tzinis
Zhepei Wang
Paris Smaragdis
99
130
0
14 Jul 2020
Improving Sound Event Detection In Domestic Environments Using Sound
  Separation
Improving Sound Event Detection In Domestic Environments Using Sound Separation
Nicolas Turpault
Scott Wisdom
Hakan Erdogan
J. Hershey
Romain Serizel
Eduardo Fonseca
Prem Seetharaman
Justin Salamon
93
49
0
08 Jul 2020
Depthwise Separable Convolutions Versus Recurrent Neural Networks for
  Monaural Singing Voice Separation
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation
Pyry Pyykkönen
S. I. Mimilakis
Konstantinos Drossos
Tuomas Virtanen
37
4
0
06 Jul 2020
Progressive Tandem Learning for Pattern Recognition with Deep Spiking
  Neural Networks
Progressive Tandem Learning for Pattern Recognition with Deep Spiking Neural Networks
Jibin Wu
Chenglin Xu
Daquan Zhou
Haizhou Li
Kay Chen Tan
67
117
0
02 Jul 2020
Exploring the time-domain deep attractor network with two-stream
  architectures in a reverberant environment
Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment
Hangting Chen
Pengyuan Zhang
32
6
0
01 Jul 2020
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for
  Mixture Signals
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals
Jing Shi
Xuankai Chang
Pengcheng Guo
Shinji Watanabe
Yusuke Fujita
Jiaming Xu
Bo Xu
Lei Xie
96
22
0
25 Jun 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Speaker-Conditional Chain Model for Speech Separation and Extraction
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
70
21
0
25 Jun 2020
Multi-path RNN for hierarchical modeling of long sequential data and its
  application to speaker stream separation
Multi-path RNN for hierarchical modeling of long sequential data and its application to speaker stream separation
K. Kinoshita
Thilo von Neumann
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb-Umbach
29
9
0
24 Jun 2020
Real Time Speech Enhancement in the Waveform Domain
Real Time Speech Enhancement in the Waveform Domain
Alexandre Défossez
Gabriel Synnaeve
Yossi Adi
109
466
0
23 Jun 2020
Unsupervised Sound Separation Using Mixture Invariant Training
Unsupervised Sound Separation Using Mixture Invariant Training
Scott Wisdom
Efthymios Tzinis
Hakan Erdogan
Ron J. Weiss
K. Wilson
J. Hershey
70
27
0
23 Jun 2020
Listen to What You Want: Neural Network-based Universal Sound Selector
Listen to What You Want: Neural Network-based Universal Sound Selector
Tsubasa Ochiai
Marc Delcroix
Yuma Koizumi
Hiroaki Ito
K. Kinoshita
S. Araki
78
62
0
10 Jun 2020
Multi-talker ASR for an unknown number of sources: Joint training of
  source counting, separation and ASR
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR
Thilo von Neumann
Christoph Boeddeker
Lukas Drude
K. Kinoshita
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb-Umbach
83
41
0
04 Jun 2020
Dilated U-net based approach for multichannel speech enhancement from
  First-Order Ambisonics recordings
Dilated U-net based approach for multichannel speech enhancement from First-Order Ambisonics recordings
Amélie Bosca
Alexandre Guérin
L. Perotin
Srdan Kitic
39
20
0
02 Jun 2020
Unsupervised Audio Source Separation using Generative Priors
Unsupervised Audio Source Separation using Generative Priors
V. Narayanaswamy
Jayaraman J. Thiagarajan
Rushil Anirudh
A. Spanias
58
27
0
28 May 2020
Efficient Integration of Multi-channel Information for
  Speaker-independent Speech Separation
Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation
Yuichiro Koyama
Oluwafemi Azeez
Bhiksha Raj
44
4
0
23 May 2020
Exploring the Best Loss Function for DNN-Based Low-latency Speech
  Enhancement with Temporal Convolutional Networks
Exploring the Best Loss Function for DNN-Based Low-latency Speech Enhancement with Temporal Convolutional Networks
Yuichiro Koyama
Tyler Vuong
Stefan Uhlich
Bhiksha Raj
82
41
0
23 May 2020
Identify Speakers in Cocktail Parties with End-to-End Attention
Identify Speakers in Cocktail Parties with End-to-End Attention
Junzhe Zhu
M. Hasegawa-Johnson
Leda Sari
22
2
0
22 May 2020
End-to-End Multi-Look Keyword Spotting
End-to-End Multi-Look Keyword Spotting
Meng Yu
Xuan Ji
Bo Wu
Dan Su
Dong Yu
52
19
0
20 May 2020
SADDEL: Joint Speech Separation and Denoising Model based on Multitask
  Learning
SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning
Yuan-Kuei Wu
Chao-I Tuan
Hung-yi Lee
Yu Tsao
45
4
0
20 May 2020
Jointly optimal denoising, dereverberation, and source separation
Jointly optimal denoising, dereverberation, and source separation
Tomohiro Nakatani
Christoph Boeddeker
K. Kinoshita
Rintaro Ikeshita
Marc Delcroix
Reinhold Haeb-Umbach
47
46
0
20 May 2020
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Tingle Li
Qingjian Lin
Yuanyuan Bao
Ming Li
39
38
0
19 May 2020
Audio-visual Multi-channel Recognition of Overlapped Speech
Audio-visual Multi-channel Recognition of Overlapped Speech
Jianwei Yu
Bo Wu
R. Yu
Shi-Xiong Zhang
Lianwu Chen
Yong Xu. Meng Yu
Dan Su
Dong Yu
Xunying Liu
Helen Meng
98
19
0
18 May 2020
The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets,
  Subjective Testing Framework, and Challenge Results
The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Testing Framework, and Challenge Results
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
Ebrahim Beyrami
R. Cheng
...
A. Aazami
Sebastian Braun
Puneet Rana
Sriram Srinivasan
J. Gehrke
119
319
0
16 May 2020
Sparse Mixture of Local Experts for Efficient Speech Enhancement
Sparse Mixture of Local Experts for Efficient Speech Enhancement
Aswin Sivaraman
Minje Kim
MoE
59
13
0
16 May 2020
Previous
123...13141516
Next