ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXivPDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 754 papers shown
Title
Improving RNN Transducer With Target Speaker Extraction and Neural
  Uncertainty Estimation
Improving RNN Transducer With Target Speaker Extraction and Neural Uncertainty Estimation
Jiatong Shi
Chunlei Zhang
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
25
12
0
26 Nov 2020
Speech Denoising with Auditory Models
Speech Denoising with Auditory Models
Mark R. Saddler
Andrew Francl
J. Feather
Kaizhi Qian
Yang Zhang
Josh H. McDermott
4
6
0
21 Nov 2020
One Shot Learning for Speech Separation
One Shot Learning for Speech Separation
Yuan-Kuei Wu
Kuan-Po Huang
Yu Tsao
Hung-yi Lee
VLM
29
8
0
20 Nov 2020
Multi-stage Speaker Extraction with Utterance and Frame-Level Reference
  Signals
Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals
Meng Ge
Chenglin Xu
Longbiao Wang
Chng Eng Siong
J. Dang
Haizhou Li
6
42
0
19 Nov 2020
WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation
  and Dereverberation
WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation
Zhaoheng Ni
Yong-mei Xu
Meng Yu
Bo Wu
Shi-Xiong Zhang
Dong Yu
Michael I. Mandel
22
8
0
18 Nov 2020
Rethinking the Separation Layers in Speech Separation Networks
Rethinking the Separation Layers in Speech Separation Networks
Yi Luo
Zhuo Chen
Cong Han
Chenda Li
Tianyan Zhou
N. Mesgarani
19
10
0
17 Nov 2020
Ultra-Lightweight Speech Separation via Group Communication
Ultra-Lightweight Speech Separation via Group Communication
Yi Luo
Cong Han
N. Mesgarani
VLM
25
30
0
17 Nov 2020
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant
  Environments
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
17
46
0
11 Nov 2020
Spoken Language Interaction with Robots: Research Issues and
  Recommendations, Report from the NSF Future Directions Workshop
Spoken Language Interaction with Robots: Research Issues and Recommendations, Report from the NSF Future Directions Workshop
M. Marge
C. Espy-Wilson
Roger K. Moore
26
78
0
11 Nov 2020
Informed Source Extraction With Application to Acoustic Echo Reduction
Informed Source Extraction With Application to Acoustic Echo Reduction
Mohamed Elminshawi
Wolfgang Mack
Emanuel Habets
11
2
0
09 Nov 2020
ESPnet-se: end-to-end speech enhancement and separation toolkit designed
  for asr integration
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration
Chenda Li
Jing Shi
Wangyou Zhang
Aswin Shanmugam Subramanian
Xuankai Chang
...
Moto Hira
Tomoki Hayashi
Christoph Boeddeker
Zhuo Chen
Shinji Watanabe
VLM
39
81
0
07 Nov 2020
Single channel voice separation for unknown number of speakers under
  reverberant and noisy settings
Single channel voice separation for unknown number of speakers under reverberant and noisy settings
Shlomo E. Chazan
Lior Wolf
Eliya Nachmani
Yossi Adi
29
29
0
04 Nov 2020
DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation,
  Enhancement and Separation
DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation
Yihui Fu
Jian Wu
Yanxin Hu
Mengtao Xing
Lei Xie
28
23
0
04 Nov 2020
Integration of speech separation, diarization, and recognition for
  multi-speaker meetings: System description, comparison, and analysis
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
Desh Raj
Pavel Denisov
Zhuo Chen
Hakan Erdogan
Zili Huang
...
Yi Luo
Naoyuki Kanda
Jinyu Li
Scott Wisdom
J. Hershey
6
84
0
03 Nov 2020
Two Heads Are Better Than One: A Two-Stage Approach for Monaural Noise
  Reduction in the Complex Domain
Two Heads Are Better Than One: A Two-Stage Approach for Monaural Noise Reduction in the Complex Domain
Andong Li
C. Zheng
Renhua Peng
Xiaodong Li
17
10
0
03 Nov 2020
What's All the FUSS About Free Universal Sound Separation Data?
What's All the FUSS About Free Universal Sound Separation Data?
Scott Wisdom
Hakan Erdogan
D. Ellis
Romain Serizel
Nicolas Turpault
Eduardo Fonseca
Justin Salamon
Prem Seetharaman
J. Hershey
27
82
0
02 Nov 2020
FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time
  Single-Channel Speech Enhancement
FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement
Xiang Hao
Xiangdong Su
Radu Horaud
Xiaofei Li
25
194
0
29 Oct 2020
Stabilizing Label Assignment for Speech Separation by Self-supervised
  Pre-training
Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training
Sung-Feng Huang
Shun-Po Chuang
Da-Rong Liu
Yi-Chen Chen
Gene-Ping Yang
Hung-yi Lee
SSL
41
14
0
29 Oct 2020
Unified Gradient Reweighting for Model Biasing with Applications to
  Source Separation
Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
Efthymios Tzinis
Dimitrios Bralios
Paris Smaragdis
21
1
0
25 Oct 2020
Attention is All You Need in Speech Separation
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
45
539
0
25 Oct 2020
Speakerfilter-Pro: an improved target speaker extractor combines the
  time domain and frequency domain
Speakerfilter-Pro: an improved target speaker extractor combines the time domain and frequency domain
Shulin He
Hao Li
Xueliang Zhang
8
3
0
25 Oct 2020
A Study of Transfer Learning in Music Source Separation
A Study of Transfer Learning in Music Source Separation
Andreas Bugler
Bryan Pardo
Prem Seetharaman
27
3
0
23 Oct 2020
Speech enhancement aided end-to-end multi-task learning for voice
  activity detection
Speech enhancement aided end-to-end multi-task learning for voice activity detection
Xu Tan
Xiao-Lei Zhang
29
32
0
23 Oct 2020
Don't shoot butterfly with rifles: Multi-channel Continuous Speech
  Separation with Early Exit Transformer
Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Sanyuan Chen
Yu-Huan Wu
Zhuo Chen
Takuya Yoshioka
Shujie Liu
Jinyu Li
29
26
0
23 Oct 2020
Listening to Sounds of Silence for Speech Denoising
Listening to Sounds of Silence for Speech Denoising
Ruilin Xu
Rundi Wu
Y. Ishiwaka
Carl Vondrick
Changxi Zheng
28
32
0
22 Oct 2020
Transcription Is All You Need: Learning to Separate Musical Mixtures
  with Score as Supervision
Transcription Is All You Need: Learning to Separate Musical Mixtures with Score as Supervision
Yun-Ning Hung
Gordon Wichern
Jonathan Le Roux
17
12
0
22 Oct 2020
Towards Listening to 10 People Simultaneously: An Efficient Permutation
  Invariant Training of Audio Source Separation Using Sinkhorn's Algorithm
Towards Listening to 10 People Simultaneously: An Efficient Permutation Invariant Training of Audio Source Separation Using Sinkhorn's Algorithm
Hideyuki Tachibana
26
14
0
22 Oct 2020
DBNET: DOA-driven beamforming network for end-to-end farfield sound
  source separation
DBNET: DOA-driven beamforming network for end-to-end farfield sound source separation
Ali Aroudi
Sebastian Braun
6
7
0
22 Oct 2020
BERT for Joint Multichannel Speech Dereverberation with Spatial-aware
  Tasks
BERT for Joint Multichannel Speech Dereverberation with Spatial-aware Tasks
Yang Jiao
15
0
0
21 Oct 2020
Speaker Separation Using Speaker Inventories and Estimated Speech
Speaker Separation Using Speaker Inventories and Estimated Speech
Peidong Wang
Zhuo Chen
DeLiang Wang
Jinyu Li
Jiawei Liu
38
11
0
20 Oct 2020
Phase recovery with Bregman divergences for audio source separation
Phase recovery with Bregman divergences for audio source separation
P. Magron
Pierre-Hugo Vial
Thomas Oberlin
Cédric Févotte
29
1
0
20 Oct 2020
Fast accuracy estimation of deep learning based multi-class musical
  source separation
Fast accuracy estimation of deep learning based multi-class musical source separation
A. Mocanu
B. Ricaud
Milos Cernak
14
0
0
19 Oct 2020
Attention-based scaling adaptation for target speech extraction
Attention-based scaling adaptation for target speech extraction
Jiangyu Han
Wei Rao
Yanhua Long
Jiaen Liang
21
9
0
19 Oct 2020
Muse: Multi-modal target speaker extraction with visual cues
Muse: Multi-modal target speaker extraction with visual cues
Zexu Pan
Ruijie Tao
Chenglin Xu
Haizhou Li
23
45
0
15 Oct 2020
The Cone of Silence: Speech Separation by Localization
The Cone of Silence: Speech Separation by Localization
Teerapat Jenrungrot
V. Jayaram
S. M. Seitz
Ira Kemelmacher-Shlizerman
32
54
0
12 Oct 2020
All for One and One for All: Improving Music Separation by Bridging
  Networks
All for One and One for All: Improving Music Separation by Bridging Networks
Ryosuke Sawata
Stefan Uhlich
Shusuke Takahashi
Yuki Mitsufuji
21
47
0
08 Oct 2020
Adversarial attacks on audio source separation
Adversarial attacks on audio source separation
Naoya Takahashi
S. Inoue
Yuki Mitsufuji
AAML
9
9
0
07 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and
  Continuous Speech Separation
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
33
88
0
04 Oct 2020
Sense and Learn: Self-Supervision for Omnipresent Sensors
Sense and Learn: Self-Supervision for Omnipresent Sensors
Aaqib Saeed
Victor Ungureanu
Beat Gfeller
OOD
SSL
22
39
0
28 Sep 2020
Correlating Subword Articulation with Lip Shapes for Embedding Aware
  Audio-Visual Speech Enhancement
Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Hang Chen
Jun Du
Yu Hu
Lirong Dai
Baocai Yin
Chin-Hui Lee
36
19
0
21 Sep 2020
Online Speaker Diarization with Relation Network
Xiang Li
Yucheng Zhao
Chong Luo
Wenjun Zeng
8
2
0
17 Sep 2020
An End-to-end Architecture of Online Multi-channel Speech Separation
An End-to-end Architecture of Online Multi-channel Speech Separation
Jian Wu
Zhuo Chen
Jinyu Li
Takuya Yoshioka
Zhili Tan
Ed Lin
Yi Luo
Lei Xie
3DV
19
21
0
07 Sep 2020
Toward Speech Separation in The Pre-Cocktail Party Problem with TasTas
Toward Speech Separation in The Pre-Cocktail Party Problem with TasTas
Ziqiang Shi
Jiqing Han
10
0
0
07 Sep 2020
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
Ashutosh Pandey
DeLiang Wang
31
135
0
03 Sep 2020
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with
  Interaural Cue Preservation
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation
Ke Tan
Buye Xu
Anurag Kumar
Eliya Nachmani
Yossi Adi
28
29
0
02 Sep 2020
Dynamical Variational Autoencoders: A Comprehensive Review
Dynamical Variational Autoencoders: A Comprehensive Review
Laurent Girin
Simon Leglaive
Xiaoyu Bie
Julien Diard
Thomas Hueber
Xavier Alameda-Pineda
BDL
23
210
0
28 Aug 2020
Continuous Speech Separation with Conformer
Continuous Speech Separation with Conformer
Sanyuan Chen
Yu-Huan Wu
Zhuo Chen
Jian Wu
Jinyu Li
Takuya Yoshioka
Chengyi Wang
Shujie Liu
M. Zhou
23
126
0
13 Aug 2020
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings,
  Semi-Supervised Conversational Data, and Biased Loss
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Umut Isik
Ritwik Giri
Neerad Phansalkar
J. Valin
Karim Helwani
A. Krishnaswamy
21
83
0
11 Aug 2020
Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM
  with Auxiliary Identity Loss
Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM with Auxiliary Identity Loss
Ziqiang Shi
Rujie Liu
Jiqing Han
16
7
0
06 Aug 2020
Content based singing voice source separation via strong conditioning
  using aligned phonemes
Content based singing voice source separation via strong conditioning using aligned phonemes
Gabriel Meseguer-Brocal
Geoffroy Peeters
32
9
0
05 Aug 2020
Previous
123...1213141516
Next