ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXivPDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 754 papers shown
Title
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech
  Separation
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech Separation
Jian Luo
Jianzong Wang
Ning Cheng
Edward Xiao
Xulong Zhang
Jing Xiao
ViT
27
12
0
28 Jun 2022
ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech
  Enhancement
ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement
Ishan Chatterjee
Maruchi Kim
V. Jayaram
Shyamnath Gollakota
Ira Kemelmacher-Shlizerman
Shwetak N. Patel
S. M. Seitz
21
25
0
27 Jun 2022
Efficient Transformer-based Speech Enhancement Using Long Frames and
  STFT Magnitudes
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Danilo de Oliveira
Tal Peer
Timo Gerkmann
21
18
0
23 Jun 2022
Restoring speech intelligibility for hearing aid users with deep
  learning
Restoring speech intelligibility for hearing aid users with deep learning
P. U. Diehl
Y. Singer
Hannes Zilly
U. Schonfeld
Paul Meyer-Rachner
Mark Berry
Henning Sprekeler
Elias Sprengel
A. Pudszuhn
V. Hofmann
6
18
0
23 Jun 2022
An Empirical Analysis on the Vulnerabilities of End-to-End Speech
  Segregation Models
An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models
Rahil Parikh
G. Rochette
C. Espy-Wilson
S. Shamma
UQCV
14
0
0
20 Jun 2022
Resource-Efficient Separation Transformer
Resource-Efficient Separation Transformer
Luca Della Libera
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Frédéric Lepoutre
François Grondin
VLM
43
16
0
19 Jun 2022
GMM based multi-stage Wiener filtering for low SNR speech enhancement
GMM based multi-stage Wiener filtering for low SNR speech enhancement
Wageesha Manamperi
P. Samarasinghe
T. Abhayapala
J. Zhang
16
6
0
19 Jun 2022
Semi-supervised Time Domain Target Speaker Extraction with Attention
Semi-supervised Time Domain Target Speaker Extraction with Attention
Zhepei Wang
Ritwik Giri
Shrikant Venkataramani
Umut Isik
J. Valin
Paris Smaragdis
Mike Goodwin
A. Krishnaswamy
24
7
0
18 Jun 2022
Simultaneous Speech Extraction for Multiple Target Speakers under the
  Meeting Scenarios
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios
Bang Zeng
Weiqing Wang
Yuanyuan Bao
Ming Li
27
0
0
17 Jun 2022
Strategies to Improve Robustness of Target Speech Extraction to
  Enrollment Variations
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Takafumi Moriya
Naoki Makishima
Mana Ihori
Tomohiro Tanaka
Ryo Masumura
6
5
0
16 Jun 2022
On the Use of Deep Mask Estimation Module for Neural Source Separation
  Systems
On the Use of Deep Mask Estimation Module for Neural Source Separation Systems
Kai Li
Xiaolin Hu
Yi Luo
20
15
0
15 Jun 2022
On the Design and Training Strategies for RNN-based Online Neural Speech
  Separation Systems
On the Design and Training Strategies for RNN-based Online Neural Speech Separation Systems
Kai Li
Yi Luo
29
12
0
15 Jun 2022
LPCSE: Neural Speech Enhancement through Linear Predictive Coding
LPCSE: Neural Speech Enhancement through Linear Predictive Coding
Yang Liu
Na Tang
Xia Chu
Yang Yang
Jun Wang
28
1
0
14 Jun 2022
Physics-Inspired Temporal Learning of Quadrotor Dynamics for Accurate
  Model Predictive Trajectory Tracking
Physics-Inspired Temporal Learning of Quadrotor Dynamics for Accurate Model Predictive Trajectory Tracking
Alessandro Saviolo
Guanrui Li
Giuseppe Loianno
23
49
0
07 Jun 2022
Sampling Frequency Independent Dialogue Separation
Sampling Frequency Independent Dialogue Separation
Jouni Paulus
Matteo Torcoli
22
12
0
05 Jun 2022
Joint Training of Speech Enhancement and Self-supervised Model for
  Noise-robust ASR
Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR
Qiu-shi Zhu
Jie Zhang
Zitian Zhang
Lirong Dai
43
15
0
26 May 2022
SepIt: Approaching a Single Channel Speech Separation Bound
SepIt: Approaching a Single Channel Speech Separation Bound
Shahar Lutati
Eliya Nachmani
Lior Wolf
VLM
43
27
0
24 May 2022
Streaming Noise Context Aware Enhancement For Automatic Speech
  Recognition in Multi-Talker Environments
Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Joseph Peter Caroselli
A. Narayanan
Yiteng Huang
11
1
0
17 May 2022
Utterance Weighted Multi-Dilation Temporal Convolutional Networks for
  Monaural Speech Dereverberation
Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation
William Ravenscroft
Stefan Goetze
Thomas Hain
29
6
0
17 May 2022
A deep representation learning speech enhancement method using
  $β$-VAE
A deep representation learning speech enhancement method using βββ-VAE
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
24
2
0
11 May 2022
Speaker Reinforcement Using Target Source Extraction for Robust
  Automatic Speech Recognition
Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Catalin Zorila
R. Doddipatla
24
11
0
09 May 2022
Mask-based Neural Beamforming for Moving Speakers with
  Self-Attention-based Tracking
Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
S. Araki
6
20
0
07 May 2022
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural
  Speech Enhancement
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement
Andong Li
Shan You
Guochen Yu
C. Zheng
Xiaodong Li
30
26
0
30 Apr 2022
Cleanformer: A multichannel array configuration-invariant neural
  enhancement frontend for ASR in smart speakers
Cleanformer: A multichannel array configuration-invariant neural enhancement frontend for ASR in smart speakers
Joseph Peter Caroselli
A. Narayanan
N. Howard
Tom O'Malley
28
4
0
25 Apr 2022
Heterogeneous Separation Consistency Training for Adaptation of
  Unsupervised Speech Separation
Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Jiangyu Han
Yanhua Long
28
6
0
23 Apr 2022
STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency
STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency
Zhong-Qiu Wang
G. Wichern
Shinji Watanabe
Jonathan Le Roux
25
36
0
21 Apr 2022
Music Source Separation with Generative Flow
Music Source Separation with Generative Flow
Ge Zhu
Jordan Darefsky
Fei Jiang
A. Selitskiy
Z. Duan
21
6
0
19 Apr 2022
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker
  Extraction
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Zifeng Zhao
Rongzhi Gu
Dongchao Yang
Jinchuan Tian
Yuexian Zou
33
2
0
15 Apr 2022
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation
  System
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
M. Z. Ozturk
Chenshu Wu
Beibei Wang
Min Wu
K. Liu
27
20
0
14 Apr 2022
Receptive Field Analysis of Temporal Convolutional Networks for Monaural
  Speech Dereverberation
Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation
William Ravenscroft
Stefan Goetze
Thomas Hain
11
8
0
13 Apr 2022
Listen only to me! How well can target speech extraction handle false
  alarms?
Listen only to me! How well can target speech extraction handle false alarms?
Marc Delcroix
K. Kinoshita
Tsubasa Ochiai
Kateřina Žmolíková
Hiroshi Sato
Tomohiro Nakatani
34
15
0
11 Apr 2022
SoundBeam: Target sound extraction conditioned on sound-class labels and
  enrollment clues for increased performance and continuous learning
SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning
Marc Delcroix
Jorge Bennasar Vázquez
Tsubasa Ochiai
K. Kinoshita
Yasunori Ohishi
S. Araki
VLM
22
32
0
08 Apr 2022
Defense against Adversarial Attacks on Hybrid Speech Recognition using
  Joint Adversarial Fine-tuning with Denoiser
Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Sonal Joshi
Saurabh Kataria
Yiwen Shao
Piotr Żelasko
Jesus Villalba
Sanjeev Khudanpur
Najim Dehak
AAML
33
4
0
08 Apr 2022
AdvEst: Adversarial Perturbation Estimation to Classify and Detect
  Adversarial Attacks against Speaker Identification
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification
Sonal Joshi
Saurabh Kataria
Jesus Villalba
Najim Dehak
AAML
38
7
0
08 Apr 2022
Audio-visual multi-channel speech separation, dereverberation and
  recognition
Audio-visual multi-channel speech separation, dereverberation and recognition
Guinan Li
Jianwei Yu
Jiajun Deng
Xunying Liu
Helen Meng
19
7
0
05 Apr 2022
Target Confusion in End-to-end Speaker Extraction: Analysis and
  Approaches
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches
Zifeng Zhao
Dongchao Yang
Rongzhi Gu
Haoran Zhang
Yuexian Zou
23
16
0
04 Apr 2022
tPLCnet: Real-time Deep Packet Loss Concealment in the Time Domain Using
  a Short Temporal Context
tPLCnet: Real-time Deep Packet Loss Concealment in the Time Domain Using a Short Temporal Context
Nils L. Westhausen
B. Meyer
21
7
0
04 Apr 2022
Improving Target Sound Extraction with Timestamp Information
Improving Target Sound Extraction with Timestamp Information
Helin Wang
Dongchao Yang
Chao Weng
Jianwei Yu
Yuexian Zou
25
8
0
02 Apr 2022
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement
  Network (E3Net) and Knowledge Distillation
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation
Manthan Thakker
Sefik Emre Eskimez
Takuya Yoshioka
Huaming Wang
14
28
0
02 Apr 2022
End-to-End Integration of Speech Recognition, Speech Enhancement, and
  Self-Supervised Learning Representation
End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Xuankai Chang
Takashi Maekaku
Yuya Fujita
Shinji Watanabe
VLM
51
45
0
01 Apr 2022
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech
  Separation for Flexible Number of Speakers
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Soumi Maiti
Yushi Ueda
Shinji Watanabe
Chunlei Zhang
Meng Yu
Shi-Xiong Zhang
Yong-mei Xu
39
32
0
31 Mar 2022
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain
  Target Speaker Extraction
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Zexu Pan
Meng Ge
Haizhou Li
21
17
0
31 Mar 2022
Speaker Extraction with Co-Speech Gestures Cue
Speaker Extraction with Co-Speech Gestures Cue
Zexu Pan
Xinyuan Qian
Haizhou Li
SLR
21
27
0
31 Mar 2022
A Comparative Study on Speaker-attributed Automatic Speech Recognition
  in Multi-party Meetings
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Fan Yu
Zhihao Du
Shiliang Zhang
Yuxiao Lin
Linfu Xie
22
13
0
31 Mar 2022
Joint domain adaptation and speech bandwidth extension using time-domain
  GANs for speaker verification
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
Saurabh Kataria
Jesús Villalba
Laureano Moro Velázquez
Najim Dehak
19
3
0
30 Mar 2022
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length
Tal Peer
Timo Gerkmann
22
21
0
30 Mar 2022
Coarse-to-Fine Recursive Speech Separation for Unknown Number of
  Speakers
Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Zhenhao Jin
Xiang Hao
Xiangdong Su
19
4
0
30 Mar 2022
Disentangling the Impacts of Language and Channel Variability on Speech
  Separation Networks
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks
Fan Wang
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
29
4
0
30 Mar 2022
Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for
  Real-Time Full-Band Speech Enhancement
Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement
Guochen Yu
Andong Li
Wenzhe Liu
C. Zheng
Yutian Wang
Haibo Wang
30
4
0
30 Mar 2022
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level
  and Utterance-Level Acoustic Representation Learning
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning
Takaaki Saeki
Kentaro Tachibana
Ryuichi Yamamoto
15
10
0
29 Mar 2022
Previous
123...789...141516
Next