ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXivPDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 753 papers shown
Title
Learning Source Disentanglement in Neural Audio Codec
Learning Source Disentanglement in Neural Audio Codec
Xiaoyu Bie
Xubo Liu
Gaël Richard
29
1
0
17 Sep 2024
Ultra-Low Latency Speech Enhancement - A Comprehensive Study
Ultra-Low Latency Speech Enhancement - A Comprehensive Study
Haibin Wu
Sebastian Braun
28
0
0
16 Sep 2024
Language-Queried Target Sound Extraction Without Parallel Training Data
Language-Queried Target Sound Extraction Without Parallel Training Data
Hao Ma
Zhiyuan Peng
Xu Li
Yukai Li
Mingjie Shao
Qiuqiang Kong
Ju Liu
VLM
77
1
0
14 Sep 2024
Biomimetic Frontend for Differentiable Audio Processing
Biomimetic Frontend for Differentiable Audio Processing
Ruolan Leslie Famularo
D. Zotkin
S. Shamma
R. Duraiswami
AI4TS
36
0
0
13 Sep 2024
TSELM: Target Speaker Extraction using Discrete Tokens and Language
  Models
TSELM: Target Speaker Extraction using Discrete Tokens and Language Models
Beilong Tang
Bang Zeng
Ming Li
35
2
0
12 Sep 2024
DENSE: Dynamic Embedding Causal Target Speech Extraction
DENSE: Dynamic Embedding Causal Target Speech Extraction
Yiwen Wang
Zeyu Yuan
Xihong Wu
46
0
0
10 Sep 2024
Mel-RoFormer for Vocal Separation and Vocal Melody Transcription
Mel-RoFormer for Vocal Separation and Vocal Melody Transcription
Ju-Chiang Wang
Fan Zhang
Jitong Chen
26
1
0
07 Sep 2024
NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention
NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention
Dashanka De Silva
Siqi Cai
Saurav Pahuja
Tanja Schultz
Haizhou Li
33
0
0
04 Sep 2024
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
Bang Zeng
Ming Li
37
2
0
04 Sep 2024
Spectron: Target Speaker Extraction using Conditional Transformer with
  Adversarial Refinement
Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement
Tathagata Bandyopadhyay
ViT
18
0
0
02 Sep 2024
LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant
  Multi-Talker Speech Separation, ASR and Speaker Diarization
LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Zengrui Jin
Yifan Yang
Mohan Shi
Wei Kang
Xiaoyu Yang
...
Lingwei Meng
Long Lin
Yong Xu
Shi-Xiong Zhang
Daniel Povey
28
2
0
01 Sep 2024
Improving Generalization of Speech Separation in Real-World Scenarios:
  Strategies in Simulation, Optimization, and Evaluation
Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation
K. Chen
Jiaqi Su
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Zeyu Jin
40
1
0
28 Aug 2024
Comparative Analysis Of Discriminative Deep Learning-Based Noise
  Reduction Methods In Low SNR Scenarios
Comparative Analysis Of Discriminative Deep Learning-Based Noise Reduction Methods In Low SNR Scenarios
Shrishti Saha Shetu
Emanuël A. P. Habets
Andreas Brendel
34
2
0
26 Aug 2024
Efficient Area-based and Speaker-Agnostic Source Separation
Efficient Area-based and Speaker-Agnostic Source Separation
Martin Strauss
Okan Kopuklu
26
3
0
19 Aug 2024
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech
  Enhancement
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement
Tao Sun
Sander Bohté
23
2
0
14 Aug 2024
BSS-CFFMA: Cross-Domain Feature Fusion and Multi-Attention Speech
  Enhancement Network based on Self-Supervised Embedding
BSS-CFFMA: Cross-Domain Feature Fusion and Multi-Attention Speech Enhancement Network based on Self-Supervised Embedding
Alimjan Mattursun
Liejun Wang
Yinfeng Yu
30
2
0
13 Aug 2024
Source Separation of Multi-source Raw Music using a Residual Quantized
  Variational Autoencoder
Source Separation of Multi-source Raw Music using a Residual Quantized Variational Autoencoder
Leonardo Berti
DRL
32
0
0
12 Aug 2024
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech
  Separation and Enhancement
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Kohei Saijo
G. Wichern
François G. Germain
Zexu Pan
Jonathan Le Roux
37
7
0
06 Aug 2024
Enhanced Reverberation as Supervision for Unsupervised Speech Separation
Enhanced Reverberation as Supervision for Unsupervised Speech Separation
Kohei Saijo
G. Wichern
François G. Germain
Zexu Pan
Jonathan Le Roux
31
1
0
06 Aug 2024
RAVSS: Robust Audio-Visual Speech Separation in Multi-Speaker Scenarios
  with Missing Visual Cues
RAVSS: Robust Audio-Visual Speech Separation in Multi-Speaker Scenarios with Missing Visual Cues
Tianrui Pan
Jie Liu
Bohan Wang
Jie Tang
Gangshan Wu
40
2
0
27 Jul 2024
Robustness of Speech Separation Models for Similar-pitch Speakers
Robustness of Speech Separation Models for Similar-pitch Speakers
Bunlong Lay
Sebastian Zaczek
Kristina Tesch
Timo Gerkmann
21
0
0
22 Jul 2024
Speech Slytherin: Examining the Performance and Efficiency of Mamba for
  Speech Separation, Recognition, and Synthesis
Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis
Xilin Jiang
Yinghao Aaron Li
Adrian Nicolas Florea
Cong Han
N. Mesgarani
Mamba
46
9
0
13 Jul 2024
A review of graph neural network applications in mechanics-related
  domains
A review of graph neural network applications in mechanics-related domains
Yingxue Zhao
Haoran Li
Haosu Zhou
H. Attar
Tobias Pfaff
Nan Li
AI4CE
37
5
0
10 Jul 2024
Improving Speech Enhancement by Integrating Inter-Channel and Band
  Features with Dual-branch Conformer
Improving Speech Enhancement by Integrating Inter-Channel and Band Features with Dual-branch Conformer
Jizhen Li
Xinmeng Xu
Weiping Tu
Yuhong Yang
Rong Zhu
32
1
0
09 Jul 2024
A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
Feiyang Xiao
Jian Guan
Qiaoxi Zhu
Xubo Liu
Wenbo Wang
Shuhan Qi
Kejia Zhang
Jianyuan Sun
Wenwu Wang
30
4
0
06 Jul 2024
All Neural Low-latency Directional Speech Extraction
All Neural Low-latency Directional Speech Extraction
Ashutosh Pandey
Sanha Lee
Juan Azcarreta
Daniel D. E. Wong
Buye Xu
32
2
0
05 Jul 2024
Investigating the Effects of Large-Scale Pseudo-Stereo Data and
  Different Speech Foundation Model on Dialogue Generative Spoken Language
  Model
Investigating the Effects of Large-Scale Pseudo-Stereo Data and Different Speech Foundation Model on Dialogue Generative Spoken Language Model
Yu-Kuan Fu
Cheng-Kuang Lee
Hsiu-Hsuan Wang
Hung-yi Lee
24
0
0
02 Jul 2024
SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight
  Conv-TasNet and State Space Modeling
SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Hiroshi Sato
Takafumi Moriya
Masato Mimura
Shota Horiguchi
Tsubasa Ochiai
Takanori Ashihara
Atsushi Ando
Kentaro Shinayama
Marc Delcroix
37
1
0
01 Jul 2024
Papez: Resource-Efficient Speech Separation with Auditory Working Memory
Papez: Resource-Efficient Speech Separation with Auditory Working Memory
Hyunseok Oh
Juheon Yi
Youngki Lee
19
2
0
01 Jul 2024
Open-Source Conversational AI with SpeechBrain 1.0
Open-Source Conversational AI with SpeechBrain 1.0
Mirco Ravanelli
Titouan Parcollet
Adel Moumen
Sylvain de Langen
Cem Subakan
...
Salima Mdhaffar
G. Laperriere
Mickael Rouvier
Renato De Mori
Yannick Esteve
VLM
47
10
0
29 Jun 2024
SNR-Progressive Model with Harmonic Compensation for Low-SNR Speech
  Enhancement
SNR-Progressive Model with Harmonic Compensation for Low-SNR Speech Enhancement
Zhongshu Hou
Tong Lei
Qinwen Hu
Zhanzhong Cao
Ming Tang
Jing Lu
32
0
0
24 Jun 2024
Improved Remixing Process for Domain Adaptation-Based Speech Enhancement
  by Mitigating Data Imbalance in Signal-to-Noise Ratio
Improved Remixing Process for Domain Adaptation-Based Speech Enhancement by Mitigating Data Imbalance in Signal-to-Noise Ratio
Li Li
Shogo Seki
33
0
0
20 Jun 2024
Diffusion-based Generative Modeling with Discriminative Guidance for
  Streamable Speech Enhancement
Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement
Chenda Li
Samuele Cornell
Shinji Watanabe
Yanmin Qian
DiffM
34
2
0
19 Jun 2024
Universal Score-based Speech Enhancement with High Content Preservation
Universal Score-based Speech Enhancement with High Content Preservation
Robin Scheibler
Yusuke Fujita
Yuma Shirahata
Tatsuya Komatsu
DiffM
37
10
0
18 Jun 2024
AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech
  Separation By Leveraging Narrow- and Cross-Band Modeling
AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling
Vahid Ahmadi Kalkhorani
Cheng Yu
Anurag Kumar
Ke Tan
Buye Xu
DeLiang Wang
34
0
0
17 Jun 2024
SMRU: Split-and-Merge Recurrent-based UNet for Acoustic Echo Cancellation and Noise Suppression
SMRU: Split-and-Merge Recurrent-based UNet for Acoustic Echo Cancellation and Noise Suppression
Zhihang Sun
Andong Li
Rilin Chen
Hao Zhang
Meng Yu
Yi Zhou
Dong Yu
66
0
0
17 Jun 2024
Joint Speaker Features Learning for Audio-visual Multichannel Speech
  Separation and Recognition
Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Guinan Li
Jiajun Deng
Youjun Chen
Mengzhe Geng
Shujie Hu
...
Zengrui Jin
Tianzi Wang
Xurong Xie
Helen Meng
Xunying Liu
VLM
34
0
0
14 Jun 2024
TSE-PI: Target Sound Extraction under Reverberant Environments with
  Pitch Information
TSE-PI: Target Sound Extraction under Reverberant Environments with Pitch Information
Yiwen Wang
Xihong Wu
46
2
0
13 Jun 2024
Target Speaker Extraction with Curriculum Learning
Target Speaker Extraction with Curriculum Learning
Yun Liu
Xuechen Liu
Xiaoxiao Miao
Junichi Yamagishi
23
3
0
12 Jun 2024
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Yiyuan Yang
Niki Trigoni
Andrew Markham
31
3
0
11 Jun 2024
RaD-Net 2: A causal two-stage repairing and denoising speech enhancement
  network with knowledge distillation and complex axial self-attention
RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention
Mingshuai Liu
Zhuangqi Chen
Xiaopeng Yan
Yuanjun Lv
Xianjun Xia
Chuanzeng Huang
Yijian Xiao
Lei Xie
46
2
0
11 Jun 2024
MR-RawNet: Speaker verification system with multiple temporal
  resolutions for variable duration utterances using raw waveforms
MR-RawNet: Speaker verification system with multiple temporal resolutions for variable duration utterances using raw waveforms
Seung-bin Kim
Chan-yeong Lim
Jungwoo Heo
Ju-ho Kim
Hyun-Seo Shin
Kyo-Won Koo
Ha-Jin Yu
52
0
0
11 Jun 2024
Unsupervised Improved MVDR Beamforming for Sound Enhancement
Unsupervised Improved MVDR Beamforming for Sound Enhancement
Jacob Kealey
John Hershey
François Grondin
16
0
0
10 Jun 2024
Towards Signal Processing In Large Language Models
Towards Signal Processing In Large Language Models
Prateek Verma
Mert Pilanci
42
3
0
10 Jun 2024
EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech
  Enhancement and Dereverberation
EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Julius Richter
Yi-Chiao Wu
Steven Krenn
Simon Welker
Bunlong Lay
Shinji Watanabe
Alexander Richard
Timo Gerkmann
38
17
0
10 Jun 2024
Thunder : Unified Regression-Diffusion Speech Enhancement with a Single
  Reverse Step using Brownian Bridge
Thunder : Unified Regression-Diffusion Speech Enhancement with a Single Reverse Step using Brownian Bridge
Thanapat Trachu
Chawan Piansaddhayanon
E. Chuangsuwanich
34
2
0
10 Jun 2024
URGENT Challenge: Universality, Robustness, and Generalizability For
  Speech Enhancement
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Wangyou Zhang
Robin Scheibler
Kohei Saijo
Samuele Cornell
Chenda Li
...
Jan Pirklbauer
Marvin Sach
Shinji Watanabe
Tim Fingscheidt
Yanmin Qian
VLM
37
7
0
07 Jun 2024
Beyond Performance Plateaus: A Comprehensive Study on Scalability in
  Speech Enhancement
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement
Wangyou Zhang
Kohei Saijo
Jee-weon Jung
Chenda Li
Shinji Watanabe
Yanmin Qian
32
4
0
06 Jun 2024
The PESQetarian: On the Relevance of Goodhart's Law for Speech
  Enhancement
The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Danilo de Oliveira
Simon Welker
Julius Richter
Timo Gerkmann
36
5
0
05 Jun 2024
Effects of Dataset Sampling Rate for Noise Cancellation through Deep
  Learning
Effects of Dataset Sampling Rate for Noise Cancellation through Deep Learning
Brandon Colelough
Andrew Zheng
24
1
0
30 May 2024
Previous
12345...141516
Next