ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.07454
  4. Cited By
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
v1v2v3 (latest)

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

20 September 2018
Yi Luo
N. Mesgarani
ArXiv (abs)PDFHTML

Papers citing "Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation"

50 / 773 papers shown
Title
Towards Bitrate-Efficient and Noise-Robust Speech Coding with Variable Bitrate RVQ
Towards Bitrate-Efficient and Noise-Robust Speech Coding with Variable Bitrate RVQ
Yunkee Chae
Kyogu Lee
10
0
0
19 Jun 2025
SpeechRefiner: Towards Perceptual Quality Refinement for Front-End Algorithms
SpeechRefiner: Towards Perceptual Quality Refinement for Front-End Algorithms
Sirui Li
Shuai Wang
Zhijun Liu
Zhongjie Jiang
Yannan Wang
Haizhou Li
20
0
0
16 Jun 2025
Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing
Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing
Thanapat Trachu
Thanathai Lertpetchpun
Ekapol Chuangsuwanich
28
0
0
13 Jun 2025
A Review on Score-based Generative Models for Audio Applications
Ge Zhu
Yutong Wen
Zhiyao Duan
DiffMMedIm
34
0
0
10 Jun 2025
A Fast and Lightweight Model for Causal Audio-Visual Speech Separation
A Fast and Lightweight Model for Causal Audio-Visual Speech Separation
Wendi Sang
Kai Li
Runxuan Yang
Jianqiang Huang
Xiaolin Hu
10
0
0
07 Jun 2025
Adaptive Differential Denoising for Respiratory Sounds Classification
Adaptive Differential Denoising for Respiratory Sounds Classification
Gaoyang Dong
Zhicheng Zhang
Ping Sun
Minghui Zhang
54
0
0
03 Jun 2025
DnR-nonverbal: Cinematic Audio Source Separation Dataset Containing Non-Verbal Sounds
DnR-nonverbal: Cinematic Audio Source Separation Dataset Containing Non-Verbal Sounds
Takuya Hasumi
Yusuke Fujita
51
0
0
03 Jun 2025
Diffusion Buffer: Online Diffusion-based Speech Enhancement with Sub-Second Latency
Diffusion Buffer: Online Diffusion-based Speech Enhancement with Sub-Second Latency
Bunlong Lay
Rostilav Makarov
Timo Gerkmann
56
0
0
03 Jun 2025
M3ANet: Multi-scale and Multi-Modal Alignment Network for Brain-Assisted Target Speaker Extraction
M3ANet: Multi-scale and Multi-Modal Alignment Network for Brain-Assisted Target Speaker Extraction
Cunhang Fan
Ying Chen
Jian Zhou
Zexu Pan
Jingjing Zhang
Youdian Gao
Xiaoke Yang
Zhengqi Wen
Zhao Lv
27
0
0
31 May 2025
ZeroSep: Separate Anything in Audio with Zero Training
ZeroSep: Separate Anything in Audio with Zero Training
Chao Huang
Yuesheng Ma
J. Huang
Susan Liang
Yunlong Tang
Jing Bi
Wenqiang Liu
Nima Mesgarani
Chenliang Xu
DiffMVLM
59
0
0
29 May 2025
MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
Yunkee Chae
Kyogu Lee
61
0
0
29 May 2025
Music Source Restoration
Music Source Restoration
Yongyi Zang
Zheqi Dai
Mark D. Plumbley
Qiuqiang Kong
15
0
0
27 May 2025
Plug-and-Play Co-Occurring Face Attention for Robust Audio-Visual Speaker Extraction
Plug-and-Play Co-Occurring Face Attention for Robust Audio-Visual Speaker Extraction
Zexu Pan
Shengkui Zhao
Tingting Wang
Kun Zhou
Yukun Ma
Chong Zhang
B. Ma
35
0
0
27 May 2025
Text-Queried Audio Source Separation via Hierarchical Modeling
Text-Queried Audio Source Separation via Hierarchical Modeling
Xinlei Yin
Xiulian Peng
Xue Jiang
Zhiwei Xiong
Yan Lu
38
0
0
27 May 2025
Room Impulse Response as a Prompt for Acoustic Echo Cancellation
Room Impulse Response as a Prompt for Acoustic Echo Cancellation
Fei Zhao
Shulin He
Xueliang Zhang
12
0
0
26 May 2025
Source Separation of Small Classical Ensembles: Challenges and Opportunities
Source Separation of Small Classical Ensembles: Challenges and Opportunities
Gerardo Roa Dabike
Trevor J. Cox
Jon P. Barker
Michael A. Akeroyd
Scott Bannister
...
Jennifer Firth
S. Graetzer
Alinka Greasley
Rebecca R. Vos
W. Whitmer
55
0
0
23 May 2025
Source Separation by Flow Matching
Source Separation by Flow Matching
Robin Scheibler
John R. Hershey
Arnaud Doucet
Henry Li
82
0
0
22 May 2025
Attractor-Based Speech Separation of Multiple Utterances by Unknown Number of Speakers
Attractor-Based Speech Separation of Multiple Utterances by Unknown Number of Speakers
Yuzhu Wang
Archontis Politis
Konstantinos Drossos
Tuomas Virtanen
42
0
0
22 May 2025
Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Hao Shi
Xugang Lu
Kazuki Shimada
Tatsuya Kawahara
DiffM
47
0
0
20 May 2025
Time-Frequency-Based Attention Cache Memory Model for Real-Time Speech Separation
Time-Frequency-Based Attention Cache Memory Model for Real-Time Speech Separation
Guo Chen
Kai Li
Runxuan Yang
Xiaolin Hu
AI4TS
84
0
0
19 May 2025
Learning to Highlight Audio by Watching Movies
Learning to Highlight Audio by Watching Movies
Chao Huang
Ruohan Gao
J. M. F. Tsang
Jan Kurcius
Cagdas Bilen
Chenliang Xu
Anurag Kumar
Sanjeel Parekh
VGen
95
1
0
17 May 2025
ISAC: An Invertible and Stable Auditory Filter Bank with Customizable Kernels for ML Integration
ISAC: An Invertible and Stable Auditory Filter Bank with Customizable Kernels for ML Integration
Daniel Haider
Felix Perfler
Péter Balázs
Clara Hollomey
Nicki Holighaus
111
0
0
12 May 2025
Listen to Extract: Onset-Prompted Target Speaker Extraction
Listen to Extract: Onset-Prompted Target Speaker Extraction
Pengjie Shen
Kangrui Chen
Shulin He
Pengru Chen
Shuqi Yuan
He Kong
Xueliang Zhang
Zehao Wang
96
0
0
08 May 2025
SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation
SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation
Zhaoxi Mu
Xinyu Yang
Gang Wang
AuLLMKELMVLM
148
1
0
06 May 2025
MaskClip: Detachable Clip-on Piezoelectric Sensing of Mask Surface Vibrations for Real-time Noise-Robust Speech Input
MaskClip: Detachable Clip-on Piezoelectric Sensing of Mask Surface Vibrations for Real-time Noise-Robust Speech Input
Hirotaka Hiraki
Jun Rekimoto
57
0
0
04 May 2025
Passive Underwater Acoustic Signal Separation based on Feature Decoupling Dual-path Network
Passive Underwater Acoustic Signal Separation based on Feature Decoupling Dual-path Network
Yucheng Liu
Longyu Jiang
103
0
0
11 Apr 2025
LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models
LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models
Beilong Tang
Bang Zeng
Ming Li
AI4TS
77
0
0
10 Apr 2025
EvMic: Event-based Non-contact sound recovery from effective spatial-temporal modeling
EvMic: Event-based Non-contact sound recovery from effective spatial-temporal modeling
Hao Yin
Shi Guo
Xu Jia
Xudong XU
Lu Zhang
Si Liu
Dong Wang
Huchuan Lu
Tianfan Xue
73
0
0
03 Apr 2025
Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation
Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation
Wupeng Wang
Zexu Pan
Xianrui Li
Shuai Wang
Haizhou Li
AI4TS
75
0
0
03 Apr 2025
UniSep: Universal Target Audio Separation with Language Models at Scale
UniSep: Universal Target Audio Separation with Language Models at Scale
Yun Wang
Hangting Chen
Dongchao Yang
Weiqin Li
Dan Luo
Guangzhi Li
Shan Yang
Zhiyong Wu
Helen Meng
Xixin Wu
VLM
84
1
0
31 Mar 2025
Magnitude-Phase Dual-Path Speech Enhancement Network based on Self-Supervised Embedding and Perceptual Contrast Stretch Boosting
Magnitude-Phase Dual-Path Speech Enhancement Network based on Self-Supervised Embedding and Perceptual Contrast Stretch Boosting
Alimjan Mattursun
Liejun Wang
Yinfeng Yu
Chunyang Ma
112
0
0
27 Mar 2025
A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices
A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices
Ci-Hao Wu
Tian-Sheuan Chang
113
1
0
27 Mar 2025
Wireless Hearables With Programmable Speech AI Accelerators
Wireless Hearables With Programmable Speech AI Accelerators
Malek Itani
Tuochao Chen
Arun Raghavan
Gavriel Kohlberg
Shyamnath Gollakota
AuLLM
84
0
0
24 Mar 2025
Elevating Robust Multi-Talker ASR by Decoupling Speaker Separation and Speech Recognition
Elevating Robust Multi-Talker ASR by Decoupling Speaker Separation and Speech Recognition
Yufeng Yang
H. Taherian
Vahid Ahmadi Kalkhorani
DeLiang Wang
65
0
0
23 Mar 2025
HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks
HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks
Ekaterina Dmitrieva
Maksim Kaledin
131
0
0
21 Mar 2025
Context-Aware Two-Step Training Scheme for Domain Invariant Speech Separation
Context-Aware Two-Step Training Scheme for Domain Invariant Speech Separation
Wupeng Wang
Zexu Pan
Jingru Lin
Shuai Wang
Haizhou Li
110
0
0
16 Mar 2025
Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech Extraction
Minsu Kim
Rodrigo Mira
Honglie Chen
Stavros Petridis
Maja Pantic
115
0
0
13 Mar 2025
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
Boyi Kang
Xinfa Zhu
Zihan Zhang
Zhen Ye
Mingshuai Liu
...
Jun Chen
Longshuai Xiao
Chao Weng
Wei Xue
Lei Xie
AuLLM
165
3
0
01 Mar 2025
CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR
CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR
Nian Shao
Rui Zhou
Pengyu Wang
Xian Li
Ying Fang
Yujie Yang
Xiaofei Li
119
0
0
27 Feb 2025
Artifact-free Sound Quality in DNN-based Closed-loop Systems for Audio Processing
Artifact-free Sound Quality in DNN-based Closed-loop Systems for Audio Processing
chuan Wen
Guy Torfs
Sarah Verhulst
105
0
0
17 Feb 2025
EDSep: An Effective Diffusion-Based Method for Speech Source Separation
Jinwei Dong
Xinsheng Wang
Qirong Mao
143
1
0
28 Jan 2025
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement
Jae-Sung Bae
Anastasia Kuznetsova
Dinesh Manocha
John Hershey
Trausti Kristjansson
Minje Kim
134
0
0
23 Jan 2025
30+ Years of Source Separation Research: Achievements and Future Challenges
30+ Years of Source Separation Research: Achievements and Future Challenges
S. Araki
N. Ito
Reinhold Haeb-Umbach
Gordon Wichern
Zhong-Qiu Wang
Yuki Mitsufuji
AI4TS
78
2
0
21 Jan 2025
Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing
Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing
David Perera
Victor Letzelter
Théo Mariotte
Adrien Cortés
Mickaël Chen
S. Essid
Ga¨el Richard
167
4
0
20 Jan 2025
Beyond Speaker Identity: Text Guided Target Speech Extraction
Beyond Speaker Identity: Text Guided Target Speech Extraction
Mingyue Huo
Abhinav Jain
Cong Phuoc Huynh
Fanjie Kong
Pichao Wang
Zhu Liu
Vimal Bhat
76
1
0
17 Jan 2025
USED: Universal Speaker Extraction and Diarization
USED: Universal Speaker Extraction and Diarization
Junyi Ao
Mehmet Sinan Yildirim
Ruijie Tao
Mengyao Ge
Shuai Wang
Yan-min Qian
Haizhou Li
101
6
0
17 Jan 2025
Microphone Array Signal Processing and Deep Learning for Speech Enhancement
Microphone Array Signal Processing and Deep Learning for Speech Enhancement
Reinhold Haeb-Umbach
Tomohiro Nakatani
Marc Delcroix
Christoph Boeddeker
Tsubasa Ochiai
107
1
0
13 Jan 2025
AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
Samir Sadok
Simon Leglaive
Laurent Girin
Gaël Richard
Xavier Alameda-Pineda
129
3
0
10 Jan 2025
FlowSep: Language-Queried Sound Separation with Rectified Flow Matching
FlowSep: Language-Queried Sound Separation with Rectified Flow Matching
Yi Yuan
Xubo Liu
Haohe Liu
Mark D. Plumbley
Wenwu Wang
140
9
0
10 Jan 2025
Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models
Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models
Tornike Karchkhadze
M. Izadi
Shlomo Dubnov
DiffM
86
5
0
31 Dec 2024
1234...141516
Next