ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.02508
  4. Cited By
SDR - half-baked or well done?

SDR - half-baked or well done?

6 November 2018
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
ArXivPDFHTML

Papers citing "SDR - half-baked or well done?"

50 / 614 papers shown
Title
Addressing Feature Imbalance in Sound Source Separation
Addressing Feature Imbalance in Sound Source Separation
Jaechang Kim
Jeongyeon Hwang
Soheun Yi
Jaewoong Cho
Jungseul Ok
27
0
0
11 Sep 2023
Causal Signal-Based DCCRN with Overlapped-Frame Prediction for Online
  Speech Enhancement
Causal Signal-Based DCCRN with Overlapped-Frame Prediction for Online Speech Enhancement
Julitta Bartolewska
Stanisław Kacprzak
K. Kowalczyk
31
2
0
07 Sep 2023
A Generalized Bandsplit Neural Network for Cinematic Audio Source
  Separation
A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation
Karn N. Watcharasupat
Chih-Wei Wu
Yiwei Ding
Iroro Orife
Aaron J. Hipple
Aaron J. Hipple. Phillip A. Williams
Scott Kramer
Alexander Lerch
W. Wolcott
32
5
0
05 Sep 2023
Single-Channel Speech Enhancement with Deep Complex U-Networks and
  Probabilistic Latent Space Models
Single-Channel Speech Enhancement with Deep Complex U-Networks and Probabilistic Latent Space Models
E. J. Nustede
Jörn Anemüller
29
3
0
04 Sep 2023
Remixing-based Unsupervised Source Separation from Scratch
Remixing-based Unsupervised Source Separation from Scratch
Kohei Saijo
Tetsuji Ogawa
18
3
0
01 Sep 2023
ReZero: Region-customizable Sound Extraction
ReZero: Region-customizable Sound Extraction
Rongzhi Gu
Yi Luo
38
13
0
31 Aug 2023
General Purpose Audio Effect Removal
General Purpose Audio Effect Removal
Matthew Rice
C. Steinmetz
Georgy Fazekas
Joshua D. Reiss
27
8
0
30 Aug 2023
Dual-path Transformer Based Neural Beamformer for Target Speech
  Extraction
Dual-path Transformer Based Neural Beamformer for Target Speech Extraction
Aoqi Guo
Sichong Qian
Baoxiang Li
Dazhi Gao
36
1
0
30 Aug 2023
Evaluation of the Speech Resynthesis Capabilities of the VoicePrivacy
  Challenge Baseline B1
Evaluation of the Speech Resynthesis Capabilities of the VoicePrivacy Challenge Baseline B1
Ünal Ege Gaznepoglu
Nils Peters
37
0
0
22 Aug 2023
TokenSplit: Using Discrete Speech Representations for Direct, Refined,
  and Transcript-Conditioned Speech Separation and Recognition
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
Hakan Erdogan
Scott Wisdom
Xuankai Chang
Zalan Borsos
Marco Tagliasacchi
Neil Zeghidour
J. Hershey
21
9
0
21 Aug 2023
Explicit Estimation of Magnitude and Phase Spectra in Parallel for
  High-Quality Speech Enhancement
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Ye-Xin Lu
Yang Ai
Zhenhua Ling
30
9
0
17 Aug 2023
IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual
  Speech Separation
IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation
Kai Li
Run Yang
Fuchun Sun
Xiaolin Hu
32
6
0
16 Aug 2023
The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing
  Track
The Sound Demixing Challenge 2023 \unicodex2013\unicode{x2013}\unicodex2013 Cinematic Demixing Track
Stefan Uhlich
Giorgio Fabbro
M. Hirano
Shusuke Takahashi
Gordon Wichern
...
R. Solovyev
A. Stempkovskiy
T. Habruseva
M. Sukhovei
Yuki Mitsufuji
50
11
0
14 Aug 2023
Conformer-based Target-Speaker Automatic Speech Recognition for
  Single-Channel Audio
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio
Yang Zhang
Krishna C. Puvvada
Vitaly Lavrukhin
Boris Ginsburg
38
14
0
09 Aug 2023
Separate Anything You Describe
Separate Anything You Describe
Xubo Liu
Qiuqiang Kong
Yan Zhao
Haohe Liu
Yiitan Yuan
Yuzhuo Liu
Rui Xia
Yuxuan Wang
Mark D. Plumbley
Wenwu Wang
VLM
38
43
0
09 Aug 2023
Target Speech Extraction with Conditional Diffusion Model
Target Speech Extraction with Conditional Diffusion Model
Naoyuki Kamo
Marc Delcroix
Tomohiro Nakatan
DiffM
31
19
0
08 Aug 2023
Music De-limiter Networks via Sample-wise Gain Inversion
Music De-limiter Networks via Sample-wise Gain Inversion
Chang-Bin Jeon
Kyogu Lee
18
1
0
02 Aug 2023
SpatialNet: Extensively Learning Spatial Information for Multichannel
  Joint Speech Separation, Denoising and Dereverberation
SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation
Changsheng Quan
Xiaofei Li
18
36
0
31 Jul 2023
The Effect of Spoken Language on Speech Enhancement using
  Self-Supervised Speech Representation Loss Functions
The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions
George Close
Thomas Hain
Stefan Goetze
32
8
0
27 Jul 2023
Complete and separate: Conditional separation with missing target source
  attribute completion
Complete and separate: Conditional separation with missing target source attribute completion
Dimitrios Bralios
Efthymios Tzinis
Paris Smaragdis
37
0
0
27 Jul 2023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Yuchen Hu
Cheng Chen
Ruizhe Li
Qiu-shi Zhu
Eng Siong Chng
DiffM
21
10
0
16 Jul 2023
Learning Spatial Features from Audio-Visual Correspondence in Egocentric
  Videos
Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos
Sagnik Majumder
Ziad Al-Halah
Kristen Grauman
SSL
EgoV
41
4
0
10 Jul 2023
The CHiME-7 UDASE task: Unsupervised domain adaptation for
  conversational speech enhancement
The CHiME-7 UDASE task: Unsupervised domain adaptation for conversational speech enhancement
Simon Leglaive
Léonie Borne
Efthymios Tzinis
M. Sadeghi
Matthieu Fraticelli
Scott Wisdom
Manuel Pariente
Daniel Pressnitzer
J. Hershey
32
17
0
07 Jul 2023
Focus on the Sound around You: Monaural Target Speaker Extraction via
  Distance and Speaker Information
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Jiuxin Lin
Peng Wang
Heinrich Dinkel
Jun Chen
Zhiyong Wu
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Yujun Wang
19
8
0
28 Jun 2023
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Matt Le
Apoorv Vyas
Bowen Shi
Brian Karrer
Leda Sari
...
Mary Williamson
Vimal Manohar
Yossi Adi
Jay Mahadeokar
Wei-Ning Hsu
AuLLM
30
270
0
23 Jun 2023
Wind Noise Reduction with a Diffusion-based Stochastic Regeneration
  Model
Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model
Jean-Marie Lemercier
J. Thiemann
Raphael Koning
Timo Gerkmann
DiffM
36
1
0
22 Jun 2023
Variance-Preserving-Based Interpolation Diffusion Models for Speech
  Enhancement
Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement
Zilu Guo
Jun Du
Chin-Hui Lee
Yu Gao
Wen-bo Zhang
DiffM
34
10
0
14 Jun 2023
Unsupervised speech enhancement with deep dynamical generative speech
  and noise models
Unsupervised speech enhancement with deep dynamical generative speech and noise models
Xiaoyu Lin
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
29
3
0
13 Jun 2023
High-Fidelity Audio Compression with Improved RVQGAN
High-Fidelity Audio Compression with Improved RVQGAN
Rithesh Kumar
Prem Seetharaman
Alejandro Luebs
I. Kumar
Kundan Kumar
56
290
0
11 Jun 2023
Audio-Visual Speech Enhancement With Selective Off-Screen Speech
  Extraction
Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction
Tomoya Yoshinaga
Keitaro Tanaka
Shigeo Morishima
38
0
0
10 Jun 2023
An Efficient Speech Separation Network Based on Recurrent Fusion Dilated
  Convolution and Channel Attention
An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention
Junyu Wang
30
1
0
09 Jun 2023
On the Behavior of Intrusive and Non-intrusive Speech Enhancement
  Metrics in Predictive and Generative Settings
On the Behavior of Intrusive and Non-intrusive Speech Enhancement Metrics in Predictive and Generative Settings
Danilo de Oliveira
Julius Richter
Jean-Marie Lemercier
Tal Peer
Timo Gerkmann
48
5
0
05 Jun 2023
Rethinking the visual cues in audio-visual speaker extraction
Rethinking the visual cues in audio-visual speaker extraction
Junjie Li
Meng Ge
Zexu Pan
Rui Cao
Longbiao Wang
J. Dang
Shiliang Zhang
35
9
0
05 Jun 2023
Audio-Visual Speech Enhancement with Score-Based Generative Models
Audio-Visual Speech Enhancement with Score-Based Generative Models
Julius Richter
Simone Frintrop
Timo Gerkmann
DiffM
34
10
0
02 Jun 2023
HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Doyeon Kim
Soo-Whan Chung
Hyewon Han
Youna Ji
Hong-Goo Kang
37
7
0
02 Jun 2023
UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion
  Model
UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model
A. Iashchenko
Pavel Andreev
Ivan Shchekotov
Nicholas Babaev
Dmitry Vetrov
DiffM
26
1
0
01 Jun 2023
Audio-Visual Speech Separation in Noisy Environments with a Lightweight
  Iterative Model
Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model
H. Martel
Julius Richter
Kai Li
Xiaolin Hu
Timo Gerkmann
VLM
22
9
0
31 May 2023
UNSSOR: Unsupervised Neural Speech Separation by Leveraging
  Over-determined Training Mixtures
UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
Zhong-Qiu Wang
Shinji Watanabe
38
10
0
31 May 2023
Towards Solving Cocktail-Party: The First Method to Build a Realistic
  Dataset with Ground Truths for Speech Separation
Towards Solving Cocktail-Party: The First Method to Build a Realistic Dataset with Ground Truths for Speech Separation
Rawad Melhem
Assef Jafar
Oumayma Al Dakkak
24
0
0
25 May 2023
Downstream Task Agnostic Speech Enhancement with Self-Supervised
  Representation Loss
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
Hiroshi Sato
Ryo Masumura
Tsubasa Ochiai
Marc Delcroix
Takafumi Moriya
...
Kentaro Shinayama
Saki Mizuno
Mana Ihori
Tomohiro Tanaka
Nobukatsu Hojo
42
5
0
24 May 2023
Direction Specific Ambisonics Source Separation with End-To-End Deep
  Learning
Direction Specific Ambisonics Source Separation with End-To-End Deep Learning
Francesc Lluís
Nils Meyer-Kahlen
V. Chatziioannou
A. Hofmann
22
5
0
19 May 2023
Unsupervised Multi-channel Separation and Adaptation
Unsupervised Multi-channel Separation and Adaptation
Cong Han
K. Wilson
Scott Wisdom
J. Hershey
26
4
0
18 May 2023
Noise-Aware Speech Separation with Contrastive Learning
Noise-Aware Speech Separation with Contrastive Learning
Zizheng Zhang
Cheng Chen
Hsin-Hung Chen
Xiang Liu
Yuchen Hu
Eng Siong Chng
31
5
0
18 May 2023
Diffusion-Based Speech Enhancement with Joint Generative and Predictive
  Decoders
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Hao Shi
Kazuki Shimada
M. Hirano
Takashi Shibuya
Yuichiro Koyama
Zhi-Wei Zhong
Shusuke Takahashi
Tatsuya Kawahara
Yuki Mitsufuji
DiffM
40
15
0
18 May 2023
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with
  Convolutional Cross Attention in Multi-talker Conditions
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions
Jie Zhang
Qingquan Xu
Qiu-shi Zhu
Zhenhua Ling
27
11
0
17 May 2023
Integrating Uncertainty into Neural Network-based Speech Enhancement
Integrating Uncertainty into Neural Network-based Speech Enhancement
Hu Fang
Dennis Becker
S. Wermter
Timo Gerkmann
UQCV
37
2
0
15 May 2023
Diffusion-based Signal Refiner for Speech Separation
Diffusion-based Signal Refiner for Speech Separation
M. Hirano
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Yuki Mitsufuji
DiffM
41
4
0
10 May 2023
AudioSlots: A slot-centric generative model for audio separation
AudioSlots: A slot-centric generative model for audio separation
P. Reddy
Scott Wisdom
Klaus Greff
J. Hershey
Thomas Kipf
OCL
VLM
30
6
0
09 May 2023
A multimodal dynamical variational autoencoder for audiovisual speech
  representation learning
A multimodal dynamical variational autoencoder for audiovisual speech representation learning
Samir Sadok
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
Renaud Séguier
38
11
0
05 May 2023
Hybrid AHS: A Hybrid of Kalman Filter and Deep Learning for Acoustic
  Howling Suppression
Hybrid AHS: A Hybrid of Kalman Filter and Deep Learning for Acoustic Howling Suppression
Huatian Zhang
Meng Yu
Yuzhong Wu
Tao Yu
Dong Yu
38
3
0
04 May 2023
Previous
123456...111213
Next