ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.04756
  4. Cited By
DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A
  Comprehensive Evaluation
v1v2 (latest)

DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive Evaluation

9 August 2022
Da-Yi Wu
Wen-Yi Hsiao
Fu-Rong Yang
Oscar D. Friedman
Warren Jackson
Scott Bruzenak
Yi-Wen Liu
Yi-Hsuan Yang
    DiffM
ArXiv (abs)PDFHTML

Papers citing "DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive Evaluation"

37 / 37 papers shown
Title
Designing Neural Synthesizers for Low-Latency Interaction
Designing Neural Synthesizers for Low-Latency Interaction
Franco Caspe
Jordie Shier
Mark Sandler
C. Saitis
Andrew Mcpherson
426
0
0
14 Mar 2025
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech
  Synthesis
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis
Rongjie Huang
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
Yi Ren
Zhou Zhao
DiffM
69
172
0
21 Apr 2022
Streamable Neural Audio Synthesis With Non-Causal Convolutions
Streamable Neural Audio Synthesis With Non-Causal Convolutions
Antoine Caillon
P. Esling
83
12
0
14 Apr 2022
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality
  Speech Synthesis
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
DiffM
94
97
0
25 Mar 2022
Improving Adversarial Waveform Generation based Singing Voice Conversion
  with Harmonic Signals
Improving Adversarial Waveform Generation based Singing Voice Conversion with Harmonic Signals
Haohan Guo
Zhiping Zhou
Fanbo Meng
Kai-Chun Liu
85
16
0
25 Jan 2022
Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for
  Singing Voice Synthesis
Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
Yu Wang
Xinsheng Wang
Pengcheng Zhu
Jie Wu
Hanzhao Li
Heyang Xue
Yongmao Zhang
Lei Xie
Mengxiao Bi
92
103
0
19 Jan 2022
Differentiable Wavetable Synthesis
Differentiable Wavetable Synthesis
Siyuan Shan
Lamtharn Hantrakul
Jitong Chen
Matt Avent
David Trevelyan
95
20
0
19 Nov 2021
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice
  Generation
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation
Rongjie Huang
Chenye Cui
Feiyang Chen
Yi Ren
Jinglin Liu
Zhou Zhao
Baoxing Huai
N. Yuan
GAN
157
63
0
14 Oct 2021
Automatic DJ Transitions with Differentiable Audio Effects and
  Generative Adversarial Networks
Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks
Bo-Yu Chen
Wei-Han Hsu
Wei-Hsiang Liao
Marco A. Martínez-Ramírez
Yuki Mitsufuji
Yi-Hsuan Yang
GAN
72
8
0
13 Oct 2021
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using
  Mel-spectrograms
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using Mel-spectrograms
Chien-Feng Liao
Jen-Yu Liu
Yi-Hsuan Yang
57
5
0
08 Oct 2021
Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System
Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System
Yukiya Hono
Kei Hashimoto
Keiichiro Oura
Yoshihiko Nankaku
K. Tokuda
38
39
0
05 Aug 2021
Neural Waveshaping Synthesis
Neural Waveshaping Synthesis
B. Hayes
C. Saitis
Gyorgy Fazekas
81
28
0
11 Jul 2021
Catch-A-Waveform: Learning to Generate Audio from a Single Short Example
Catch-A-Waveform: Learning to Generate Audio from a Single Short Example
Gal Greshler
Tamar Rott Shaham
T. Michaeli
97
25
0
11 Jun 2021
Differentiable Signal Processing With Black-Box Audio Effects
Differentiable Signal Processing With Black-Box Audio Effects
Marco A. Martínez-Ramírez
Oliver Wang
Paris Smaragdis
Nicholas J. Bryan
51
31
0
11 May 2021
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Jinglin Liu
Chengxi Li
Yi Ren
Feiyang Chen
Zhou Zhao
DiffM
144
269
0
06 May 2021
Lightweight and interpretable neural modeling of an audio distortion
  effect using hyperconditioned differentiable biquads
Lightweight and interpretable neural modeling of an audio distortion effect using hyperconditioned differentiable biquads
S. Nercessian
Andy M. Sarroff
K. Werner
38
29
0
15 Mar 2021
Real-time Timbre Transfer and Sound Synthesis using DDSP
Real-time Timbre Transfer and Sound Synthesis using DDSP
Francesco Ganis
Erik Frej Knudesn
Soren V. K. Lyster
Robin Otterbein
David Sudholt
Cumhur Erkut
42
10
0
12 Mar 2021
Latent Space Explorations of Singing Voice Synthesis using DDSP
Latent Space Explorations of Singing Voice Synthesis using DDSP
J. Alonso
Cumhur Erkut
138
12
0
12 Mar 2021
PeriodNet: A non-autoregressive waveform generation model with a
  structure separating periodic and aperiodic components
PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components
Yukiya Hono
Shinji Takaki
Kei Hashimoto
Keiichiro Oura
Yoshihiko Nankaku
K. Tokuda
62
16
0
15 Feb 2021
Automatic multitrack mixing with a differentiable mixing console of
  neural audio effects
Automatic multitrack mixing with a differentiable mixing console of neural audio effects
C. Steinmetz
Jordi Pons
Santiago Pascual
Joan Serrà
111
50
0
20 Oct 2020
HiFi-GAN: Generative Adversarial Networks for Efficient and High
  Fidelity Speech Synthesis
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Jungil Kong
Jaehyeon Kim
Jaekyoung Bae
179
1,952
0
12 Oct 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffMBDL
164
1,468
0
21 Sep 2020
HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis
HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis
Jiawei Chen
Xu Tan
Jian Luan
Tao Qin
Tie-Yan Liu
VLM
87
93
0
03 Sep 2020
WaveGrad: Estimating Gradients for Waveform Generation
WaveGrad: Estimating Gradients for Waveform Generation
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
William Chan
DiffMBDL
113
793
0
02 Sep 2020
Neural Granular Sound Synthesis
Neural Granular Sound Synthesis
Adrien Bitton
P. Esling
Tatsuya Harada
52
7
0
04 Aug 2020
DeepSinger: Singing Voice Synthesis with Data Mined From the Web
DeepSinger: Singing Voice Synthesis with Data Mined From the Web
Yi Ren
Xu Tan
Tao Qin
Jian Luan
Zhou Zhao
Tie-Yan Liu
87
73
0
09 Jul 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
229
3,160
0
16 May 2020
DDSP: Differentiable Digital Signal Processing
DDSP: Differentiable Digital Signal Processing
Jesse Engel
Lamtharn Hantrakul
Chenjie Gu
Adam Roberts
DiffM
181
381
0
14 Jan 2020
Parallel WaveGAN: A fast waveform generation model based on generative
  adversarial networks with multi-resolution spectrogram
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
Ryuichi Yamamoto
Eunwoo Song
Jae-Min Kim
62
820
0
25 Oct 2019
Sequence-to-sequence Singing Synthesis Using the Feed-forward
  Transformer
Sequence-to-sequence Singing Synthesis Using the Feed-forward Transformer
Merlijn Blaauw
J. Bonada
51
55
0
22 Oct 2019
MelGAN: Generative Adversarial Networks for Conditional Waveform
  Synthesis
MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Kundan Kumar
Rithesh Kumar
T. Boissière
L. Gestin
Wei Zhen Teoh
Jose M. R. Sotelo
A. D. Brébisson
Yoshua Bengio
Aaron Courville
GAN
168
958
0
08 Oct 2019
Adversarially Trained End-to-end Korean Singing Voice Synthesis System
Adversarially Trained End-to-end Korean Singing Voice Synthesis System
Juheon Lee
Hyeong-Seok Choi
Chang-Bin Jeon
Junghyun Koo
Kyogu Lee
63
78
0
06 Aug 2019
Neural source-filter waveform models for statistical parametric speech
  synthesis
Neural source-filter waveform models for statistical parametric speech synthesis
Xin Wang
Shinji Takaki
Junichi Yamagishi
87
118
0
27 Apr 2019
Fréchet Audio Distance: A Metric for Evaluating Music Enhancement
  Algorithms
Fréchet Audio Distance: A Metric for Evaluating Music Enhancement Algorithms
Kevin Kilgour
Mauricio Zuluaga
Dominik Roblek
Matthew Sharifi
83
199
0
20 Dec 2018
Efficient Neural Audio Synthesis
Efficient Neural Audio Synthesis
Nal Kalchbrenner
Erich Elsen
Karen Simonyan
Seb Noury
Norman Casagrande
Edward Lockhart
Florian Stimberg
Aaron van den Oord
Sander Dieleman
Koray Kavukcuoglu
94
870
0
23 Feb 2018
CREPE: A Convolutional Representation for Pitch Estimation
CREPE: A Convolutional Representation for Pitch Estimation
Jong Wook Kim
Justin Salamon
P. Li
J. P. Bello
69
385
0
17 Feb 2018
WaveNet: A Generative Model for Raw Audio
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
406
7,421
0
12 Sep 2016
1