ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.07508
  4. Cited By
Hierarchical Diffusion Models for Singing Voice Neural Vocoder

Hierarchical Diffusion Models for Singing Voice Neural Vocoder

14 October 2022
Naoya Takahashi
Mayank Kumar
Singh
Yuki Mitsufuji
    DiffM
ArXivPDFHTML

Papers citing "Hierarchical Diffusion Models for Singing Voice Neural Vocoder"

12 / 12 papers shown
Title
SilentCipher: Deep Audio Watermarking
SilentCipher: Deep Audio Watermarking
Mayank Kumar Singh
Naoya Takahashi
Weihsiang Liao
Yuki Mitsufuji
38
7
0
06 Jun 2024
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a
  Diffusion Probabilistic Model
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model
Yukiya Hono
Kei Hashimoto
Yoshihiko Nankaku
Keiichi Tokuda
DiffM
27
2
0
22 Feb 2024
FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder
FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder
Tan Dat Nguyen
Ji-Hoon Kim
Youngjoon Jang
Jaehun Kim
Joon Son Chung
DiffM
34
5
0
18 Jan 2024
Reconstruction of Sound Field through Diffusion Models
Reconstruction of Sound Field through Diffusion Models
F. Miotello
Luca Comanducci
Mirco Pezzoli
Alberto Bernardini
Fabio Antonacci
Augusto Sarti
DiffM
27
7
0
14 Dec 2023
Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low
  Complexity
Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity
Krishna Subramani
J. Valin
Jan Büthe
Paris Smaragdis
Mike Goodwin
11
3
0
25 Sep 2023
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial
  Network
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
Takashi Shibuya
Yuhta Takida
Yuki Mitsufuji
16
11
0
06 Sep 2023
Enhancing Semantic Communication with Deep Generative Models -- An
  ICASSP Special Session Overview
Enhancing Semantic Communication with Deep Generative Models -- An ICASSP Special Session Overview
Eleonora Grassucci
Yuki Mitsufuji
Ping Zhang
Danilo Comminiello
29
3
0
05 Sep 2023
From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
Robin San Roman
Yossi Adi
Antoine Deleforge
Romain Serizel
Gabriel Synnaeve
Alexandre Défossez
DiffM
19
21
0
02 Aug 2023
The Ethical Implications of Generative Audio Models: A Systematic
  Literature Review
The Ethical Implications of Generative Audio Models: A Systematic Literature Review
J. Barnett
16
25
0
07 Jul 2023
Singing Voice Synthesis Using Differentiable LPC and
  Glottal-Flow-Inspired Wavetables
Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables
Chin-Yun Yu
Gyorgy Fazekas
28
7
0
29 Jun 2023
Iteratively Improving Speech Recognition and Voice Conversion
Iteratively Improving Speech Recognition and Voice Conversion
Mayank Singh
Naoya Takahashi
Ono Naoyuki
13
4
0
24 May 2023
Robust One-Shot Singing Voice Conversion
Robust One-Shot Singing Voice Conversion
Naoya Takahashi
M. Singh
Yuki Mitsufuji
DiffM
15
8
0
20 Oct 2022
1