ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.12477
46
0
v1v2 (latest)

ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning

19 September 2024
Daewoong Kim
Hao-Wen Dong
Dasaem Jeong
ArXiv (abs)PDFHTML
Abstract

Modeling the natural contour of fundamental frequency (F0) plays a critical role in music audio synthesis. However, transcribing and managing multiple F0 contours in polyphonic music is challenging, and explicit F0 contour modeling has not yet been explored for polyphonic instrumental synthesis. In this paper, we present ViolinDiff, a two-stage diffusion-based synthesis framework. For a given violin MIDI file, the first stage estimates the F0 contour as pitch bend information, and the second stage generates mel spectrogram incorporating these expressive details. The quantitative metrics and listening test results show that the proposed model generates more realistic violin sounds than the model without explicit pitch bend modeling. Audio samples are available online: daewoung.github.io/ViolinDiff-Demo.

View on arXiv
@article{kim2025_2409.12477,
  title={ ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning },
  author={ Daewoong Kim and Hao-Wen Dong and Dasaem Jeong },
  journal={arXiv preprint arXiv:2409.12477},
  year={ 2025 }
}
Comments on this paper