ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.05148
  4. Cited By
DiffRoll: Diffusion-based Generative Music Transcription with
  Unsupervised Pretraining Capability
v1v2 (latest)

DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability

11 October 2022
K. Cheuk
Ryosuke Sawata
Toshimitsu Uesaka
Naoki Murata
Naoya Takahashi
Shusuke Takahashi
Dorien Herremans
Yuki Mitsufuji
    DiffM
ArXiv (abs)PDFHTML

Papers citing "DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability"

18 / 18 papers shown
Title
PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance
PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance
Qijun Gan
Song Wang
Shengtao Wu
Jianke Zhu
237
1
0
13 Jun 2024
HPPNet: Modeling the Harmonic Structure and Pitch Invariance in Piano
  Transcription
HPPNet: Modeling the Harmonic Structure and Pitch Invariance in Piano Transcription
Weixing Wei
P. Li
Yi Yu
Wei Li
67
14
0
30 Aug 2022
Analog Bits: Generating Discrete Data using Diffusion Models with
  Self-Conditioning
Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning
Ting-Li Chen
Ruixiang Zhang
Geoffrey E. Hinton
DiffM
94
308
0
08 Aug 2022
Classifier-Free Diffusion Guidance
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
193
3,898
0
26 Jul 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion
Multi-instrument Music Synthesis with Spectrogram Diffusion
Curtis Hawthorne
Ian Simon
Adam Roberts
Neil Zeghidour
Josh Gardner
Ethan Manilow
Jesse Engel
DiffM
64
51
0
11 Jun 2022
Unaligned Supervision For Automatic Music Transcription in The Wild
Unaligned Supervision For Automatic Music Transcription in The Wild
Ben Maman
Amit H. Bermano
72
29
0
28 Apr 2022
Denoising Diffusion Restoration Models
Denoising Diffusion Restoration Models
Bahjat Kawar
Michael Elad
Stefano Ermon
Jiaming Song
DiffM
278
842
0
27 Jan 2022
Symbolic Music Generation with Diffusion Models
Symbolic Music Generation with Diffusion Models
Gautam Mittal
Jesse Engel
Curtis Hawthorne
Ian Simon
MGenDiffM
95
193
0
30 Mar 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
418
4,987
0
24 Feb 2021
Denoising Diffusion Implicit Models
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLMDiffM
286
7,454
0
06 Oct 2020
High-resolution Piano Transcription with Pedals by Regressing Onset and
  Offset Times
High-resolution Piano Transcription with Pedals by Regressing Onset and Offset Times
Qiuqiang Kong
Bochen Li
Xuchen Song
Yuan Wan
Yuxuan Wang
374
112
0
05 Oct 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffMBDL
155
1,466
0
21 Sep 2020
The impact of Audio input representations on neural network based music
  transcription
The impact of Audio input representations on neural network based music transcription
K. Cheuk
Kat R. Agres
Dorien Herremans
414
17
0
25 Jan 2020
GANSynth: Adversarial Neural Audio Synthesis
GANSynth: Adversarial Neural Audio Synthesis
Jesse Engel
Kumar Krishna Agrawal
Shuo Chen
Ishaan Gulrajani
Chris Donahue
Adam Roberts
99
392
0
23 Feb 2019
Enabling Factorized Piano Music Modeling and Generation with the MAESTRO
  Dataset
Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset
Curtis Hawthorne
Andriy Stasyuk
Adam Roberts
Ian Simon
Cheng-Zhi Anna Huang
Sander Dieleman
Erich Elsen
Jesse Engel
Douglas Eck
416
452
0
29 Oct 2018
Onsets and Frames: Dual-Objective Piano Transcription
Onsets and Frames: Dual-Objective Piano Transcription
Curtis Hawthorne
Erich Elsen
Jialin Song
Adam Roberts
Ian Simon
Colin Raffel
Jesse Engel
Sageev Oore
Douglas Eck
176
280
0
30 Oct 2017
MuseGAN: Multi-track Sequential Generative Adversarial Networks for
  Symbolic Music Generation and Accompaniment
MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment
Hao-Wen Dong
Wen-Yi Hsiao
Li-Chia Yang
Yi-Hsuan Yang
MGenGAN
129
547
0
19 Sep 2017
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Narain Sohl-Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
SyDaDiffM
306
7,005
0
12 Mar 2015
1