ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.10474
  4. Cited By
The challenge of realistic music generation: modelling raw audio at
  scale

The challenge of realistic music generation: modelling raw audio at scale

26 June 2018
Sander Dieleman
Aaron van den Oord
Karen Simonyan
ArXivPDFHTML

Papers citing "The challenge of realistic music generation: modelling raw audio at scale"

35 / 35 papers shown
Title
Push-Grasp Policy Learning Using Equivariant Models and Grasp Score Optimization
Push-Grasp Policy Learning Using Equivariant Models and Grasp Score Optimization
Boce Hu
Heng Tian
Dian Wang
Haojie Huang
Xupeng Zhu
Robin Walters
Robert W. Platt
39
0
0
03 Apr 2025
TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
Xuying Zhang
Yutong Liu
Yangguang Li
Renrui Zhang
Y. Liu
...
Wanli Ouyang
Zhiwei Xiong
Peng Gao
Qibin Hou
Ming-Ming Cheng
127
3
0
13 Mar 2025
The GigaMIDI Dataset with Features for Expressive Music Performance Detection
The GigaMIDI Dataset with Features for Expressive Music Performance Detection
Keon Ju M. Lee
J. Ens
Sara Adkins
Pedro Sarmento
M. Barthet
Philippe Pasquier
44
0
0
24 Feb 2025
Hookpad Aria: A Copilot for Songwriters
Hookpad Aria: A Copilot for Songwriters
Chris Donahue
Shih-Lun Wu
Yewon Kim
Dave Carlton
Ryan Miyakawa
John Thickstun
53
1
0
12 Feb 2025
MusicScore: A Dataset for Music Score Modeling and Generation
MusicScore: A Dataset for Music Score Modeling and Generation
Yuheng Lin
Zheqi Dai
Qiuqiang Kong
VLM
37
2
0
17 Jun 2024
DinoSR: Self-Distillation and Online Clustering for Self-supervised
  Speech Representation Learning
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Alexander H. Liu
Heng-Jui Chang
Michael Auli
Wei-Ning Hsu
James R. Glass
27
25
0
17 May 2023
SingSong: Generating musical accompaniments from singing
SingSong: Generating musical accompaniments from singing
Chris Donahue
Antoine Caillon
Adam Roberts
Ethan Manilow
P. Esling
...
Mauro Verzetti
Ian Simon
Olivier Pietquin
Neil Zeghidour
Jesse Engel
34
52
0
30 Jan 2023
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Kinyugo Maina
21
5
0
16 Jan 2023
Generative Models for Improved Naturalness, Intelligibility, and Voicing
  of Whispered Speech
Generative Models for Improved Naturalness, Intelligibility, and Voicing of Whispered Speech
Dominik Wagner
Sebastian P. Bayerl
H. A. C. Maruri
Tobias Bocklet
21
7
0
04 Dec 2022
Learning Hierarchical Metrical Structure Beyond Measures
Learning Hierarchical Metrical Structure Beyond Measures
Junyan Jiang
Daniel Y. Chin
Yixiao Zhang
Gus Xia
39
4
0
21 Sep 2022
DrumGAN VST: A Plugin for Drum Sound Analysis/Synthesis With
  Autoencoding Generative Adversarial Networks
DrumGAN VST: A Plugin for Drum Sound Analysis/Synthesis With Autoencoding Generative Adversarial Networks
J. Nistal
Cyran Aouameur
Ithan Velarde
Stefan Lattner
GAN
37
4
0
29 Jun 2022
Dual Learning Music Composition and Dance Choreography
Dual Learning Music Composition and Dance Choreography
Shuang Wu
Zhenguang Liu
Shijian Lu
Li Cheng
21
8
0
28 Jan 2022
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at
  Scale
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Arun Babu
Changhan Wang
Andros Tjandra
Kushal Lakhotia
Qiantong Xu
...
Yatharth Saraf
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
SSL
32
657
0
17 Nov 2021
PixelPyramids: Exact Inference Models from Lossless Image Pyramids
PixelPyramids: Exact Inference Models from Lossless Image Pyramids
Shweta Mahajan
Stefan Roth
TPM
12
2
0
17 Oct 2021
DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational
  Transformer
DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational Transformer
Haozhe Ji
Minlie Huang
15
23
0
12 Oct 2021
Musical Speech: A Transformer-based Composition Tool
Musical Speech: A Transformer-based Composition Tool
Jason dÉon
Sri Harsha Dumpala
Chandramouli Shama Sastry
Daniel Oore
Sageev Oore
18
1
0
02 Aug 2021
Codified audio language modeling learns useful representations for music
  information retrieval
Codified audio language modeling learns useful representations for music information retrieval
Rodrigo Castellon
Chris Donahue
Percy Liang
81
86
0
12 Jul 2021
SoundStream: An End-to-End Neural Audio Codec
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
43
722
0
07 Jul 2021
A Generative Model for Raw Audio Using Transformer Architectures
A Generative Model for Raw Audio Using Transformer Architectures
Prateek Verma
C. Chafe
22
28
0
30 Jun 2021
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical
  Parameters
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical Parameters
Pritish Chandna
António Ramires
Xavier Serra
Emilia Gómez
19
4
0
21 May 2021
Predicting Video with VQVAE
Predicting Video with VQVAE
Jacob Walker
Ali Razavi
Aaron van den Oord
DRL
24
66
0
02 Mar 2021
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model
  with Pitch-dependent Dilated Convolution Neural Network
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network
Yi-Chiao Wu
Tomoki Hayashi
Patrick Lumban Tobing
Kazuhiro Kobayashi
T. Toda
24
18
0
11 Jul 2020
Unsupervised Cross-lingual Representation Learning for Speech
  Recognition
Unsupervised Cross-lingual Representation Learning for Speech Recognition
Alexis Conneau
Alexei Baevski
R. Collobert
Abdel-rahman Mohamed
Michael Auli
SSL
70
754
0
24 Jun 2020
Perceiving Music Quality with GANs
Perceiving Music Quality with GANs
Agrin Hilmkil
Carl Thomé
Anders Arpteg
18
3
0
11 Jun 2020
Deep generative models for musical audio synthesis
Deep generative models for musical audio synthesis
M. Huzaifah
L. Wyse
27
20
0
10 Jun 2020
Unconditional Audio Generation with Generative Adversarial Networks and
  Cycle Regularization
Unconditional Audio Generation with Generative Adversarial Networks and Cycle Regularization
Jen-Yu Liu
Yu-Hua Chen
Yin-Cheng Yeh
Yi-Hsuan Yang
GAN
32
35
0
18 May 2020
Transferring neural speech waveform synthesizers to musical instrument
  sounds generation
Transferring neural speech waveform synthesizers to musical instrument sounds generation
Yi Zhao
Xin Wang
Lauri Juvela
Junichi Yamagishi
21
16
0
27 Oct 2019
Quant GANs: Deep Generation of Financial Time Series
Quant GANs: Deep Generation of Financial Time Series
Magnus Wiese
R. Knobloch
R. Korn
Peter Kretschmer
GAN
AI4TS
AIFin
22
273
0
15 Jul 2019
LakhNES: Improving multi-instrumental music generation with cross-domain
  pre-training
LakhNES: Improving multi-instrumental music generation with cross-domain pre-training
Chris Donahue
H. H. Mao
Yiting Li
G. Cottrell
Julian McAuley
30
116
0
10 Jul 2019
Generating Diverse High-Fidelity Images with VQ-VAE-2
Generating Diverse High-Fidelity Images with VQ-VAE-2
Ali Razavi
Aaron van den Oord
Oriol Vinyals
DRL
BDL
16
1,767
0
02 Jun 2019
Generating Long Sequences with Sparse Transformers
Generating Long Sequences with Sparse Transformers
R. Child
Scott Gray
Alec Radford
Ilya Sutskever
11
1,848
0
23 Apr 2019
Hierarchical Autoregressive Image Models with Auxiliary Decoders
Hierarchical Autoregressive Image Models with Auxiliary Decoders
J. Fauw
Sander Dieleman
Karen Simonyan
GAN
30
37
0
06 Mar 2019
Iris-GAN: Learning to Generate Realistic Iris Images Using Convolutional
  GAN
Iris-GAN: Learning to Generate Realistic Iris Images Using Convolutional GAN
Shervin Minaee
AmirAli Abdolrashidi
GAN
VLM
21
30
0
12 Dec 2018
A Universal Music Translation Network
A Universal Music Translation Network
Noam Mor
Lior Wolf
Adam Polyak
Yaniv Taigman
9
110
0
21 May 2018
Pixel Recurrent Neural Networks
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSeg
GAN
251
2,550
0
25 Jan 2016
1