ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.00341
  4. Cited By
Jukebox: A Generative Model for Music

Jukebox: A Generative Model for Music

30 April 2020
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
    VLM
ArXiv (abs)PDFHTMLGithub (7986★)

Papers citing "Jukebox: A Generative Model for Music"

50 / 473 papers shown
Title
AccoMontage: Accompaniment Arrangement via Phrase Selection and Style
  Transfer
AccoMontage: Accompaniment Arrangement via Phrase Selection and Style Transfer
Jingwei Zhao
Gus Xia
65
26
0
25 Aug 2021
ImageBART: Bidirectional Context with Multinomial Diffusion for
  Autoregressive Image Synthesis
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
Patrick Esser
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
115
162
0
19 Aug 2021
A Benchmarking Initiative for Audio-Domain Music Generation Using the
  Freesound Loop Dataset
A Benchmarking Initiative for Audio-Domain Music Generation Using the Freesound Loop Dataset
Tun-Min Hung
Bo-Yu Chen
Yen-Tung Yeh
Yi-Hsuan Yang
57
12
0
03 Aug 2021
DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio
  Synthesis with GANs
DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio Synthesis with GANs
J. Nistal
Stefan Lattner
G. Richard
78
9
0
03 Aug 2021
Musical Speech: A Transformer-based Composition Tool
Musical Speech: A Transformer-based Composition Tool
Jason dÉon
Sri Harsha Dumpala
Chandramouli Shama Sastry
Daniel Oore
Sageev Oore
57
1
0
02 Aug 2021
A Survey on Audio Synthesis and Audio-Visual Multimodal Processing
A Survey on Audio Synthesis and Audio-Visual Multimodal Processing
Zhaofeng Shi
57
7
0
01 Aug 2021
DadaGP: A Dataset of Tokenized GuitarPro Songs for Sequence Models
DadaGP: A Dataset of Tokenized GuitarPro Songs for Sequence Models
Pedro Sarmento
Adarsh Kumar
CJ Carr
Zack Zukowski
M. Barthet
Yi-Hsuan Yang
73
33
0
30 Jul 2021
Dance2Music: Automatic Dance-driven Music Generation
Dance2Music: Automatic Dance-driven Music Generation
Gunjan Aggarwal
Devi Parikh
MGen
125
23
0
13 Jul 2021
Codified audio language modeling learns useful representations for music
  information retrieval
Codified audio language modeling learns useful representations for music information retrieval
Rodrigo Castellon
Chris Donahue
Percy Liang
146
91
0
12 Jul 2021
PocketVAE: A Two-step Model for Groove Generation and Control
PocketVAE: A Two-step Model for Groove Generation and Control
Kyungyun Lee
Wonil Kim
Juhan Nam
44
1
0
11 Jul 2021
BumbleBee: A Transformer for Music
BumbleBee: A Transformer for Music
L. Fenaux
Maria Juliana Quintero
114
2
0
07 Jul 2021
Evaluating Large Language Models Trained on Code
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELMALM
294
5,701
0
07 Jul 2021
SoundStream: An End-to-End Neural Audio Codec
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
120
806
0
07 Jul 2021
A Generative Model for Raw Audio Using Transformer Architectures
A Generative Model for Raw Audio Using Transformer Architectures
Prateek Verma
C. Chafe
79
29
0
30 Jun 2021
Transflower: probabilistic autoregressive dance generation with
  multimodal attention
Transflower: probabilistic autoregressive dance generation with multimodal attention
Guillermo Valle Pérez
G. Henter
Jonas Beskow
A. Holzapfel
Pierre-Yves Oudeyer
Simon Alexanderson
128
43
0
25 Jun 2021
Divergence Frontiers for Generative Models: Sample Complexity,
  Quantization Effects, and Frontier Integrals
Divergence Frontiers for Generative Models: Sample Complexity, Quantization Effects, and Frontier Integrals
Lang Liu
Krishna Pillutla
Sean Welleck
Sewoong Oh
Yejin Choi
Zaïd Harchaoui
MQ
89
14
0
15 Jun 2021
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
Abhishek Sinha
Jiaming Song
Chenlin Meng
Stefano Ermon
VLMDiffM
138
121
0
12 Jun 2021
Catch-A-Waveform: Learning to Generate Audio from a Single Short Example
Catch-A-Waveform: Learning to Generate Audio from a Single Short Example
Gal Greshler
Tamar Rott Shaham
T. Michaeli
102
25
0
11 Jun 2021
Score-based Generative Modeling in Latent Space
Score-based Generative Modeling in Latent Space
Arash Vahdat
Karsten Kreis
Jan Kautz
DiffM
116
688
0
10 Jun 2021
Generative Models as a Data Source for Multiview Representation Learning
Generative Models as a Data Source for Multiview Representation Learning
Ali Jahanian
Xavier Puig
Yonglong Tian
Phillip Isola
99
129
0
09 Jun 2021
Deep Neural Networks and End-to-End Learning for Audio Compression
Deep Neural Networks and End-to-End Learning for Audio Compression
Daniela N. Rim
I. Jang
Heeyoul Choi
55
9
0
25 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin
  Dynamics
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
V. Jayaram
John Thickstun
DiffM
107
25
0
17 May 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
451
8,015
0
11 May 2021
Which transformer architecture fits my data? A vocabulary bottleneck in
  self-attention
Which transformer architecture fits my data? A vocabulary bottleneck in self-attention
Noam Wies
Yoav Levine
Daniel Jannai
Amnon Shashua
92
20
0
09 May 2021
Computer-Aided Design as Language
Computer-Aided Design as Language
Yaroslav Ganin
Sergey Bartunov
Yujia Li
E. Keller
Stefano Saliceti
3DV
156
95
0
06 May 2021
One Billion Audio Sounds from GPU-enabled Modular Synthesis
One Billion Audio Sounds from GPU-enabled Modular Synthesis
Joseph P. Turian
Jordie Shier
George Tzanetakis
K. McNally
Max Henry
103
22
0
27 Apr 2021
VideoGPT: Video Generation using VQ-VAE and Transformers
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViTVGen
325
513
0
20 Apr 2021
Geometry-Free View Synthesis: Transformers and no 3D Priors
Geometry-Free View Synthesis: Transformers and no 3D Priors
Robin Rombach
Patrick Esser
Bjorn Ommer
ViT
114
95
0
15 Apr 2021
Spectrogram Inpainting for Interactive Generation of Instrument Sounds
Spectrogram Inpainting for Interactive Generation of Instrument Sounds
Théis Bazin
Gaëtan Hadjeres
P. Esling
M. Malt
61
11
0
15 Apr 2021
Creativity and Machine Learning: A Survey
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLMAI4CE
129
43
0
06 Apr 2021
Speech Resynthesis from Discrete Disentangled Self-Supervised
  Representations
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Adam Polyak
Yossi Adi
Jade Copet
Eugene Kharitonov
Kushal Lakhotia
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
141
318
0
01 Apr 2021
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
Giorgio Barnabò
Giovanni Trappolini
L. Lastilla
Cesare Campagnano
Angela Fan
Fabio Petroni
Fabrizio Silvestri
66
4
0
01 Apr 2021
Symbolic Music Generation with Diffusion Models
Symbolic Music Generation with Diffusion Models
Gautam Mittal
Jesse Engel
Curtis Hawthorne
Ian Simon
MGenDiffM
99
194
0
30 Mar 2021
Tiny Transformers for Environmental Sound Classification at the Edge
Tiny Transformers for Environmental Sound Classification at the Edge
David Elliott
Carlos E. Otero
Steven Wyatt
Evan Martino
81
16
0
22 Mar 2021
Variable-rate discrete representation learning
Variable-rate discrete representation learning
Sander Dieleman
C. Nash
Jesse Engel
Karen Simonyan
BDLDRL
82
24
0
10 Mar 2021
Generating Images with Sparse Representations
Generating Images with Sparse Representations
C. Nash
Jacob Menick
Sander Dieleman
Peter W. Battaglia
93
211
0
05 Mar 2021
Predicting Video with VQVAE
Predicting Video with VQVAE
Jacob Walker
Ali Razavi
Aaron van den Oord
DRL
128
69
0
02 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
1.1K
30,111
0
26 Feb 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
430
5,019
0
24 Feb 2021
Towards Causal Representation Learning
Towards Causal Representation Learning
Bernhard Schölkopf
Francesco Locatello
Stefan Bauer
Nan Rosemary Ke
Nal Kalchbrenner
Anirudh Goyal
Yoshua Bengio
OODCMLAI4CE
160
323
0
22 Feb 2021
Using Deep LSD to build operators in GANs latent space with meaning in
  real space
Using Deep LSD to build operators in GANs latent space with meaning in real space
J. Q. Toledo-Marín
J. Glazier
GAN
122
3
0
09 Feb 2021
Understanding the Tradeoffs in Client-side Privacy for Downstream Speech
  Tasks
Understanding the Tradeoffs in Client-side Privacy for Downstream Speech Tasks
Peter Wu
Paul Pu Liang
Jiatong Shi
Ruslan Salakhutdinov
Shinji Watanabe
Louis-Philippe Morency
65
9
0
22 Jan 2021
MP3net: coherent, minute-long music generation from raw audio with a
  simple convolutional GAN
MP3net: coherent, minute-long music generation from raw audio with a simple convolutional GAN
Korneel van den Broek
MGen
71
7
0
12 Jan 2021
Generative Deep Learning for Virtuosic Classical Music: Generative
  Adversarial Networks as Renowned Composers
Generative Deep Learning for Virtuosic Classical Music: Generative Adversarial Networks as Renowned Composers
Daniel Szelogowski
MGenGAN
23
0
0
01 Jan 2021
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at
  Pitch
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch
Joseph P. Turian
Max Henry
49
31
0
08 Dec 2020
Multi-Instrumentalist Net: Unsupervised Generation of Music from Body
  Movements
Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements
Kun Su
Xiulong Liu
Eli Shlizerman
91
29
0
07 Dec 2020
MTCRNN: A multi-scale RNN for directed audio texture synthesis
MTCRNN: A multi-scale RNN for directed audio texture synthesis
M. Huzaifah
L. Wyse
78
2
0
25 Nov 2020
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them
  on Images
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images
R. Child
BDLVLM
192
353
0
20 Nov 2020
A Comprehensive Survey on Deep Music Generation: Multi-level
  Representations, Algorithms, Evaluations, and Future Directions
A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions
Shulei Ji
Jing Luo
Xinyu Yang
MGen
61
126
0
13 Nov 2020
Scaling Laws for Autoregressive Generative Modeling
Scaling Laws for Autoregressive Generative Modeling
T. Henighan
Jared Kaplan
Mor Katz
Mark Chen
Christopher Hesse
...
Nick Ryder
Daniel M. Ziegler
John Schulman
Dario Amodei
Sam McCandlish
143
434
0
28 Oct 2020
Previous
123...1089
Next