ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.00341
  4. Cited By
Jukebox: A Generative Model for Music

Jukebox: A Generative Model for Music

30 April 2020
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
    VLM
ArXiv (abs)PDFHTMLGithub (7986★)

Papers citing "Jukebox: A Generative Model for Music"

50 / 473 papers shown
Title
State of the Art on Diffusion Models for Visual Computing
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
130
111
0
11 Oct 2023
LLark: A Multimodal Instruction-Following Language Model for Music
LLark: A Multimodal Instruction-Following Language Model for Music
Josh Gardner
Simon Durand
Daniel Stoller
Rachel M. Bittner
AuLLM
85
16
0
11 Oct 2023
EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational
  Autoencoders
EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders
Gulcin Baykal
M. Kandemir
Gözde B. Ünal
67
13
0
09 Oct 2023
Prompt-to-OS (P2OS): Revolutionizing Operating Systems and
  Human-Computer Interaction with Integrated AI Generative Models
Prompt-to-OS (P2OS): Revolutionizing Operating Systems and Human-Computer Interaction with Integrated AI Generative Models
Gabriele Tolomei
Cesare Campagnano
Fabrizio Silvestri
Giovanni Trappolini
81
4
0
07 Oct 2023
Soft Convex Quantization: Revisiting Vector Quantization with Convex
  Optimization
Soft Convex Quantization: Revisiting Vector Quantization with Convex Optimization
Tanmay Gautam
Reid Pryzant
Ziyi Yang
Chenguang Zhu
Somayeh Sojoudi
MQ
58
4
0
04 Oct 2023
Scaling Up Music Information Retrieval Training with Semi-Supervised
  Learning
Scaling Up Music Information Retrieval Training with Semi-Supervised Learning
Yun-Ning Hung
Ju-Chiang Wang
Minz Won
Duc Le
67
0
0
02 Oct 2023
Music- and Lyrics-driven Dance Synthesis
Music- and Lyrics-driven Dance Synthesis
Wenjie Yin
Qingyuan Yao
Yi Yu
Hang Yin
Danica Kragic
Mårten Björkman
DiffM
41
0
0
30 Sep 2023
XVO: Generalized Visual Odometry via Cross-Modal Self-Training
XVO: Generalized Visual Odometry via Cross-Modal Self-Training
Tohida Rehman
Ronit Mandal
Jimuyang Zhang
Debarshi Kumar Sanyal
SSL
134
21
0
28 Sep 2023
Transformer-VQ: Linear-Time Transformers via Vector Quantization
Transformer-VQ: Linear-Time Transformers via Vector Quantization
Albert Mohwald
109
17
0
28 Sep 2023
Finite Scalar Quantization: VQ-VAE Made Simple
Finite Scalar Quantization: VQ-VAE Made Simple
Fabian Mentzer
David C. Minnen
E. Agustsson
Michael Tschannen
111
190
0
27 Sep 2023
ID.8: Co-Creating Visual Stories with Generative AI
ID.8: Co-Creating Visual Stories with Generative AI
Victor Nikhil Antony
Chien-Ming Huang
107
27
0
25 Sep 2023
AI (r)evolution -- where are we heading? Thoughts about the future of
  music and sound technologies in the era of deep learning
AI (r)evolution -- where are we heading? Thoughts about the future of music and sound technologies in the era of deep learning
Giovanni Bindi
Nils Demerlé
Rodrigo Diaz
David Genova
Aliénor Golvet
...
Yixiao Zhang
Axel Roebel
Nick Bryan-Kinns
Jean-Louis Giavitto
M. Barthet
34
0
0
20 Sep 2023
LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation
LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation
Yihao Zhi
Xiaodong Cun
Xuelin Chen
Xi Shen
Wen Guo
Shaoli Huang
Shenghua Gao
68
28
0
17 Sep 2023
MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
Zeyu Ling
Bo Han
Yongkang Wong
Mohan Kankanhalli
Weidong Geng
DiffM
61
6
0
06 Sep 2023
Self-Supervised Disentanglement of Harmonic and Rhythmic Features in
  Music Audio Signals
Self-Supervised Disentanglement of Harmonic and Rhythmic Features in Music Audio Signals
Yiming Wu
CoGeDRL
115
0
0
06 Sep 2023
Neural Vector Fields: Generalizing Distance Vector Fields by Codebooks
  and Zero-Curl Regularization
Neural Vector Fields: Generalizing Distance Vector Fields by Codebooks and Zero-Curl Regularization
Xianghui Yang
Guosheng Lin
Zhenghao Chen
Luping Zhou
103
2
0
04 Sep 2023
MAGMA: Music Aligned Generative Motion Autodecoder
MAGMA: Music Aligned Generative Motion Autodecoder
Sohan Anisetty
Amit Raj
James Hays
59
0
0
03 Sep 2023
Priority-Centric Human Motion Generation in Discrete Latent Space
Priority-Centric Human Motion Generation in Discrete Latent Space
Hanyang Kong
Kehong Gong
Dongze Lian
Michael Bi Mi
Xinchao Wang
DiffM
119
55
0
28 Aug 2023
A Comprehensive Survey for Evaluation Methodologies of AI-Generated
  Music
A Comprehensive Survey for Evaluation Methodologies of AI-Generated Music
Zeyu Xiong
Weitao Wang
Jing Yu
Yue Lin
Ziyan Wang
MGen
83
7
0
26 Aug 2023
Sparks of Large Audio Models: A Survey and Outlook
Sparks of Large Audio Models: A Survey and Outlook
S. Latif
Moazzam Shoukat
Fahad Shamshad
Muhammad Usama
Yi Ren
...
Wenwu Wang
Xulong Zhang
Roberto Togneri
Min Zhang
Björn W. Schuller
LM&MAAuLLM
202
39
0
24 Aug 2023
A Survey of AI Music Generation Tools and Models
A Survey of AI Music Generation Tools and Models
Yueyue Zhu
Jared Baca
Banafsheh Rekabdar
Reza Rawassizadeh
MGen
110
18
0
24 Aug 2023
Efficient Transfer Learning in Diffusion Models via Adversarial Noise
Efficient Transfer Learning in Diffusion Models via Adversarial Noise
Xiyu Wang
Baijiong Lin
Daochang Liu
Chang Xu
DiffM
99
3
0
23 Aug 2023
Example-Based Framework for Perceptually Guided Audio Texture Generation
Example-Based Framework for Perceptually Guided Audio Texture Generation
Purnima Kamath
Chitralekha Gupta
L. Wyse
Suranga Nanayakkara
48
4
0
23 Aug 2023
Music Understanding LLaMA: Advancing Text-to-Music Generation with
  Question Answering and Captioning
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning
Shansong Liu
Atin Sakkeer Hussain
Chenshuo Sun
Yin Shan
MLLM
86
55
0
22 Aug 2023
AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes
Zhaohui Li
Haitao Wang
Xinghua Jiang
117
1
0
14 Aug 2023
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
Peike Li
Bo-Yu Chen
Yao Yao
Yikai Wang
Allen Wang
Alex Jinpeng Wang
MGenVLMDiffM
167
42
0
09 Aug 2023
Generative AI for Medical Imaging: extending the MONAI Framework
Generative AI for Medical Imaging: extending the MONAI Framework
W. H. Pinaya
M. Graham
E. Kerfoot
Petru-Daniel Tudosiu
J. Dafflon
...
Andrew Feng
Marc Modat
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
SyDaMedIm
105
72
0
27 Jul 2023
Online Clustered Codebook
Online Clustered Codebook
Chuanxia Zheng
Andrea Vedaldi
96
34
0
27 Jul 2023
IteraTTA: An interface for exploring both text prompts and audio priors
  in generating music with text-to-audio models
IteraTTA: An interface for exploring both text prompts and audio priors in generating music with text-to-audio models
Hiromu Yakura
Masataka Goto
79
2
0
24 Jul 2023
Brain2Music: Reconstructing Music from Human Brain Activity
Brain2Music: Reconstructing Music from Human Brain Activity
Timo I. Denk
Yu Takagi
Takuya Matsuyama
A. Agostinelli
Tomoya Nakai
Christian Frank
Shinji Nishimoto
86
14
0
20 Jul 2023
Polyffusion: A Diffusion Model for Polyphonic Score Generation with
  Internal and External Controls
Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls
Lejun Min
Junyan Jiang
Gus Xia
Jingwei Zhao
DiffM
97
22
0
19 Jul 2023
On the Effectiveness of Speech Self-supervised Learning for Music
On the Effectiveness of Speech Self-supervised Learning for Music
Yi Ma
Ruibin Yuan
Yizhi Li
Ge Zhang
Xingran Chen
...
Ruibo Liu
Gus Xia
Roger Dannenberg
Yi-Ting Guo
Jie Fu
67
10
0
11 Jul 2023
VampNet: Music Generation via Masked Acoustic Token Modeling
VampNet: Music Generation via Masked Acoustic Token Modeling
Hugo Flores Garcia
Prem Seetharaman
Rithesh Kumar
Bryan Pardo
MGen
93
68
0
10 Jul 2023
ChatGPT in the Age of Generative AI and Large Language Models: A Concise Survey
S. Mohamadi
Ghulam Mujtaba
Ngan Le
Gianfranco Doretto
Don Adjeroh
LM&MAAI4MH
113
21
0
09 Jul 2023
The Ethical Implications of Generative Audio Models: A Systematic
  Literature Review
The Ethical Implications of Generative Audio Models: A Systematic Literature Review
J. Barnett
86
32
0
07 Jul 2023
Unsupervised 3D out-of-distribution detection with latent diffusion
  models
Unsupervised 3D out-of-distribution detection with latent diffusion models
M. Graham
W. H. Pinaya
P. Wright
Petru-Daniel Tudosiu
Y. Mah
...
H. Jäger
D. Werring
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
DiffMMedIm
82
11
0
07 Jul 2023
Hierarchical Neural Coding for Controllable CAD Model Generation
Hierarchical Neural Coding for Controllable CAD Model Generation
Xiang Xu
P. Jayaraman
Joseph G. Lambourne
Karl D. D. Willis
Yasutaka Furukawa
99
43
0
30 Jun 2023
Audio Embeddings as Teachers for Music Classification
Audio Embeddings as Teachers for Music Classification
Yiwei Ding
Alexander Lerch
65
5
0
30 Jun 2023
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by
  Whispering to ChatGPT
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
Le Zhuo
Ruibin Yuan
Jiahao Pan
Yi Ma
Yizhi Li
...
Chenghua Lin
Emmanouil Benetos
Wenhu Chen
Wei Xue
Yi-Ting Guo
109
18
0
29 Jun 2023
Fauno: The Italian Large Language Model that will leave you senza
  parole!
Fauno: The Italian Large Language Model that will leave you senza parole!
Andrea Bacciu
Giovanni Trappolini
Andrea Santilli
Emanuele Rodolà
Fabrizio Silvestri
61
18
0
26 Jun 2023
DISCO-10M: A Large-Scale Music Dataset
DISCO-10M: A Large-Scale Music Dataset
Luca A. Lanzendörfer
Florian Grötschla
Emil Funke
Roger Wattenhofer
76
14
0
23 Jun 2023
Toward Leveraging Pre-Trained Self-Supervised Frontends for Automatic
  Singing Voice Understanding Tasks: Three Case Studies
Toward Leveraging Pre-Trained Self-Supervised Frontends for Automatic Singing Voice Understanding Tasks: Three Case Studies
Yuya Yamamoto
45
2
0
22 Jun 2023
MARBLE: Music Audio Representation Benchmark for Universal Evaluation
MARBLE: Music Audio Representation Benchmark for Universal Evaluation
Ruibin Yuan
Yi Ma
Yizhi Li
Ge Zhang
Xingran Chen
...
Si Liu
Shi Wang
Ruibo Liu
Yi-Ting Guo
Jie Fu
157
34
0
18 Jun 2023
The pop song generator: designing an online course to teach
  collaborative, creative AI
The pop song generator: designing an online course to teach collaborative, creative AI
M. Yee-King
A. Fiorucci
M. dÍnverno
34
0
0
15 Jun 2023
Unbiased Learning of Deep Generative Models with Structured Discrete
  Representations
Unbiased Learning of Deep Generative Models with Structured Discrete Representations
H. Bendekgey
Gabriel Hope
Erik B. Sudderth
OCLBDLDRL
58
1
0
14 Jun 2023
Better Generalization with Semantic IDs: A Case Study in Ranking for
  Recommendations
Better Generalization with Semantic IDs: A Case Study in Ranking for Recommendations
Anima Singh
Trung Vu
Nikhil Mehta
Raghunandan H. Keshavan
M. Sathiamoorthy
...
Lukasz Heldt
Li Wei
Devansh Tandon
Ed H. Chi
Xinyang Yi
86
24
0
13 Jun 2023
Tokenization with Factorized Subword Encoding
Tokenization with Factorized Subword Encoding
David Samuel
Lilja Øvrelid
67
2
0
13 Jun 2023
High-Fidelity Audio Compression with Improved RVQGAN
High-Fidelity Audio Compression with Improved RVQGAN
Rithesh Kumar
Prem Seetharaman
Alejandro Luebs
I. Kumar
Kundan Kumar
126
338
0
11 Jun 2023
Simple and Controllable Music Generation
Simple and Controllable Music Generation
Jade Copet
Felix Kreuk
Itai Gat
Tal Remez
David Kant
Gabriel Synnaeve
Yossi Adi
Alexandre Défossez
MGen
149
377
0
08 Jun 2023
Coupled Variational Autoencoder
Coupled Variational Autoencoder
Xiaoran Hao
Patrick Shafto
BDLDRL
74
4
0
05 Jun 2023
Previous
123456...8910
Next