ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.00341
  4. Cited By
Jukebox: A Generative Model for Music

Jukebox: A Generative Model for Music

30 April 2020
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
    VLM
ArXivPDFHTML

Papers citing "Jukebox: A Generative Model for Music"

50 / 461 papers shown
Title
Self-Supervised Disentanglement of Harmonic and Rhythmic Features in
  Music Audio Signals
Self-Supervised Disentanglement of Harmonic and Rhythmic Features in Music Audio Signals
Yiming Wu
CoGe
DRL
32
0
0
06 Sep 2023
Neural Vector Fields: Generalizing Distance Vector Fields by Codebooks
  and Zero-Curl Regularization
Neural Vector Fields: Generalizing Distance Vector Fields by Codebooks and Zero-Curl Regularization
Xianghui Yang
Guosheng Lin
Zhenghao Chen
Luping Zhou
42
2
0
04 Sep 2023
MAGMA: Music Aligned Generative Motion Autodecoder
MAGMA: Music Aligned Generative Motion Autodecoder
Sohan Anisetty
Amit Raj
James Hays
26
0
0
03 Sep 2023
Priority-Centric Human Motion Generation in Discrete Latent Space
Priority-Centric Human Motion Generation in Discrete Latent Space
Hanyang Kong
Kehong Gong
Dongze Lian
Michael Bi Mi
Xinchao Wang
DiffM
37
51
0
28 Aug 2023
A Comprehensive Survey for Evaluation Methodologies of AI-Generated
  Music
A Comprehensive Survey for Evaluation Methodologies of AI-Generated Music
Zeyu Xiong
Weitao Wang
Jing Yu
Yue Lin
Ziyan Wang
MGen
33
6
0
26 Aug 2023
Sparks of Large Audio Models: A Survey and Outlook
Sparks of Large Audio Models: A Survey and Outlook
S. Latif
Moazzam Shoukat
Fahad Shamshad
Muhammad Usama
Yi Ren
...
Wenwu Wang
Xulong Zhang
Roberto Togneri
Min Zhang
Björn W. Schuller
LM&MA
AuLLM
35
38
0
24 Aug 2023
A Survey of AI Music Generation Tools and Models
A Survey of AI Music Generation Tools and Models
Yueyue Zhu
Jared Baca
Banafsheh Rekabdar
Reza Rawassizadeh
MGen
35
14
0
24 Aug 2023
Efficient Transfer Learning in Diffusion Models via Adversarial Noise
Efficient Transfer Learning in Diffusion Models via Adversarial Noise
Xiyu Wang
Baijiong Lin
Daochang Liu
Chang Xu
DiffM
37
3
0
23 Aug 2023
Example-Based Framework for Perceptually Guided Audio Texture Generation
Example-Based Framework for Perceptually Guided Audio Texture Generation
Purnima Kamath
Chitralekha Gupta
L. Wyse
Suranga Nanayakkara
24
4
0
23 Aug 2023
Music Understanding LLaMA: Advancing Text-to-Music Generation with
  Question Answering and Captioning
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning
Shansong Liu
Atin Sakkeer Hussain
Chenshuo Sun
Yin Shan
MLLM
32
46
0
22 Aug 2023
AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes
Zhaohui Li
Haitao Wang
Xinghua Jiang
40
1
0
14 Aug 2023
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
Peike Li
Bo-Yu Chen
Yao Yao
Yikai Wang
Allen Wang
Alex Jinpeng Wang
MGen
VLM
DiffM
72
37
0
09 Aug 2023
Generative AI for Medical Imaging: extending the MONAI Framework
Generative AI for Medical Imaging: extending the MONAI Framework
W. H. Pinaya
M. Graham
E. Kerfoot
Petru-Daniel Tudosiu
J. Dafflon
...
Andrew Feng
Marc Modat
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
SyDa
MedIm
48
64
0
27 Jul 2023
Online Clustered Codebook
Online Clustered Codebook
Chuanxia Zheng
Andrea Vedaldi
37
26
0
27 Jul 2023
IteraTTA: An interface for exploring both text prompts and audio priors
  in generating music with text-to-audio models
IteraTTA: An interface for exploring both text prompts and audio priors in generating music with text-to-audio models
Hiromu Yakura
Masataka Goto
27
2
0
24 Jul 2023
Brain2Music: Reconstructing Music from Human Brain Activity
Brain2Music: Reconstructing Music from Human Brain Activity
Timo I. Denk
Yu Takagi
Takuya Matsuyama
A. Agostinelli
Tomoya Nakai
Christian Frank
Shinji Nishimoto
31
13
0
20 Jul 2023
Polyffusion: A Diffusion Model for Polyphonic Score Generation with
  Internal and External Controls
Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls
Lejun Min
Junyan Jiang
Gus Xia
Jingwei Zhao
DiffM
18
21
0
19 Jul 2023
On the Effectiveness of Speech Self-supervised Learning for Music
On the Effectiveness of Speech Self-supervised Learning for Music
Yi Ma
Ruibin Yuan
Yizhi Li
Ge Zhang
Xingran Chen
...
Ruibo Liu
Gus Xia
Roger Dannenberg
Yi-Ting Guo
Jie Fu
28
10
0
11 Jul 2023
VampNet: Music Generation via Masked Acoustic Token Modeling
VampNet: Music Generation via Masked Acoustic Token Modeling
Hugo Flores Garcia
Prem Seetharaman
Rithesh Kumar
Bryan Pardo
MGen
48
64
0
10 Jul 2023
ChatGPT in the Age of Generative AI and Large Language Models: A Concise Survey
S. Mohamadi
G. Mujtaba
Ngan Le
Gianfranco Doretto
Don Adjeroh
LM&MA
AI4MH
31
21
0
09 Jul 2023
The Ethical Implications of Generative Audio Models: A Systematic
  Literature Review
The Ethical Implications of Generative Audio Models: A Systematic Literature Review
J. Barnett
29
25
0
07 Jul 2023
Unsupervised 3D out-of-distribution detection with latent diffusion
  models
Unsupervised 3D out-of-distribution detection with latent diffusion models
M. Graham
W. H. Pinaya
P. Wright
Petru-Daniel Tudosiu
Y. Mah
...
H. Jäger
D. Werring
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
DiffM
MedIm
25
9
0
07 Jul 2023
Hierarchical Neural Coding for Controllable CAD Model Generation
Hierarchical Neural Coding for Controllable CAD Model Generation
Xiang Xu
P. Jayaraman
Joseph G. Lambourne
Karl D. D. Willis
Yasutaka Furukawa
16
40
0
30 Jun 2023
Audio Embeddings as Teachers for Music Classification
Audio Embeddings as Teachers for Music Classification
Yiwei Ding
Alexander Lerch
33
5
0
30 Jun 2023
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by
  Whispering to ChatGPT
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
Le Zhuo
Ruibin Yuan
Jiahao Pan
Yi Ma
Yizhi Li
...
Chenghua Lin
Emmanouil Benetos
Wenhu Chen
Wei Xue
Yi-Ting Guo
38
16
0
29 Jun 2023
Fauno: The Italian Large Language Model that will leave you senza
  parole!
Fauno: The Italian Large Language Model that will leave you senza parole!
Andrea Bacciu
Giovanni Trappolini
Andrea Santilli
Emanuele Rodolà
Fabrizio Silvestri
24
18
0
26 Jun 2023
DISCO-10M: A Large-Scale Music Dataset
DISCO-10M: A Large-Scale Music Dataset
Luca A. Lanzendörfer
Florian Grötschla
Emil Funke
Roger Wattenhofer
25
12
0
23 Jun 2023
Toward Leveraging Pre-Trained Self-Supervised Frontends for Automatic
  Singing Voice Understanding Tasks: Three Case Studies
Toward Leveraging Pre-Trained Self-Supervised Frontends for Automatic Singing Voice Understanding Tasks: Three Case Studies
Yuya Yamamoto
33
2
0
22 Jun 2023
MARBLE: Music Audio Representation Benchmark for Universal Evaluation
MARBLE: Music Audio Representation Benchmark for Universal Evaluation
Ruibin Yuan
Yi Ma
Yizhi Li
Ge Zhang
Xingran Chen
...
Si Liu
Shi Wang
Ruibo Liu
Yi-Ting Guo
Jie Fu
91
26
0
18 Jun 2023
The pop song generator: designing an online course to teach
  collaborative, creative AI
The pop song generator: designing an online course to teach collaborative, creative AI
M. Yee-King
A. Fiorucci
M. dÍnverno
20
0
0
15 Jun 2023
Unbiased Learning of Deep Generative Models with Structured Discrete
  Representations
Unbiased Learning of Deep Generative Models with Structured Discrete Representations
H. Bendekgey
Gabriel Hope
Erik B. Sudderth
OCL
BDL
DRL
30
1
0
14 Jun 2023
Better Generalization with Semantic IDs: A Case Study in Ranking for
  Recommendations
Better Generalization with Semantic IDs: A Case Study in Ranking for Recommendations
Anima Singh
Trung Vu
Nikhil Mehta
Raghunandan H. Keshavan
M. Sathiamoorthy
...
Lukasz Heldt
Li Wei
Devansh Tandon
Ed H. Chi
Xinyang Yi
26
19
0
13 Jun 2023
Tokenization with Factorized Subword Encoding
Tokenization with Factorized Subword Encoding
David Samuel
Lilja Øvrelid
41
1
0
13 Jun 2023
High-Fidelity Audio Compression with Improved RVQGAN
High-Fidelity Audio Compression with Improved RVQGAN
Rithesh Kumar
Prem Seetharaman
Alejandro Luebs
I. Kumar
Kundan Kumar
56
288
0
11 Jun 2023
Simple and Controllable Music Generation
Simple and Controllable Music Generation
Jade Copet
Felix Kreuk
Itai Gat
Tal Remez
David Kant
Gabriel Synnaeve
Yossi Adi
Alexandre Défossez
MGen
47
343
0
08 Jun 2023
Coupled Variational Autoencoder
Coupled Variational Autoencoder
Xiaoran Hao
Patrick Shafto
BDL
DRL
34
4
0
05 Jun 2023
MERT: Acoustic Music Understanding Model with Large-Scale
  Self-supervised Training
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
Yizhi Li
Ruibin Yuan
Ge Zhang
Yi Ma
Xingran Chen
...
Yemin Shi
Wen-Fen Huang
Zili Wang
Yi-Ting Guo
Jie Fu
30
109
0
31 May 2023
Diff-Instruct: A Universal Approach for Transferring Knowledge From
  Pre-trained Diffusion Models
Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models
Weijian Luo
Tianyang Hu
Shifeng Zhang
Jiacheng Sun
Zhenguo Li
Zhihua Zhang
35
107
0
29 May 2023
Disentanglement via Latent Quantization
Disentanglement via Latent Quantization
Kyle Hsu
W. Dorrell
James C. R. Whittington
Jiajun Wu
Chelsea Finn
DRL
31
25
0
28 May 2023
Efficient Neural Music Generation
Efficient Neural Music Generation
Max W. Y. Lam
Qiao Tian
Tang-Chun Li
Zongyu Yin
Siyuan Feng
...
Mingbo Ma
Xuchen Song
Jitong Chen
Yuping Wang
Yuxuan Wang
DiffM
MGen
34
49
0
25 May 2023
Spoken Question Answering and Speech Continuation Using
  Spectrogram-Powered LLM
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
Eliya Nachmani
Alon Levkovitch
Roy Hirsch
Julián Salazar
Chulayutsh Asawaroengchai
Soroosh Mariooryad
Ehud Rivlin
RJ Skerry-Ryan
Michelle Tadmor Ramanovich
AuLLM
34
31
0
24 May 2023
Not All Image Regions Matter: Masked Vector Quantization for
  Autoregressive Image Generation
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation
Mengqi Huang
Zhendong Mao
Quang Wang
Yongdong Zhang
VGen
DiffM
68
21
0
23 May 2023
Towards Accurate Image Coding: Improved Autoregressive Image Generation
  with Dynamic Vector Quantization
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization
Mengqi Huang
Zhendong Mao
Zhuowei Chen
Yongdong Zhang
MQ
38
35
0
19 May 2023
MIDI-Draw: Sketching to Control Melody Generation
MIDI-Draw: Sketching to Control Melody Generation
Tashi Namgyal
Peter A. Flach
Raúl Santos-Rodríguez
20
2
0
19 May 2023
Straightening Out the Straight-Through Estimator: Overcoming
  Optimization Challenges in Vector Quantized Networks
Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks
Minyoung Huh
Brian Cheung
Pulkit Agrawal
Phillip Isola
MQ
36
48
0
15 May 2023
Integrating Generative Artificial Intelligence in Intelligent Vehicle
  Systems
Integrating Generative Artificial Intelligence in Intelligent Vehicle Systems
Lukas Stappen
J. Dillmann
S. Striegel
Hans-Jörg Vögel
Nicolas Flores-Herr
Björn W. Schuller
32
9
0
15 May 2023
Generative Pre-trained Transformer: A Comprehensive Review on Enabling
  Technologies, Potential Applications, Emerging Challenges, and Future
  Directions
Generative Pre-trained Transformer: A Comprehensive Review on Enabling Technologies, Potential Applications, Emerging Challenges, and Future Directions
Gokul Yenduri
M. Ramalingam
G. C. Selvi
Y. Supriya
Gautam Srivastava
...
Rutvij H. Jhaveri
B. Prabadevi
Weizheng Wang
Athanasios V. Vasilakos
Thippa Reddy Gadekallu
AI4CE
LM&MA
28
167
0
11 May 2023
V2Meow: Meowing to the Visual Beat via Video-to-Music Generation
V2Meow: Meowing to the Visual Beat via Video-to-Music Generation
Kun Su
Judith Yue Li
Qingqing Huang
Dima Kuzmin
Joonseok Lee
...
Fei Sha
A. Jansen
Yu Wang
Mauro Verzetti
Timo I. Denk
VGen
39
12
0
11 May 2023
Shap-E: Generating Conditional 3D Implicit Functions
Shap-E: Generating Conditional 3D Implicit Functions
Heewoo Jun
Alex Nichol
DiffM
203
311
0
03 May 2023
Long-Term Rhythmic Video Soundtracker
Long-Term Rhythmic Video Soundtracker
Jiashuo Yu
Yaohui Wang
Xinyuan Chen
Xiao Sun
Yu Qiao
DiffM
64
14
0
02 May 2023
Previous
123456...8910
Next