ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.00341
  4. Cited By
Jukebox: A Generative Model for Music

Jukebox: A Generative Model for Music

30 April 2020
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
    VLM
ArXiv (abs)PDFHTMLGithub (7986★)

Papers citing "Jukebox: A Generative Model for Music"

50 / 473 papers shown
Title
MERT: Acoustic Music Understanding Model with Large-Scale
  Self-supervised Training
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
Yizhi Li
Ruibin Yuan
Ge Zhang
Yi Ma
Xingran Chen
...
Yemin Shi
Wen-Fen Huang
Zili Wang
Yi-Ting Guo
Jie Fu
121
130
0
31 May 2023
Diff-Instruct: A Universal Approach for Transferring Knowledge From
  Pre-trained Diffusion Models
Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models
Weijian Luo
Tianyang Hu
Shifeng Zhang
Jiacheng Sun
Zhenguo Li
Zhihua Zhang
124
138
0
29 May 2023
Disentanglement via Latent Quantization
Disentanglement via Latent Quantization
Kyle Hsu
W. Dorrell
James C. R. Whittington
Jiajun Wu
Chelsea Finn
DRL
163
27
0
28 May 2023
Efficient Neural Music Generation
Efficient Neural Music Generation
Max W. Y. Lam
Qiao Tian
Tang-Chun Li
Zongyu Yin
Siyuan Feng
...
Mingbo Ma
Xuchen Song
Jitong Chen
Yuping Wang
Yuxuan Wang
DiffMMGen
95
56
0
25 May 2023
Spoken Question Answering and Speech Continuation Using
  Spectrogram-Powered LLM
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
Eliya Nachmani
Alon Levkovitch
Roy Hirsch
Julián Salazar
Chulayutsh Asawaroengchai
Soroosh Mariooryad
Ehud Rivlin
RJ Skerry-Ryan
Michelle Tadmor Ramanovich
AuLLM
118
45
0
24 May 2023
Not All Image Regions Matter: Masked Vector Quantization for
  Autoregressive Image Generation
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation
Mengqi Huang
Zhendong Mao
Quang Wang
Yongdong Zhang
VGenDiffM
127
24
0
23 May 2023
Towards Accurate Image Coding: Improved Autoregressive Image Generation
  with Dynamic Vector Quantization
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization
Mengqi Huang
Zhendong Mao
Zhuowei Chen
Yongdong Zhang
MQ
132
41
0
19 May 2023
MIDI-Draw: Sketching to Control Melody Generation
MIDI-Draw: Sketching to Control Melody Generation
Tashi Namgyal
Peter A. Flach
Raúl Santos-Rodríguez
126
2
0
19 May 2023
Straightening Out the Straight-Through Estimator: Overcoming
  Optimization Challenges in Vector Quantized Networks
Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks
Minyoung Huh
Brian Cheung
Pulkit Agrawal
Phillip Isola
MQ
60
55
0
15 May 2023
Integrating Generative Artificial Intelligence in Intelligent Vehicle
  Systems
Integrating Generative Artificial Intelligence in Intelligent Vehicle Systems
Lukas Stappen
J. Dillmann
S. Striegel
Hans-Jörg Vögel
Nicolas Flores-Herr
Björn W. Schuller
69
9
0
15 May 2023
Generative Pre-trained Transformer: A Comprehensive Review on Enabling
  Technologies, Potential Applications, Emerging Challenges, and Future
  Directions
Generative Pre-trained Transformer: A Comprehensive Review on Enabling Technologies, Potential Applications, Emerging Challenges, and Future Directions
Gokul Yenduri
M. Ramalingam
G. C. Selvi
Y. Supriya
Gautam Srivastava
...
Rutvij H. Jhaveri
B. Prabadevi
Weizheng Wang
Athanasios V. Vasilakos
Thippa Reddy Gadekallu
AI4CELM&MA
90
189
0
11 May 2023
V2Meow: Meowing to the Visual Beat via Video-to-Music Generation
V2Meow: Meowing to the Visual Beat via Video-to-Music Generation
Kun Su
Judith Yue Li
Qingqing Huang
Dima Kuzmin
Joonseok Lee
...
Fei Sha
A. Jansen
Yu Wang
Mauro Verzetti
Timo I. Denk
VGen
86
14
0
11 May 2023
Shap-E: Generating Conditional 3D Implicit Functions
Shap-E: Generating Conditional 3D Implicit Functions
Heewoo Jun
Alex Nichol
DiffM
292
322
0
03 May 2023
Long-Term Rhythmic Video Soundtracker
Long-Term Rhythmic Video Soundtracker
Jiashuo Yu
Yaohui Wang
Xinyuan Chen
Xiao Sun
Yu Qiao
DiffM
105
13
0
02 May 2023
A Two-part Transformer Network for Controllable Motion Synthesis
A Two-part Transformer Network for Controllable Motion Synthesis
Shuaiying Hou
Hongyu Tao
Hujun Bao
Weiwei Xu
ViT
80
6
0
25 Apr 2023
Cross Attention Transformers for Multi-modal Unsupervised Whole-Body PET
  Anomaly Detection
Cross Attention Transformers for Multi-modal Unsupervised Whole-Body PET Anomaly Detection
Ashay Patel
Petru-Daniel Tudosiu
W. H. Pinaya
G. Cook
Vicky Goh
Sebastien Ourselin
M. Jorge Cardoso
OODViTMedIm
94
11
0
14 Apr 2023
ChatGPT is all you need to decolonize sub-Saharan Vocational Education
ChatGPT is all you need to decolonize sub-Saharan Vocational Education
Isidora Chara Tourni
G. Grigorakis
Isidoros Marougkas
Konstantinos M. Dafnis
Vassiliki‐Panagiota Tassopoulou
23
0
0
11 Apr 2023
Leveraging Neural Representations for Audio Manipulation
Leveraging Neural Representations for Audio Manipulation
Scott H. Hawley
C. Steinmetz
65
2
0
10 Apr 2023
SC-VAE: Sparse Coding-based Variational Autoencoder with Learned ISTA
SC-VAE: Sparse Coding-based Variational Autoencoder with Learned ISTA
Pan Xiao
Peijie Qiu
Sungmin Ha
Abdalla Bani
Shuang Zhou
Aristeidis Sotiras
DRL
50
4
0
29 Mar 2023
GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents
GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents
Tenglong Ao
Zeyi Zhang
Libin Liu
DiffMVGen
144
152
0
26 Mar 2023
Generative AI for Cyber Threat-Hunting in 6G-enabled IoT Networks
Generative AI for Cyber Threat-Hunting in 6G-enabled IoT Networks
M. Ferrag
Merouane Debbah
Muna Al-Hawawreh
34
37
0
21 Mar 2023
IRGen: Generative Modeling for Image Retrieval
IRGen: Generative Modeling for Image Retrieval
Yidan Zhang
Ting Zhang
Dong Chen
Yujing Wang
Qi Chen
...
Qi Zhang
Fan Yang
Mao Yang
Q. Liao
B. Guo
3DVVLM
139
15
0
17 Mar 2023
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
Peng Jin
Hao Li
Ze-Long Cheng
Kehan Li
Xiang Ji
Chang-rui Liu
Li-ming Yuan
Jie Chen
DiffMVGen
93
58
0
17 Mar 2023
Vector Quantized Time Series Generation with a Bidirectional Prior Model
Vector Quantized Time Series Generation with a Bidirectional Prior Model
Daesoo Lee
Sara Malacarne
Erlend Aune
BDL
88
29
0
08 Mar 2023
Neural Vector Fields: Implicit Representation by Explicit Learning
Neural Vector Fields: Implicit Representation by Explicit Learning
Xianghui Yang
Guosheng Lin
Zhenghao Chen
Luping Zhou
AI4CE
101
18
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
108
554
0
07 Mar 2023
A General Framework for Learning Procedural Audio Models of
  Environmental Sounds
A General Framework for Learning Procedural Audio Models of Environmental Sounds
Danzel Serrano
M. Cartwright
DiffMDRL
63
1
0
04 Mar 2023
Co-Speech Gesture Synthesis using Discrete Gesture Token Learning
Co-Speech Gesture Synthesis using Discrete Gesture Token Learning
Shuhong Lu
Youngwoo Yoon
Andrew W. Feng
SLR
88
12
0
04 Mar 2023
Self-Organising Neural Discrete Representation Learning à la Kohonen
Self-Organising Neural Discrete Representation Learning à la Kohonen
Kazuki Irie
Róbert Csordás
Jürgen Schmidhuber
SSL
79
1
0
15 Feb 2023
Video Probabilistic Diffusion Models in Projected Latent Space
Video Probabilistic Diffusion Models in Projected Latent Space
Sihyun Yu
Kihyuk Sohn
Subin Kim
Jinwoo Shin
VGenDiffM
115
172
0
15 Feb 2023
GFlowNet-EM for learning compositional latent variable models
GFlowNet-EM for learning compositional latent variable models
J. E. Hu
Nikolay Malkin
Moksh Jain
Katie Everett
Alexandros Graikos
Yoshua Bengio
CoGe
103
41
0
13 Feb 2023
Vector Quantized Wasserstein Auto-Encoder
Vector Quantized Wasserstein Auto-Encoder
Tung-Long Vuong
Trung Le
He Zhao
Chuanxia Zheng
Mehrtash Harandi
Jianfei Cai
Dinh Q. Phung
DRL
70
20
0
12 Feb 2023
Noise2Music: Text-conditioned Music Generation with Diffusion Models
Noise2Music: Text-conditioned Music Generation with Diffusion Models
Qingqing Huang
Daniel S. Park
Tao Wang
Timo I. Denk
Andy Ly
...
Jesse Engel
Quoc V. Le
William Chan
Zhifeng Chen
Wei Han
MGenDiffM
115
202
0
08 Feb 2023
Multi-Source Diffusion Models for Simultaneous Music Generation and
  Separation
Multi-Source Diffusion Models for Simultaneous Music Generation and Separation
Giorgio Mariani
Irene Tallini
Emilian Postolache
Michele Mancusi
Luca Cosmo
Emanuele Rodolà
DiffM
155
43
0
04 Feb 2023
QR-CLIP: Introducing Explicit Open-World Knowledge for Location and Time
  Reasoning
QR-CLIP: Introducing Explicit Open-World Knowledge for Location and Time Reasoning
Weimin Shi
Mingchen Zhuge
D. Gao
Zhong Zhou
Ming-Ming Cheng
Deng-Ping Fan
LRMVLM
88
0
0
02 Feb 2023
ArchiSound: Audio Generation with Diffusion
ArchiSound: Audio Generation with Diffusion
Flavio Schneider
72
25
0
30 Jan 2023
CHeart: A Conditional Spatio-Temporal Generative Model for Cardiac
  Anatomy
CHeart: A Conditional Spatio-Temporal Generative Model for Cardiac Anatomy
Mengyun Qiao
Shuo Wang
Huaqi Qiu
A. de Marvao
D. O’Regan
Daniel Rueckert
Wenjia Bai
MedIm
78
14
0
30 Jan 2023
SingSong: Generating musical accompaniments from singing
SingSong: Generating musical accompaniments from singing
Chris Donahue
Antoine Caillon
Adam Roberts
Ethan Manilow
P. Esling
...
Mauro Verzetti
Ian Simon
Olivier Pietquin
Neil Zeghidour
Jesse Engel
110
55
0
30 Jan 2023
Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion
Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion
Flavio Schneider
Ojasv Kamal
Zhijing Jin
Bernhard Schölkopf
MGen
125
84
0
27 Jan 2023
MusicLM: Generating Music From Text
MusicLM: Generating Music From Text
A. Agostinelli
Timo I. Denk
Zalan Borsos
Jesse Engel
Mauro Verzetti
...
Adam Roberts
Marco Tagliasacchi
Matthew Sharifi
Neil Zeghidour
Christian Frank
MGen
154
451
0
26 Jan 2023
Dance2MIDI: Dance-driven multi-instruments music generation
Dance2MIDI: Dance-driven multi-instruments music generation
Bo Han
Yuheng Li
Yixuan Shen
Yi Ren
Feilin Han
138
5
0
22 Jan 2023
Self-Supervised Learning for Data Scarcity in a Fatigue Damage
  Prognostic Problem
Self-Supervised Learning for Data Scarcity in a Fatigue Damage Prognostic Problem
A. Akrim
C. Gogu
R. Vingerhoeds
M. Salaün
AI4CE
102
25
0
20 Jan 2023
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Kinyugo Maina
85
7
0
16 Jan 2023
T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete
  Representations
T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations
Jianrong Zhang
Yangsong Zhang
Xiaodong Cun
Shaoli Huang
Yong Zhang
Hongwei Zhao
Hongtao Lu
Xiaodong Shen
149
358
0
15 Jan 2023
Rock Guitar Tablature Generation via Natural Language Processing
Rock Guitar Tablature Generation via Natural Language Processing
Josue Casco-Rodriguez
73
1
0
12 Jan 2023
ChatGPT is not all you need. A State of the Art Review of large
  Generative AI models
ChatGPT is not all you need. A State of the Art Review of large Generative AI models
Roberto Gozalo-Brizuela
E.C. Garrido-Merchán
91
268
0
11 Jan 2023
Latent Autoregressive Source Separation
Latent Autoregressive Source Separation
Emilian Postolache
Giorgio Mariani
Michele Mancusi
Andrea Santilli
Luca Cosmo
Emanuele Rodolà
BDLDRL
63
10
0
09 Jan 2023
Generating music with sentiment using Transformer-GANs
Generating music with sentiment using Transformer-GANs
Pedro Neves
José Fornari
J. Florindo
MGen
57
23
0
21 Dec 2022
MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music
  Audio Representation Learning
MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning
Yizhi Li
Ruibin Yuan
Ge Zhang
Yi Ma
Chenghua Lin
...
Haoyu He
Emmanouil Benetos
Norbert Gyenge
Ruibo Liu
Jie Fu
SSL
90
21
0
05 Dec 2022
Melody transcription via generative pre-training
Melody transcription via generative pre-training
Chris Donahue
John Thickstun
Percy Liang
68
18
0
04 Dec 2022
Previous
123...1056789
Next