ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.00341
  4. Cited By
Jukebox: A Generative Model for Music

Jukebox: A Generative Model for Music

30 April 2020
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
    VLM
ArXiv (abs)PDFHTMLGithub (7986★)

Papers citing "Jukebox: A Generative Model for Music"

50 / 473 papers shown
Title
Estimating Musical Surprisal in Audio
Estimating Musical Surprisal in Audio
Mathias Rose Bjare
Giorgia Cantisani
Stefan Lattner
Gerhard Widmer
80
0
0
13 Jan 2025
ARES: Auxiliary Range Expansion for Outlier Synthesis
ARES: Auxiliary Range Expansion for Outlier Synthesis
Eui-Soo Jung
Hae-Hun Seo
Hyun-Woo Jung
Je-Geon Oh
Yoon-Yeong Kim
OODD
126
0
0
11 Jan 2025
Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models
Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models
Tornike Karchkhadze
M. Izadi
Shlomo Dubnov
DiffM
88
5
0
31 Dec 2024
Hierarchical Vector Quantization for Unsupervised Action Segmentation
Hierarchical Vector Quantization for Unsupervised Action Segmentation
Federico Spurio
Emad Bahrami
Gianpiero Francesca
Juergen Gall
122
0
0
23 Dec 2024
When Worse is Better: Navigating the compression-generation tradeoff in
  visual tokenization
When Worse is Better: Navigating the compression-generation tradeoff in visual tokenization
Vivek Ramanujan
Kushal Tirumala
Armen Aghajanyan
Luke Zettlemoyer
Ali Farhadi
DiffM
124
3
0
20 Dec 2024
Dataset Augmentation by Mixing Visual Concepts
Dataset Augmentation by Mixing Visual Concepts
Abdullah Al Rahat
Hemanth Venkateswara
DiffM
116
0
0
19 Dec 2024
SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
Chenyu Yang
Shuai Wang
Hangting Chen
Jianwei Yu
Wei Tan
Rongzhi Gu
Yongjun Xu
Yizhi Zhou
Haina Zhu
Haoyang Li
KELM
425
2
0
18 Dec 2024
Tuning Music Education: AI-Powered Personalization in Learning Music
Tuning Music Education: AI-Powered Personalization in Learning Music
Mayank Sanganeria
Rohan Gala
173
0
0
18 Dec 2024
Interpreting Graphic Notation with MusicLDM: An AI Improvisation of
  Cornelius Cardew's Treatise
Interpreting Graphic Notation with MusicLDM: An AI Improvisation of Cornelius Cardew's Treatise
Tornike Karchkhadze
Keren Shao
Shlomo Dubnov
104
0
0
12 Dec 2024
Zero-shot Musical Stem Retrieval with Joint-Embedding Predictive Architectures
Zero-shot Musical Stem Retrieval with Joint-Embedding Predictive Architectures
Alain Riou
Antonin Gagnere
Gaëtan Hadjeres
Stefan Lattner
Geoffroy Peeters
134
0
0
29 Nov 2024
Continuous Autoregressive Models with Noise Augmentation Avoid Error
  Accumulation
Continuous Autoregressive Models with Noise Augmentation Avoid Error Accumulation
Marco Pasini
J. Nistal
Stefan Lattner
George Fazekas
122
3
0
27 Nov 2024
Mixed-State Quantum Denoising Diffusion Probabilistic Model
Mixed-State Quantum Denoising Diffusion Probabilistic Model
Gino Kwun
Bingzhi Zhang
Quntao Zhuang
DiffM
169
2
0
26 Nov 2024
Representation Collapsing Problems in Vector Quantization
Representation Collapsing Problems in Vector Quantization
Wenhao Zhao
Qiran Zou
Rushi Shah
Dianbo Liu
109
2
0
25 Nov 2024
A Survey of Recent Advances and Challenges in Deep Audio-Visual Correlation Learning
Luis Vilaca
Yi Yu
Paula Vinan
195
0
0
24 Nov 2024
VQalAttent: a Transparent Speech Generation Pipeline based on
  Transformer-learned VQ-VAE Latent Space
VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space
Armani Rodriguez
S. Kokalj-Filipovic
101
1
0
22 Nov 2024
Exploratory Study Of Human-AI Interaction For Hindustani Music
Exploratory Study Of Human-AI Interaction For Hindustani Music
N. Shikarpur
Cheng-Zhi Anna Huang
162
0
0
21 Nov 2024
VQ-ACE: Efficient Policy Search for Dexterous Robotic Manipulation via
  Action Chunking Embedding
VQ-ACE: Efficient Policy Search for Dexterous Robotic Manipulation via Action Chunking Embedding
Chenyu Yang
Davide Liconti
Robert K. Katzschmann
108
2
0
05 Nov 2024
MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and
  Correspondence
MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence
Fuming You
Minghui Fang
Li Tang
Rongjie Huang
Yongqi Wang
Zhou Zhao
81
2
0
04 Nov 2024
Sing-On-Your-Beat: Simple Text-Controllable Accompaniment Generations
Sing-On-Your-Beat: Simple Text-Controllable Accompaniment Generations
Quoc-Huy Trinh
Minh-Van Nguyen
Trong-Hieu Nguyen-Mau
Khoa Tran
Thanh Do
59
0
0
03 Nov 2024
Music Foundation Model as Generic Booster for Music Downstream Tasks
Music Foundation Model as Generic Booster for Music Downstream Tasks
Weihsiang Liao
Yuhta Takida
Yukara Ikemiya
Zhi-Wei Zhong
Chieh-Hsin Lai
...
Stefan Uhlich
Taketo Akama
Woosung Choi
Yuichiro Koyama
Yuki Mitsufuji
237
1
0
02 Nov 2024
Emotion-Guided Image to Music Generation
Emotion-Guided Image to Music Generation
Souraja Kundu
Saket Singh
Yuji Iwahori
54
3
0
29 Oct 2024
Fourier Head: Helping Large Language Models Learn Complex Probability Distributions
Fourier Head: Helping Large Language Models Learn Complex Probability Distributions
Nate Gillman
Daksh Aggarwal
Michael Freeman
Saurabh Singh
Chen Sun
AI4TS
110
4
0
29 Oct 2024
Melody Construction for Persian lyrics using LSTM recurrent neural
  networks
Melody Construction for Persian lyrics using LSTM recurrent neural networks
Farshad Jafari
Farzad Didehvar
Amin Gheibi
26
0
0
23 Oct 2024
SeisLM: a Foundation Model for Seismic Waveforms
SeisLM: a Foundation Model for Seismic Waveforms
Tianlin Liu
Jannes Münchmeyer
Laura Laurenti
C. Marone
Maarten V. de Hoop
Ivan Dokmanić
VLM
129
6
0
21 Oct 2024
OpenMU: Your Swiss Army Knife for Music Understanding
OpenMU: Your Swiss Army Knife for Music Understanding
Mengjie Zhao
Zhi-Wei Zhong
Zhuoyuan Mao
Shiqi Yang
Wei-Hsiang Liao
Shusuke Takahashi
Hiromi Wakaki
Yuki Mitsufuji
OSLM
103
8
0
21 Oct 2024
SNAC: Multi-Scale Neural Audio Codec
SNAC: Multi-Scale Neural Audio Codec
Hubert Siuzdak
Florian Grötschla
Luca A. Lanzendörfer
49
19
0
18 Oct 2024
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic
  Synchronization
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization
Ruiqi Li
Siqi Zheng
Xize Cheng
Ziang Zhang
Shengpeng Ji
Zhou Zhao
VGen
123
9
0
16 Oct 2024
Gaussian Mixture Vector Quantization with Aggregated Categorical
  Posterior
Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior
Mingyuan Yan
Jiawei Wu
Rushi Shah
Dianbo Liu
52
0
0
14 Oct 2024
Restructuring Vector Quantization with the Rotation Trick
Restructuring Vector Quantization with the Rotation Trick
Christopher Fifty
Ronald G. Junkins
Dennis Duan
Aniketh Iger
Jerry W. Liu
Ehsan Amid
Sebastian Thrun
Christopher Ré
LLMSV
173
13
0
08 Oct 2024
Do Music Generation Models Encode Music Theory?
Do Music Generation Models Encode Music Theory?
Megan Wei
Michael Freeman
Chris Donahue
Chen Sun
MGen
68
6
0
01 Oct 2024
Integrating Text-to-Music Models with Language Models: Composing Long
  Structured Music Pieces
Integrating Text-to-Music Models with Language Models: Composing Long Structured Music Pieces
Lilac Atassi
84
0
0
01 Oct 2024
From Vision to Audio and Beyond: A Unified Model for Audio-Visual
  Representation and Generation
From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation
Kun Su
Xiulong Liu
Eli Shlizerman
VGen
163
7
0
27 Sep 2024
EgoLM: Multi-Modal Language Model of Egocentric Motions
EgoLM: Multi-Modal Language Model of Egocentric Motions
Fangzhou Hong
Vladimir Guzov
Hyo Jin Kim
Yuting Ye
Richard Newcombe
Ziwei Liu
Lingni Ma
81
4
0
26 Sep 2024
A Multimodal Single-Branch Embedding Network for Recommendation in
  Cold-Start and Missing Modality Scenarios
A Multimodal Single-Branch Embedding Network for Recommendation in Cold-Start and Missing Modality Scenarios
Christian Ganhor
Marta Moscati
Anna Hausberger
Shah Nawaz
Markus Schedl
65
2
0
26 Sep 2024
DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis
DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis
Zixuan Wang
Jiayi Li
Xiaoyu Qin
Shikun Sun
Songtao Zhou
Jia Jia
Jiebo Luo
VGen
56
0
0
23 Sep 2024
Disentanglement with Factor Quantized Variational Autoencoders
Disentanglement with Factor Quantized Variational Autoencoders
Gulcin Baykal
M. Kandemir
Gözde B. Ünal
CoGeDRL
84
0
0
23 Sep 2024
AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music
  Transcription Model
AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model
Kazuma Komiya
Yoshihisa Fukuhara
58
0
0
21 Sep 2024
Learning Source Disentanglement in Neural Audio Codec
Learning Source Disentanglement in Neural Audio Codec
Xiaoyu Bie
Xubo Liu
Gaël Richard
108
2
0
17 Sep 2024
Prevailing Research Areas for Music AI in the Era of Foundation Models
Prevailing Research Areas for Music AI in the Era of Foundation Models
Megan Wei
M. Modrzejewski
Aswin Sivaraman
Dorien Herremans
MedIm
94
2
0
14 Sep 2024
Seed-Music: A Unified Framework for High Quality and Controlled Music
  Generation
Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
Ye Bai
Haonan Chen
Jitong Chen
Zhuo Chen
Yi Deng
...
Hang Zhao
Ziyi Zhao
Dejian Zhong
Shicen Zhou
Pei Zou
DiffM
104
8
0
13 Sep 2024
An End-to-End Approach for Chord-Conditioned Song Generation
An End-to-End Approach for Chord-Conditioned Song Generation
Shuochen Gao
Shun Lei
Fan Zhuo
Hangyu Liu
Feng Liu
Boshi Tang
Qiaochu Huang
Shiyin Kang
Zhiyong Wu
64
4
0
10 Sep 2024
Multi-Source Music Generation with Latent Diffusion
Multi-Source Music Generation with Latent Diffusion
Zhongweiyang Xu
Debottam Dutta
Yu-Lin Wei
Romit Roy Choudhury
DiffM
124
2
0
10 Sep 2024
SongCreator: Lyrics-based Universal Song Generation
SongCreator: Lyrics-based Universal Song Generation
Shun Lei
Yixuan Zhou
Boshi Tang
Max W. Y. Lam
Feng Liu
Hangyu Liu
Jingcheng Wu
Shiyin Kang
Zhiyong Wu
Helen Meng
101
8
0
09 Sep 2024
Mel-RoFormer for Vocal Separation and Vocal Melody Transcription
Mel-RoFormer for Vocal Separation and Vocal Melody Transcription
Ju-Chiang Wang
Fan Zhang
Jitong Chen
63
2
0
07 Sep 2024
Applications and Advances of Artificial Intelligence in Music
  Generation:A Review
Applications and Advances of Artificial Intelligence in Music Generation:A Review
Yanxu Chen
Linshu Huang
Tian Gou
MGen
74
4
0
03 Sep 2024
Dynamic Motion Synthesis: Masked Audio-Text Conditioned Spatio-Temporal
  Transformers
Dynamic Motion Synthesis: Masked Audio-Text Conditioned Spatio-Temporal Transformers
Sohan Anisetty
James Hays
77
0
0
03 Sep 2024
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
Shengpeng Ji
Ziyue Jiang
Xize Cheng
Yifu Chen
Minghui Fang
...
Rongjie Huang
Yidi Jiang
Qian Chen
Zhou Zhao
Zhou Zhao
VLM
149
45
0
29 Aug 2024
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem
  Compatibility Estimation
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility Estimation
Alain Riou
Stefan Lattner
Gaëtan Hadjeres
Michael Anslow
Geoffroy Peeters
73
2
0
05 Aug 2024
Generating High-quality Symbolic Music Using Fine-grained Discriminators
Generating High-quality Symbolic Music Using Fine-grained Discriminators
Zhedong Zhang
Liang-Sheng Li
Jiehua Zhang
Zhenghui Hu
Hongkui Wang
Chenggang Yan
Jian Yang
Yuankai Qi
84
3
0
03 Aug 2024
Combining audio control and style transfer using latent diffusion
Combining audio control and style transfer using latent diffusion
Andreas Maier
Yuliya Burankova
Anne Hartebrodt
David B. Blumenthal
DiffM
70
3
0
31 Jul 2024
Previous
12345...8910
Next