ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.00341
  4. Cited By
Jukebox: A Generative Model for Music

Jukebox: A Generative Model for Music

30 April 2020
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
    VLM
ArXiv (abs)PDFHTMLGithub (7986★)

Papers citing "Jukebox: A Generative Model for Music"

50 / 473 papers shown
Title
Between the AI and Me: Analysing Listeners' Perspectives on AI- and
  Human-Composed Progressive Metal Music
Between the AI and Me: Analysing Listeners' Perspectives on AI- and Human-Composed Progressive Metal Music
Yan Yang
Dongxu Li
Mathieu Barthet
66
2
0
31 Jul 2024
QueST: Self-Supervised Skill Abstractions for Learning Continuous
  Control
QueST: Self-Supervised Skill Abstractions for Learning Continuous Control
Atharva Mete
Haotian Xue
Albert Wilcox
Yongxin Chen
Animesh Garg
SSL
144
22
0
22 Jul 2024
Explainability Paths for Sustained Artistic Practice with AI
Explainability Paths for Sustained Artistic Practice with AI
Austin Tecks
Thomas Peschlow
Gabriel Vigliensoni
65
2
0
21 Jul 2024
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music
  Generation
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation
Yun-Han Lan
Wen-Yi Hsiao
Hao-Chung Cheng
Yi-Hsuan Yang
90
9
0
21 Jul 2024
Stable Audio Open
Stable Audio Open
Zach Evans
Julian Parker
CJ Carr
Zack Zukowski
Josiah Taylor
Jordi Pons
268
53
0
19 Jul 2024
PASTA: Controllable Part-Aware Shape Generation with Autoregressive
  Transformers
PASTA: Controllable Part-Aware Shape Generation with Autoregressive Transformers
Songlin Li
Despoina Paschalidou
Leonidas Guibas
101
2
0
18 Jul 2024
Latent Spaces Enable Transformer-Based Dose Prediction in Complex
  Radiotherapy Plans
Latent Spaces Enable Transformer-Based Dose Prediction in Complex Radiotherapy Plans
E. Wang
Ryan Au
Pencilla Lang
Sarah Mattonen
MedIm
55
0
0
11 Jul 2024
WalkTheDog: Cross-Morphology Motion Alignment via Phase Manifolds
WalkTheDog: Cross-Morphology Motion Alignment via Phase Manifolds
Peizhuo Li
Sebastian Starke
Yuting Ye
Olga Sorkine-Hornung
101
6
0
11 Jul 2024
Exploring Real-Time Music-to-Image Systems for Creative Inspiration in
  Music Creation
Exploring Real-Time Music-to-Image Systems for Creative Inspiration in Music Creation
Meng Yang
Maria Teresa Llano
Jon McCormack
109
0
0
08 Jul 2024
Balance of Number of Embedding and their Dimensions in Vector
  Quantization
Balance of Number of Embedding and their Dimensions in Vector Quantization
Hang Chen
Sankepally Sainath Reddy
Ziwei Chen
Dianbo Liu
93
2
0
06 Jul 2024
A Framework for AI assisted Musical Devices
A Framework for AI assisted Musical Devices
Miguel Civit
Luis Muñoz-Saavedra
Francisco Cuadrado
Charles Tijus
Maria J. Escalona
39
1
0
03 Jul 2024
MuDiT & MuSiT: Alignment with Colloquial Expression in
  Description-to-Song Generation
MuDiT & MuSiT: Alignment with Colloquial Expression in Description-to-Song Generation
Zihao Wang
Haoxuan Liu
Jiaxing Yu
Tao Zhang
Yan Liu
Kai Zhang
137
1
0
03 Jul 2024
Towards Training Music Taggers on Synthetic Data
Towards Training Music Taggers on Synthetic Data
N. Kroher
Steven Manangu
A. Pikrakis
66
1
0
02 Jul 2024
Subtractive Training for Music Stem Insertion using Latent Diffusion Models
Subtractive Training for Music Stem Insertion using Latent Diffusion Models
Ivan Villa-Renteria
Mason L. Wang
Zachary Shah
Zhe Li
Soohyun Kim
Neelesh Ramachandran
Mert Pilanci
181
0
0
27 Jun 2024
SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for
  Efficient Audio Synthesis and Beyond
SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond
Marco Comunità
Zhi-Wei Zhong
Akira Takahashi
Shiqi Yang
Mengjie Zhao
Koichi Saito
Yukara Ikemiya
Takashi Shibuya
Shusuke Takahashi
Yuki Mitsufuji
116
6
0
25 Jun 2024
The Music Maestro or The Musically Challenged, A Massive Music
  Evaluation Benchmark for Large Language Models
The Music Maestro or The Musically Challenged, A Massive Music Evaluation Benchmark for Large Language Models
Jiajia Li
Lu Yang
Mingni Tang
Cong Chen
Zuchao Li
Ping Wang
Hai Zhao
LM&MA
86
6
0
22 Jun 2024
LARP: Language Audio Relational Pre-training for Cold-Start Playlist
  Continuation
LARP: Language Audio Relational Pre-training for Cold-Start Playlist Continuation
Rebecca Salganik
Xiaohao Liu
Yunshan Ma
Jian Kang
Tat-Seng Chua
CLL
100
2
0
20 Jun 2024
Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of
  99%
Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
Lei Zhu
Fangyun Wei
Yanye Lu
Dong Chen
VLM
97
40
0
17 Jun 2024
Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in
  the Wild
Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild
Lingni Ma
Yuting Ye
Fangzhou Hong
Vladimir Guzov
Yifeng Jiang
...
C. Karen Liu
Ziwei Liu
Jakob Engel
R. D. Nardi
Richard Newcombe
94
25
0
14 Jun 2024
ToneUnit: A Speech Discretization Approach for Tonal Language Speech
  Synthesis
ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis
Dehua Tao
Daxin Tan
Y. Yeung
Xiao Chen
Tan Lee
84
3
0
13 Jun 2024
Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion
  Models
Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models
J. Nistal
Marco Pasini
Cyran Aouameur
M. Grachten
Stefan Lattner
DiffM
103
19
0
12 Jun 2024
Visual Representation Learning with Stochastic Frame Prediction
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
99
3
0
11 Jun 2024
MeLFusion: Synthesizing Music from Image and Language Cues using
  Diffusion Models
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models
Sanjoy Chowdhury
Sayan Nag
K. J. Joseph
Balaji Vasan Srinivasan
Dinesh Manocha
DiffM
89
8
0
07 Jun 2024
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
Zeyue Tian
Zhaoyang Liu
Ruibin Yuan
Jiahao Pan
Xiaoqiang Huang
Xu Tan
Xu Tan
Qifeng Chen
Yu Guo
VGen
275
17
0
06 Jun 2024
An Independence-promoting Loss for Music Generation with Language Models
An Independence-promoting Loss for Music Generation with Language Models
Jean-Marie Lemercier
Simon Rouard
Jade Copet
Yossi Adi
Alexandre Défossez
159
1
0
04 Jun 2024
A Survey of Deep Learning Audio Generation Methods
A Survey of Deep Learning Audio Generation Methods
Matej Bozic
Marko Horvat
VLMMedIm
109
2
0
31 May 2024
RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text
RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text
Jiaben Chen
Xin Yan
Yihang Chen
Siyuan Cen
Qinwei Ma
Haoyu Zhen
Kaizhi Qian
Lie Lu
Chuang Gan
70
0
0
30 May 2024
DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music
  Generation
DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation
Cheng-i Wang
Julian McAuley
Taylor Berg-Kirkpatrick
Nicholas J. Bryan
120
12
0
30 May 2024
M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion
  Comprehension and Generation
M3^33GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation
Mingshuang Luo
Ruibing Hou
Hong Chang
Zimo Liu
Yaowei Wang
Shiguang Shan
85
10
0
25 May 2024
SIGGesture: Generalized Co-Speech Gesture Synthesis via Semantic
  Injection with Large-Scale Pre-Training Diffusion Models
SIGGesture: Generalized Co-Speech Gesture Synthesis via Semantic Injection with Large-Scale Pre-Training Diffusion Models
Qingrong Cheng
Xu Li
Xinghui Fu
DiffM
85
2
0
22 May 2024
Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded
  Diffusion Models
Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models
Ziyu Wang
Lejun Min
Gus Xia
DiffM
63
11
0
16 May 2024
VQDNA: Unleashing the Power of Vector Quantization for Multi-Species
  Genomic Sequence Modeling
VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling
Siyuan Li
Zedong Wang
Zicheng Liu
Di Wu
Cheng Tan
Jiangbin Zheng
Yufei Huang
Stan Z. Li
79
8
0
13 May 2024
SonifyAR: Context-Aware Sound Generation in Augmented Reality
SonifyAR: Context-Aware Sound Generation in Augmented Reality
Xia Su
Jon E. Froehlich
Eunyee Koh
Chang Xiao
58
3
0
11 May 2024
Detecting music deepfakes is easy but actually hard
Detecting music deepfakes is easy but actually hard
Darius Afchar
Gabriel Meseguer-Brocal
Romain Hennequin
103
9
0
07 May 2024
Collage: Light-Weight Low-Precision Strategy for LLM Training
Collage: Light-Weight Low-Precision Strategy for LLM Training
Tao Yu
Gaurav Gupta
Karthick Gopalswamy
Amith R. Mamidala
Hao Zhou
Jeffrey Huynh
Youngsuk Park
Ron Diamant
Anoop Deoras
Jun Huan
MQ
99
3
0
06 May 2024
POPDG: Popular 3D Dance Generation with PopDanceSet
POPDG: Popular 3D Dance Generation with PopDanceSet
Zhenye Luo
Min Ren
Xuecai Hu
Yongzhen Huang
Li Yao
90
9
0
06 May 2024
SATO: Stable Text-to-Motion Framework
SATO: Stable Text-to-Motion Framework
Wenshuo Chen
Hongru Xiao
Erhang Zhang
Lijie Hu
Lei Wang
Mengyuan Liu
Chong Chen
100
9
0
02 May 2024
Sparse multi-view hand-object reconstruction for unseen environments
Sparse multi-view hand-object reconstruction for unseen environments
Yik Lung Pang
Changjae Oh
Andrea Cavallaro
86
2
0
02 May 2024
ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized
  Transformers
ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
Yuzhe Gu
Enmao Diao
106
4
0
30 Apr 2024
ComposerX: Multi-Agent Symbolic Music Composition with LLMs
ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Qixin Deng
Qikai Yang
Ruibin Yuan
Yipeng Huang
Yi Wang
...
Emmanouil Benetos
Wenwu Wang
Guangyu Xia
Wei Xue
Yi-Ting Guo
LLMAG
99
36
0
28 Apr 2024
MCM: Multi-condition Motion Synthesis Framework
MCM: Multi-condition Motion Synthesis Framework
Zeyu Ling
Bo Han
Yongkang Wang
Han Lin
Mohan Kankanhalli
Weidong Geng
72
0
0
19 Apr 2024
MIDGET: Music Conditioned 3D Dance Generation
MIDGET: Music Conditioned 3D Dance Generation
Jinwu Wang
Wei Mao
Miaomiao Liu
74
0
0
18 Apr 2024
Large Language Models: From Notes to Musical Form
Large Language Models: From Notes to Musical Form
Lilac Atassi
90
0
0
18 Apr 2024
A Data-Driven Representation for Sign Language Production
A Data-Driven Representation for Sign Language Production
Harry Walsh
Abolfazl Ravanshad
Mariam Rahmani
Richard Bowden
SLR
78
5
0
17 Apr 2024
Long-form music generation with latent diffusion
Long-form music generation with latent diffusion
Zach Evans
Julian Parker
CJ Carr
Zack Zukowski
Josiah Taylor
Jordi Pons
MGenDiffM
124
45
0
16 Apr 2024
Foundational GPT Model for MEG
Foundational GPT Model for MEG
Richard Csaky
M. Es
Oiwi Parker Jones
M. Woolrich
72
2
0
14 Apr 2024
Contextual Chart Generation for Cyber Deception
Contextual Chart Generation for Cyber Deception
David D. Nguyen
David Liebowitz
Surya Nepal
S. Kanhere
Sharif Abuadbba
99
0
0
07 Apr 2024
AI Royalties -- an IP Framework to Compensate Artists & IP Holders for
  AI-Generated Content
AI Royalties -- an IP Framework to Compensate Artists & IP Holders for AI-Generated Content
Pablo Ducru
Jonathan Raiman
Ronaldo Lemos
Clay Garner
George He
Hanna Balcha
Gabriel Souto
Sergio Branco
Celina Bottino
140
3
0
05 Apr 2024
SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
Junghyun Koo
Gordon Wichern
François Germain
Sameer Khurana
Jonathan Le Roux
103
5
0
02 Apr 2024
A Novel Audio Representation for Music Genre Identification in MIR
A Novel Audio Representation for Music Genre Identification in MIR
Navin Kamuni
Mayank Jindal
Arpita Soni
Sukender Reddy Mallreddy
Sharath Chandra Macha
VLM
69
7
0
01 Apr 2024
Previous
123456...8910
Next