Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.13535
Cited By
MusFlow: Multimodal Music Generation via Conditional Flow Matching
18 April 2025
Jiahao Song
Yuzhao Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MusFlow: Multimodal Music Generation via Conditional Flow Matching"
19 / 19 papers shown
Title
CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens
Zhihao Du
Qian Chen
Shiliang Zhang
Kai Hu
Heng Lu
...
Siqi Zheng
Yue Gu
Ziyang Ma
Zhifu Gao
Zhijie Yan
DiffM
76
140
0
07 Jul 2024
A Survey of AI Music Generation Tools and Models
Yueyue Zhu
Jared Baca
Banafsheh Rekabdar
Reza Rawassizadeh
MGen
86
17
0
24 Aug 2023
Improving and generalizing flow-based generative models with minibatch optimal transport
Alexander Tong
Kilian Fatras
Nikolay Malkin
G. Huguet
Yanlei Zhang
Jarrid Rector-Brooks
Guy Wolf
Yoshua Bengio
OOD
DiffM
OT
114
304
0
01 Feb 2023
Video Background Music Generation: Dataset, Method and Evaluation
Le Zhuo
Zhaokai Wang
Baisen Wang
Yue Liao
Chenxi Bao
Stanley Peng
Miao Lu
Xiaobo Li
Fei Fang
Si Liu
VGen
56
31
0
21 Nov 2022
A Review of Intelligent Music Generation Systems
Lei Wang
Ziyi Zhao
Han Liu
Junwei Pang
Yi-qiang Qin
Qidi Wu
MGen
49
35
0
16 Nov 2022
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow
Xingchao Liu
Chengyue Gong
Qiang Liu
OOD
209
1,043
0
07 Sep 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLM
BDL
VLM
CLIP
555
4,409
0
28 Jan 2022
Video Background Music Generation with Controllable Music Transformer
Shangzhe Di
Jiang
Sihan Liu
Zhaokai Wang
Leyan Zhu
Zexin He
Hongming Liu
Shuicheng Yan
76
93
0
16 Nov 2021
BARTScore: Evaluating Generated Text as Text Generation
Weizhe Yuan
Graham Neubig
Pengfei Liu
123
849
0
22 Jun 2021
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
Jack Hessel
Ari Holtzman
Maxwell Forbes
Ronan Le Bras
Yejin Choi
CLIP
169
1,584
0
18 Apr 2021
MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers
Wenhui Wang
Hangbo Bao
Shaohan Huang
Li Dong
Furu Wei
MQ
96
269
0
31 Dec 2020
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song
Jascha Narain Sohl-Dickstein
Diederik P. Kingma
Abhishek Kumar
Stefano Ermon
Ben Poole
DiffM
SyDa
353
6,566
0
26 Nov 2020
A Survey on Contrastive Self-supervised Learning
Ashish Jaiswal
Ashwin Ramesh Babu
Mohammad Zaki Zadeh
Debapriya Banerjee
F. Makedon
SSL
129
1,398
0
31 Oct 2020
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Jungil Kong
Jaehyeon Kim
Jaekyoung Bae
179
1,947
0
12 Oct 2020
Neural Ordinary Differential Equations
T. Chen
Yulia Rubanova
J. Bettencourt
David Duvenaud
AI4CE
437
5,157
0
19 Jun 2018
Computational Optimal Transport
Gabriel Peyré
Marco Cuturi
OT
229
2,150
0
01 Mar 2018
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
786
132,363
0
12 Jun 2017
FMA: A Dataset For Music Analysis
M. Defferrard
Kirell Benzi
P. Vandergheynst
Xavier Bresson
113
440
0
06 Dec 2016
Sinkhorn Distances: Lightspeed Computation of Optimal Transportation Distances
Marco Cuturi
OT
220
4,288
0
04 Jun 2013
1