Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.00968
Cited By
Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
2 November 2023
Jaeyong Kang
Soujanya Poria
Dorien Herremans
MGen
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model"
21 / 21 papers shown
Title
MusFlow: Multimodal Music Generation via Conditional Flow Matching
Jiahao Song
Yuzhao Wang
37
0
0
18 Apr 2025
A Survey on Cross-Modal Interaction Between Music and Multimodal Data
Sifei Li
Mining Tan
Feier Shen
Minyan Luo
Zijiao Yin
Fan Tang
W. Dong
Changsheng Xu
69
0
0
17 Apr 2025
Generative AI for Film Creation: A Survey of Recent Advances
Ruihan Zhang
Borou Yu
Jiajian Min
Yetong Xin
Zheng Wei
...
Sijia Jiang
Peiwen Huang
Na Chen
Xuanxuan Liu
Anyi Rao
VGen
70
0
0
11 Apr 2025
Extending Visual Dynamics for Video-to-Music Generation
Xiaohao Liu
Teng Tu
Yunshan Ma
Tat-Seng Chua
VGen
64
0
0
10 Apr 2025
Mozualization: Crafting Music and Visual Representation with Multimodal AI
Wanfang Xu
Lixiang Zhao
Haiwen Song
Xinheng Song
Zhaolin Lu
Yu Liu
Min Chen
Eng Gee Lim
Lingyun Yu
VGen
19
0
0
05 Apr 2025
Vision-to-Music Generation: A Survey
Zhaokai Wang
Chenxi Bao
Le Zhuo
Jingrui Han
Yang Yue
Yihong Tang
Victor Shea-Jay Huang
Yue Liao
EGVM
VGen
76
1
0
27 Mar 2025
Cross-Modal Learning for Music-to-Music-Video Description Generation
Zhuoyuan Mao
Mengjie Zhao
Qiyu Wu
Zhi-Wei Zhong
Wei-Hsiang Liao
Hiromi Wakaki
Yuki Mitsufuji
DiffM
VGen
87
0
0
14 Mar 2025
AudioX: Diffusion Transformer for Anything-to-Audio Generation
Zeyue Tian
Yizhu Jin
Zhaoyang Liu
Ruibin Yuan
Xu Tan
Qifeng Chen
Wei Xue
Yu Guo
67
3
0
13 Mar 2025
FilmComposer: LLM-Driven Music Production for Silent Film Clips
Zhifeng Xie
Qile He
Youjia Zhu
Qiwei He
Mengtian Li
VGen
103
2
0
11 Mar 2025
Generative AI for Cel-Animation: A Survey
Yunlong Tang
Junjia Guo
Pinxin Liu
Zhiyuan Wang
Hang Hua
...
Jing Bi
Mingqian Feng
Xuzhao Li
Zeliang Zhang
Chenliang Xu
VGen
93
7
0
08 Jan 2025
JEMA: A Joint Embedding Framework for Scalable Co-Learning with Multimodal Alignment
Joao Sousa
Roya Darabi
A. A. Sousa
Frank Brueckner
Luís Paulo Reis
Ana Reis
40
2
0
31 Oct 2024
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization
Ruiqi Li
Siqi Zheng
Xize Cheng
Ziang Zhang
Shengpeng Ji
Zhou Zhao
VGen
71
7
0
16 Oct 2024
SONIQUE: Video Background Music Generation Using Unpaired Audio-Visual Data
Liqian Zhang
Magdalena Fuentes
DiffM
VGen
39
3
0
04 Oct 2024
Prevailing Research Areas for Music AI in the Era of Foundation Models
Megan Wei
M. Modrzejewski
Aswin Sivaraman
Dorien Herremans
MedIm
43
1
0
14 Sep 2024
VMAS: Video-to-Music Generation via Semantic Alignment in Web Music Videos
Yan-Bo Lin
Yu Tian
L. Yang
Gedas Bertasius
Heng Wang
VGen
34
7
0
11 Sep 2024
BandControlNet: Parallel Transformers-based Steerable Popular Music Generation with Fine-Grained Spatiotemporal Features
Jing Luo
Xinyu Yang
Dorien Herremans
34
3
0
15 Jul 2024
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
Zeyue Tian
Zhaoyang Liu
Ruibin Yuan
Jiahao Pan
Xiaoqiang Huang
Xu Tan
Xu Tan
Qifeng Chen
Yu Guo
VGen
104
16
0
06 Jun 2024
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey
Dinh-Viet-Toan Le
Louis Bigo
Mikaela Keller
Dorien Herremans
MedIm
32
9
0
27 Feb 2024
DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
K. Cheuk
Ryosuke Sawata
Toshimitsu Uesaka
Naoki Murata
Naoya Takahashi
Shusuke Takahashi
Dorien Herremans
Yuki Mitsufuji
DiffM
45
16
0
11 Oct 2022
ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data
K. Cheuk
Dorien Herremans
Li Su
58
32
0
11 Jul 2021
Generating Lead Sheets with Affect: A Novel Conditional seq2seq Framework
D. Makris
Kat R. Agres
Dorien Herremans
57
27
0
27 Apr 2021
1