ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.12415
  4. Cited By
MuLan: A Joint Embedding of Music Audio and Natural Language

MuLan: A Joint Embedding of Music Audio and Natural Language

26 August 2022
Qingqing Huang
A. Jansen
Joonseok Lee
Ravi Ganti
Judith Yue Li
D. Ellis
ArXivPDFHTML

Papers citing "MuLan: A Joint Embedding of Music Audio and Natural Language"

20 / 20 papers shown
Title
GlobalMood: A cross-cultural benchmark for music emotion recognition
GlobalMood: A cross-cultural benchmark for music emotion recognition
Harin Lee
Elif Celen
Peter M. C. Harrison
Manuel Anglada-Tort
Pol van Rijn
Minsu Park
Marc Schönwiesner
Nori Jacoby
32
0
0
14 May 2025
NOTA: Multimodal Music Notation Understanding for Visual Large Language Model
NOTA: Multimodal Music Notation Understanding for Visual Large Language Model
Mingni Tang
Jiajia Li
Lu Yang
Zhiqiang Zhang
Jinghao Tian
Z. Li
L. Zhang
P. Wang
56
0
0
17 Feb 2025
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations
Ruben Ciranni
Emilian Postolache
Giorgio Mariani
Michele Mancusi
Giorgio Fabbro
Emanuele Rodolà
Luca Cosmo
74
7
0
10 Jan 2025
Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)
Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)
S. Oota
Zijiao Chen
Manish Gupta
R. Bapi
G. Jobard
F. Alexandre
X. Hinaut
3DV
AI4CE
49
11
0
31 Dec 2024
SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
Chenyu Yang
Shuai Wang
Hangting Chen
Jianwei Yu
Wei Tan
Rongzhi Gu
Y. Xu
Yizhi Zhou
Haina Zhu
H. Li
KELM
168
1
0
18 Dec 2024
Art2Mus: Bridging Visual Arts and Music through Cross-Modal Generation
Art2Mus: Bridging Visual Arts and Music through Cross-Modal Generation
Ivan Rinaldi
Nicola Fanelli
Giovanna Castellano
G. Vessio
31
2
0
07 Oct 2024
MusicLIME: Explainable Multimodal Music Understanding
MusicLIME: Explainable Multimodal Music Understanding
Theodoros Sotirou
Vassilis Lyberatos
Orfeas Menis-Mastromichalakis
Giorgos Stamou
28
2
0
16 Sep 2024
PAGURI: a user experience study of creative interaction with
  text-to-music models
PAGURI: a user experience study of creative interaction with text-to-music models
Francesca Ronchini
Luca Comanducci
Gabriele Perego
Fabio Antonacci
35
3
0
05 Jul 2024
Contrastive Learning from Synthetic Audio Doppelgängers
Contrastive Learning from Synthetic Audio Doppelgängers
Manuel Cherep
Nikhil Singh
40
1
0
09 Jun 2024
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving
  Zero-Shot Voice Editing
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing
Philip Anastassiou
Zhenyu Tang
Kainan Peng
Dongya Jia
Jiaxin Li
Ming Tu
Yuping Wang
Yuxuan Wang
Mingbo Ma
42
4
0
10 Apr 2024
Correlation of Fréchet Audio Distance With Human Perception of
  Environmental Audio Is Embedding Dependant
Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Modan Tailleur
Junwon Lee
Mathieu Lagrange
Keunwoo Choi
Laurie M. Heller
Keisuke Imoto
Yuki Okamoto
22
10
0
26 Mar 2024
Content-based Controls For Music Large Language Modeling
Content-based Controls For Music Large Language Modeling
Liwei Lin
Gus Xia
Junyan Jiang
Yixiao Zhang
18
14
0
26 Oct 2023
MuseChat: A Conversational Music Recommendation System for Videos
MuseChat: A Conversational Music Recommendation System for Videos
Zhikang Dong
Bin Chen
Xiulong Liu
Paweł Polak
Peng Zhang
LRM
39
26
0
10 Oct 2023
Efficient Supervised Training of Audio Transformers for Music
  Representation Learning
Efficient Supervised Training of Audio Transformers for Music Representation Learning
Pablo Alonso-Jiménez
Xavier Serra
Dmitry Bogdanov
ViT
35
3
0
28 Sep 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
J. Liu
73
31
0
27 Aug 2023
Pengi: An Audio Language Model for Audio Tasks
Pengi: An Audio Language Model for Audio Tasks
Soham Deshmukh
Benjamin Elizalde
Rita Singh
Huaming Wang
MLLM
AuLLM
34
157
0
19 May 2023
Multi-Source Contrastive Learning from Musical Audio
Multi-Source Contrastive Learning from Musical Audio
C. Garoufis
Athanasia Zlatintsi
Petros Maragos
24
6
0
14 Feb 2023
TimbreCLIP: Connecting Timbre to Text and Images
TimbreCLIP: Connecting Timbre to Text and Images
Nicolas Jonason
Bob L. T. Sturm
CLIP
25
4
0
21 Nov 2022
Codified audio language modeling learns useful representations for music
  information retrieval
Codified audio language modeling learns useful representations for music information retrieval
Rodrigo Castellon
Chris Donahue
Percy Liang
78
86
0
12 Jul 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
298
3,700
0
11 Feb 2021
1