Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.08386
Cited By
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations
15 June 2020
Xavier Favory
K. Drossos
Tuomas Virtanen
Xavier Serra
Re-assign community
ArXiv
PDF
HTML
Papers citing
"COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations"
20 / 20 papers shown
Title
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations
Ruben Ciranni
Emilian Postolache
Giorgio Mariani
Michele Mancusi
Giorgio Fabbro
Emanuele Rodolà
Luca Cosmo
76
7
0
10 Jan 2025
Efficient Supervised Training of Audio Transformers for Music Representation Learning
Pablo Alonso-Jiménez
Xavier Serra
Dmitry Bogdanov
ViT
35
3
0
28 Sep 2023
Weakly-supervised Automated Audio Captioning via text only training
Theodoros Kouzelis
Vassilis Katsouros
CLIP
40
6
0
21 Sep 2023
Pre-Training Strategies Using Contrastive Learning and Playlist Information for Music Classification and Similarity
Pablo Alonso-Jiménez
Xavier Favory
Hadrien Foroughmand
Grigoris Bourdalas
Xavier Serra
T. Lidy
Dmitry Bogdanov
42
6
0
24 Apr 2023
Improving Self-Supervised Learning for Audio Representations by Feature Diversity and Decorrelation
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
SSL
42
3
0
07 Mar 2023
Multi-Source Contrastive Learning from Musical Audio
C. Garoufis
Athanasia Zlatintsi
Petros Maragos
34
6
0
14 Feb 2023
Randomized Quantization: A Generic Augmentation for Data Agnostic Self-supervised Learning
Huimin Wu
Chenyang Lei
Xiao Sun
Pengju Wang
Qifeng Chen
Kwang-Ting Cheng
Stephen Lin
Zhirong Wu
MQ
38
5
0
19 Dec 2022
An empirical study of weakly supervised audio tagging embeddings for general audio representations
Heinrich Dinkel
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Yujun Wang
43
1
0
30 Sep 2022
MuLan: A Joint Embedding of Music Audio and Natural Language
Qingqing Huang
A. Jansen
Joonseok Lee
Ravi Ganti
Judith Yue Li
D. Ellis
30
131
0
26 Aug 2022
Contrastive Audio-Language Learning for Music
Ilaria Manco
Emmanouil Benetos
Elio Quinton
Gyorgy Fazekas
27
44
0
25 Aug 2022
BYOL for Audio: Exploring Pre-trained General-purpose Audio Representations
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
SSL
36
53
0
15 Apr 2022
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
42
106
0
02 Mar 2022
Learning music audio representations via weak language supervision
Ilaria Manco
Emmanouil Benetos
Elio Quinton
Gyorgy Fazekas
22
33
0
08 Dec 2021
Evaluating Off-the-Shelf Machine Listening and Natural Language Models for Automated Audio Captioning
Benno Weck
Xavier Favory
K. Drossos
Xavier Serra
23
8
0
14 Oct 2021
One Billion Audio Sounds from GPU-enabled Modular Synthesis
Joseph P. Turian
Jordie Shier
George Tzanetakis
K. McNally
Max Henry
21
22
0
27 Apr 2021
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
SSL
38
175
0
11 Mar 2021
Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval
Yuma Koizumi
Yasunori Ohishi
Daisuke Niizumi
Daiki Takeuchi
Masahiro Yasuda
30
40
0
14 Dec 2020
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags
Xavier Favory
K. Drossos
Tuomas Virtanen
Xavier Serra
32
15
0
27 Oct 2020
Contrastive Representation Learning: A Framework and Review
Phúc H. Lê Khắc
Graham Healy
Alan F. Smeaton
SSL
AI4TS
186
687
0
10 Oct 2020
FSD50K: An Open Dataset of Human-Labeled Sound Events
Eduardo Fonseca
Xavier Favory
Jordi Pons
F. Font
Xavier Serra
26
438
0
01 Oct 2020
1