Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.01279
Cited By
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders
5 April 2017
Jesse Engel
Cinjon Resnick
Adam Roberts
Sander Dieleman
Douglas Eck
Karen Simonyan
Mohammad Norouzi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders"
50 / 127 papers shown
Title
DPN-GAN: Inducing Periodic Activations in Generative Adversarial Networks for High-Fidelity Audio Synthesis
Zeeshan Ahmad
Shudi Bao
Meng Chen
20
0
0
14 May 2025
Designing Neural Synthesizers for Low-Latency Interaction
Franco Caspe
Jordie Shier
Mark Sandler
C. Saitis
Andrew Mcpherson
189
0
0
14 Mar 2025
TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument
Kyungsu Kim
Junghyun Koo
Sungho Lee
Haesun Joung
Kyogu Lee
58
0
0
13 Feb 2025
Zero-shot Musical Stem Retrieval with Joint-Embedding Predictive Architectures
Alain Riou
Antonin Gagnere
Gaëtan Hadjeres
Stefan Lattner
Geoffroy Peeters
91
0
0
29 Nov 2024
Towards Robust Few-shot Class Incremental Learning in Audio Classification using Contrastive Representation
Riyansha Singh
Parinita Nema
V. Kurmi
CLL
40
1
0
27 Jul 2024
Generating Sample-Based Musical Instruments Using Neural Audio Codec Language Models
S. Nercessian
Johannes Imort
Ninon Devis
Frederik Blang
38
1
0
22 Jul 2024
Contrastive Learning from Synthetic Audio Doppelgängers
Manuel Cherep
Nikhil Singh
40
1
0
09 Jun 2024
Investigating Design Choices in Joint-Embedding Predictive Architectures for General Audio Representation Learning
Alain Riou
Stefan Lattner
Gaëtan Hadjeres
Geoffroy Peeters
43
2
0
14 May 2024
AudioRepInceptionNeXt: A lightweight single-stream architecture for efficient audio recognition
Kin Wai Lau
Yasar Abbas Ur Rehman
L. Po
44
1
0
21 Apr 2024
Track Role Prediction of Single-Instrumental Sequences
Changheon Han
Suhyun Lee
Minsam Ko
27
0
0
20 Apr 2024
uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures
Afrina Tabassum
Dung N. Tran
Trung D. Q. Dang
Ismini Lourentzou
K. Koishida
50
0
0
14 Mar 2024
Self-supervised Complex Network for Machine Sound Anomaly Detection
Miseul Kim
M. Ho
Hong-Goo Kang
22
8
0
21 Dec 2023
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Yunfei Chu
Jin Xu
Xiaohuan Zhou
Qian Yang
Shiliang Zhang
Zhijie Yan
Chang Zhou
Jingren Zhou
AuLLM
42
274
0
14 Nov 2023
InstrumentGen: Generating Sample-Based Musical Instruments From Text
S. Nercessian
Johannes Imort
29
2
0
07 Nov 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Xiaozhong Liu
78
31
0
27 Aug 2023
Improving Domain Generalization for Sound Classification with Sparse Frequency-Regularized Transformer
Honglin Mu
Wentian Xia
Wanxiang Che
22
1
0
19 Jul 2023
Pengi: An Audio Language Model for Audio Tasks
Soham Deshmukh
Benjamin Elizalde
Rita Singh
Huaming Wang
MLLM
AuLLM
39
159
0
19 May 2023
Enhancing Unsupervised Audio Representation Learning via Adversarial Sample Generation
Yulin Pan
Xiangteng He
Biao Gong
Yuxin Peng
Yiliang Lv
SSL
24
0
0
15 Mar 2023
Improving Self-Supervised Learning for Audio Representations by Feature Diversity and Decorrelation
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
SSL
42
3
0
07 Mar 2023
Approach to Learning Generalized Audio Representation Through Batch Embedding Covariance Regularization and Constant-Q Transforms
Ankit Parag Shah
Shuyi Chen
Kejun Zhou
Yue Chen
Bhiksha Raj
18
1
0
07 Mar 2023
Amortised Invariance Learning for Contrastive Self-Supervision
Ruchika Chavhan
Henry Gouk
Jan Stuehmer
Calum Heggan
Mehrdad Yaghoobi
Timothy M. Hospedales
SSL
40
11
0
24 Feb 2023
jazznet: A Dataset of Fundamental Piano Patterns for Music Audio Machine Learning Research
Tosiron Adegbija
SLR
21
6
0
17 Feb 2023
An Attention-based Approach to Hierarchical Multi-label Music Instrument Classification
Zhi-Wei Zhong
M. Hirano
Kazuki Shimada
Kazuya Tateishi
Shusuke Takahashi
Yuki Mitsufuji
20
12
0
16 Feb 2023
Multi-Source Contrastive Learning from Musical Audio
C. Garoufis
Athanasia Zlatintsi
Petros Maragos
27
6
0
14 Feb 2023
Hypernetworks build Implicit Neural Representations of Sounds
Filip Szatkowski
Karol J. Piczak
Przemtslaw Spurek
Jacek Tabor
Tomasz Trzciñski
24
11
0
09 Feb 2023
Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis
Cyrus Vahidi
Han Han
Changhong Wang
Mathieu Lagrange
Gyorgy Fazekas
Vincent Lostanlen
16
8
0
24 Jan 2023
A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning for Voice-Controlled Robots
Peixin Chang
Shuijing Liu
Tianchen Ji
Neeloy Chakraborty
Kaiwen Hong
Katherine Driggs-Campbell
51
3
0
23 Jan 2023
Randomized Quantization: A Generic Augmentation for Data Agnostic Self-supervised Learning
Huimin Wu
Chenyang Lei
Xiao Sun
Pengju Wang
Qifeng Chen
Kwang-Ting Cheng
Stephen Lin
Zhirong Wu
MQ
38
5
0
19 Dec 2022
Audio Latent Space Cartography
Nicolas Jonason
Bob L. T. Sturm
DiffM
23
0
0
05 Dec 2022
TimbreCLIP: Connecting Timbre to Text and Images
Nicolas Jonason
Bob L. T. Sturm
CLIP
33
4
0
21 Nov 2022
A Review of Intelligent Music Generation Systems
Lei Wang
Ziyi Zhao
Han Liu
Junwei Pang
Yi-qiang Qin
Qidi Wu
MGen
21
31
0
16 Nov 2022
Show Me the Instruments: Musical Instrument Retrieval from Mixture Audio
Kyungsuk Kim
Minju Park
Ha-na Joung
Yunkee Chae
Yeongbeom Hong
Seonghyeon Go
Kyogu Lee
11
6
0
15 Nov 2022
GANStrument: Adversarial Instrument Sound Synthesis with Pitch-invariant Instance Conditioning
Gaku Narita
Junichi Shimizu
Taketo Akama
GAN
26
11
0
10 Nov 2022
HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks
Filip Szatkowski
Karol J. Piczak
Przemysław Spurek
Jacek Tabor
Tomasz Trzciñski
23
12
0
03 Nov 2022
Synthesizer Preset Interpolation using Transformer Auto-Encoders
G. Vaillant
Thierry Dutoit
19
3
0
27 Oct 2022
JukeDrummer: Conditional Beat-aware Audio-domain Drum Accompaniment Generation via Transformer VQ-VAE
Yueh-Kao Wu
Ching-Yu Chiu
Yi-Hsuan Yang
ViT
21
14
0
12 Oct 2022
Learning Temporal Resolution in Spectrogram for Audio Classification
Haohe Liu
Xubo Liu
Qiuqiang Kong
Wenwu Wang
Mark D. Plumbley
34
7
0
04 Oct 2022
An empirical study of weakly supervised audio tagging embeddings for general audio representations
Heinrich Dinkel
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Yujun Wang
43
1
0
30 Sep 2022
The Efficacy of Self-Supervised Speech Models for Audio Representations
Tung-Yu Wu
Chen An Li
Tzu-Han Lin
Tsung-Yuan Hsu
Hung-yi Lee
32
5
0
26 Sep 2022
An Initial study on Birdsong Re-synthesis Using Neural Vocoders
Rhythm Bhatia
Tomi Kinnunen
23
1
0
21 Sep 2022
Mel Spectrogram Inversion with Stable Pitch
Bruno Di Giorgi
M. Levy
Richard Sharp
22
6
0
26 Aug 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion
Curtis Hawthorne
Ian Simon
Adam Roberts
Neil Zeghidour
Josh Gardner
Ethan Manilow
Jesse Engel
DiffM
21
49
0
11 Jun 2022
Co-creation and ownership for AI radio
Skylar Gordon
Robert Mahari
Manaswi Mishra
Ziv Epstein
24
4
0
01 Jun 2022
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
137
350
0
21 May 2022
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
Hui Zhang
Tian Yuan
Junkun Chen
Xintong Li
Renjie Zheng
...
Zeyu Chen
Xiaoguang Hu
Dianhai Yu
Yanjun Ma
Liang Huang
AuLLM
31
24
0
20 May 2022
Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
N. Harada
K. Kashino
27
5
0
17 May 2022
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
N. Harada
K. Kashino
32
65
0
26 Apr 2022
BYOL for Audio: Exploring Pre-trained General-purpose Audio Representations
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
N. Harada
K. Kashino
SSL
36
53
0
15 Apr 2022
DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio Representation Learning
Sreyan Ghosh
Ashish Seth
and Deepak Mittal
Maneesh Singh
S. Umesh
SSL
27
6
0
25 Mar 2022
Climate Change & Computer Audition: A Call to Action and Overview on Audio Intelligence to Help Save the Planet
Björn W. Schuller
Ali Akman
Yi-Fen Chang
H. Coppock
Alexander Gebhard
Alexander Kathan
Esther Rituerto-González
Andreas Triantafyllopoulos
Florian B. Pokorny
38
1
0
10 Mar 2022
1
2
3
Next