Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.00130
Cited By
Evaluating generative audio systems and their metrics
31 August 2022
Ashvala Vinay
Alexander Lerch
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evaluating generative audio systems and their metrics"
15 / 15 papers shown
Title
Aligning Text-to-Music Evaluation with Human Preferences
Yichen Huang
Zachary Novack
Koichi Saito
Jiatong Shi
Shinji Watanabe
Yuki Mitsufuji
John Thickstun
Chris Donahue
EGVM
70
1
0
20 Mar 2025
The Effect of Perceptual Metrics on Music Representation Learning for Genre Classification
Tashi Namgyal
Alexander Hepburn
Raúl Santos-Rodríguez
Valero Laparra
Jesús Malo
32
0
0
25 Sep 2024
Prevailing Research Areas for Music AI in the Era of Foundation Models
Megan Wei
M. Modrzejewski
Aswin Sivaraman
Dorien Herremans
MedIm
43
1
0
14 Sep 2024
Source Separation of Multi-source Raw Music using a Residual Quantized Variational Autoencoder
Leonardo Berti
DRL
40
0
0
12 Aug 2024
PAGURI: a user experience study of creative interaction with text-to-music models
Francesca Ronchini
Luca Comanducci
Gabriele Perego
Fabio Antonacci
35
3
0
05 Jul 2024
Subtractive Training for Music Stem Insertion using Latent Diffusion Models
Ivan Villa-Renteria
Mason L. Wang
Zachary Shah
Zhe Li
Soohyun Kim
Neelesh Ramachandran
Mert Pilanci
42
0
0
27 Jun 2024
Data is Overrated: Perceptual Metrics Can Lead Learning in the Absence of Training Data
Tashi Namgyal
Alexander Hepburn
Raúl Santos-Rodríguez
Valero Laparra
Jesús Malo
33
1
0
06 Dec 2023
A Review of Differentiable Digital Signal Processing for Music & Speech Synthesis
B. Hayes
Jordie Shier
Gyorgy Fazekas
Andrew Mcpherson
C. Saitis
27
21
0
29 Aug 2023
Siamese SIREN: Audio Compression with Implicit Neural Representations
Luca A. Lanzendörfer
Roger Wattenhofer
32
9
0
22 Jun 2023
Multi-modal Latent Diffusion
Mustapha Bounoua
Giulio Franzese
Pietro Michiardi
DiffM
24
13
0
07 Jun 2023
What You Hear Is What You See: Audio Quality Metrics From Image Quality Metrics
Tashi Namgyal
Alexander Hepburn
Raúl Santos-Rodríguez
Valero Laparra
Jesús Malo
27
1
0
19 May 2023
Configurable EBEN: Extreme Bandwidth Extension Network to enhance body-conducted speech capture
Hauret Julien
Joubaud Thomas
V. Zimpfer
Bavu Éric
21
6
0
17 Mar 2023
Multi-Source Diffusion Models for Simultaneous Music Generation and Separation
Giorgio Mariani
Irene Tallini
Emilian Postolache
Michele Mancusi
Luca Cosmo
Emanuele Rodolà
DiffM
30
37
0
04 Feb 2023
Neural Waveshaping Synthesis
B. Hayes
C. Saitis
Gyorgy Fazekas
36
28
0
11 Jul 2021
DDSP: Differentiable Digital Signal Processing
Jesse Engel
Lamtharn Hantrakul
Chenjie Gu
Adam Roberts
DiffM
96
373
0
14 Jan 2020
1