ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.05109
  4. Cited By
CDPAM: Contrastive learning for perceptual audio similarity

CDPAM: Contrastive learning for perceptual audio similarity

9 February 2021
Pranay Manocha
Zeyu Jin
Richard Y. Zhang
Adam Finkelstein
ArXivPDFHTML

Papers citing "CDPAM: Contrastive learning for perceptual audio similarity"

18 / 18 papers shown
Title
SCOREQ: Speech Quality Assessment with Contrastive Regression
SCOREQ: Speech Quality Assessment with Contrastive Regression
Alessandro Ragano
Jan Skoglund
Andrew Hines
40
6
0
09 Oct 2024
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Sreyan Ghosh
Sonal Kumar
Zhifeng Kong
Rafael Valle
Bryan Catanzaro
Dinesh Manocha
DiffM
49
2
0
02 Oct 2024
Contrastive Learning from Synthetic Audio Doppelgängers
Contrastive Learning from Synthetic Audio Doppelgängers
Manuel Cherep
Nikhil Singh
40
1
0
09 Jun 2024
NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech
  Enhancement and Non-matching Reference Audio Quality Assessment
NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment
Alessandro Ragano
Jan Skoglund
Andrew Hines
25
9
0
28 Sep 2023
Siamese SIREN: Audio Compression with Implicit Neural Representations
Siamese SIREN: Audio Compression with Implicit Neural Representations
Luca A. Lanzendörfer
Roger Wattenhofer
32
9
0
22 Jun 2023
RealImpact: A Dataset of Impact Sound Fields for Real Objects
RealImpact: A Dataset of Impact Sound Fields for Real Objects
Samuel Clarke
Ruohan Gao
Mason Wang
M. Rau
Julia Xu
Jui-Hsien Wang
Doug L. James
Jiajun Wu
40
9
0
16 Jun 2023
Hypernetworks build Implicit Neural Representations of Sounds
Hypernetworks build Implicit Neural Representations of Sounds
Filip Szatkowski
Karol J. Piczak
Przemtslaw Spurek
Jacek Tabor
Tomasz Trzciñski
24
11
0
09 Feb 2023
HyperSound: Generating Implicit Neural Representations of Audio Signals
  with Hypernetworks
HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks
Filip Szatkowski
Karol J. Piczak
Przemysław Spurek
Jacek Tabor
Tomasz Trzciñski
23
12
0
03 Nov 2022
ViFiCon: Vision and Wireless Association Via Self-Supervised Contrastive
  Learning
ViFiCon: Vision and Wireless Association Via Self-Supervised Contrastive Learning
Nicholas Meegan
Hansi Liu
Bryan Bo Cao
Abrar Alali
Kristin J. Dana
Marco Gruteser
Shubham Jain
A. Ashok
6
1
0
11 Oct 2022
Predicting pairwise preferences between TTS audio stimuli using parallel
  ratings data and anti-symmetric twin neural networks
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Cassia Valentini-Botinhao
M. Ribeiro
O. Watts
Korin Richmond
G. Henter
11
1
0
22 Sep 2022
Equivariant Self-Supervision for Musical Tempo Estimation
Equivariant Self-Supervision for Musical Tempo Estimation
Elio Quinton
32
9
0
03 Sep 2022
Audio Similarity is Unreliable as a Proxy for Audio Quality
Audio Similarity is Unreliable as a Proxy for Audio Quality
Pranay Manocha
Zeyu Jin
Adam Finkelstein
30
8
0
27 Jun 2022
Self-supervised Context-aware Style Representation for Expressive Speech
  Synthesis
Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Yihan Wu
Xi Wang
S. Zhang
Lei He
Ruihua Song
J. Nie
42
15
0
25 Jun 2022
Speech Quality Assessment through MOS using Non-Matching References
Speech Quality Assessment through MOS using Non-Matching References
Pranay Manocha
Anurag Kumar
63
25
0
24 Jun 2022
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
Takaaki Saeki
Detai Xin
Wataru Nakata
Tomoki Koriyama
Shinnosuke Takamichi
Hiroshi Saruwatari
39
177
0
05 Apr 2022
Towards Lightweight Controllable Audio Synthesis with Conditional
  Implicit Neural Representations
Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations
Jan Zuiderveld
Marco Federici
Erik J. Bekkers
AI4CE
29
6
0
14 Nov 2021
NORESQA: A Framework for Speech Quality Assessment using Non-Matching
  References
NORESQA: A Framework for Speech Quality Assessment using Non-Matching References
Pranay Manocha
Buye Xu
Anurag Kumar
35
44
0
16 Sep 2021
Self-supervised Contrastive Cross-Modality Representation Learning for
  Spoken Question Answering
Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Chenyu You
Nuo Chen
Yuexian Zou
SSL
27
62
0
08 Sep 2021
1