ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.06182
  4. Cited By
CREPE: A Convolutional Representation for Pitch Estimation

CREPE: A Convolutional Representation for Pitch Estimation

17 February 2018
Jong Wook Kim
Justin Salamon
P. Li
J. P. Bello
ArXivPDFHTML

Papers citing "CREPE: A Convolutional Representation for Pitch Estimation"

50 / 153 papers shown
Title
Neurodyne: Neural Pitch Manipulation with Representation Learning and Cycle-Consistency GAN
Neurodyne: Neural Pitch Manipulation with Representation Learning and Cycle-Consistency GAN
Yicheng Gu
Chaoren Wang
Zhizheng Wu
Lauri Juvela
12
0
0
21 May 2025
ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition
ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition
Zhiping Qiu
Yitong Jin
Yijiao Wang
Yi Shi
Changbo Wang
Chao Tan
Xiaobing Li
Feng Yu
Tao Yu
Qionghai Dai
34
0
0
07 May 2025
Real-Time Pitch/F0 Detection Using Spectrogram Images and Convolutional Neural Networks
Real-Time Pitch/F0 Detection Using Spectrogram Images and Convolutional Neural Networks
Xufang Zhao
Omer Tsimhoni
23
0
0
08 Apr 2025
SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System
SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System
Hyeongju Kim
Jinhyeok Yang
Yechan Yu
Seunghun Ji
Jacob Morton
Frederik Bous
Joon Byun
Juheon Lee
51
0
0
29 Mar 2025
Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach
Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach
J. Abeßer
Shri Kiran Srinivasan
Meinard Muller
54
0
0
24 Mar 2025
Designing Neural Synthesizers for Low-Latency Interaction
Designing Neural Synthesizers for Low-Latency Interaction
Franco Caspe
Jordie Shier
Mark Sandler
C. Saitis
Andrew Mcpherson
240
0
0
14 Mar 2025
ReelWave: A Multi-Agent Framework Toward Professional Movie Sound Generation
Zixuan Wang
Chi-Keung Tang
Yu-Wing Tai
DiffM
VGen
63
0
0
10 Mar 2025
AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
Samir Sadok
Simon Leglaive
Laurent Girin
Gaël Richard
Xavier Alameda-Pineda
58
1
0
10 Jan 2025
A System for Melodic Harmonization using Schoenberg Regions, Giant Steps, and Church Modes
Frederick Fernandes
28
0
0
05 Jan 2025
The Sound of Water: Inferring Physical Properties from Pouring Liquids
Piyush Bagad
Makarand Tapaswi
Cees G. M. Snoek
Andrew Zisserman
47
0
0
18 Nov 2024
The Concatenator: A Bayesian Approach To Real Time Concatenative
  Musaicing
The Concatenator: A Bayesian Approach To Real Time Concatenative Musaicing
Christopher Tralie
Ben Cantil
33
0
0
07 Nov 2024
Automatic Estimation of Singing Voice Musical Dynamics
Automatic Estimation of Singing Voice Musical Dynamics
Jyoti Narang
Nazif Can Tamer
Viviana De La Vega
Xavier Serra
26
0
0
27 Oct 2024
Sound Check: Auditing Audio Datasets
Sound Check: Auditing Audio Datasets
William Agnew
Julia Barnett
Annie Chu
Rachel Hong
Michael Feffer
Robin Netzorg
Harry H. Jiang
Ezra Awumey
Sauvik Das
46
1
0
17 Oct 2024
Towards Computational Analysis of Pansori Singing
Towards Computational Analysis of Pansori Singing
Sangheon Park
Danbinaerin Han
Dasaem Jeong
23
0
0
16 Oct 2024
SiFiSinger: A High-Fidelity End-to-End Singing Voice Synthesizer based
  on Source-filter Model
SiFiSinger: A High-Fidelity End-to-End Singing Voice Synthesizer based on Source-filter Model
Jianwei Cui
Yu Gu
Chao Weng
Jie Zhang
Liping Chen
Lirong Dai
70
4
0
16 Oct 2024
Exploring synthetic data for cross-speaker style transfer in style
  representation based TTS
Exploring synthetic data for cross-speaker style transfer in style representation based TTS
Lucas Ueda
Leonardo B. de M. M. Marques
Flávio O. Simões
Mário Uliani Neto
Fernando Runstein
Bianca Dal Bó
Paula D. P. Costa
33
0
0
25 Sep 2024
Hierarchical Generative Modeling of Melodic Vocal Contours in Hindustani
  Classical Music
Hierarchical Generative Modeling of Melodic Vocal Contours in Hindustani Classical Music
N. Shikarpur
Krishna Maneesha Dendukuri
Yusong Wu
Antoine Caillon
Cheng-Zhi Anna Huang
20
1
0
22 Aug 2024
LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with
  Inference Acceleration via Latent Consistency Distillation
LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation
Shihao Chen
Yu Gu
Jianwei Cui
Jie Zhang
Rilin Chen
Lirong Dai
45
2
0
22 Aug 2024
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
Junwon Lee
Jaekwon Im
Dabin Kim
Juhan Nam
VGen
42
9
0
21 Aug 2024
DisMix: Disentangling Mixtures of Musical Instruments for Source-level
  Pitch and Timbre Manipulation
DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation
Yin-Jyun Luo
K. Cheuk
Woosung Choi
Toshimitsu Uesaka
Keisuke Toyama
...
Chieh-Hsin Lai
Yuhta Takida
Wei-Hsiang Liao
Simon Dixon
Yuki Mitsufuji
CoGe
49
2
0
20 Aug 2024
MaskAnyone Toolkit: Offering Strategies for Minimizing Privacy Risks and
  Maximizing Utility in Audio-Visual Data Archiving
MaskAnyone Toolkit: Offering Strategies for Minimizing Privacy Risks and Maximizing Utility in Audio-Visual Data Archiving
B. Owoyele
Martin Schilling
Rohan Sawahn
Niklas Kaemer
Pavel Zherebenkov
Bhuvanesh Verma
Wim Pouw
Gerard de Melo
34
0
0
06 Aug 2024
Differentiable Modal Synthesis for Physical Modeling of Planar String
  Sound and Motion Simulation
Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation
J. Lee
Jaehyun Park
Min Jun Choi
Kyogu Lee
42
2
0
07 Jul 2024
YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer
  Architectures and Cross-dataset Stem Augmentation
YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
Sungkyun Chang
Emmanouil Benetos
Holger Kirchhoff
Simon Dixon
42
3
0
05 Jul 2024
Who Finds This Voice Attractive? A Large-Scale Experiment Using
  In-the-Wild Data
Who Finds This Voice Attractive? A Large-Scale Experiment Using In-the-Wild Data
Hitoshi Suda
Aya Watanabe
Shinnosuke Takamichi
36
0
0
05 Jul 2024
Machine Learning Techniques in Automatic Music Transcription: A
  Systematic Survey
Machine Learning Techniques in Automatic Music Transcription: A Systematic Survey
Fatemeh Jamshidi
Gary Pike
Amit Das
Richard Chapman
45
4
0
20 Jun 2024
Articulatory Encodec: Coding Speech through Vocal Tract Kinematics
Articulatory Encodec: Coding Speech through Vocal Tract Kinematics
Cheol Jun Cho
Peter Wu
Tejas S. Prabhune
Dhruv Agarwal
Gopala K. Anumanchipalli
36
3
0
18 Jun 2024
TSE-PI: Target Sound Extraction under Reverberant Environments with
  Pitch Information
TSE-PI: Target Sound Extraction under Reverberant Environments with Pitch Information
Yiwen Wang
Xihong Wu
51
2
0
13 Jun 2024
LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice
  Conversion with Singer Guidance
LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance
Shihao Chen
Yu Gu
Jie Zhang
Na Li
Rilin Chen
Liping Chen
Lirong Dai
DiffM
48
6
0
08 Jun 2024
STraDa: A Singer Traits Dataset
STraDa: A Singer Traits Dataset
Yuexuan Kong
V. Tran
Romain Hennequin
18
2
0
06 Jun 2024
An Investigation of Time-Frequency Representation Discriminators for
  High-Fidelity Vocoder
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder
Yicheng Gu
Xueyao Zhang
Liumeng Xue
Haizhou Li
Zhizheng Wu
30
3
0
26 Apr 2024
ATFNet: Adaptive Time-Frequency Ensembled Network for Long-term Time
  Series Forecasting
ATFNet: Adaptive Time-Frequency Ensembled Network for Long-term Time Series Forecasting
Hengyu Ye
Jiadong Chen
Shijin Gong
Fuxin Jiang
Tieying Zhang
Jianjun Chen
Xiaofeng Gao
AI4TS
37
3
0
08 Apr 2024
Toward Fully Self-Supervised Multi-Pitch Estimation
Toward Fully Self-Supervised Multi-Pitch Estimation
Frank Cwitkowitz
Zhiyao Duan
32
4
0
23 Feb 2024
Cacophony: An Improved Contrastive Audio-Text Model
Cacophony: An Improved Contrastive Audio-Text Model
Ge Zhu
Jordan Darefsky
Zhiyao Duan
AuLLM
46
11
0
10 Feb 2024
DiffMoog: a Differentiable Modular Synthesizer for Sound Matching
DiffMoog: a Differentiable Modular Synthesizer for Sound Matching
Noy Uzrad
Oren Barkan
Almog Elharar
Shlomi Shvartzman
Moshe Laufer
Lior Wolf
Noam Koenigstein
32
4
0
23 Jan 2024
DJCM: A Deep Joint Cascade Model for Singing Voice Separation and Vocal
  Pitch Estimation
DJCM: A Deep Joint Cascade Model for Singing Voice Separation and Vocal Pitch Estimation
Haojie Wei
Xueke Cao
Wenbo Xu
Tangpeng Dan
Yueguo Chen
VLM
27
2
0
08 Jan 2024
Leveraging Laryngograph Data for Robust Voicing Detection in Speech
Leveraging Laryngograph Data for Robust Voicing Detection in Speech
Yixuan Zhang
Heming Wang
DeLiang Wang
32
0
0
05 Dec 2023
A Semi-Supervised Deep Learning Approach to Dataset Collection for
  Query-By-Humming Task
A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-By-Humming Task
Amantur Amatov
Dmitry Lamanov
Maksim Titov
Ivan Vovk
Ilya Makarov
Mikhail Kudinov
33
0
0
02 Dec 2023
String Sound Synthesizer on GPU-accelerated Finite Difference Scheme
String Sound Synthesizer on GPU-accelerated Finite Difference Scheme
J. Lee
Min Jun Choi
Kyogu Lee
23
2
0
30 Nov 2023
Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice
  Conversion
Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice Conversion
A. R. Bargum
Stefania Serafin
Cumhur Erkut
26
3
0
14 Nov 2023
Efficient bandwidth extension of musical signals using a differentiable
  harmonic plus noise model
Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model
Pierre-Amaury Grumiaux
Mathieu Lagrange
16
2
0
13 Nov 2023
A cry for help: Early detection of brain injury in newborns
A cry for help: Early detection of brain injury in newborns
Charles C. Onu
Samantha Latremouille
Arsenii Gorin
Junhao Wang
Innocent Udeogu
...
O. Kehinde
Muhammad A. Salisu
Datonye Briggs
Yoshua Bengio
Doina Precup
59
2
0
12 Oct 2023
F0 analysis of Ghanaian pop singing reveals progressive alignment with
  equal temperament over the past three decades: a case study
F0 analysis of Ghanaian pop singing reveals progressive alignment with equal temperament over the past three decades: a case study
Irán R. Román
Daniel Faronbi
Isabelle Burger-Weiser
Leila Adu-Gilmore
14
2
0
02 Oct 2023
Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low
  Complexity
Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity
Krishna Subramani
J. Valin
Jan Büthe
Paris Smaragdis
Mike Goodwin
29
3
0
25 Sep 2023
Music Source Separation Based on a Lightweight Deep Learning Framework
  (DTTNET: DUAL-PATH TFC-TDF UNET)
Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)
Junyu Chen
Susmitha Vekkot
Pancham Shukla
33
6
0
15 Sep 2023
DDSP-SFX: Acoustically-guided sound effects generation with
  differentiable digital signal processing
DDSP-SFX: Acoustically-guided sound effects generation with differentiable digital signal processing
Yunyi Liu
Craig Jin
David Gunawan
19
2
0
14 Sep 2023
DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance
  from String-wise MIDI Input
DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input
Nicolas Jonason
Xin Eric Wang
Erica Cooper
Lauri Juvela
Bob L. T. Sturm
Junichi Yamagishi
36
1
0
14 Sep 2023
EnCodecMAE: Leveraging neural codecs for universal audio representation
  learning
EnCodecMAE: Leveraging neural codecs for universal audio representation learning
L. Pepino
Pablo Riera
Luciana Ferrer
38
4
0
14 Sep 2023
PESTO: Pitch Estimation with Self-supervised Transposition-equivariant
  Objective
PESTO: Pitch Estimation with Self-supervised Transposition-equivariant Objective
Alain Riou
Stefan Lattner
Gaëtan Hadjeres
Geoffroy Peeters
37
13
0
05 Sep 2023
A Review of Differentiable Digital Signal Processing for Music & Speech
  Synthesis
A Review of Differentiable Digital Signal Processing for Music & Speech Synthesis
B. Hayes
Jordie Shier
Gyorgy Fazekas
Andrew Mcpherson
C. Saitis
29
21
0
29 Aug 2023
Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion
Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion
Jordan J. Bird
Ahmad Lotfi
13
17
0
24 Aug 2023
1234
Next