Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.11506
Cited By
v1
v2 (latest)
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds
24 July 2021
Xuan Shi
Erica Cooper
Junichi Yamagishi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds"
31 / 31 papers shown
Title
Codified audio language modeling learns useful representations for music information retrieval
Rodrigo Castellon
Chris Donahue
Percy Liang
122
90
0
12 Jul 2021
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis
Erica Cooper
Xin Wang
Junichi Yamagishi
68
6
0
25 Apr 2021
On the Reproducibility of Neural Network Predictions
Srinadh Bhojanapalli
Kimberly Wilber
Andreas Veit
A. S. Rawat
Seungyeon Kim
A. Menon
Sanjiv Kumar
110
35
0
05 Feb 2021
LEAF: A Learnable Frontend for Audio Classification
Neil Zeghidour
O. Teboul
Félix de Chaumont Quitry
Marco Tagliasacchi
VLM
AAML
120
148
0
21 Jan 2021
DDSP: Differentiable Digital Signal Processing
Jesse Engel
Lamtharn Hantrakul
Chenjie Gu
Adam Roberts
DiffM
172
381
0
14 Jan 2020
On Model Stability as a Function of Random Seed
Pranava Madhyastha
Dhruv Batra
86
63
0
23 Sep 2019
Probing the Information Encoded in X-vectors
Desh Raj
David Snyder
Daniel Povey
Sanjeev Khudanpur
92
87
0
13 Sep 2019
Data Augmentation for Instrument Classification Robust to Audio Effects
António Ramires
Xavier Serra
29
8
0
19 Jul 2019
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition
Xu Xiang
Shuai Wang
Houjun Huang
Y. Qian
Kai Yu
DRL
52
144
0
18 Jun 2019
Neural source-filter waveform models for statistical parametric speech synthesis
Xin Wang
Shinji Takaki
Junichi Yamagishi
77
118
0
27 Apr 2019
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification
Jee-weon Jung
Hee-Soo Heo
Ju-ho Kim
Hye-jin Shim
Ha-Jin Yu
61
142
0
17 Apr 2019
GANSynth: Adversarial Neural Audio Synthesis
Jesse Engel
Kumar Krishna Agrawal
Shuo Chen
Ishaan Gulrajani
Chris Donahue
Adam Roberts
101
392
0
23 Feb 2019
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
Sicong Huang
Qiyang Li
Cem Anil
Xuchan Bao
Sageev Oore
Roger C. Grosse
76
98
0
22 Nov 2018
PerformanceNet: Score-to-Audio Music Generation with Multi-Band Convolutional Residual Network
Bryan Wang
Yi-Hsuan Yang
59
38
0
11 Nov 2018
Neural Music Synthesis for Flexible Timbre Control
Jong Wook Kim
Rachel M. Bittner
Aparna Kumar
J. P. Bello
57
39
0
01 Nov 2018
Music Transformer
Cheng-Zhi Anna Huang
Ashish Vaswani
Jakob Uszkoreit
Noam M. Shazeer
Ian Simon
Curtis Hawthorne
Andrew M. Dai
Matthew D. Hoffman
Monica Dinculescu
Douglas Eck
205
486
0
12 Sep 2018
Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
173
716
0
29 Jul 2018
Conditioning Deep Generative Raw Audio Models for Structured Automatic Music
Rachel Manzelli
Vijay Thakkar
Ali Siahkamari
Brian Kulis
MGen
47
45
0
26 Jun 2018
Frame-level Instrument Recognition by Timbre and Pitch
Yun-Ning Hung
Yi-Hsuan Yang
55
35
0
25 Jun 2018
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
356
2,285
0
14 Jun 2018
A Universal Music Translation Network
Noam Mor
Lior Wolf
Adam Polyak
Yaniv Taigman
73
110
0
21 May 2018
Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System
Weicheng Cai
Jinkun Chen
Ming Li
60
332
0
14 Apr 2018
A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification
Weicheng Cai
Zexin Cai
Xiangjinzi Zhang
Xiaoqi Wang
Ming Li
36
76
0
02 Apr 2018
Attentive Statistics Pooling for Deep Speaker Embedding
K. Okabe
Takafumi Koshinaka
Koichi Shinoda
98
530
0
29 Mar 2018
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Jonathan Shen
Ruoming Pang
Ron J. Weiss
M. Schuster
Navdeep Jaitly
...
Yuxuan Wang
RJ Skerry-Ryan
Rif A. Saurous
Yannis Agiomyrgiannakis
Yonghui Wu
79
2,701
0
16 Dec 2017
Squeeze-and-Excitation Networks
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
427
26,539
0
05 Sep 2017
Deep Speaker: an End-to-End Neural Speaker Embedding System
Chao Li
Xiaokong Ma
B. Jiang
Xiangang Li
Xuewei Zhang
Xiao-Chang Liu
Ying Cao
Ajay Kannan
Zhenyao Zhu
53
493
0
05 May 2017
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders
Jesse Engel
Cinjon Resnick
Adam Roberts
Sander Dieleman
Douglas Eck
Karen Simonyan
Mohammad Norouzi
120
629
0
05 Apr 2017
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
406
7,405
0
12 Sep 2016
Deep convolutional neural networks for predominant instrument recognition in polyphonic music
Yoonchang Han
Jae‐Hun Kim
Kyogu Lee
54
205
0
31 May 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,322
0
10 Dec 2015
1