v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown

Title
Semi-Recurrent CNN-based VAE-GAN for Sequential Data Generation Mohammad Akbari Jie Liang GAN 75 20 0 01 Jun 2018
Backpropagation for Implicit Spectral Densities Aditya A. Ramesh Yann LeCun 55 10 0 01 Jun 2018
Inverting Supervised Representations with Autoregressive Neural Density Models C. Nash Nate Kushman Christopher K. I. Williams DRL 64 25 0 01 Jun 2018
Mining gold from implicit models to improve likelihood-free inference Johann Brehmer Gilles Louppe J. Pavez Kyle Cranmer AI4CE TPM 188 181 0 30 May 2018
Theory and Experiments on Vector Quantized Autoencoders Aurko Roy Ashish Vaswani Arvind Neelakantan Niki Parmar 91 88 0 28 May 2018
Lipschitz regularity of deep neural networks: analysis and efficient estimation Kevin Scaman Aladin Virmaux 158 533 0 28 May 2018
Real-valued parametric conditioning of an RNN for interactive sound synthesis L. Wyse 38 9 0 28 May 2018
Stable Recurrent Models John Miller Moritz Hardt 83 119 0 25 May 2018
ASR-based Features for Emotion Recognition: A Transfer Learning Approach Noé Tits Kevin El Haddad Thierry Dutoit 65 28 0 23 May 2018
CNN+CNN: Convolutional Decoders for Image Captioning Qingzhong Wang Antoni B. Chan VLM 73 86 0 23 May 2018
Generative timbre spaces: regularizing variational auto-encoders with perceptual metrics P. Esling Axel Chemla-Romeu-Santos Adrien Bitton 52 32 0 22 May 2018
Meta-learning with differentiable closed-form solvers Luca Bertinetto João F. Henriques Philip Torr Andrea Vedaldi ODL 123 932 0 21 May 2018
A Universal Music Translation Network Noam Mor Lior Wolf Adam Polyak Yaniv Taigman 89 110 0 21 May 2018
An Evaluation of Trajectory Prediction Approaches and Notes on the TrajNet Benchmark S. Becker Ronny Hug Wolfgang Hubner Michael Arens 108 71 0 19 May 2018
The global optimum of shallow neural network is attained by ridgelet transform Sho Sonoda Isao Ishikawa Masahiro Ikeda Kei Hagihara Y. Sawano Takuo Matsubara Noboru Murata 35 1 0 19 May 2018
Number Sequence Prediction Problems for Evaluating Computational Powers of Neural Networks Hyoungwook Nam Segwang Kim Kyomin Jung AIMat 66 15 0 19 May 2018
Sequential Neural Likelihood: Fast Likelihood-free Inference with Autoregressive Flows George Papamakarios D. Sterratt Iain Murray BDL 552 370 0 18 May 2018
QuaterNet: A Quaternion-based Recurrent Model for Human Motion Dario Pavllo David Grangier Michael Auli 3DH 76 263 0 16 May 2018
Towards a universal neural network encoder for time series Joan Serrà Santiago Pascual Alexandros Karatzoglou AI4TS 84 123 0 10 May 2018
Intracranial Error Detection via Deep Learning M. Völker Jiří Hammer R. Schirrmeister Joos Behncke L. Fiederer A. Schulze-Bonhage Petr Marusič Wolfram Burgard T. Ball 65 10 0 04 May 2018
Randomly weighted CNNs for (music) audio classification Jordi Pons Xavier Serra 79 86 0 01 May 2018
Collapsed speech segment detection and suppression for WaveNet vocoder Yi-Chiao Wu Kazuhiro Kobayashi Tomoki Hayashi Patrick Lumban Tobing Tomoki Toda 65 25 0 30 Apr 2018
Automatic Documentation of ICD Codes with Far-Field Speech Recognition Albert Haque Corinna Fukushima 21 0 0 30 Apr 2018
Deep Speech Denoising with Vector Space Projections Jeff Hetherly Paul Gamble M. Barrios Cory Stephenson Karl S. Ni 32 0 0 27 Apr 2018
Detection of Glottal Closure Instants from Raw Speech using Convolutional Neural Networks Mohit Goyal Varun Srivastava P. PrathoshA. 40 2 0 26 Apr 2018
JUNIPR: a Framework for Unsupervised Machine Learning in Particle Physics Anders Andreassen Ilya Feige Christopher Frye M. Schwartz MU 100 137 0 25 Apr 2018
Speaker-independent raw waveform model for glottal excitation Lauri Juvela Vassilis Tsiaras Bajibabu Bollepalli Manu Airaksinen Junichi Yamagishi P. Alku 54 39 0 25 Apr 2018
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment Tomi Kinnunen Jaime Lorenzo-Trueba Junichi Yamagishi Tomoki Toda Daisuke Saito F. Villavicencio Zhenhua Ling 51 28 0 23 Apr 2018
Deep Layered Learning in MIR Anders Elowsson 46 4 0 18 Apr 2018
The unreasonable effectiveness of the forget gate J. Westhuizen Joan Lasenby 77 89 0 13 Apr 2018
Blood Vessel Geometry Synthesis using Generative Adversarial Networks J. Wolterink T. Leiner Ivana Isgum GAN MedIm 43 25 0 12 Apr 2018
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods Jaime Lorenzo-Trueba Junichi Yamagishi Tomoki Toda Daisuke Saito F. Villavicencio Tomi Kinnunen Zhenhua Ling 69 321 0 12 Apr 2018
Understanding disentangling in $β$ -VAE Christopher P. Burgess I. Higgins Arka Pal Loic Matthey Nicholas Watters Guillaume Desjardins Alexander Lerchner CoGe DRL 73 832 0 10 Apr 2018
A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis Xin Wang Jaime Lorenzo-Trueba Shinji Takaki Lauri Juvela Junichi Yamagishi 70 67 0 07 Apr 2018
Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder K. Akuzawa Yusuke Iwasawa Y. Matsuo 87 139 0 06 Apr 2018
Structured Disentangled Representations Babak Esmaeili Hao Wu Sarthak Jain Alican Bozkurt N. Siddharth Brooks Paige Dana H. Brooks Jennifer Dy Jan-Willem van de Meent OOD CML BDL DRL 88 169 0 06 Apr 2018
Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset Xinpeng Chen Jingyuan Chen Lin Ma Jian Yao Wen Liu Jiebo Luo Tong Zhang AI4TS VGen 37 20 0 04 Apr 2018
Music Genre Classification using Machine Learning Techniques Hareesh Bahuleyan VLM 34 104 0 03 Apr 2018
Neural Autoregressive Flows Chin-Wei Huang David M. Krueger Alexandre Lacoste Aaron Courville DRL AI4CE 161 447 0 03 Apr 2018
Conditional End-to-End Audio Transforms Albert Haque Michelle Guo Prateek Verma 114 41 0 30 Mar 2018
Parallel Grid Pooling for Data Augmentation Akito Takeki Daiki Ikami Go Irie Kiyoharu Aizawa 56 7 0 30 Mar 2018
Detecting Alzheimer's Disease Using Gated Convolutional Neural Network from Audio Data Tifani Warnita Nakamasa Inoue Koichi Shinoda 36 40 0 30 Mar 2018
Machine Speech Chain with One-shot Speaker Adaptation Andros Tjandra S. Sakti Satoshi Nakamura 71 56 0 28 Mar 2018
World Models David R Ha Jürgen Schmidhuber SyDa 202 1,102 0 27 Mar 2018
Complex-Valued Restricted Boltzmann Machine for Direct Speech Parameterization from Complex Spectra Toru Nakashika Shinji Takaki Junichi Yamagishi 13 1 0 27 Mar 2018
MOrdReD: Memory-based Ordinal Regression Deep Neural Networks for Time Series Forecasting Bernardo Pérez Orozco G. Abbati Stephen J. Roberts OOD AI4TS 38 14 0 26 Mar 2018
HAMLET: Interpretable Human And Machine co-LEarning Technique Olivier Deiss Siddharth Biswal Jing Jin Haoqi Sun M. P. M. Brandon Westover Jimeng Sun 60 11 0 26 Mar 2018
Calibrated Prediction Intervals for Neural Network Regressors Gil Keren N. Cummins Björn Schuller UQCV 87 31 0 26 Mar 2018
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis Yuxuan Wang Daisy Stanton Yu Zhang RJ Skerry-Ryan Eric Battenberg Joel Shor Y. Xiao Fei Ren Ye Jia Rif A. Saurous 68 827 0 23 Mar 2018
Generalization Challenges for Neural Architectures in Audio Source Separation Shariq Mobin Brian Cheung Bruno A. Olshausen DRL 46 2 0 23 Mar 2018