v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown

Title
Dilated Convolution with Dilated GRU for Music Source Separation Jen-Yu Liu Yi-Hsuan Yang 72 41 0 04 Jun 2019
MelNet: A Generative Model for Audio in the Frequency Domain Sean Vasquez M. Lewis DiffM 85 132 0 04 Jun 2019
Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion Joan Serrà Santiago Pascual Carlos Segura CVBM 76 85 0 03 Jun 2019
Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with SampleRNN David Álvarez Santiago Pascual Antonio Bonafonte 73 12 0 03 Jun 2019
Robust Sequence-to-Sequence Acoustic Modeling with Stepwise Monotonic Attention for Neural TTS Mutian He Yan Deng Lei He 97 81 0 03 Jun 2019
Generating Diverse High-Fidelity Images with VQ-VAE-2 Ali Razavi Aaron van den Oord Oriol Vinyals DRL BDL 209 1,832 0 02 Jun 2019
Graph WaveNet for Deep Spatial-Temporal Graph Modeling Zonghan Wu Shirui Pan Guodong Long Jing Jiang Chengqi Zhang GNN AI4TS 108 2,208 0 31 May 2019
Self-Referencing Embedded Strings (SELFIES): A 100% robust molecular string representation Mario Krenn Florian Hase AkshatKumar Nigam Pascal Friederich Alán Aspuru-Guzik 116 71 0 31 May 2019
Learning Sparse Networks Using Targeted Dropout Aidan Gomez Ivan Zhang Siddhartha Rao Kamalakara Divyam Madaan Kevin Swersky Y. Gal Geoffrey E. Hinton 112 98 0 31 May 2019
Unsupervised Model Selection for Variational Disentangled Representation Learning Sunny Duan Loic Matthey Andre Saraiva Nicholas Watters Christopher P. Burgess Alexander Lerchner I. Higgins OOD DRL 96 80 0 29 May 2019
Global Guarantees for Blind Demodulation with Generative Priors Paul Hand Babhru Joshi 129 33 0 29 May 2019
Rethinking Full Connectivity in Recurrent Neural Networks Matthijs Van Keirsbilck A. Keller Xiaodong Yang LRM 41 14 0 29 May 2019
Complex-valued neural networks for machine learning on non-stationary physical data Jesper Sören Dramsch M. Lüthje Anders Christensen 80 36 0 29 May 2019
SignalTrain: Profiling Audio Compressors with Deep Neural Networks Scott H. Hawley Benjamin Colburn S. I. Mimilakis 42 12 0 28 May 2019
Learning distant cause and effect using only local and immediate credit assignment D. Rawlinson Abdelrahman Ahmed Gideon Kowadlo 45 3 0 28 May 2019
Validation of Approximate Likelihood and Emulator Models for Computationally Intensive Simulations Niccolò Dalmasso Ann B. Lee Rafael Izbicki T. Pospisil Ilmun Kim Chieh-An Lin 78 8 0 27 May 2019
VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019 Andros Tjandra Berrak Sisman Mingyang Zhang S. Sakti Haizhou Li Satoshi Nakamura 85 72 0 27 May 2019
Learning by stochastic serializations Pablo Strasser S. Armand Stéphane Marchand-Maillet Alexandros Kalousis 31 0 0 27 May 2019
ViterbiNet: A Deep Learning Based Viterbi Algorithm for Symbol Detection Nir Shlezinger Nariman Farsad Yonina C. Eldar Andrea J. Goldsmith 91 145 0 26 May 2019
Learning to Reason in Large Theories without Imitation Kshitij Bansal Christian Szegedy M. Rabe Sarah M. Loos Viktor Toman NAI LRM 107 42 0 25 May 2019
Fast computation of loudness using a deep neural network Josef Schlittenlacher Richard Turner B. Moore 14 2 0 24 May 2019
Zero-shot Knowledge Transfer via Adversarial Belief Matching P. Micaelli Amos Storkey 82 230 0 23 May 2019
Quantifying Long Range Dependence in Language and User Behavior to improve RNNs Francois Belletti Minmin Chen Ed H. Chi AI4TS 45 23 0 23 May 2019
Compression with Flows via Local Bits-Back Coding Jonathan Ho Evan Lohn Pieter Abbeel 103 54 0 21 May 2019
Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems Ohsung Kwon Eunwoo Song Jae-Min Kim Hong-Goo Kang 54 4 0 21 May 2019
Non-Autoregressive Neural Text-to-Speech Kainan Peng Ming-Yu Liu Z. Song Kexin Zhao 101 40 0 21 May 2019
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network V. Wan Chun-an Chan Tom Kenter Jakub Vít R. Clark 71 75 0 17 May 2019
Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with Hierarchical Latent Variables F. Kingma Pieter Abbeel Jonathan Ho 106 98 0 16 May 2019
MoGlow: Probabilistic and controllable motion synthesis using normalising flows G. Henter Simon Alexanderson Jonas Beskow 94 98 0 16 May 2019
AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss Kaizhi Qian Yang Zhang Shiyu Chang Xuesong Yang M. Hasegawa-Johnson 170 471 0 14 May 2019
Almost Unsupervised Text to Speech and Automatic Speech Recognition Yi Ren Xu Tan Tao Qin Sheng Zhao Zhou Zhao Tie-Yan Liu 95 102 0 13 May 2019
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis Jan Skoglund J. Valin 88 38 0 12 May 2019
Machine learning in acoustics: theory and applications Michael J. Bianco Peter Gerstoft James Traer Emma Ozanich M. Roch Sharon Gannot Charles-Alban Deledalle AI4CE 89 391 0 11 May 2019
FastDraw: Addressing the Long Tail of Lane Detection by Adapting a Sequential Prediction Network Jonah Philion 137 159 0 10 May 2019
Deep Unsupervised Cardinality Estimation Zongheng Yang Eric Liang Amog Kamsetty Chenggang Wu Yan Duan Peter Chen Pieter Abbeel J. M. Hellerstein S. Krishnan Ion Stoica 96 208 0 10 May 2019
Semi-supervised and Population Based Training for Voice Commands Recognition Oguz H. Elibol Gokce Keskin Anil Thomas 26 2 0 10 May 2019
AI in the media and creative industries Giuseppe Amato Malte Behrmann Frédéric Bimbot Baptiste Caramiaux Fabrizio Falchi ... Andrew Perkis R. Redondo Enrico Turrin T. Viéville Emmanuel Vincent 66 43 0 10 May 2019
Universal Adversarial Perturbations for Speech Recognition Systems Paarth Neekhara Shehzeen Samarah Hussain Prakhar Pandey Shlomo Dubnov Julian McAuley F. Koushanfar AAML 82 118 0 09 May 2019
Think Globally, Act Locally: A Deep Neural Network Approach to High-Dimensional Time Series Forecasting Rajat Sen Hsiang-Fu Yu Inderjit Dhillon AI4TS 140 366 0 09 May 2019
Generative Model with Dynamic Linear Flow Huadong Liao Jiawei He Kun-xian Shu DRL 49 5 0 08 May 2019
Capture, Learning, and Synthesis of 3D Speaking Styles Daniel Cudeiro Timo Bolkart Cassidy Laidlaw Anurag Ranjan Michael J. Black CVBM 3DH 108 344 0 08 May 2019
Generalized Dilation Neural Networks Gavneet Singh Chadha Jan Niclas Reimann Andreas Schwung MLT AI4TS MedIm 21 0 0 08 May 2019
Neural Architecture Refinement: A Practical Way for Avoiding Overfitting in NAS Yangzhou Jiang Cong Zhao Zeyang Dou Lei Pang 59 5 0 07 May 2019
Multivariate Time Series Classification using Dilated Convolutional Neural Network Omolbanin Yazdanbakhsh S. Dick 57 33 0 05 May 2019
Temporal Graph Convolutional Networks for Automatic Seizure Detection Ian Covert B. Krishnan I. Najm Jiening Zhan Matthew Shore J. Hixson M. Po 60 71 0 03 May 2019
High quality, lightweight and adaptable TTS using LPCNet Zvi Kons Slava Shechtman A. Sorin Carmel Rabinovitz R. Hoory 69 54 0 02 May 2019
Deep Learning for Audio Signal Processing Hendrik Purwins Yue Liu Tuomas Virtanen Jan Schlüter Shuo-yiin Chang Tara N. Sainath VLM 119 598 0 30 Apr 2019
PYRO-NN: Python Reconstruction Operators in Neural Networks Christopher Syben Markus Michen Bernhard Stimpel Stephan Seitz Stefan B. Ploner Andreas Maier AI4CE 51 62 0 30 Apr 2019
Curriculum Learning in Deep Neural Networks for Financial Forecasting Allison Koenecke Amita Gajewar AI4TS 37 16 0 29 Apr 2019
Neural source-filter waveform models for statistical parametric speech synthesis Xin Wang Shinji Takaki Junichi Yamagishi 97 118 0 27 Apr 2019