v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown

Title
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet J. Valin Jan Skoglund 62 79 0 28 Mar 2019
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages Kyubyong Park Thomas Mulc 83 101 0 27 Mar 2019
WGANSing: A Multi-Voice Singing Voice Synthesizer Based on the Wasserstein-GAN Pritish Chandna Merlijn Blaauw J. Bonada E. Gómez 114 62 0 26 Mar 2019
General Probabilistic Surface Optimization and Log Density Estimation Dmitry Kopitkov Vadim Indelman 87 1 0 25 Mar 2019
Bandwidth Extension on Raw Audio via Generative Adversarial Networks S. Kim V. Sathe GAN 58 26 0 21 Mar 2019
Smart Edition of MIDI Files Pierre Roy F. Pachet 28 1 0 20 Mar 2019
Neural Networks for Lorenz Map Prediction: A Trip Through Time Denisa Roberts AI4TS AI4CE 21 1 0 18 Mar 2019
A Vocoder Based Method For Singing Voice Extraction Pritish Chandna Merlijn Blaauw J. Bonada E. Gómez 43 9 0 18 Mar 2019
Bilinear Representation for Language-based Image Editing Using Conditional Generative Adversarial Networks Xiaofeng Mao YueFeng Chen Yuhong Li T. Xiong Yuan He Hui Xue GAN 81 21 0 18 Mar 2019
Counterpoint by Convolution Cheng-Zhi Anna Huang Tim Cooijmans Adam Roberts Aaron Courville Douglas Eck BDL 81 151 0 18 Mar 2019
Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis Bajibabu Bollepalli Lauri Juvela P. Alku 63 46 0 14 Mar 2019
Voice command generation using Progressive Wavegans Thomas Wiest N. Cummins Alice Baird Simone Hantke J. Dineley Björn Schuller GAN 35 1 0 13 Mar 2019
Deep Text-to-Speech System with Seq2Seq Model Gary Wang AI4TS 41 9 0 11 Mar 2019
Scaling up deep neural networks: a capacity allocation perspective Jonathan Donier 46 0 0 11 Mar 2019
Accelerating Minibatch Stochastic Gradient Descent using Typicality Sampling Xinyu Peng Li Li Feiyue Wang BDL 140 59 0 11 Mar 2019
Singing voice conversion with non-parallel data Xin Chen Wei Chu Jinxi Guo N. Xu 39 28 0 11 Mar 2019
A Deep Generative Model of Speech Complex Spectrograms Aditya Arie Nugraha Kouhei Sekiguchi Kazuyoshi Yoshii 47 19 0 08 Mar 2019
A Character-Level Approach to the Text Normalization Problem Based on a New Causal Encoder Adrián Javaloy Bornás G. García-Mateos CML 11 3 0 06 Mar 2019
Autoregressive Convolutional Recurrent Neural Network for Univariate and Multivariate Time Series Prediction Matteo Maggiolo Gerasimos Spanakis AI4TS BDL 62 9 0 06 Mar 2019
High-Fidelity Image Generation With Fewer Labels Mario Lucic Michael Tschannen Marvin Ritter Xiaohua Zhai Olivier Bachem Sylvain Gelly GAN OOD 130 159 0 06 Mar 2019
MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation Yazan Abu Farha Juergen Gall 90 671 0 05 Mar 2019
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation Manoj Kumar Mohammad Babaeizadeh D. Erhan Chelsea Finn Sergey Levine Laurent Dinh Durk Kingma VGen 98 132 0 04 Mar 2019
Analysing Deep Learning-Spectral Envelope Prediction Methods for Singing Synthesis F. Bous A. Röbel 34 3 0 04 Mar 2019
Equilibrated Recurrent Neural Network: Neuronal Time-Delayed Self-Feedback Improves Accuracy and Stability Ziming Zhang Anil Kag Alan Sullivan Venkatesh Saligrama 47 5 0 02 Mar 2019
Fine-Grained Semantic Segmentation of Motion Capture Data using Dilated Temporal Fully-Convolutional Networks N. Cheema S. Hosseini J. Sprenger E. Herrmann H. Du K. Fischer P. Slusallek 18 3 0 02 Mar 2019
1D Convolutional Neural Network Models for Sleep Arousal Detection M. Zabihi Ali Bahrami Rad S. Kiranyaz Simo Särkkä Moncef Gabbouj 47 14 0 01 Mar 2019
A Unified Neural Architecture for Instrumental Audio Tasks Steven Spratley Daniel Beck Trevor Cohn 56 5 0 01 Mar 2019
Assume, Augment and Learn: Unsupervised Few-Shot Meta-Learning via Random Labels and Data Augmentation Antreas Antoniou Amos Storkey SSL 113 75 0 26 Feb 2019
The State of Sparsity in Deep Neural Networks Trevor Gale Erich Elsen Sara Hooker 193 765 0 25 Feb 2019
Wasserstein-Wasserstein Auto-Encoders Shunkang Zhang Yuan Gao Yuling Jiao Jin Liu Yang Wang Can Yang DRL DiffM 29 13 0 25 Feb 2019
Attentional Encoder Network for Targeted Sentiment Classification Youwei Song Jiahai Wang Tao Jiang Zhiyue Liu Yanghui Rao 72 278 0 25 Feb 2019
GANSynth: Adversarial Neural Audio Synthesis Jesse Engel Kumar Krishna Agrawal Shuo Chen Ishaan Gulrajani Chris Donahue Adam Roberts 111 393 0 23 Feb 2019
Towards Neural Mixture Recommender for Long Range Dependent User Sequences Jiaxi Tang Francois Belletti Sagar Jain Minmin Chen Alex Beutel Can Xu Ed H. Chi 64 92 0 22 Feb 2019
Capacity allocation through neural network layers Jonathan Donier 48 3 0 22 Feb 2019
Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering Ramakrishna Vedantam Karan Desai Stefan Lee Marcus Rohrbach Dhruv Batra Devi Parikh NAI BDL 97 87 0 21 Feb 2019
Audio-Linguistic Embeddings for Spoken Sentences Albert Haque Michelle Guo Prateek Verma Li Fei-Fei 80 51 0 20 Feb 2019
Data Efficient Voice Cloning for Neural Singing Synthesis Merlijn Blaauw J. Bonada R. Daido 137 33 0 19 Feb 2019
Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks Hafiz Malik 53 26 0 18 Feb 2019
STCN: Stochastic Temporal Convolutional Networks Emre Aksan Otmar Hilliges BDL 59 62 0 18 Feb 2019
Learning to Adaptively Scale Recurrent Neural Networks Hao Hu Liqiang Wang Guo-Jun Qi AI4CE 42 10 0 15 Feb 2019
Fully Convolutional Networks for Text Classification Jacob Anderson 16 4 0 14 Feb 2019
Toward Ergonomic Risk Prediction via Segmentation of Indoor Object Manipulation Actions Using Spatiotemporal Convolutional Networks Behnoosh Parsa Ekta U. Samani Rose Hendrix Cameron Devine Shashi M. Singh Santosh Devasia A. Banerjee 51 24 0 14 Feb 2019
Recurrent Neural Networks with Stochastic Layers for Acoustic Novelty Detection Duong Nguyen O. Kirsebom F. Frazão Ronan Fablet Stan Matwin 45 5 0 13 Feb 2019
Capacity allocation analysis of neural networks: A tool for principled architecture design Jonathan Donier 49 4 0 12 Feb 2019
Unpriortized Autoencoder For Image Generation Jaeyoung Yoo Hojun Lee Nojun Kwak SyDa SSL GAN DRL 34 2 0 12 Feb 2019
Towards a Robust Deep Neural Network in Texts: A Survey Wenqi Wang Benxiao Tang Run Wang Lina Wang Aoshuang Ye AAML 99 39 0 12 Feb 2019
MaCow: Masked Convolutional Generative Flow Xuezhe Ma Xiang Kong Shanghang Zhang Eduard H. Hovy DRL 74 66 0 12 Feb 2019
Adversarial Generation of Time-Frequency Features with application in audio synthesis Andrés Marafioti Nicki Holighaus Nathanael Perraudin P. Majdak 74 68 0 11 Feb 2019
A Vocoder-free WaveNet Voice Conversion with Non-Parallel Data Xiaohai Tian Chng Eng Siong Haizhou Li 36 7 0 11 Feb 2019
Data-Driven Vehicle Trajectory Forecasting Shayan Jawed Eya Boumaiza Josif Grabocka Lars Schmidt-Thieme 55 5 0 09 Feb 2019