v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown

Title
Generating High Fidelity Images with Subscale Pixel Networks and Multidimensional Upscaling Jacob Menick Nal Kalchbrenner 106 151 0 04 Dec 2018
Timeception for Complex Action Recognition Noureldien Hussein E. Gavves A. Smeulders 147 215 0 04 Dec 2018
Pedestrian Detection with Autoregressive Network Phases Garrick Brazil Xiaoming Liu 88 72 0 02 Dec 2018
Cross-Modulation Networks for Few-Shot Learning Hugo Prol Vincent Dumoulin Luis Herranz 71 15 0 01 Dec 2018
Effects of Loss Functions And Target Representations on Adversarial Robustness Sean Saito S. Roy AAML 72 7 0 01 Dec 2018
SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation Md Shamim Hussain M. A. Haque 44 48 0 01 Dec 2018
LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis Min-Jae Hwang Frank Soong Fenglong Xie Xi Wang Hyeonjoo Kang Hong-Goo Kang 53 21 0 29 Nov 2018
3D human pose estimation in video with temporal convolutions and semi-supervised training Dario Pavllo Christoph Feichtenhofer David Grangier Michael Auli 3DH 81 1,015 0 28 Nov 2018
Play as You Like: Timbre-enhanced Multi-modal Music Style Transfer Chien-Yu Lu Min-Xin Xue Chia-Che Chang Che-Rung Lee Li Su 89 34 0 28 Nov 2018
UFANS: U-shaped Fully-Parallel Acoustic Neural Structure For Statistical Parametric Speech Synthesis With 20X Faster Dabiao Ma Zhiba Su Yuhao Lu Wenxuan Wang Zhen Li 34 3 0 28 Nov 2018
Improved Speech Enhancement with the Wave-U-Net Can Eren Sezener Tillman Weyde 65 165 0 27 Nov 2018
Class-Distinct and Class-Mutual Image Generation with GANs Takuhiro Kaneko Yoshitaka Ushiku Tatsuya Harada 100 9 0 27 Nov 2018
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion Wen-Chin Huang Yi-Chiao Wu Hsin-Te Hwang Patrick Lumban Tobing Tomoki Hayashi Kazuhiro Kobayashi Tomoki Toda Yu Tsao H. Wang 63 20 0 27 Nov 2018
Planning in Dynamic Environments with Conditional Autoregressive Models Johanna Hansen Kyle Kastner Aaron Courville Gregory Dudek 53 1 0 25 Nov 2018
An overview of deep learning in medical imaging focusing on MRI A. Lundervold A. Lundervold OOD 112 1,654 0 25 Nov 2018
Interpretable Convolutional Filters with SincNet Mirco Ravanelli Yoshua Bengio 93 107 0 23 Nov 2018
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer Sicong Huang Qiyang Li Cem Anil Xuchan Bao Sageev Oore Roger C. Grosse 92 98 0 22 Nov 2018
Sequential Neural Methods for Likelihood-free Inference Conor Durkan George Papamakarios Iain Murray BDL 187 25 0 21 Nov 2018
Measuring Depression Symptom Severity from Spoken Language and 3D Facial Expressions Albert Haque Michelle Guo Adam S. Miner Li Fei-Fei 46 112 0 21 Nov 2018
The Effect of Explicit Structure Encoding of Deep Neural Networks for Symbolic Music Generation Kai Chen Weilin Zhang Shlomo Dubnov Gus Xia Wei Li MGen 43 5 0 20 Nov 2018
Black-Box Autoregressive Density Estimation for State-Space Models Tom Ryder Andrew Golightly A. Mcgough D. Prangle BDL 46 6 0 20 Nov 2018
Multi-scale aggregation of phase information for reducing computational cost of CNN based DOA estimation Soumitro Chakrabarty Emanuel Habets 45 6 0 20 Nov 2018
Improving Sequence-to-Sequence Acoustic Modeling by Adding Text-Supervision Jing-Xuan Zhang Zhenhua Ling Yuan Jiang Li-Juan Liu Chen Liang Lirong Dai 80 30 0 20 Nov 2018
Learning Robust Heterogeneous Signal Features from Parallel Neural Network for Audio Sentiment Analysis Feiyang Chen Ziqian Luo 59 19 0 20 Nov 2018
Coupled Recurrent Models for Polyphonic Music Composition John Thickstun Zaïd Harchaoui Dean Phillips Foster Sham Kakade 42 12 0 20 Nov 2018
Efficient keyword spotting using dilated convolutions and gating A. Coucke M. Chlieh Thibault Gisselbrecht David Leroy Mathieu Poumeyrol Thibaut Lavril 101 100 0 19 Nov 2018
Harmonic Recomposition using Conditional Autoregressive Modeling Kyle Kastner Rithesh Kumar Tim Cooijmans Aaron Courville 52 0 0 18 Nov 2018
Representation Mixing for TTS Synthesis Kyle Kastner J. F. Santos Yoshua Bengio Aaron Courville 55 43 0 17 Nov 2018
High Quality Prediction of Protein Q8 Secondary Structure by Diverse Neural Network Architectures Iddo Drori Isht Dwivedi Pranav Shrestha Jeffrey Wan Yueqi Wang ... Kaveri A. Thakoor Chinmay Joshi Sonam Goenka C. Keasar I. Pe’er 74 27 0 17 Nov 2018
Generating Black Metal and Math Rock: Beyond Bach, Beethoven, and Beatles Zack Zukowski CJ Carr 50 18 0 16 Nov 2018
Generating Albums with SampleRNN to Imitate Metal, Rock, and Punk Bands CJ Carr Zack Zukowski MGen 35 20 0 16 Nov 2018
Learning to Predict the Cosmological Structure Formation Siyu He Yin Li Yu Feng S. Ho Siamak Ravanbakhsh Wei Chen Barnabás Póczós 92 172 0 15 Nov 2018
Effect of data reduction on sequence-to-sequence neural TTS Javier Latorre Jakub Lachowicz Jaime Lorenzo-Trueba Thomas Merritt Thomas Drugman S. Ronanki Klimkov Viacheslav 92 59 0 15 Nov 2018
Comprehensive evaluation of statistical speech waveform synthesis Thomas Merritt Bartosz Putrycz Adam Nadolski Tianjun Ye Daniel Korzekwa ... Alexis Moinet A. Breen Rafal Kuklinski N. Strom Roberto Barra-Chicote 51 18 0 15 Nov 2018
Towards achieving robust universal neural vocoding Jaime Lorenzo-Trueba Thomas Drugman Javier Latorre Thomas Merritt Bartosz Putrycz Roberto Barra-Chicote Alexis Moinet Vatsal Aggarwal DRL 139 19 0 15 Nov 2018
Melodic Phrase Segmentation By Deep Neural Networks Y. Guan Jinyu Zhao Yiqin Qiu Zheng Zhang Gus Xia 37 11 0 14 Nov 2018
Neural Wavetable: a playable wavetable synthesizer using neural networks Lamtharn Hantrakul Li-Chia Yang 41 3 0 13 Nov 2018
Hallucinating Point Cloud into 3D Sculptural Object Chun-Liang Li Eunsu Kang Songwei Ge Lingyao Zhang Austin Dill Manzil Zaheer Barnabás Póczós 3DPC 52 2 0 13 Nov 2018
Agent Embeddings: A Latent Representation for Pole-Balancing Networks Oscar Chang Robert Kwiatkowski Siyuan Chen Hod Lipson 147 6 0 12 Nov 2018
PerformanceNet: Score-to-Audio Music Generation with Multi-Band Convolutional Residual Network Bryan Wang Yi-Hsuan Yang 71 38 0 11 Nov 2018
ExcitNet vocoder: A neural excitation model for parametric speech synthesis systems Eunwoo Song Kyungguen Byun Hong-Goo Kang 75 29 0 09 Nov 2018
Mode matching in GANs through latent space learning and inversion Chao Weng AP Prathosh Dong Yu Varun Srivastava S. Chaudhury GAN 62 2 0 08 Nov 2018
Speaker-adaptive neural vocoders for parametric speech synthesis systems Eunwoo Song Xiang Yu Erik Cambria Jagath Rajapakse 49 3 0 08 Nov 2018
Learning Disentangled Representations for Timber and Pitch in Music Audio Yun-Ning Hung Yian Chen Yi-Hsuan Yang 85 16 0 08 Nov 2018
Blockwise Parallel Decoding for Deep Autoregressive Models Mitchell Stern Noam M. Shazeer Ashley J. Llorens 86 238 0 07 Nov 2018
High-quality speech coding with SampleRNN Adam Conkey Per Hedelin Cong Zhou Tucker Hermans Lars Villemoes 71 59 0 07 Nov 2018
Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using a WaveNet Approach Ran Wang Yao Wang A. Flinker 29 7 0 06 Nov 2018
FloWaveNet : A Generative Flow for Raw Audio Sungwon Kim Sang-gil Lee Jongyoon Song Jaehyeon Kim Sungroh Yoon 118 169 0 06 Nov 2018
ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion Hirokazu Kameoka Kou Tanaka Damian Kwaśny Takuhiro Kaneko Nobukatsu Hojo 92 64 0 05 Nov 2018
Nonparallel Emotional Speech Conversion Jian Gao Deep Chakraborty H. Tembine Olaitan Olaleye 87 69 0 03 Nov 2018