v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown

Title
Autoencoding sensory substitution Viktor Tóth L. Parkkonen 35 6 0 14 Jul 2019
Generative Modeling by Estimating Gradients of the Data Distribution Yang Song Stefano Ermon SyDa DiffM 273 3,972 0 12 Jul 2019
R-Transformer: Recurrent Neural Network Enhanced Transformer Z. Wang Yao Ma Zitao Liu Jiliang Tang ViT 82 106 0 12 Jul 2019
On the Evaluation of Conditional GANs Terrance Devries Adriana Romero Luis Villaseñor-Pineda Graham W. Taylor M. Drozdzal EGVM 87 43 0 11 Jul 2019
Multi-Speaker End-to-End Speech Synthesis Jihyun Park Kexin Zhao Kainan Peng Ming-Yu Liu SyDa 74 19 0 09 Jul 2019
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning Yu Zhang Ron J. Weiss Heiga Zen Yonghui Wu Zhiwen Chen RJ Skerry-Ryan Ye Jia Andrew Rosenberg Bhuvana Ramabhadran 76 189 0 09 Jul 2019
M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention Shuang Ma Daniel J. McDuff Yale Song 44 4 0 09 Jul 2019
Towards Debugging Deep Neural Networks by Generating Speech Utterances Bilal Soomro Anssi Kanervisto Trung Ngo Trong Ville Hautamaki 21 0 0 06 Jul 2019
Speech bandwidth extension with WaveNet Archit Gupta Brendan Shillingford Yannis Assael Thomas C. Walters 60 29 0 05 Jul 2019
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach Noé Tits 40 10 0 05 Jul 2019
Neural Drum Machine : An Interactive System for Real-time Synthesis of Drum Sounds Cyran Aouameur P. Esling Gaëtan Hadjeres 42 22 0 04 Jul 2019
The Indirect Convolution Algorithm Marat Dukhan 68 42 0 03 Jul 2019
Multitasking with Alexa Multitasking with Alexa: How Using Intelligent Personal Assistants Impacts Language-based Primary Task Performance Justin Edwards H. Liu Tianyu Zhou Sandy J. J. Gould L. Clark Philip R. Doyle Benjamin R. Cowan 37 23 0 03 Jul 2019
Deep Learning Based Energy Disaggregation and On/Off Detection of Household Appliances Jie Jiang Qiuqiang Kong Mark D. Plumbley Nigel Gilbert 61 59 0 03 Jul 2019
Generative Models for Automatic Chemical Design Daniel Schwalbe-Koda Rafael Gómez-Bombarelli MedIm AI4CE 87 81 0 02 Jul 2019
Themis: Fair and Efficient GPU Cluster Scheduling Kshiteej S. Mahajan Arjun Balasubramanian Arjun Singhvi Shivaram Venkataraman Aditya Akella Amar Phanishayee Shuchi Chawla 80 185 0 02 Jul 2019
Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for Multiple Source Separations Gabriel Meseguer-Brocal Geoffroy Peeters 84 61 0 02 Jul 2019
A Tandem Learning Rule for Effective Training and Rapid Inference of Deep Spiking Neural Networks Jibin Wu Yansong Chua Malu Zhang Guoqi Li Haizhou Li Kay Chen Tan 78 14 0 02 Jul 2019
Adaptive Music Composition for Games P. Hutchings Jon McCormack 65 29 0 02 Jul 2019
Quasi-Periodic WaveNet Vocoder: A Pitch Dependent Dilated Convolution Model for Parametric Speech Generation Yi-Chiao Wu Tomoki Hayashi Patrick Lumban Tobing Kazuhiro Kobayashi Tomoki Toda 63 16 0 01 Jul 2019
Analysis by Adversarial Synthesis -- A Novel Approach for Speech Vocoding Ahmed Mustafa A. Biswas Christian Bergler Julia Schottenhamml Andreas Maier GAN 50 4 0 01 Jul 2019
Deep Residual Neural Networks for Audio Spoofing Detection M. Alzantot Ziqi Wang Mani B. Srivastava 77 169 0 30 Jun 2019
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting Shiyang Li Xiaoyong Jin Yao Xuan Xiyou Zhou Wenhu Chen Yu Wang Xifeng Yan AI4TS 204 1,451 0 29 Jun 2019
Curriculum Learning for Deep Generative Models with Clustering Deli Zhao Jiapeng Zhu Zhenfang Guo Bo Zhang GNN 85 2 0 27 Jun 2019
RUSLAN: Russian Spoken Language Corpus for Speech Synthesis Lenar Gabdrakhmanov Rustem Garaev E. Razinkov 52 10 0 26 Jun 2019
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training Peng Wu Zhenhua Ling Li-Juan Liu Yuan Jiang Hong-Chuan Wu Lirong Dai 98 72 0 26 Jun 2019
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations Jing-Xuan Zhang Zhenhua Ling Lirong Dai 100 99 0 25 Jun 2019
SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech Shreyas Seshadri Okko Räsänen 23 10 0 24 Jun 2019
A Neural Vocoder with Hierarchical Generation of Amplitude and Phase Spectra for Statistical Parametric Speech Synthesis Yang Ai Zhenhua Ling 123 29 0 23 Jun 2019
Universal Approximation of Input-Output Maps by Temporal Convolutional Nets Joshua Hanson Maxim Raginsky AI4TS 62 6 0 21 Jun 2019
Black-Box Inference for Non-Linear Latent Force Models W. Ward Tom Ryder D. Prangle Mauricio A. Alvarez DRL 80 14 0 21 Jun 2019
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling Yuanhao Yi Yang Ai Zhenhua Ling Lirong Dai 56 33 0 21 Jun 2019
Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders Yin-Jyun Luo Kat R. Agres Dorien Herremans 103 46 0 19 Jun 2019
Disentangled Inference for GANs with Latently Invertible Autoencoder Jiapeng Zhu Deli Zhao Bo Zhang Bolei Zhou GAN DRL 109 35 0 19 Jun 2019
A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation Hieu-Thi Luong Junichi Yamagishi 74 10 0 18 Jun 2019
Pose Guided Fashion Image Synthesis Using Deep Generative Model Wei Sun Jawadul H. Bappy Shanglin Yang Yi Tian Xu Tianfu Wu Hui Zhou 56 12 0 17 Jun 2019
Learning Execution through Neural Code Fusion Zhan Shi Kevin Swersky Daniel Tarlow Parthasarathy Ranganathan Milad Hashemi GNN 116 29 0 17 Jun 2019
ASAC: Active Sensing using Actor-Critic models Jinsung Yoon James Jordon M. Schaar CML 59 16 0 16 Jun 2019
Parametric Resynthesis with neural vocoders Soumi Maiti Michael I. Mandel 68 19 0 16 Jun 2019
Stand-Alone Self-Attention in Vision Models Prajit Ramachandran Niki Parmar Ashish Vaswani Irwan Bello Anselm Levskaya Jonathon Shlens VLM SLR ViT 173 1,217 0 13 Jun 2019
GluonTS: Probabilistic Time Series Models in Python A. Alexandrov Konstantinos Benidis Michael Bohlke-Schneider Valentin Flunkert Jan Gasthaus ... David Salinas J. Schulz Lorenzo Stella Ali Caner Türkmen Bernie Wang BDL AI4TS 75 115 0 12 Jun 2019
Toward Interpretable Music Tagging with Self-Attention Minz Won Sanghyuk Chun Xavier Serra ViT 74 82 0 12 Jun 2019
Probabilistic Forecasting with Temporal Convolutional Neural Network Yitian Chen Yanfei Kang Yixiong Chen Zizhuo Wang BDL AI4TS 119 331 0 11 Jun 2019
Parallel Scheduled Sampling Daniel Duckworth Arvind Neelakantan Ben Goodrich Lukasz Kaiser Samy Bengio 77 23 0 11 Jun 2019
Neural Spline Flows Conor Durkan Artur Bekasov Iain Murray George Papamakarios DRL 236 778 0 10 Jun 2019
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis Eric Battenberg Soroosh Mariooryad Daisy Stanton RJ Skerry-Ryan Matt Shannon David Kao Tom Bagby BDL 107 45 0 08 Jun 2019
TransNet: A deep network for fast detection of common shot transitions Tomás Soucek Jaroslav Moravec Jakub Lokoč 42 31 0 08 Jun 2019
KERMIT: Generative Insertion-Based Modeling for Sequences William Chan Nikita Kitaev Kelvin Guu Mitchell Stern Jakob Uszkoreit VLM 96 65 0 04 Jun 2019
Effective LHC measurements with matrix elements and machine learning Johann Brehmer Kyle Cranmer Irina Espejo F. Kling Gilles Louppe J. Pavez 81 14 0 04 Jun 2019
Text-based Editing of Talking-head Video Ohad Fried A. Tewari Michael Zollhöfer Adam Finkelstein Eli Shechtman Dan B. Goldman Kyle Genova Zeyu Jin Christian Theobalt Maneesh Agrawala VGen 110 262 0 04 Jun 2019