v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown

Title
Neural Percussive Synthesis Parameterised by High-Level Timbral Features António Ramires Pritish Chandna Xavier Favory Emilia Gómez Xavier Serra 81 23 0 25 Nov 2019
Natural Image Manipulation for Autoregressive Models Using Fisher Scores Wilson Yan Jonathan Ho Pieter Abbeel 28 0 0 25 Nov 2019
Adversarial Learning of Privacy-Preserving and Task-Oriented Representations Taihong Xiao Yi-Hsuan Tsai Kihyuk Sohn Manmohan Chandraker Ming-Hsuan Yang 76 75 0 22 Nov 2019
Go From the General to the Particular: Multi-Domain Translation with Domain Transformation Networks Yong Wang Longyue Wang Shuming Shi Victor O.K. Li Zhaopeng Tu 62 25 0 22 Nov 2019
Adversarial Robustness of Flow-Based Generative Models Phillip E. Pope Yogesh Balaji Soheil Feizi AAML 48 20 0 20 Nov 2019
Deep-Learning Estimation of Band Gap with the Reading-Periodic-Table Method and Periodic Convolution Layer Tomohiko Konno 71 1 0 16 Nov 2019
Granular Motor State Monitoring of Free Living Parkinson's Disease Patients via Deep Learning K. Yuksel Jann Goschenhofer H. V. Varma U. Fietzek Franz MJ Pfister OOD 30 0 0 15 Nov 2019
Deep Long Audio Inpainting Ya-Liang Chang Kuan-Ying Lee Po-Yu Wu Hung-yi Lee Winston H. Hsu 68 33 0 15 Nov 2019
Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling Ruizhe Zhao Brian K. Vogel Tanvir Ahmed Wayne Luk 61 37 0 14 Nov 2019
Compressive Transformers for Long-Range Sequence Modelling Jack W. Rae Anna Potapenko Siddhant M. Jayakumar Timothy Lillicrap RALM VLM KELM 105 656 0 13 Nov 2019
Rate-Regularization and Generalization in VAEs Alican Bozkurt Babak Esmaeili Jean-Baptiste Tristan Dana H. Brooks Jennifer G. Dy Jan-Willem van de Meent DRL 92 8 0 11 Nov 2019
GMAN: A Graph Multi-Attention Network for Traffic Prediction Chuanpan Zheng Xiaoliang Fan Cheng-Yu Wang Jianzhong Qi AI4TS AI4CE 154 1,406 0 11 Nov 2019
Generative Autoregressive Networks for 3D Dancing Move Synthesis from Music Hyemin Ahn Jaehun Kim Kihyun Kim Songhwai Oh GAN 81 44 0 11 Nov 2019
Feedback Recurrent AutoEncoder Yang Yang Guillaume Sautière J. Jon Ryu Taco S. Cohen 106 21 0 11 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications Chao Zhang Zichao Yang Xiaodong He Li Deng HAI AI4TS 122 338 0 10 Nov 2019
Transformation of low-quality device-recorded speech to high-quality speech using improved SEGAN model Seyyed Saeed Sarfjoo Xin Wang G. Henter Jaime Lorenzo-Trueba Shinji Takaki Junichi Yamagishi 45 8 0 10 Nov 2019
Characterizing dynamically varying acoustic scenes from egocentric audio recordings in workplace setting Arindam Jati Amrutha Nadarajan Karel Mundnich Shrikanth Narayanan 34 2 0 10 Nov 2019
XceptionTime: A Novel Deep Architecture based on Depthwise Separable Convolutions for Hand Gesture Classification E. Rahimian Soheil Zabihi S. F. Atashzar A. Asif Arash Mohammadi 71 45 0 09 Nov 2019
On the Relationship between Self-Attention and Convolutional Layers Jean-Baptiste Cordonnier Andreas Loukas Martin Jaggi 179 535 0 08 Nov 2019
Teacher-Student Training for Robust Tacotron-based TTS Rui Liu Berrak Sisman Jingdong Li F. Bao Guanglai Gao Haizhou Li 109 38 0 07 Nov 2019
Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework Mingbo Ma Baigong Zheng Kaibo Liu Renjie Zheng Hairong Liu Kainan Peng Kenneth Church Liang Huang 66 31 0 07 Nov 2019
Deep Hedging: Learning to Simulate Equity Option Markets Magnus Wiese Lianjun Bai Ben Wood Hans Buehler GAN 90 69 0 05 Nov 2019
Emotional speech synthesis with rich and granularized control Seyun Um Sangshin Oh Kyungguen Byun Inseon Jang C. Ahn Hong-Goo Kang 85 90 0 05 Nov 2019
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech Xin Wang Junichi Yamagishi Massimiliano Todisco Héctor Delgado A. Nautsch ... J. Bonastre Avashna Govender S. Ronanki Jing-Xuan Zhang Zhenhua Ling 99 12 0 05 Nov 2019
The frontier of simulation-based inference Kyle Cranmer Johann Brehmer Gilles Louppe AI4CE 277 859 0 04 Nov 2019
Deep-Gap: A deep learning framework for forecasting crowdsourcing supply-demand gap based on imaging time series and residual learning Ahmed Ben Said A. Erradi AI4TS 40 7 0 02 Nov 2019
Deep convolutional neural networks for multi-scale time-series classification and application to disruption prediction in fusion devices R. Churchill the DIII-D team AI4CE 36 10 0 31 Oct 2019
Neural Density Estimation and Likelihood-free Inference George Papamakarios BDL DRL 100 47 0 29 Oct 2019
Disentangling Timbre and Singing Style with Multi-singer Singing Synthesis System Juheon Lee Hyeong-Seok Choi Junghyun Koo Kyogu Lee 45 18 0 29 Oct 2019
Spoofing Speaker Verification Systems with Deep Multi-speaker Text-to-speech Synthesis Mingrui Yuan Z. Duan 23 1 0 29 Oct 2019
Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment Yusuke Yasuda Xin Wang Junichi Yamagishi 28 2 0 28 Oct 2019
Transferring neural speech waveform synthesizers to musical instrument sounds generation Yi Zhao Xin Wang Lauri Juvela Junichi Yamagishi 92 17 0 27 Oct 2019
Implicit Posterior Variational Inference for Deep Gaussian Processes Haibin Yu Yizhou Chen Zhongxiang Dai K. H. Low Patrick Jaillet 88 43 0 26 Oct 2019
Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency M. Whitehill Shuang Ma Daniel J. McDuff Yale Song 111 35 0 25 Oct 2019
Study of Deep Generative Models for Inorganic Chemical Compositions Yoshihide Sawada Koji Morikawa Mikiya Fujii GAN 69 13 0 25 Oct 2019
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram Ryuichi Yamamoto Eunwoo Song Jae-Min Kim 195 821 0 25 Oct 2019
Hierarchical Representation Learning in Graph Neural Networks with Node Decimation Pooling F. Bianchi Daniele Grattarola L. Livi Cesare Alippi 199 49 0 24 Oct 2019
Towards Fine-Grained Prosody Control for Voice Conversion Zheng Lian Zhengqi Wen 70 19 0 24 Oct 2019
Vision-Infused Deep Audio Inpainting Hang Zhou Ziwei Liu Lingfeng Guo Ping Luo Dahua Lin 142 88 0 24 Oct 2019
Fast and High-Quality Singing Voice Synthesis System based on Convolutional Neural Networks Kazuhiro Nakamura Shinji Takaki Kei Hashimoto Keiichiro Oura Yoshihiko Nankaku K. Tokuda 84 19 0 24 Oct 2019
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit Tomoki Hayashi Ryuichi Yamamoto Katsuki Inoue Takenori Yoshimura Shinji Watanabe Tomoki Toda K. Takeda Yu Zhang Xu Tan VLM 93 205 0 24 Oct 2019
Expression Analysis Based on Face Regions in Read-world Conditions Zheng Lian Ya Li J. Tao Jian Huang Mingyue Niu CVBM 47 58 0 23 Oct 2019
Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales Sanjay Thakur H. V. Hoof Gunshi Gupta David Meger BDL 42 2 0 23 Oct 2019
Detecting Out-of-Distribution Inputs in Deep Neural Networks Using an Early-Layer Output Vahdat Abdelzad Krzysztof Czarnecki Rick Salay Taylor Denouden Sachin Vernekar Buu Phan OODD 64 47 0 23 Oct 2019
Complex Transformer: A Framework for Modeling Complex-Valued Sequence Muqiao Yang Martin Q. Ma Dongyu Li Yao-Hung Hubert Tsai Ruslan Salakhutdinov ViT 53 38 0 22 Oct 2019
GANspection Hammad A. Ayyubi GAN 18 0 0 21 Oct 2019
You May Not Need Order in Time Series Forecasting Yunkai Zhang Qiao Jiang Shurui Li Xiaoyong Jin Xueying Ma Xifeng Yan AI4TS 28 3 0 21 Oct 2019
XL-Editor: Post-editing Sentences with XLNet Yong-Siang Shih Wei-Cheng Chang Yiming Yang KELM 79 11 0 19 Oct 2019
Label-efficient audio classification through multitask learning and self-supervision Tyler Lee Ting Gong Suchismita Padhy Andrew Rouditchenko A. Ndirango SSL VLM 62 7 0 19 Oct 2019
Decoupling feature propagation from the design of graph auto-encoders P. Scherer Helena Andrés-Terré Pietro Lio M. Jamnik BDL 16 1 0 18 Oct 2019