v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown

Title
Inference-optimized AI and high performance computing for gravitational wave detection at scale Pranshu Chaturvedi Asad Khan Minyang Tian Eliu A. Huerta Huihuo Zheng 72 28 0 26 Jan 2022
Invertible Voice Conversion Zexin Cai Ming Li BDL 71 1 0 26 Jan 2022
Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention Artem Gorodetskii Ivan Ozhiganov 117 2 0 25 Jan 2022
Improving Adversarial Waveform Generation based Singing Voice Conversion with Harmonic Signals Haohan Guo Zhiping Zhou Fanbo Meng Kai-Chun Liu 100 16 0 25 Jan 2022
Text and Code Embeddings by Contrastive Pre-Training Arvind Neelakantan Tao Xu Raul Puri Alec Radford Jesse Michael Han ... Tabarak Khan Toki Sherbakov Joanne Jang Peter Welinder Lilian Weng SSL AI4TS 401 446 0 24 Jan 2022
Fast Transient Stability Prediction Using Grid-informed Temporal and Topological Embedding Deep Neural Network Peiyuan Sun L. Huo Siyuan Liang Xin Chen 64 7 0 23 Jan 2022
HiSTGNN: Hierarchical Spatio-temporal Graph Neural Networks for Weather Forecasting Minbo Ma Peng Xie Fei Teng Tian-Jie Li Bin Wang Shenggong Ji Junbo Zhang AI4TS 55 9 0 22 Jan 2022
Online POI Recommendation: Learning Dynamic Geo-Human Interactions in Streams Dongjie Wang Kunpeng Liu Hui Xiong Yanjie Fu 160 8 0 19 Jan 2022
MHTTS: Fast multi-head text-to-speech for spontaneous speech with imperfect transcription Dabiao Ma Yitong Zhang Meng Li Feng Ye 39 1 0 19 Jan 2022
Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis Yu Wang Xinsheng Wang Pengcheng Zhu Jie Wu Hanzhao Li Heyang Xue Yongmao Zhang Lei Xie Mengxiao Bi 112 103 0 19 Jan 2022
Variational Autoencoder Generative Adversarial Network for Synthetic Data Generation in Smart Home Mina Razghandi Hao Zhou Melike Erol-Kantarci D. Turgut 45 33 0 19 Jan 2022
Dilated Convolutional Neural Networks for Lightweight Diacritics Restoration Bálint Csanády András Lukács 26 0 0 18 Jan 2022
A Practical Guide to Logical Access Voice Presentation Attack Detection Xin Wang Junichi Yamagishi AAML 109 11 0 10 Jan 2022
Audio representations for deep learning in sound synthesis: A review Anastasia Natsiou Seán O'Leary AI4TS 74 18 0 07 Jan 2022
A sinusoidal signal reconstruction method for the inversion of the mel-spectrogram Anastasia Natsiou Seán O'Leary 46 3 0 07 Jan 2022
Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks Lei Cheng Ruslan Khalitov Tong Yu Zhirong Yang 73 32 0 06 Jan 2022
Eye Know You Too: A DenseNet Architecture for End-to-end Eye Movement Biometrics Dillon Lohr Oleg V. Komogortsev 77 4 0 05 Jan 2022
A Comprehensive Survey on Radio Frequency (RF) Fingerprinting: Traditional Approaches, Deep Learning, and Open Challenges Anu Jagannath Jithin Jagannath P. Kumar 66 145 0 03 Jan 2022
Evaluating Deep Music Generation Methods Using Data Augmentation Toby Godwin Georgios Rizos Alice Baird N. A. Futaisi Vincent Brisse Bjoern W. Schuller MGen 34 1 0 31 Dec 2021
InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer Chin-Tung Lin Mu Yang ViT 51 1 0 31 Dec 2021
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation Han Zhang Weichong Yin Yewei Fang Lanxin Li Boqiang Duan Zhihua Wu Yu Sun Hao Tian Hua Wu Haifeng Wang 71 59 0 31 Dec 2021
BP-Net: Cuff-less, Calibration-free, and Non-invasive Blood Pressure Estimation via a Generic Deep Convolutional Architecture Soheil Zabihi E. Rahimian Fatemeh Marefat A. Asif P. Mohseni Arash Mohammadi OOD 20 2 0 31 Dec 2021
Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives Hideyuki Tachibana Mocho Go Muneyoshi Inahara Yotaro Katayama Yotaro Watanabe DiffM 78 3 0 26 Dec 2021
Latent Space Simulation for Carbon Capture Design Optimization Brian Bartoldson Rui Wang Yu-Hang Fu David Widemann Sam Nguyen J. Bao Zhijie Xu Brenda Ng 52 3 0 22 Dec 2021
TFDPM: Attack detection for cyber-physical systems with diffusion probabilistic models Tijin Yan Tong Zhou Yufeng Zhan Yuanqing Xia DiffM 59 8 0 20 Dec 2021
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus Rongjie Huang Feiyang Chen Yi Ren Jinglin Liu Chenye Cui Zhou Zhao 97 104 0 20 Dec 2021
Soundify: Matching Sound Effects to Video David Chuan-En Lin Anastasis Germanidis Cristobal Valenzuela Yining Shi Nikolas Martelaro 79 16 0 17 Dec 2021
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling Yusong Wu Ethan Manilow Yi Deng Rigel Swavely Kyle Kastner Tim Cooijmans Aaron Courville Cheng-Zhi Anna Huang Jesse Engel 87 45 0 17 Dec 2021
A Comparative Study of Detecting Anomalies in Time Series Data Using LSTM and TCN Models Saroj Gopali Faranak Abri Sima Siami‐Namini A. Namin AI4TS 31 13 0 17 Dec 2021
EmotionBox: a music-element-driven emotional music generation system using Recurrent Neural Network Kaitong Zheng R. Meng C. Zheng Xiaodong Li Jinqiu Sang JuanJuan Cai Jie Wang MGen 58 2 0 16 Dec 2021
Leveraging Image-based Generative Adversarial Networks for Time Series Generation Justin Hellermann Stefan Lessmann GAN AI4TS 72 4 0 15 Dec 2021
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs Zhisheng Xiao Karsten Kreis Arash Vahdat DiffM 131 562 0 15 Dec 2021
Scale-Aware Neural Architecture Search for Multivariate Time Series Forecasting Donghui Chen Ling-Hao Chen Zongjiang Shang Youdong Zhang Bo Wen Chenghu Yang AI4TS 73 7 0 14 Dec 2021
AI and extreme scale computing to learn and infer the physics of higher order gravitational wave modes of quasi-circular, spinning, non-precessing binary black hole mergers Asad Khan E. A. H. abd Prayush Kumar 74 5 0 13 Dec 2021
Computational bioacoustics with deep learning: a review and roadmap D. Stowell 98 259 0 13 Dec 2021
Lifelong Hyper-Policy Optimization with Multiple Importance Sampling Regularization P. Liotet Francesco Vidaich Alberto Maria Metelli Marcello Restelli OffRL 71 8 0 13 Dec 2021
Causal Knowledge Guided Societal Event Forecasting Songgaojun Deng Huzefa Rangwala Yue Ning AI4TS 61 2 0 10 Dec 2021
Neural Multi-Quantile Forecasting for Optimal Inventory Management Federico Garza Ramírez 32 1 0 10 Dec 2021
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading Leyuan Qu C. Weber S. Wermter 79 23 0 09 Dec 2021
Forecasting Brain Activity Based on Models of Spatio-Temporal Brain Dynamics: A Comparison of Graph Neural Network Architectures S. Wein Alina Schüller A. Tomé W. Malloni M. Greenlee E. Lang AI4CE 86 15 0 08 Dec 2021
Periodic Residual Learning for Crowd Flow Forecasting Chengxin Wang Yuxuan Liang Gary S. H. Tan AI4TS 47 12 0 08 Dec 2021
Dilated convolution with learnable spacings Ismail Khalfaoui-Hassani Thomas Pellegrini T. Masquelier 125 32 0 07 Dec 2021
VocBench: A Neural Vocoder Benchmark for Speech Synthesis Ehab A. AlBadawy Andrew Gibiansky Qing He Jilong Wu Ming-Ching Chang Siwei Lyu 61 12 0 06 Dec 2021
Parameter Efficient Deep Probabilistic Forecasting O. Sprangers Sebastian Schelter Maarten de Rijke BDL AI4TS 118 24 0 06 Dec 2021
Dynamic Graph Learning-Neural Network for Multivariate Time Series Modeling Zhuoling Li Gaowei Zhang Lingyu Xu Jie Yu AI4TS 42 2 0 06 Dec 2021
ES-dRNN: A Hybrid Exponential Smoothing and Dilated Recurrent Neural Network Model for Short-Term Load Forecasting Slawek Smyl Grzegorz Dudek Paweł Pełka AI4TS 61 33 0 05 Dec 2021
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone Edresson Casanova Julian Weber C. Shulby Arnaldo Cândido Júnior Eren Golge M. Ponti 249 415 0 04 Dec 2021
My(o) Armband Leaks Passwords: An EMG and IMU Based Keylogging Side-Channel Attack Matthias Gazzari Annemarie Mattmann Max Maass M. Hollick 47 5 0 04 Dec 2021
Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation Yingruo Fan Zhaojiang Lin Jun Saito Wenping Wang Taku Komura 69 22 0 04 Dec 2021
Deep Efficient Continuous Manifold Learning for Time Series Modeling Seungwoo Jeong Wonjun Ko A. Mulyadi Heung-Il Suk AI4TS 92 7 0 03 Dec 2021