v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown

Title
Unsupervised Representation Learning for Time Series with Temporal Neighborhood Coding S. Tonekaboni Danny Eytan Anna Goldenberg CML SSL AI4TS 171 298 0 01 Jun 2021
Enhancing Trajectory Prediction using Sparse Outputs: Application to Team Sports Brandon Victor Aiden Nibali Zhen He D. Carey 39 9 0 01 Jun 2021
Continual 3D Convolutional Neural Networks for Real-time Processing of Videos Lukas Hedegaard Alexandros Iosifidis 3DPC 89 15 0 31 May 2021
StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts Matthew Baas Herman Kamper 53 6 0 31 May 2021
Multi-Scale Temporal Convolution Network for Classroom Voice Detection Lu Ma Xintian Wang Song Yang Y. Gong Zhongqin Wu 47 1 0 31 May 2021
Cascaded Diffusion Models for High Fidelity Image Generation Jonathan Ho Chitwan Saharia William Chan David J. Fleet Mohammad Norouzi Tim Salimans 279 1,246 0 30 May 2021
DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion Songxiang Liu Yuewen Cao Jane Polak Scowcroft Helen Meng DiffM 86 59 0 28 May 2021
DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding Neil Zeghidour O. Teboul David Grangier 63 13 0 28 May 2021
Safe Model-based Off-policy Reinforcement Learning for Eco-Driving in Connected and Automated Hybrid Electric Vehicles Zhaoxuan Zhu Nicola Pivaro Shobhit Gupta Abhishek Gupta Marcello Canova OffRL 65 37 0 25 May 2021
Inclusion of Domain-Knowledge into GNNs using Mode-Directed Inverse Entailment T. Dash A. Srinivasan A. Baskar 72 13 0 22 May 2021
Spatial-temporal Conv-sequence Learning with Accident Encoding for Traffic Flow Prediction Zichuan Liu Rui Zhang Chen Wang Zhu Xiao Hongbo Jiang AI4TS 49 20 0 21 May 2021
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical Parameters Pritish Chandna António Ramires Xavier Serra Emilia Gómez 61 4 0 21 May 2021
Temporal convolutional networks predict dynamic oxygen uptake response from wearable sensors across exercise intensities Robert Amelard E. Hedge R. Hughson 26 18 0 20 May 2021
High-Fidelity and Low-Latency Universal Neural Vocoder based on Multiband WaveRNN with Data-Driven Linear Prediction for Discrete Waveform Modeling Patrick Lumban Tobing Tomoki Toda 67 8 0 20 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics V. Jayaram John Thickstun DiffM 107 25 0 17 May 2021
Itsy Bitsy SpiderNet: Fully Connected Residual Network for Fraud Detection S. Afanasiev A. Smirnova D. Kotereva 64 2 0 17 May 2021
ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All You Need For Audio Generation Shoule Wu Ziqiang Shi DiffM 157 11 0 17 May 2021
Drill the Cork of Information Bottleneck by Inputting the Most Important Data Xinyu Peng Jiawei Zhang Feiyue Wang Li Li 43 6 0 15 May 2021
Predicting speech intelligibility from EEG in a non-linear classification paradigm Bernd Accou Mohammad Jalilpour-Monesi Hugo Van hamme T. Francart 22 12 0 14 May 2021
Advances in Machine and Deep Learning for Modeling and Real-time Detection of Multi-Messenger Sources Eliu A. Huerta Zhizhen Zhao 106 21 0 13 May 2021
Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech Vadim Popov Ivan Vovk Vladimir Gogoryan Tasnima Sadekova Mikhail Kudinov DiffM 119 544 0 13 May 2021
Diffusion Models Beat GANs on Image Synthesis Prafulla Dhariwal Alex Nichol 496 8,017 0 11 May 2021
Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents Chaojun Xiao Xueyu Hu Zhiyuan Liu Cunchao Tu Maosong Sun AILaw ELM 104 245 0 09 May 2021
Machine Learning (ML)-Centric Resource Management in Cloud Computing: A Review and Future Directions Tahseen Khan Wenhong Tian Rajkumar Buyya 58 106 0 09 May 2021
Latency-Controlled Neural Architecture Search for Streaming Speech Recognition Liqiang He Shulin Feng Jane Polak Scowcroft Dong Yu 54 0 0 08 May 2021
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism Jinglin Liu Chengxi Li Yi Ren Feiyang Chen Zhou Zhao DiffM 198 271 0 06 May 2021
Non-Autoregressive vs Autoregressive Neural Networks for System Identification Daniel Weber C. Gühmann 58 7 0 05 May 2021
CoSA: Scheduling by Constrained Optimization for Spatial Accelerators Qijing Huang Minwoo Kang Grace Dinh Thomas Norell Aravind Kalaiah J. Demmel J. Wawrzynek Y. Shao 72 112 0 05 May 2021
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding J. Nistal Cyran Aouameur Stefan Lattner G. Richard 105 7 0 04 May 2021
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning Shuo Wang Surya Nepal Kristen Moore M. Grobler Carsten Rudolph A. Abuadbba FedML 69 8 0 03 May 2021
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization Michael Ruogu Zhang T. Paine Ofir Nachum Cosmin Paduraru George Tucker Ziyun Wang Mohammad Norouzi OffRL 91 49 0 28 Apr 2021
Learning deep autoregressive models for hierarchical data Carl R. Andersson Niklas Wahlström Thomas B. Schon BDL 57 3 0 28 Apr 2021
BeamLearning: an end-to-end Deep Learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data Hadrien Pujol Éric Bavu Alexandre Garcia 90 22 0 27 Apr 2021
Sifting out the features by pruning: Are convolutional networks the winning lottery ticket of fully connected ones? Franco Pellegrini Giulio Biroli 111 6 0 27 Apr 2021
End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks Rodrigo Mira Konstantinos Vougioukas Pingchuan Ma Stavros Petridis Björn W. Schuller Maja Pantic 121 47 0 27 Apr 2021
Points2Sound: From mono to binaural audio using 3D point cloud scenes Francesc Lluís V. Chatziioannou A. Hofmann 3DPC 120 6 0 26 Apr 2021
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis Erica Cooper Xin Wang Junichi Yamagishi 91 6 0 25 Apr 2021
Restoring degraded speech via a modified diffusion model Jianwei Zhang Suren Jayasuriya Visar Berisha DiffM 64 21 0 22 Apr 2021
Scaling of neural-network quantum states for time evolution Sheng-Hsuan Lin F. Pollmann 66 25 0 21 Apr 2021
Lossless Compression with Latent Variable Models James Townsend BDL DRL 78 6 0 21 Apr 2021
Eye Know You: Metric Learning for End-to-end Biometric Authentication Using Eye Movements from a Longitudinal Dataset Dillon Lohr Henry K. Griffith Oleg V. Komogortsev 89 33 0 21 Apr 2021
Superpixels and Graph Convolutional Neural Networks for Efficient Detection of Nutrient Deficiency Stress from Aerial Imagery Saba Dadsetan David Pichler David Wilson N. Hovakimyan Jennifer Hobbs 99 6 0 20 Apr 2021
VideoGPT: Video Generation using VQ-VAE and Transformers Wilson Yan Yunzhi Zhang Pieter Abbeel A. Srinivas ViT VGen 345 513 0 20 Apr 2021
Review of end-to-end speech synthesis technology based on deep learning Zhaoxi Mu Xinyu Yang Yizhuo Dong AuLLM ALM 94 25 0 20 Apr 2021
Mapping the Internet: Modelling Entity Interactions in Complex Heterogeneous Networks Šimon Mandlík Tomás Pevný 52 5 0 19 Apr 2021
Recursive input and state estimation: A general framework for learning from time series with missing data Alberto García-Durán Robert West AI4TS 30 2 0 17 Apr 2021
KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset Saida Mussakhojayeva Aigerim Janaliyeva A. Mirzakhmetov Yerbolat Khassanov H. A. Varol 61 14 0 17 Apr 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement Alexander Richard Michael Zollhoefer Yandong Wen Fernando de la Torre Yaser Sheikh CVBM 97 202 0 16 Apr 2021
Efficient and Generic 1D Dilated Convolution Layer for Deep Learning Narendra Chaudhary Sanchit Misra Dhiraj D. Kalamkar A. Heinecke E. Georganas Barukh Ziv Menachem Adelman Bharat Kaul 61 9 0 16 Apr 2021
Spectrogram Inpainting for Interactive Generation of Instrument Sounds Théis Bazin Gaëtan Hadjeres P. Esling M. Malt 61 11 0 15 Apr 2021