ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Attentive Contractive Flow with Lipschitz-constrained Self-Attention
Attentive Contractive Flow with Lipschitz-constrained Self-Attention
Avideep Mukherjee
Badri N. Patro
Vinay P. Namboodiri
56
0
0
24 Sep 2021
Interpretability in Safety-Critical FinancialTrading Systems
Interpretability in Safety-Critical FinancialTrading Systems
Gabriel Deza
Adelin Travers
C. Rowat
Nicolas Papernot
AAMLAIFin
109
1
0
24 Sep 2021
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
Yuanxun Lu
Jinxiang Chai
Xun Cao
100
89
0
22 Sep 2021
Neural forecasting at scale
Neural forecasting at scale
Philippe Chatigny
Shengrui Wang
Jean-Marc Patenaude and
Boris N. Oreshkin
AI4TS
88
1
0
20 Sep 2021
Deep Spatio-temporal Sparse Decomposition for Trend Prediction and
  Anomaly Detection in Cardiac Electrical Conduction
Deep Spatio-temporal Sparse Decomposition for Trend Prediction and Anomaly Detection in Cardiac Electrical Conduction
Xinyu Zhao
Hao Yan
Zhiyong Hu
D. Du
73
12
0
20 Sep 2021
On-device neural speech synthesis
On-device neural speech synthesis
Sivanand Achanta
Albert Antony
L. Golipour
Jiangchuan Li
T. Raitio
...
Francesco Rossi
Jennifer Shi
Jaimin Upadhyay
David Winarsky
Hepeng Zhang
118
17
0
17 Sep 2021
WaveCorr: Correlation-savvy Deep Reinforcement Learning for Portfolio
  Management
WaveCorr: Correlation-savvy Deep Reinforcement Learning for Portfolio Management
S. Marzban
Erick Delage
Jonathan Yu-Meng Li
J. Desgagne-Bouchard
C. Dussault
114
0
0
14 Sep 2021
Machine-Learned Prediction Equilibrium for Dynamic Traffic Assignment
Machine-Learned Prediction Equilibrium for Dynamic Traffic Assignment
Lukas Graf
T. Harks
Kostas Kollias
M. Markl
39
4
0
14 Sep 2021
Predicting the outcome of team movements -- Player time series analysis
  using fuzzy and deep methods for representation learning
Predicting the outcome of team movements -- Player time series analysis using fuzzy and deep methods for representation learning
Omid Shokrollahi
Bahman Rohani
A. Nobakhti
AI4TS
27
3
0
13 Sep 2021
DynSTGAT: Dynamic Spatial-Temporal Graph Attention Network for Traffic Signal Control
Libing Wu
Min Wang
Dan Wu
Jia Wu
56
35
0
12 Sep 2021
Multilingual Audio-Visual Smartphone Dataset And Evaluation
Multilingual Audio-Visual Smartphone Dataset And Evaluation
Hareesh Mandalapu
N. AravindaReddyP
Raghavendra Ramachandra
K. S. Rao
Pabitra Mitra
S. M. I. S. R. Mahadeva Prasanna
Christoph Busch
60
2
0
09 Sep 2021
Signal-domain representation of symbolic music for learning embedding
  spaces
Signal-domain representation of symbolic music for learning embedding spaces
Mathieu Prang
P. Esling
39
4
0
08 Sep 2021
Text-Free Prosody-Aware Generative Spoken Language Modeling
Text-Free Prosody-Aware Generative Spoken Language Modeling
Eugene Kharitonov
Ann Lee
Adam Polyak
Yossi Adi
Jade Copet
...
Tu Nguyen
M. Rivière
Abdel-rahman Mohamed
Emmanuel Dupoux
Wei-Ning Hsu
118
122
0
07 Sep 2021
Timbre Transfer with Variational Auto Encoding and Cycle-Consistent
  Adversarial Networks
Timbre Transfer with Variational Auto Encoding and Cycle-Consistent Adversarial Networks
Russell Sammut Bonnici
C. Saitis
Martin Benning
GAN
97
15
0
05 Sep 2021
Network Modulation Synthesis: New Algorithms for Generating Musical
  Audio Using Autoencoder Networks
Network Modulation Synthesis: New Algorithms for Generating Musical Audio Using Autoencoder Networks
Jeremy Hyrkas
49
1
0
04 Sep 2021
How to Inject Backdoors with Better Consistency: Logit Anchoring on
  Clean Data
How to Inject Backdoors with Better Consistency: Logit Anchoring on Clean Data
Zhiyuan Zhang
Lingjuan Lyu
Weiqiang Wang
Lichao Sun
Xu Sun
86
36
0
03 Sep 2021
A Multi-view Multi-task Learning Framework for Multi-variate Time Series
  Forecasting
A Multi-view Multi-task Learning Framework for Multi-variate Time Series Forecasting
Jinliang Deng
Xiusi Chen
Renhe Jiang
Xuan Song
Ivor W. Tsang
AI4TS
83
39
0
02 Sep 2021
Physiological-Physical Feature Fusion for Automatic Voice Spoofing
  Detection
Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection
Junxiao Xue
Hao Zhou
Yabo Wang
33
9
0
01 Sep 2021
Adversarial Example Devastation and Detection on Speech Recognition
  System by Adding Random Noise
Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random Noise
Mingyu Dong
Diqun Yan
Yongkang Gong
Rangding Wang
AAML
37
2
0
31 Aug 2021
Neural HMMs are all you need (for high-quality attention-free TTS)
Neural HMMs are all you need (for high-quality attention-free TTS)
Shivam Mehta
Éva Székely
Jonas Beskow
G. Henter
102
18
0
30 Aug 2021
Differentiable Convolution Search for Point Cloud Processing
Differentiable Convolution Search for Point Cloud Processing
Xing Nie
Yongcheng Liu
Shaohong Chen
Jianlong Chang
Chunlei Huo
Gaofeng Meng
Qi Tian
Weiming Hu
Chunhong Pan
3DPC
69
6
0
29 Aug 2021
TCCT: Tightly-Coupled Convolutional Transformer on Time Series
  Forecasting
TCCT: Tightly-Coupled Convolutional Transformer on Time Series Forecasting
Li Shen
Yangzhu Wang
AI4TS
88
98
0
29 Aug 2021
Speech Representations and Phoneme Classification for Preserving the
  Endangered Language of Ladin
Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin
Zane Durante
Leena Mathur
Eric Ye
Sichong Zhao
Tejas Ramdas
Khalil Iskarous
109
0
0
27 Aug 2021
Bilateral Denoising Diffusion Models
Bilateral Denoising Diffusion Models
Max W. Y. Lam
Jun Wang
Rongjie Huang
Jane Polak Scowcroft
Dong Yu
DiffM
85
43
0
26 Aug 2021
Self-Attention for Audio Super-Resolution
Self-Attention for Audio Super-Resolution
Nathanaël Carraz Rakotonirina
SupR
71
24
0
26 Aug 2021
Integrated Speech and Gesture Synthesis
Integrated Speech and Gesture Synthesis
Siyang Wang
Simon Alexanderson
Joakim Gustafson
Jonas Beskow
G. Henter
Éva Székely
90
19
0
25 Aug 2021
Fighting Game Commentator with Pitch and Loudness Adjustment Utilizing
  Highlight Cues
Fighting Game Commentator with Pitch and Loudness Adjustment Utilizing Highlight Cues
Junjie H. Xu
Zhou Fang
Qihang Chen
Satoru Ohno
Pujana Paliyawan
42
4
0
18 Aug 2021
Combining speakers of multiple languages to improve quality of neural
  voices
Combining speakers of multiple languages to improve quality of neural voices
Javier Latorre
Charlotte Bailleul
Tuuli H. Morrill
Alistair Conkie
Y. Stylianou
64
8
0
17 Aug 2021
GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints
GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints
Ji-Hoon Kim
Sang-Hoon Lee
Ji-Hyun Lee
Hong G Jung
Seong-Whan Lee
175
6
0
16 Aug 2021
Weakly Supervised Temporal Anomaly Segmentation with Dynamic Time
  Warping
Weakly Supervised Temporal Anomaly Segmentation with Dynamic Time Warping
Dongha Lee
Sehun Yu
Hyunjun Ju
Hwanjo Yu
55
13
0
15 Aug 2021
Enhancing audio quality for expressive Neural Text-to-Speech
Enhancing audio quality for expressive Neural Text-to-Speech
Abdelhamid Ezzerg
Adam Gabry's
Bartosz Putrycz
Daniel Korzekwa
Daniel Sáez-Trigueros
David McHardy
Kamil Pokora
Jakub Lachowicz
Jaime Lorenzo-Trueba
V. Klimkov
140
6
0
13 Aug 2021
Multimodal analysis of the predictability of hand-gesture properties
Multimodal analysis of the predictability of hand-gesture properties
Taras Kucherenko
Rajmund Nagy
Michael Neff
Hedvig Kjellström
G. Henter
59
22
0
12 Aug 2021
RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform
RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform
Youxuan Ma
Zongze Ren
Shugong Xu
88
40
0
12 Aug 2021
A Generalizable Model-and-Data Driven Approach for Open-Set RFF
  Authentication
A Generalizable Model-and-Data Driven Approach for Open-Set RFF Authentication
Renjie Xie
Wei Xu
Yanzhi Chen
Jiabao Yu
A. Hu
Derrick Wing Kwan Ng
A. L. Swindlehurst
62
74
0
10 Aug 2021
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary
  Person
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person
Xinsheng Wang
Qicong Xie
Jihua Zhu
Lei Xie
O. Scharenborg
120
19
0
09 Aug 2021
A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate
A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate
Ahmed Mustafa
Jan Büthe
Srikanth Korse
Kishan Gupta
Guillaume Fuchs
N. Pia
134
19
0
09 Aug 2021
Multi-Slice Net: A novel light weight framework for COVID-19 Diagnosis
Multi-Slice Net: A novel light weight framework for COVID-19 Diagnosis
Harshala Gammulle
Tharindu Fernando
Sridha Sridharan
Akila Pemasiri
Clinton Fookes
64
3
0
09 Aug 2021
An Empirical Study on End-to-End Singing Voice Synthesis with
  Encoder-Decoder Architectures
An Empirical Study on End-to-End Singing Voice Synthesis with Encoder-Decoder Architectures
Dengfeng Ke
Yuxing Lu
Xudong Liu
Yanyan Xu
Jing Sun
Cheng-Hao Cai
55
0
0
06 Aug 2021
Knowledge Distillation from BERT Transformer to Speech Transformer for
  Intent Classification
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Yiding Jiang
Bidisha Sharma
Maulik C. Madhavi
Haizhou Li
100
26
0
05 Aug 2021
A FAIR and AI-ready Higgs boson decay dataset
A FAIR and AI-ready Higgs boson decay dataset
Yifan Chen
Eliu A. Huerta
Javier Mauricio Duarte
Philip C. Harris
Daniel S. Katz
...
Raghav Kansal
Sang Eon Park
Volodymyr V. Kindratenko
Zhizhen Zhao
R. Rusack
94
27
0
04 Aug 2021
Online Training of Spiking Recurrent Neural Networks with Phase-Change
  Memory Synapses
Online Training of Spiking Recurrent Neural Networks with Phase-Change Memory Synapses
Yiğit Demirağ
Charlotte Frenkel
Melika Payvand
Giacomo Indiveri
85
17
0
04 Aug 2021
A Benchmarking Initiative for Audio-Domain Music Generation Using the
  Freesound Loop Dataset
A Benchmarking Initiative for Audio-Domain Music Generation Using the Freesound Loop Dataset
Tun-Min Hung
Bo-Yu Chen
Yen-Tung Yeh
Yi-Hsuan Yang
62
12
0
03 Aug 2021
DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio
  Synthesis with GANs
DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio Synthesis with GANs
J. Nistal
Stefan Lattner
G. Richard
78
9
0
03 Aug 2021
PSA-GAN: Progressive Self Attention GANs for Synthetic Time Series
PSA-GAN: Progressive Self Attention GANs for Synthetic Time Series
Paul Jeha
Michael Bohlke-Schneider
Pedro Mercado
Shubham Kapoor
Rajbir-Singh Nirwan
Valentin Flunkert
Jan Gasthaus
Tim Januschowski
AI4TS
110
52
0
02 Aug 2021
Creation and Detection of German Voice Deepfakes
Creation and Detection of German Voice Deepfakes
Vanessa Barnekow
Dominik Binder
Niclas Kromrey
Pascal Munaretto
A. Schaad
Felix Schmieder
23
3
0
02 Aug 2021
DeepTrack: Lightweight Deep Learning for Vehicle Path Prediction in
  Highways
DeepTrack: Lightweight Deep Learning for Vehicle Path Prediction in Highways
Vinit Katariya
Mohammadreza Baharani
Nichole L. Morris
O. Shoghli
Hamed Tabkhi
HAI
50
44
0
01 Aug 2021
End to End Bangla Speech Synthesis
End to End Bangla Speech Synthesis
Prithwiraj Bhattacharjee
Rajan Saha Raju
Arif Ahmad
M. S. Rahman
41
2
0
01 Aug 2021
A Survey on Audio Synthesis and Audio-Visual Multimodal Processing
A Survey on Audio Synthesis and Audio-Visual Multimodal Processing
Zhaofeng Shi
73
7
0
01 Aug 2021
Sequence-to-Sequence Voice Reconstruction for Silent Speech in a Tonal
  Language
Sequence-to-Sequence Voice Reconstruction for Silent Speech in a Tonal Language
Huiyan Li
Haohong Lin
You Wang
Hengyang Wang
Ming Zhang
Han Gao
Qing Ai
Zhiyuan Luo
Guang Li
63
14
0
31 Jul 2021
Practical Attacks on Voice Spoofing Countermeasures
Practical Attacks on Voice Spoofing Countermeasures
Andre Kassis
Urs Hengartner
AAML
49
15
0
30 Jul 2021
Previous
123...272829...606162
Next