ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXivPDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,039 papers shown
Title
Retrospectives on the Embodied AI Workshop
Retrospectives on the Embodied AI Workshop
Matt Deitke
Dhruv Batra
Yonatan Bisk
Tommaso Campari
Angel X. Chang
...
Jesse Thomason
Alexander Toshev
Joanne Truong
Luca Weihs
Jiajun Wu
LM&Ro
42
51
0
13 Oct 2022
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data
  for Zero-Shot Multi-Speaker Text-to-Speech
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech
Byoung Jin Choi
Myeonghun Jeong
Minchan Kim
Sung Hwan Mun
N. Kim
DiffM
27
5
0
12 Oct 2022
Unsupervised Learning of Equivariant Structure from Sequences
Unsupervised Learning of Equivariant Structure from Sequences
Takeru Miyato
Masanori Koyama
Kenji Fukumizu
31
12
0
12 Oct 2022
Style-Guided Inference of Transformer for High-resolution Image
  Synthesis
Style-Guided Inference of Transformer for High-resolution Image Synthesis
Jonghwa Yim
Minjae Kim
ViT
39
0
0
11 Oct 2022
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from
  Diffusion Models
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
Matthew Baas
Herman Kamper
DiffM
40
7
0
11 Oct 2022
Training Spiking Neural Networks with Local Tandem Learning
Training Spiking Neural Networks with Local Tandem Learning
Qu Yang
Jibin Wu
Malu Zhang
Yansong Chua
Xinchao Wang
Haizhou Li
63
37
0
10 Oct 2022
Self-explaining Hierarchical Model for Intraoperative Time Series
Self-explaining Hierarchical Model for Intraoperative Time Series
Dingwen Li
Bing Xue
C. King
Bradley A. Fritz
M. Avidan
Joanna Abraham
Chenyang Lu
AI4CE
21
3
0
10 Oct 2022
Winner Takes It All: Training Performant RL Populations for
  Combinatorial Optimization
Winner Takes It All: Training Performant RL Populations for Combinatorial Optimization
Nathan Grinsztajn
Daniel Furelos-Blanco
Shikha Surana
Clément Bonnet
Thomas D. Barrett
54
28
0
07 Oct 2022
Real-World Robot Learning with Masked Visual Pre-training
Real-World Robot Learning with Masked Visual Pre-training
Ilija Radosavovic
Tete Xiao
Stephen James
Pieter Abbeel
Jitendra Malik
Trevor Darrell
SSL
159
242
0
06 Oct 2022
An Overview of Affective Speech Synthesis and Conversion in the Deep
  Learning Era
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era
Andreas Triantafyllopoulos
Björn W. Schuller
Gokcce .Iymen
M. Sezgin
Xiangheng He
...
Shuo Liu
Silvan Mertes
Elisabeth André
Ruibo Fu
Jianhua Tao
25
53
0
06 Oct 2022
The Sound of Silence: Efficiency of First Digit Features in Synthetic
  Audio Detection
The Sound of Silence: Efficiency of First Digit Features in Synthetic Audio Detection
Daniele Mari
Federica Latora
Simone Milani
21
11
0
06 Oct 2022
PSVRF: Learning to restore Pitch-Shifted Voice without reference
Yangfu Li
Xiaodan Lin
Jiaxin Yang
21
0
0
06 Oct 2022
GT-GAN: General Purpose Time Series Synthesis with Generative
  Adversarial Networks
GT-GAN: General Purpose Time Series Synthesis with Generative Adversarial Networks
Jinsung Jeon
Jeonghak Kim
Haryong Song
Seunghyeon Cho
Noseong Park
AI4TS
85
44
0
05 Oct 2022
HYPRO: A Hybridly Normalized Probabilistic Model for Long-Horizon
  Prediction of Event Sequences
HYPRO: A Hybridly Normalized Probabilistic Model for Long-Horizon Prediction of Event Sequences
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
AI4TS
24
34
0
04 Oct 2022
Movement Analytics: Current Status, Application to Manufacturing, and
  Future Prospects from an AI Perspective
Movement Analytics: Current Status, Application to Manufacturing, and Future Prospects from an AI Perspective
Peter Baumgartner
Daniel V. Smith
Mashud Rana
Reena Kapoor
Elena Tartaglia
A. Schutt
Ashfaqur Rahman
John Taylor
S. Dunstall
32
4
0
04 Oct 2022
Force-Aware Interface via Electromyography for Natural VR/AR Interaction
Force-Aware Interface via Electromyography for Natural VR/AR Interaction
Yunxiang Zhang
Benjamin Liang
Boyuan Chen
P. Torrens
S. F. Atashzar
Dahua Lin
Qinghong Sun
OOD
31
23
0
03 Oct 2022
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on
  Fixed-Point Iteration
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration
Yuma Koizumi
Kohei Yatabe
Heiga Zen
M. Bacchiani
DiffM
59
29
0
03 Oct 2022
Mastering Spatial Graph Prediction of Road Networks
Mastering Spatial Graph Prediction of Road Networks
Sotiris Anagnostidis
Aurelien Lucchi
Thomas Hofmann
GNN
32
1
0
03 Oct 2022
AudioGen: Textually Guided Audio Generation
AudioGen: Textually Guided Audio Generation
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre Défossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
DiffM
27
290
0
30 Sep 2022
ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers
  for Streaming Speech Recognition
ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition
Martin H. Radfar
Rohit Barnwal
Rupak Vignesh Swaminathan
Feng-Ju Chang
Grant P. Strimel
Nathan Susanj
Athanasios Mouchtaris
42
13
0
29 Sep 2022
The Chamber Ensemble Generator: Limitless High-Quality MIR Data via
  Generative Modeling
The Chamber Ensemble Generator: Limitless High-Quality MIR Data via Generative Modeling
Yusong Wu
Josh Gardner
Ethan Manilow
Ian Simon
Curtis Hawthorne
Jesse Engel
40
10
0
28 Sep 2022
TVLT: Textless Vision-Language Transformer
TVLT: Textless Vision-Language Transformer
Zineng Tang
Jaemin Cho
Yixin Nie
Joey Tianyi Zhou
VLM
54
28
0
28 Sep 2022
DynDepNet: Learning Time-Varying Dependency Structures from fMRI Data
  via Dynamic Graph Structure Learning
DynDepNet: Learning Time-Varying Dependency Structures from fMRI Data via Dynamic Graph Structure Learning
Alexander Campbell
A. Zippo
L. Passamonti
N. Toschi
Pietro Lio
35
3
0
27 Sep 2022
Learning to Learn with Generative Models of Neural Network Checkpoints
Learning to Learn with Generative Models of Neural Network Checkpoints
William S. Peebles
Ilija Radosavovic
Tim Brooks
Alexei A. Efros
Jitendra Malik
UQCV
75
65
0
26 Sep 2022
Multi-Task Adversarial Training Algorithm for Multi-Speaker Neural
  Text-to-Speech
Multi-Task Adversarial Training Algorithm for Multi-Speaker Neural Text-to-Speech
Yusuke Nakai
Yuki Saito
K. Udagawa
Hiroshi Saruwatari
AAML
30
1
0
26 Sep 2022
DeepVol: Volatility Forecasting from High-Frequency Data with Dilated
  Causal Convolutions
DeepVol: Volatility Forecasting from High-Frequency Data with Dilated Causal Convolutions
Fernando Moreno-Pino
S. Zohren
39
12
0
23 Sep 2022
Leveraging the Potential of Novel Data in Power Line Communication of
  Electricity Grids
Leveraging the Potential of Novel Data in Power Line Communication of Electricity Grids
Christoph Balada
Max Bondorf
Sheraz Ahmed
Andreas Dengel
M. Zdrallek
19
0
0
23 Sep 2022
Image Classification using Sequence of Pixels
Image Classification using Sequence of Pixels
Gajraj Kuldeep
21
0
0
23 Sep 2022
StyleTime: Style Transfer for Synthetic Time Series Generation
StyleTime: Style Transfer for Synthetic Time Series Generation
Yousef El-Laham
Svitlana Vyetrenko
AI4TS
31
5
0
22 Sep 2022
Poisson Flow Generative Models
Poisson Flow Generative Models
Yilun Xu
Ziming Liu
M. Tegmark
Tommi Jaakkola
103
80
0
22 Sep 2022
Controllable Accented Text-to-Speech Synthesis
Controllable Accented Text-to-Speech Synthesis
Rui Liu
Berrak Sisman
Guanglai Gao
Haizhou Li
47
6
0
22 Sep 2022
Deep Lake: a Lakehouse for Deep Learning
Deep Lake: a Lakehouse for Deep Learning
S. Hambardzumyan
Abhina Tuli
Levon Ghukasyan
Fariz Rahman
Hrant Topchyan
...
Mark McQuade
M. Harutyunyan
Tatevik Hakobyan
I. Stranic
Davit Buniatyan
31
17
0
22 Sep 2022
An Initial study on Birdsong Re-synthesis Using Neural Vocoders
An Initial study on Birdsong Re-synthesis Using Neural Vocoders
Rhythm Bhatia
Tomi Kinnunen
23
1
0
21 Sep 2022
Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic
  Wasserstein GAN
Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN
Yin-Ping Cho
Yu Tsao
Hsin-Min Wang
Yi-Wen Liu
DiffM
40
9
0
21 Sep 2022
Reconstructing Robot Operations via Radio-Frequency Side-Channel
Reconstructing Robot Operations via Radio-Frequency Side-Channel
Ryan Shah
Chuadhry Mujeeb Ahmed
Shishir Nagaraja
AAML
14
1
0
21 Sep 2022
EMA-VIO: Deep Visual-Inertial Odometry with External Memory Attention
EMA-VIO: Deep Visual-Inertial Odometry with External Memory Attention
Zheming Tu
Changhao Chen
Xianfei Pan
Ruochen Liu
Jiarui Cui
Jun Mao
58
16
0
18 Sep 2022
Distribution Aware Metrics for Conditional Natural Language Generation
Distribution Aware Metrics for Conditional Natural Language Generation
David M. Chan
Yiming Ni
David A. Ross
Sudheendra Vijayanarasimhan
Austin Myers
John F. Canny
53
4
0
15 Sep 2022
Detecting Synthetic Speech Manipulation in Real Audio Recordings
Detecting Synthetic Speech Manipulation in Real Audio Recordings
M. Rahman
M. Graciarena
Diego Castán
Chris Cobo-Kroenke
Mitchell McLaren
A. Lawson
AAML
35
9
0
15 Sep 2022
Open Challenges in Synthetic Speech Detection
Open Challenges in Synthetic Speech Detection
Luca Cuccovillo
Christoforos Papastergiopoulos
Anastasios Vafeiadis
Artem Yaroshchuk
P. Aichroth
K. Votis
Dimitrios Tzovaras
48
28
0
15 Sep 2022
Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated
  Self-Attention
Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention
Jingwei Zhao
Gus Xia
Ye Wang
45
18
0
15 Sep 2022
A Temporal Anomaly Detection System for Vehicles utilizing Functional
  Working Groups and Sensor Channels
A Temporal Anomaly Detection System for Vehicles utilizing Functional Working Groups and Sensor Channels
Subash Neupane
Ivan A. Fernandez
Wilson Patterson
Sudip Mittal
Shahram Rahimi
21
5
0
14 Sep 2022
ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in
  Paragraph-based TTS
ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
Liumeng Xue
Frank Soong
Shaofei Zhang
Linfu Xie
32
23
0
14 Sep 2022
Using Rater and System Metadata to Explain Variance in the VoiceMOS
  Challenge 2022 Dataset
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Michael Chinen
Jan Skoglund
Chandan K. A. Reddy
Alessandro Ragano
Andrew Hines
13
9
0
14 Sep 2022
Residual Correction in Real-Time Traffic Forecasting
Residual Correction in Real-Time Traffic Forecasting
Daejin Kim
Young Cho
Dongmin Kim
Cheonbok Park
Jaegul Choo
21
7
0
12 Sep 2022
DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion
DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion
Ruibin Yuan
Yuxuan Wu
Jacob Li
Jaxter Kim
39
5
0
09 Sep 2022
AudioLM: a Language Modeling Approach to Audio Generation
AudioLM: a Language Modeling Approach to Audio Generation
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
73
575
0
07 Sep 2022
Read it to me: An emotionally aware Speech Narration Application
Read it to me: An emotionally aware Speech Narration Application
Rishibha Bansal
18
0
0
06 Sep 2022
Bridging Music and Text with Crowdsourced Music Comments: A
  Sequence-to-Sequence Framework for Thematic Music Comments Generation
Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation
Peining Zhang
Junliang Guo
Linli Xu
Mu You
Junming Yin
27
0
0
05 Sep 2022
HAGCN : Network Decentralization Attention Based Heterogeneity-Aware
  Spatiotemporal Graph Convolution Network for Traffic Signal Forecasting
HAGCN : Network Decentralization Attention Based Heterogeneity-Aware Spatiotemporal Graph Convolution Network for Traffic Signal Forecasting
Junkyu Jang
Sunghyuk Park
32
1
0
05 Sep 2022
On the Horizon: Interactive and Compositional Deepfakes
On the Horizon: Interactive and Compositional Deepfakes
Eric Horvitz
21
27
0
05 Sep 2022
Previous
123...171819...596061
Next