v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown

Title
Sok: Comprehensive Security Overview, Challenges, and Future Directions of Voice-Controlled Systems Haozhe Xu Cong Wu Yangyang Gu Xingcan Shang Jing Chen Kun He Ruiying Du 132 3 0 27 May 2024
CNN-based Compressor Mass Flow Estimator in Industrial Aircraft Vapor Cycle System Justin Reverdi Sixin Zhang Said Aoues Fabrice Gamboa Serge Gratton Thomas Pellegrini 72 0 0 27 May 2024
EEG-DBNet: A Dual-Branch Network for Temporal-Spectral Decoding in Motor-Imagery Brain-Computer Interfaces Xicheng Lou Xinwei Li Hongying Meng Jun Hu Meili Xu Yue Zhao Jiazhang Yang Zhangyong Li 98 2 0 25 May 2024
The Rarity of Musical Audio Signals Within the Space of Possible Audio Generation Nick Collins MGen 45 0 0 23 May 2024
Fisher Flow Matching for Generative Modeling over Discrete Data Oscar Davis Samuel Kessler Mircea Petrache .Ismail .Ilkan Ceylan Michael M. Bronstein A. Bose 111 21 0 23 May 2024
Leveraging 2D Information for Long-term Time Series Forecasting with Vanilla Transformers Xin Cheng Preslav Nakov Shuqi Li Di Luo Xun Wang Dongyan Zhao Rui Yan AI4TS 94 2 0 22 May 2024
A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation Gwanghyun Kim Alonso Martinez Yu-Chuan Su Brendan Jou José Lezama ... Lijun Yu Lu Jiang A. Jansen Jacob Walker Krishna Somandepalli 77 9 0 22 May 2024
DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation Weiting Tan Jingyu Zhang Lingfeng Shen Daniel Khashabi Philipp Koehn 84 0 0 22 May 2024
PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images Yiheng Xiong Angela Dai ViT 67 1 0 20 May 2024
Deep Ensemble Art Style Recognition Orfeas Menis Mastromichalakis Natasa Sofou Giorgos Stamou 3DPC 61 11 0 19 May 2024
Switched Flow Matching: Eliminating Singularities via Switching ODEs Qunxi Zhu Wei Lin 108 1 0 19 May 2024
Generative Artificial Intelligence: A Systematic Review and Applications S. S. Sengar Affan Bin Hasan Sanjay Kumar Fiona Carroll MedIm 76 74 0 17 May 2024
FLEXIBLE: Forecasting Cellular Traffic by Leveraging Explicit Inductive Graph-Based Learning D. Ngo Kandaraj Piamrat Ons Aouedi Thomas Hassan Philippe Raipin-Parvédy AI4TS 52 0 0 14 May 2024
FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation Jianyi Chen Wei Xue Xu Tan Zhen Ye Qi-fei Liu Yi-Ting Guo 66 2 0 13 May 2024
Beyond traditional Magnetic Resonance processing with Artificial Intelligence Amir Jahangiri Vladislav Orekhov 45 2 0 13 May 2024
Multi-Scale Dilated Convolution Network for Long-Term Time Series Forecasting Feifei Li Suhan Guo Feng Han Jian Zhao Shen Furao 75 3 0 09 May 2024
The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio Yuankun Xie Yi Lu Ruibo Fu Zhengqi Wen Zhiyong Wang ... Xiaopeng Wang Yukun Liu Haonan Cheng Long Ye Yi Sun 98 21 0 08 May 2024
HILCodec: High Fidelity and Lightweight Neural Audio Codec S. Ahn Beom Jun Woo Mingrui Han Chanyeong Moon Nam Soo Kim 43 9 0 08 May 2024
VAEneu: A New Avenue for VAE Application on Probabilistic Forecasting Alireza Koochali Ensiye Tahaei Andreas Dengel Sheraz Ahmed AI4TS 97 1 0 07 May 2024
Detecting music deepfakes is easy but actually hard Darius Afchar Gabriel Meseguer-Brocal Romain Hennequin 101 9 0 07 May 2024
UniGen: Unified Modeling of Initial Agent States and Trajectories for Generating Autonomous Driving Scenarios R. Mahjourian Rongbing Mu Valerii Likhosherstov Paul Mougin Xiukun Huang Joao Messias Shimon Whiteson 59 8 0 06 May 2024
Embedded Distributed Inference of Deep Neural Networks: A Systematic Review Federico Nicolás Peccia Oliver Bringmann 90 0 0 06 May 2024
Multi-Modality Spatio-Temporal Forecasting via Self-Supervised Learning Jiewen Deng Renhe Jiang Jiaqi Zhang Xuan Song AI4TS 115 6 0 06 May 2024
ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers Yuzhe Gu Enmao Diao 102 4 0 30 Apr 2024
CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition Jianzong Wang Pengcheng Li Xulong Zhang Ning Cheng Jing Xiao 63 0 0 30 Apr 2024
Evaluating the effectiveness of predicting covariates in LSTM Networks for Time Series Forecasting Gareth Davies AI4TS 125 1 0 29 Apr 2024
TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality Tiantian Feng Xuan Shi Rahul Gupta Shrikanth S. Narayanan 75 0 0 27 Apr 2024
Any-Quantile Probabilistic Forecasting of Short-Term Electricity Demand Slawek Smyl Boris N. Oreshkin Paweł Pełka Grzegorz Dudek AI4TS 79 0 0 26 Apr 2024
LM-IGTD: a 2D image generator for low-dimensional and mixed-type tabular data to leverage the potential of convolutional neural networks Vanesa Gómez-Martínez F. J. Lara-Abelenda Pablo Peiro-Corbacho David Chushig-Muzo C. Granja C. Soguero-Ruíz LMTD 64 2 0 26 Apr 2024
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder Yicheng Gu Xueyao Zhang Liumeng Xue Haizhou Li Zhizheng Wu 55 3 0 26 Apr 2024
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges Badri N. Patro Vijay Srinivas Agneeswaran Mamba 116 45 0 24 Apr 2024
Music Style Transfer With Diffusion Model Hong Huang Yuyi Wang Luyao Li Jun Lin DiffM 55 0 0 23 Apr 2024
FlashSpeech: Efficient Zero-Shot Speech Synthesis Zhen Ye Zeqian Ju Haohe Liu Xu Tan Jianyi Chen ... Weizhen Bian Shulin He Qi-fei Liu Yi-Ting Guo Wei Xue 102 20 0 23 Apr 2024
LVNS-RAVE: Diversified audio generation with RAVE and Latent Vector Novelty Search Jinyue Guo Anna-Maria Christodoulou Balint Laczko K. Glette 52 0 0 22 Apr 2024
Audio Anti-Spoofing Detection: A Survey Menglu Li Yasaman Ahmadiadli Xiao-Ping Zhang 104 25 0 22 Apr 2024
Large Language Models: From Notes to Musical Form Lilac Atassi 90 0 0 18 Apr 2024
Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach Mir Rayat Imtiaz Hossain Mennatullah Siam Leonid Sigal James J. Little VLM 100 7 0 17 Apr 2024
Decoupled Weight Decay for Any $p$ Norm N. Outmezguine Noam Levi 86 3 0 16 Apr 2024
Hardware-aware training of models with synaptic delays for digital event-driven neuromorphic processors A. Patiño-Saucedo Roy Meijer Amirreza Yousefzadeh M. Gomony Federico Corradi Paul Detterer Laura Garrido-Regife B. Linares-Barranco Manolis Sifalakis 40 2 0 16 Apr 2024
Long-form music generation with latent diffusion Zach Evans Julian Parker CJ Carr Zack Zukowski Josiah Taylor Jordi Pons MGen DiffM 122 45 0 16 Apr 2024
A Survey on Deep Learning for Theorem Proving Zhaoyu Li Jialiang Sun Logan Murphy Qidong Su Zenan Li Xian Zhang Kaiyu Yang Xujie Si LRM 123 32 0 15 Apr 2024
High Significant Fault Detection in Azure Core Workload Insights Pranay Lohia Laurent Boué Sharath Ranganath Vijay Srinivas Agneeswaran AI4CE 28 2 0 14 Apr 2024
Foundational GPT Model for MEG Richard Csaky M. Es Oiwi Parker Jones M. Woolrich 67 2 0 14 Apr 2024
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping Kevin Zhang Luka Chkhetiani Francis McCann Ramirez Yash Khare Andrea Vanzo ... Ruben Bousbib Taufiquzzaman Peyash Michael Nguyen Dillon Pulliam Domenic Donato 61 4 0 10 Apr 2024
Adapting LLaMA Decoder to Vision Transformer Jiahao Wang Wenqi Shao Mengzhao Chen Chengyue Wu Yong Liu Taiqiang Wu Kaipeng Zhang Songyang Zhang Kai-xiang Chen Ping Luo MLLM 85 4 0 10 Apr 2024
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing Philip Anastassiou Zhenyu Tang Kainan Peng Dongya Jia Jiaxin Li Ming Tu Yuping Wang Yuxuan Wang Mingbo Ma 126 4 0 10 Apr 2024
TimeCSL: Unsupervised Contrastive Learning of General Shapelets for Explorable Time Series Analysis Zhiyu Liang Cheng Liang Zheng Liang Hongzhi Wang Bo Zheng 66 1 0 07 Apr 2024
A Novel Bi-LSTM And Transformer Architecture For Generating Tabla Music Roopa Mayya Vivekanand Venkataraman A. Paduri Narayana Darapaneni 45 0 0 06 Apr 2024
PromptCodec: High-Fidelity Neural Speech Codec using Disentangled Representation Learning based Adaptive Feature-aware Prompt Encoders Yu Pan Lei Ma Jianjun Zhao 99 4 0 03 Apr 2024
A Novel Audio Representation for Music Genre Identification in MIR Navin Kamuni Mayank Jindal Arpita Soni Sukender Reddy Mallreddy Sharath Chandra Macha VLM 69 7 0 01 Apr 2024