Cross-Scale Vector Quantization for Scalable Neural Speech Coding

7 July 2022

Papers citing "Cross-Scale Vector Quantization for Scalable Neural Speech Coding"

20 / 20 papers shown

Title
End-to-End Neural Speech Coding for Real-Time Communications Xue Jiang Xiulian Peng Chengyu Zheng Huaying Xue Yuan Zhang Yan Lu 63 29 0 24 Jan 2022
HARP-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable Neural Audio Coding Darius Petermann Seungkwon Beack Minje Kim 42 15 0 22 Jul 2021
SoundStream: An End-to-End Neural Audio Codec Neil Zeghidour Alejandro Luebs Ahmed Omran Jan Skoglund Marco Tagliasacchi AI4TS 105 792 0 07 Jul 2021
Generative Speech Coding with Predictive Variance Regularization W. Kleijn Andrew Storus Michael Chinen Tom Denton Felicia S. C. Lim Alejandro Luebs Jan Skoglund Hengchin Yeh 40 68 0 18 Feb 2021
Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders Jonah Casebeer Vinjai Vale Umut Isik J. Valin Ritwik Giri A. Krishnaswamy 81 19 0 12 Feb 2021
A Spectral Energy Distance for Parallel Speech Synthesis A. Gritsenko Tim Salimans Rianne van den Berg Jasper Snoek Nal Kalchbrenner 42 70 0 03 Aug 2020
Efficient And Scalable Neural Residual Waveform Coding With Collaborative Quantization Kai Zhen Mi Suk Lee Jongmo Sung Seungkwon Beack Minje Kim 67 20 0 13 Feb 2020
Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder Cristina Garbacea Aaron van den Oord Yazhe Li Felicia S. C. Lim Alejandro Luebs Oriol Vinyals Thomas C. Walters 56 121 0 14 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations Alexei Baevski Steffen Schneider Michael Auli SSL 150 666 0 12 Oct 2019
Cascaded Cross-Module Residual Learning towards Lightweight End-to-End Speech Coding Kai Zhen Jongmo Sung Mi Suk Lee Seungkwon Beack Minje Kim 56 40 0 18 Jun 2019
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis Jan Skoglund J. Valin 58 38 0 12 May 2019
Differentiable Consistency Constraints for Improved Deep Speech Enhancement Scott Wisdom J. Hershey K. Wilson J. Thorpe Michael Chinen Brian Patton Rif A. Saurous 38 119 0 20 Nov 2018
High-quality speech coding with SampleRNN Adam Conkey Per Hedelin Cong Zhou Tucker Hermans Lars Villemoes 47 59 0 07 Nov 2018
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction J. Valin Jan Skoglund 65 451 0 28 Oct 2018
Wavenet based low rate speech coding W. Kleijn Felicia S. C. Lim Alejandro Luebs Jan Skoglund Florian Stimberg Quan Wang Thomas C. Walters 40 143 0 01 Dec 2017
Neural Discrete Representation Learning Aaron van den Oord Oriol Vinyals Koray Kavukcuoglu BDL SSL OCL 226 5,008 0 02 Nov 2017
End-to-End Optimized Speech Coding with Deep Neural Networks Srihari Kankanahalli MQ 48 68 0 25 Oct 2017
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model Soroush Mehri Kundan Kumar Ishaan Gulrajani Rithesh Kumar Shubham Jain Jose M. R. Sotelo Aaron Courville Yoshua Bengio 106 599 0 22 Dec 2016
Least Squares Generative Adversarial Networks Xudong Mao Qing Li Haoran Xie Raymond Y. K. Lau Zhen Wang Stephen Paul Smolley GAN 329 4,573 0 13 Nov 2016
WaveNet: A Generative Model for Raw Audio Aaron van den Oord Sander Dieleman Heiga Zen Karen Simonyan Oriol Vinyals Alex Graves Nal Kalchbrenner A. Senior Koray Kavukcuoglu DiffM 401 7,391 0 12 Sep 2016