ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.03067
  4. Cited By
Cross-Scale Vector Quantization for Scalable Neural Speech Coding

Cross-Scale Vector Quantization for Scalable Neural Speech Coding

7 July 2022
Xue Jiang
Xiulian Peng
Huaying Xue
Yuan Zhang
Yan Lu
    MQ
ArXivPDFHTML

Papers citing "Cross-Scale Vector Quantization for Scalable Neural Speech Coding"

20 / 20 papers shown
Title
End-to-End Neural Speech Coding for Real-Time Communications
End-to-End Neural Speech Coding for Real-Time Communications
Xue Jiang
Xiulian Peng
Chengyu Zheng
Huaying Xue
Yuan Zhang
Yan Lu
63
29
0
24 Jan 2022
HARP-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable
  Neural Audio Coding
HARP-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable Neural Audio Coding
Darius Petermann
Seungkwon Beack
Minje Kim
42
15
0
22 Jul 2021
SoundStream: An End-to-End Neural Audio Codec
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
105
792
0
07 Jul 2021
Generative Speech Coding with Predictive Variance Regularization
Generative Speech Coding with Predictive Variance Regularization
W. Kleijn
Andrew Storus
Michael Chinen
Tom Denton
Felicia S. C. Lim
Alejandro Luebs
Jan Skoglund
Hengchin Yeh
40
68
0
18 Feb 2021
Enhancing into the codec: Noise Robust Speech Coding with
  Vector-Quantized Autoencoders
Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Jonah Casebeer
Vinjai Vale
Umut Isik
J. Valin
Ritwik Giri
A. Krishnaswamy
81
19
0
12 Feb 2021
A Spectral Energy Distance for Parallel Speech Synthesis
A Spectral Energy Distance for Parallel Speech Synthesis
A. Gritsenko
Tim Salimans
Rianne van den Berg
Jasper Snoek
Nal Kalchbrenner
42
70
0
03 Aug 2020
Efficient And Scalable Neural Residual Waveform Coding With
  Collaborative Quantization
Efficient And Scalable Neural Residual Waveform Coding With Collaborative Quantization
Kai Zhen
Mi Suk Lee
Jongmo Sung
Seungkwon Beack
Minje Kim
67
20
0
13 Feb 2020
Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder
Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder
Cristina Garbacea
Aaron van den Oord
Yazhe Li
Felicia S. C. Lim
Alejandro Luebs
Oriol Vinyals
Thomas C. Walters
56
121
0
14 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
150
666
0
12 Oct 2019
Cascaded Cross-Module Residual Learning towards Lightweight End-to-End
  Speech Coding
Cascaded Cross-Module Residual Learning towards Lightweight End-to-End Speech Coding
Kai Zhen
Jongmo Sung
Mi Suk Lee
Seungkwon Beack
Minje Kim
56
40
0
18 Jun 2019
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis
Jan Skoglund
J. Valin
58
38
0
12 May 2019
Differentiable Consistency Constraints for Improved Deep Speech
  Enhancement
Differentiable Consistency Constraints for Improved Deep Speech Enhancement
Scott Wisdom
J. Hershey
K. Wilson
J. Thorpe
Michael Chinen
Brian Patton
Rif A. Saurous
38
119
0
20 Nov 2018
High-quality speech coding with SampleRNN
High-quality speech coding with SampleRNN
Adam Conkey
Per Hedelin
Cong Zhou
Tucker Hermans
Lars Villemoes
47
59
0
07 Nov 2018
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction
J. Valin
Jan Skoglund
65
451
0
28 Oct 2018
Wavenet based low rate speech coding
Wavenet based low rate speech coding
W. Kleijn
Felicia S. C. Lim
Alejandro Luebs
Jan Skoglund
Florian Stimberg
Quan Wang
Thomas C. Walters
40
143
0
01 Dec 2017
Neural Discrete Representation Learning
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
226
5,008
0
02 Nov 2017
End-to-End Optimized Speech Coding with Deep Neural Networks
End-to-End Optimized Speech Coding with Deep Neural Networks
Srihari Kankanahalli
MQ
48
68
0
25 Oct 2017
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
Soroush Mehri
Kundan Kumar
Ishaan Gulrajani
Rithesh Kumar
Shubham Jain
Jose M. R. Sotelo
Aaron Courville
Yoshua Bengio
106
599
0
22 Dec 2016
Least Squares Generative Adversarial Networks
Least Squares Generative Adversarial Networks
Xudong Mao
Qing Li
Haoran Xie
Raymond Y. K. Lau
Zhen Wang
Stephen Paul Smolley
GAN
329
4,573
0
13 Nov 2016
WaveNet: A Generative Model for Raw Audio
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
401
7,391
0
12 Sep 2016
1