ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.01120
  4. Cited By
Wavenet based low rate speech coding

Wavenet based low rate speech coding

1 December 2017
W. Kleijn
Felicia S. C. Lim
Alejandro Luebs
Jan Skoglund
Florian Stimberg
Quan Wang
Thomas C. Walters
ArXivPDFHTML

Papers citing "Wavenet based low rate speech coding"

33 / 33 papers shown
Title
SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling
SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling
Shengshi Yao
Jincheng Dai
Xiaoqi Qin
Sixian Wang
Siye Wang
K. Niu
Ping Zhang
38
0
0
22 Jan 2025
FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates
FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates
N. Pia
Martin Strauss
M. Multrus
B. Edler
42
0
0
26 Sep 2024
OpenACE: An Open Benchmark for Evaluating Audio Coding Performance
OpenACE: An Open Benchmark for Evaluating Audio Coding Performance
Jozef Coldenhoff
Niclas Granqvist
Milos Cernak
33
0
0
12 Sep 2024
MFCC-GAN Codec: A New AI-based Audio Coding
MFCC-GAN Codec: A New AI-based Audio Coding
Mohammad Reza Hasanabadi
21
0
0
22 Oct 2023
Fewer-token Neural Speech Codec with Time-invariant Codes
Fewer-token Neural Speech Codec with Time-invariant Codes
Yong Ren
Tao Wang
Jiangyan Yi
Le Xu
Jianhua Tao
Chuyuan Zhang
Jun Zhou
22
33
0
15 Sep 2023
AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes
Zhaohui Li
Haitao Wang
Xinghua Jiang
40
1
0
14 Aug 2023
Native Multi-Band Audio Coding within Hyper-Autoencoded Reconstruction
  Propagation Networks
Native Multi-Band Audio Coding within Hyper-Autoencoded Reconstruction Propagation Networks
Darius Petermann
Inseon Jang
Minje Kim
16
1
0
14 Mar 2023
High Quality Audio Coding with MDCTNet
High Quality Audio Coding with MDCTNet
G. Davidson
M. Vinton
P. Ekstrand
Cong Zhou
Lars Villemoes
Lie Lu
MedIm
18
8
0
08 Dec 2022
Neural Feature Predictor and Discriminative Residual Coding for
  Low-Bitrate Speech Coding
Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding
Haici Yang
Wootaek Lim
Minje Kim
24
9
0
04 Nov 2022
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on
  Fixed-Point Iteration
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration
Yuma Koizumi
Kohei Yatabe
Heiga Zen
M. Bacchiani
DiffM
49
29
0
03 Oct 2022
Using Rater and System Metadata to Explain Variance in the VoiceMOS
  Challenge 2022 Dataset
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Michael Chinen
Jan Skoglund
Chandan K. A. Reddy
Alessandro Ragano
Andrew Hines
13
9
0
14 Sep 2022
Beyond Transmitting Bits: Context, Semantics, and Task-Oriented
  Communications
Beyond Transmitting Bits: Context, Semantics, and Task-Oriented Communications
Deniz Gunduz
Zhijin Qin
Iñaki Estella Aguerri
Harpreet S. Dhillon
Zhaohui Yang
Aylin Yener
Kai‐Kit Wong
C. Chae
29
434
0
19 Jul 2022
End-to-End Binaural Speech Synthesis
End-to-End Binaural Speech Synthesis
Wen-Chin Huang
Dejan Marković
Alexander Richard
I. D. Gebru
Anjali Menon
32
8
0
08 Jul 2022
NESC: Robust Neural End-2-End Speech Coding with GANs
NESC: Robust Neural End-2-End Speech Coding with GANs
N. Pia
Kishan Gupta
Srikanth Korse
M. Multrus
Guillaume Fuchs
33
15
0
07 Jul 2022
Cross-Scale Vector Quantization for Scalable Neural Speech Coding
Cross-Scale Vector Quantization for Scalable Neural Speech Coding
Xue Jiang
Xiulian Peng
Huaying Xue
Yuan Zhang
Yan Lu
MQ
39
9
0
07 Jul 2022
Neural Speech Synthesis on a Shoestring: Improving the Efficiency of
  LPCNet
Neural Speech Synthesis on a Shoestring: Improving the Efficiency of LPCNet
J. Valin
Umut Isik
Paris Smaragdis
A. Krishnaswamy
29
4
0
22 Feb 2022
COIN++: Neural Compression Across Modalities
COIN++: Neural Compression Across Modalities
Emilien Dupont
H. Loya
Milad Alizadeh
Adam Goliñski
Yee Whye Teh
Arnaud Doucet
63
83
0
30 Jan 2022
End-to-End Neural Speech Coding for Real-Time Communications
End-to-End Neural Speech Coding for Real-Time Communications
Xue Jiang
Xiulian Peng
Chengyu Zheng
Huaying Xue
Yuan Zhang
Yan Lu
34
27
0
24 Jan 2022
HARP-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable
  Neural Audio Coding
HARP-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable Neural Audio Coding
Darius Petermann
Seungkwon Beack
Minje Kim
30
14
0
22 Jul 2021
SoundStream: An End-to-End Neural Audio Codec
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
43
739
0
07 Jul 2021
WaveNet-Based Deep Neural Networks for the Characterization of Anomalous
  Diffusion (WADNet)
WaveNet-Based Deep Neural Networks for the Characterization of Anomalous Diffusion (WADNet)
Dezhong Li
Qiujin Yao
Zihan Huang
DiffM
14
19
0
14 Jun 2021
Generative Speech Coding with Predictive Variance Regularization
Generative Speech Coding with Predictive Variance Regularization
W. Kleijn
Andrew Storus
Michael Chinen
Tom Denton
Felicia S. C. Lim
Alejandro Luebs
Jan Skoglund
Hengchin Yeh
29
67
0
18 Feb 2021
Enhancing into the codec: Noise Robust Speech Coding with
  Vector-Quantized Autoencoders
Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Jonah Casebeer
Vinjai Vale
Umut Isik
J. Valin
Ritwik Giri
A. Krishnaswamy
54
18
0
12 Feb 2021
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model
  with Pitch-dependent Dilated Convolution Neural Network
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network
Yi-Chiao Wu
Tomoki Hayashi
Patrick Lumban Tobing
Kazuhiro Kobayashi
T. Toda
27
18
0
11 Jul 2020
ViSQOL v3: An Open Source Production Ready Objective Speech and Audio
  Metric
ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric
Michael Chinen
Felicia S. C. Lim
Jan Skoglund
Nikita Gureev
F. O'Gorman
Andrew Hines
13
132
0
20 Apr 2020
Efficient And Scalable Neural Residual Waveform Coding With
  Collaborative Quantization
Efficient And Scalable Neural Residual Waveform Coding With Collaborative Quantization
Kai Zhen
Mi Suk Lee
Jongmo Sung
Seungkwon Beack
Minje Kim
38
20
0
13 Feb 2020
Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder
Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder
Cristina Garbacea
Aaron van den Oord
Yazhe Li
Felicia S. C. Lim
Alejandro Luebs
Oriol Vinyals
Thomas C. Walters
27
121
0
14 Oct 2019
Speech bandwidth extension with WaveNet
Speech bandwidth extension with WaveNet
Archit Gupta
Brendan Shillingford
Yannis Assael
Thomas C. Walters
27
28
0
05 Jul 2019
Cascaded Cross-Module Residual Learning towards Lightweight End-to-End
  Speech Coding
Cascaded Cross-Module Residual Learning towards Lightweight End-to-End Speech Coding
Kai Zhen
Jongmo Sung
Mi Suk Lee
Seungkwon Beack
Minje Kim
35
39
0
18 Jun 2019
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis
Jan Skoglund
J. Valin
41
38
0
12 May 2019
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
J. Valin
Jan Skoglund
24
78
0
28 Mar 2019
Collapsed speech segment detection and suppression for WaveNet vocoder
Collapsed speech segment detection and suppression for WaveNet vocoder
Yi-Chiao Wu
Kazuhiro Kobayashi
Tomoki Hayashi
Patrick Lumban Tobing
T. Toda
15
25
0
30 Apr 2018
Speaker-independent raw waveform model for glottal excitation
Speaker-independent raw waveform model for glottal excitation
Lauri Juvela
Vassilis Tsiaras
Bajibabu Bollepalli
Manu Airaksinen
Junichi Yamagishi
P. Alku
19
39
0
25 Apr 2018
1