ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.09064
  4. Cited By
End-to-End Optimized Speech Coding with Deep Neural Networks

End-to-End Optimized Speech Coding with Deep Neural Networks

25 October 2017
Srihari Kankanahalli
    MQ
ArXivPDFHTML

Papers citing "End-to-End Optimized Speech Coding with Deep Neural Networks"

19 / 19 papers shown
Title
FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates
FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates
N. Pia
Martin Strauss
M. Multrus
B. Edler
44
0
0
26 Sep 2024
Learning Source Disentanglement in Neural Audio Codec
Learning Source Disentanglement in Neural Audio Codec
Xiaoyu Bie
Xubo Liu
Gaël Richard
34
1
0
17 Sep 2024
OpenACE: An Open Benchmark for Evaluating Audio Coding Performance
OpenACE: An Open Benchmark for Evaluating Audio Coding Performance
Jozef Coldenhoff
Niclas Granqvist
Milos Cernak
35
0
0
12 Sep 2024
Native Multi-Band Audio Coding within Hyper-Autoencoded Reconstruction
  Propagation Networks
Native Multi-Band Audio Coding within Hyper-Autoencoded Reconstruction Propagation Networks
Darius Petermann
Inseon Jang
Minje Kim
16
1
0
14 Mar 2023
Neural Feature Predictor and Discriminative Residual Coding for
  Low-Bitrate Speech Coding
Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding
Haici Yang
Wootaek Lim
Minje Kim
29
9
0
04 Nov 2022
AudioLM: a Language Modeling Approach to Audio Generation
AudioLM: a Language Modeling Approach to Audio Generation
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
73
575
0
07 Sep 2022
Beyond Transmitting Bits: Context, Semantics, and Task-Oriented
  Communications
Beyond Transmitting Bits: Context, Semantics, and Task-Oriented Communications
Deniz Gunduz
Zhijin Qin
Iñaki Estella Aguerri
Harpreet S. Dhillon
Zhaohui Yang
Aylin Yener
Kai‐Kit Wong
C. Chae
32
435
0
19 Jul 2022
NESC: Robust Neural End-2-End Speech Coding with GANs
NESC: Robust Neural End-2-End Speech Coding with GANs
N. Pia
Kishan Gupta
Srikanth Korse
M. Multrus
Guillaume Fuchs
38
15
0
07 Jul 2022
Cross-Scale Vector Quantization for Scalable Neural Speech Coding
Cross-Scale Vector Quantization for Scalable Neural Speech Coding
Xue Jiang
Xiulian Peng
Huaying Xue
Yuan Zhang
Yan Lu
MQ
44
9
0
07 Jul 2022
Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement
  by Re-Synthesis
Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis
Karren D. Yang
Dejan Marković
Steven Krenn
Vasu Agrawal
Alexander Richard
VGen
20
32
0
31 Mar 2022
HARP-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable
  Neural Audio Coding
HARP-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable Neural Audio Coding
Darius Petermann
Seungkwon Beack
Minje Kim
30
14
0
22 Jul 2021
SoundStream: An End-to-End Neural Audio Codec
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
43
744
0
07 Jul 2021
Psychoacoustic Calibration of Loss Functions for Efficient End-to-End
  Neural Audio Coding
Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding
Kai Zhen
Mi Suk Lee
Jongmo Sung
Seung-Wha Beack
Minje Kim
40
21
0
31 Dec 2020
Efficient And Scalable Neural Residual Waveform Coding With
  Collaborative Quantization
Efficient And Scalable Neural Residual Waveform Coding With Collaborative Quantization
Kai Zhen
Mi Suk Lee
Jongmo Sung
Seungkwon Beack
Minje Kim
38
20
0
13 Feb 2020
Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder
Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder
Cristina Garbacea
Aaron van den Oord
Yazhe Li
Felicia S. C. Lim
Alejandro Luebs
Oriol Vinyals
Thomas C. Walters
27
121
0
14 Oct 2019
Cascaded Cross-Module Residual Learning towards Lightweight End-to-End
  Speech Coding
Cascaded Cross-Module Residual Learning towards Lightweight End-to-End Speech Coding
Kai Zhen
Jongmo Sung
Mi Suk Lee
Seungkwon Beack
Minje Kim
35
39
0
18 Jun 2019
Automatic Detection and Compression for Passive Acoustic Monitoring of
  the African Forest Elephant
Automatic Detection and Compression for Passive Acoustic Monitoring of the African Forest Elephant
Johan Bjorck
B. Rappazzo
Di Chen
Richard Bernstein
P. Wrege
Carla P. Gomes
19
32
0
25 Feb 2019
Deep Generative Models for Distribution-Preserving Lossy Compression
Deep Generative Models for Distribution-Preserving Lossy Compression
Michael Tschannen
E. Agustsson
Mario Lucic
16
130
0
28 May 2018
Real-Time Single Image and Video Super-Resolution Using an Efficient
  Sub-Pixel Convolutional Neural Network
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network
Wenzhe Shi
Jose Caballero
Ferenc Huszár
J. Totz
Andrew P. Aitken
Rob Bishop
Daniel Rueckert
Zehan Wang
SupR
234
5,181
0
16 Sep 2016
1