End-to-End Optimized Speech Coding with Deep Neural Networks

25 October 2017

Papers citing "End-to-End Optimized Speech Coding with Deep Neural Networks"

19 / 19 papers shown

Title
FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates N. Pia Martin Strauss M. Multrus B. Edler 44 0 0 26 Sep 2024
Learning Source Disentanglement in Neural Audio Codec Xiaoyu Bie Xubo Liu Gaël Richard 34 1 0 17 Sep 2024
OpenACE: An Open Benchmark for Evaluating Audio Coding Performance Jozef Coldenhoff Niclas Granqvist Milos Cernak 35 0 0 12 Sep 2024
Native Multi-Band Audio Coding within Hyper-Autoencoded Reconstruction Propagation Networks Darius Petermann Inseon Jang Minje Kim 16 1 0 14 Mar 2023
Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding Haici Yang Wootaek Lim Minje Kim 29 9 0 04 Nov 2022
AudioLM: a Language Modeling Approach to Audio Generation Zalan Borsos Raphaël Marinier Damien Vincent Eugene Kharitonov Olivier Pietquin ... Dominik Roblek O. Teboul David Grangier Marco Tagliasacchi Neil Zeghidour AuLLM 73 575 0 07 Sep 2022
Beyond Transmitting Bits: Context, Semantics, and Task-Oriented Communications Deniz Gunduz Zhijin Qin Iñaki Estella Aguerri Harpreet S. Dhillon Zhaohui Yang Aylin Yener Kai‐Kit Wong C. Chae 32 435 0 19 Jul 2022
NESC: Robust Neural End-2-End Speech Coding with GANs N. Pia Kishan Gupta Srikanth Korse M. Multrus Guillaume Fuchs 38 15 0 07 Jul 2022
Cross-Scale Vector Quantization for Scalable Neural Speech Coding Xue Jiang Xiulian Peng Huaying Xue Yuan Zhang Yan Lu MQ 44 9 0 07 Jul 2022
Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis Karren D. Yang Dejan Marković Steven Krenn Vasu Agrawal Alexander Richard VGen 20 32 0 31 Mar 2022
HARP-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable Neural Audio Coding Darius Petermann Seungkwon Beack Minje Kim 30 14 0 22 Jul 2021
SoundStream: An End-to-End Neural Audio Codec Neil Zeghidour Alejandro Luebs Ahmed Omran Jan Skoglund Marco Tagliasacchi AI4TS 43 744 0 07 Jul 2021
Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding Kai Zhen Mi Suk Lee Jongmo Sung Seung-Wha Beack Minje Kim 40 21 0 31 Dec 2020
Efficient And Scalable Neural Residual Waveform Coding With Collaborative Quantization Kai Zhen Mi Suk Lee Jongmo Sung Seungkwon Beack Minje Kim 38 20 0 13 Feb 2020
Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder Cristina Garbacea Aaron van den Oord Yazhe Li Felicia S. C. Lim Alejandro Luebs Oriol Vinyals Thomas C. Walters 27 121 0 14 Oct 2019
Cascaded Cross-Module Residual Learning towards Lightweight End-to-End Speech Coding Kai Zhen Jongmo Sung Mi Suk Lee Seungkwon Beack Minje Kim 35 39 0 18 Jun 2019
Automatic Detection and Compression for Passive Acoustic Monitoring of the African Forest Elephant Johan Bjorck B. Rappazzo Di Chen Richard Bernstein P. Wrege Carla P. Gomes 19 32 0 25 Feb 2019
Deep Generative Models for Distribution-Preserving Lossy Compression Michael Tschannen E. Agustsson Mario Lucic 16 130 0 28 May 2018
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network Wenzhe Shi Jose Caballero Ferenc Huszár J. Totz Andrew P. Aitken Rob Bishop Daniel Rueckert Zehan Wang SupR 234 5,181 0 16 Sep 2016