LPCNet: Improving Neural Speech Synthesis Through Linear Prediction

28 October 2018

Papers citing "LPCNet: Improving Neural Speech Synthesis Through Linear Prediction"

50 / 80 papers shown

Title
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder Bowen Zhang Congchao Guo Geng Yang Hang Yu Haozhe Zhang ... Yichen Xiao Yiying Zhou Yujie Zhang Yuan Lu Yucen He 26 0 0 12 May 2025
RADE: A Neural Codec for Transmitting Speech over HF Radio Channels David Rowe Jean-Marc Valin 26 0 0 10 May 2025
A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection Lam Pham Phat Lam Dat Tran Hieu Tang Tin Nguyen Alexander Schindler Canh Vu Alexander Polonsky Canh Vu 56 3 0 23 Sep 2024
InstructSing: High-Fidelity Singing Voice Generation via Instructing Yourself Chang Zeng Chunhui Wang Xiaoxiao Miao Jian Zhao Zhonglin Jiang Yong Chen 41 0 0 10 Sep 2024
ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild Jiangyan Yi Chu Yuan Zhang Jianhua Tao Chenglong Wang Xinrui Yan Yong Ren Hao Gu Junzuo Zhou 52 1 0 09 Aug 2024
Differentiable All-pole Filters for Time-varying Audio Systems Chin-Yun Yu Christopher Mitcheltree Alistair Carson Stefan Bilbao Joshua D. Reiss Gyorgy Fazekas 40 2 0 11 Apr 2024
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model Yukiya Hono Kei Hashimoto Yoshihiko Nankaku Keiichi Tokuda DiffM 35 2 0 22 Feb 2024
Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity Krishna Subramani J. Valin Jan Büthe Paris Smaragdis Mike Goodwin 27 3 0 25 Sep 2023
RAMP: Retrieval-Augmented MOS Prediction via Confidence-based Dynamic Weighting Haibo Wang Shiwan Zhao Xiguang Zheng Yong Qin 29 11 0 31 Aug 2023
APNet: An All-Frame-Level Neural Vocoder Incorporating Direct Prediction of Amplitude and Phase Spectra Yang Ai Zhenhua Ling 34 13 0 13 May 2023
Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings Wei Xue Yiwen Wang Qi-fei Liu Yi-Ting Guo 37 1 0 09 May 2023
Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis Chunyu Qiang Peng Yang Hao Che Ying Zhang Xiaorui Wang Zhong-ming Wang 46 9 0 14 Mar 2023
FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model Rui Xue Yanqing Liu Lei He Xuejiao Tan Linquan Liu Ed Lin Sheng Zhao 34 7 0 06 Mar 2023
Cross-domain Neural Pitch and Periodicity Estimation Max Morrison Caedon Hsieh Nathan Pruyne Bryan Pardo 18 17 0 28 Jan 2023
Emotion Selectable End-to-End Text-based Speech Editing Tao Wang Jiangyan Yi Ruibo Fu J. Tao Zhengqi Wen Chu Yuan Zhang 33 2 0 20 Dec 2022
Puffin: pitch-synchronous neural waveform generation for fullband speech on modest devices O. Watts Lovisa Wihlborg Cassia Valentini-Botinhao 30 3 0 25 Nov 2022
Efficient Incremental Text-to-Speech on GPUs Muyang Du Chuan Liu Jiaxing Qi Junjie Lai 24 1 0 25 Nov 2022
Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural Speech Synthesis System Takenori Yoshimura Shinji Takaki Kazuhiro Nakamura Keiichiro Oura Yukiya Hono Kei Hashimoto Yoshihiko Nankaku K. Tokuda 29 7 0 21 Nov 2022
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing J. Webber Cassia Valentini-Botinhao Evelyn Williams G. Henter Simon King 11 9 0 13 Nov 2022
An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space Jihwan Lee Jaesung Bae Seongkyu Mun Heejin Choi Joun Yeop Lee Hoon-Young Cho Chanwoo Kim 26 2 0 06 Nov 2022
Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding Haici Yang Wootaek Lim Minje Kim 24 9 0 04 Nov 2022
Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis Karolos Nikitaras Konstantinos Klapsas Nikolaos Ellinas Georgia Maniati June Sig Sung Inchul Hwang S. Raptis Aimilios Chalamandaris Pirros Tsiakoulis 14 0 0 01 Nov 2022
Robust MelGAN: A robust universal neural vocoder for high-fidelity TTS Kun Song Jian Cong Xinsheng Wang Yongmao Zhang Linfu Xie Ning Jiang Haiying Wu 27 0 0 31 Oct 2022
The Sound of Silence: Efficiency of First Digit Features in Synthetic Audio Detection Daniele Mari Federica Latora Simone Milani 13 11 0 06 Oct 2022
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration Yuma Koizumi Kohei Yatabe Heiga Zen M. Bacchiani DiffM 42 29 0 03 Oct 2022
ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS Liumeng Xue Frank Soong Shaofei Zhang Linfu Xie 27 23 0 14 Sep 2022
Fully Automated End-to-End Fake Audio Detection Chenglong Wang Jiangyan Yi J. Tao Haiyang Sun Xun Chen Zhengkun Tian Haoxin Ma Cunhang Fan Ruibo Fu 26 28 0 20 Aug 2022
Cross-Scale Vector Quantization for Scalable Neural Speech Coding Xue Jiang Xiulian Peng Huaying Xue Yuan Zhang Yan Lu MQ 39 9 0 07 Jul 2022
Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model J. Valin Ahmed Mustafa Christopher Montgomery Timothy B. Terriberry Michael Klingbeil Paris Smaragdis A. Krishnaswamy 24 18 0 11 May 2022
Fine-grained Noise Control for Multispeaker Speech Synthesis Karolos Nikitaras G. Vamvoukakis Nikolaos Ellinas Konstantinos Klapsas K. Markopoulos S. Raptis June Sig Sung Gunu Jho Aimilios Chalamandaris Pirros Tsiakoulis 29 4 0 11 Apr 2022
Self-supervised learning for robust voice cloning Konstantinos Klapsas Nikolaos Ellinas Karolos Nikitaras G. Vamvoukakis Panos Kakoulidis ... S. Raptis June Sig Sung Gunu Jho Aimilios Chalamandaris Pirros Tsiakoulis SSL 27 6 0 07 Apr 2022
Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation Marc-Antoine Georges Julien Diard Laurent Girin J. Schwartz Thomas Hueber 11 7 0 05 Apr 2022
Bunched LPCNet2: Efficient Neural Vocoders Covering Devices from Cloud to Edge Sangjun Park Kihyun Choo Joohyung Lee A. Porov Konstantin Osipov June Sig Sung 14 6 0 27 Mar 2022
WeSinger: Data-augmented Singing Voice Synthesis with Auxiliary Losses Zewang Zhang Yibin Zheng Xinhui Li Li Lu 26 16 0 21 Mar 2022
Real time spectrogram inversion on mobile phone Oleg Rybakov Marco Tagliasacchi Yunpeng Li Liyang Jiang Xia Zhang Fadi Biadsy 21 4 0 01 Mar 2022
End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation Krishna Subramani J. Valin Umut Isik Paris Smaragdis A. Krishnaswamy 29 11 0 23 Feb 2022
Neural Speech Synthesis on a Shoestring: Improving the Efficiency of LPCNet J. Valin Umut Isik Paris Smaragdis A. Krishnaswamy 29 4 0 22 Feb 2022
Wavebender GAN: An architecture for phonetically meaningful speech manipulation Gustavo Teodoro Döhler Beck Ulme Wennberg Zofia Malisz G. Henter AI4CE 27 8 0 22 Feb 2022
COIN++: Neural Compression Across Modalities Emilien Dupont H. Loya Milad Alizadeh Adam Goliñski Yee Whye Teh Arnaud Doucet 57 82 0 30 Jan 2022
Disentangling Style and Speaker Attributes for TTS Style Transfer Xiaochun An Frank Soong Lei Xie 68 18 0 24 Jan 2022
End-to-End Neural Speech Coding for Real-Time Communications Xue Jiang Xiulian Peng Chengyu Zheng Huaying Xue Yuan Zhang Yan Lu 29 27 0 24 Jan 2022
A sinusoidal signal reconstruction method for the inversion of the mel-spectrogram Anastasia Natsiou Seán O'Leary 22 3 0 07 Jan 2022
VocBench: A Neural Vocoder Benchmark for Speech Synthesis Ehab A. AlBadawy Andrew Gibiansky Qing He Jilong Wu Ming-Ching Chang Siwei Lyu 22 12 0 06 Dec 2021
Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis Alexandra Vioni Myrsini Christidou Nikolaos Ellinas G. Vamvoukakis Panos Kakoulidis Taehoon Kim June Sig Sung Hyoungmin Park Aimilios Chalamandaris Pirros Tsiakoulis 13 11 0 19 Nov 2021
Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations Hyeong-Seok Choi Juheon Lee W. Kim Jie Hwan Lee Hoon Heo Kyogu Lee 37 150 0 27 Oct 2021
ViDA-MAN: Visual Dialog with Digital Humans T. Shen Jiawei Zuo Fan Shi Jin Zhang Liqin Jiang Meng Chen Zhengchen Zhang Wei Zhang Xiaodong He Tao Mei 33 6 0 26 Oct 2021
KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke Xiaobin Zhuang Huiran Yu Weifeng Zhao Tao Jiang Peng Hu 27 5 0 18 Oct 2021
HARP-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable Neural Audio Coding Darius Petermann Seungkwon Beack Minje Kim 27 14 0 22 Jul 2021
SoundStream: An End-to-End Neural Audio Codec Neil Zeghidour Alejandro Luebs Ahmed Omran Jan Skoglund Marco Tagliasacchi AI4TS 43 731 0 07 Jul 2021
EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion Daxin Tan Liqun Deng Y. Yeung Xin Jiang Xiao Chen Tan Lee 29 37 0 04 Jul 2021