ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.02702
  4. Cited By
PromptCodec: High-Fidelity Neural Speech Codec using Disentangled
  Representation Learning based Adaptive Feature-aware Prompt Encoders

PromptCodec: High-Fidelity Neural Speech Codec using Disentangled Representation Learning based Adaptive Feature-aware Prompt Encoders

3 April 2024
Yu Pan
Lei Ma
Jianjun Zhao
ArXivPDFHTML

Papers citing "PromptCodec: High-Fidelity Neural Speech Codec using Disentangled Representation Learning based Adaptive Feature-aware Prompt Encoders"

10 / 10 papers shown
Title
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
Shengpeng Ji
Ziyue Jiang
Xize Cheng
Yifu Chen
Minghui Fang
...
Rongjie Huang
Yidi Jiang
Qian Chen
Zhou Zhao
Zhou Zhao
VLM
71
40
0
29 Aug 2024
CAM++: A Fast and Efficient Network for Speaker Verification Using
  Context-Aware Masking
CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking
Haibo Wang
Siqi Zheng
Yafeng Chen
Luyao Cheng
Qian Chen
70
80
0
01 Mar 2023
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with
  Natural Language Style Prompt
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt
Dongchao Yang
Songxiang Liu
Rongjie Huang
Chao Weng
Helen Meng
DiffM
VLM
49
88
0
31 Jan 2023
AudioLM: a Language Modeling Approach to Audio Generation
AudioLM: a Language Modeling Approach to Audio Generation
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
80
589
0
07 Sep 2022
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
Heiga Zen
Viet Dang
R. Clark
Yu Zhang
Ron J. Weiss
Ye Jia
Zhiwen Chen
Yonghui Wu
55
933
0
05 Apr 2019
High-quality speech coding with SampleRNN
High-quality speech coding with SampleRNN
Adam Conkey
Per Hedelin
Cong Zhou
Tucker Hermans
Lars Villemoes
28
59
0
07 Nov 2018
Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
Xingyuan Sun
Jiajun Wu
Xiuming Zhang
Zhoutong Zhang
Chengkai Zhang
Tianfan Xue
J. Tenenbaum
William T. Freeman
3DV
57
453
0
12 Apr 2018
Neural Discrete Representation Learning
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
137
4,928
0
02 Nov 2017
WaveNet: A Generative Model for Raw Audio
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
227
7,361
0
12 Sep 2016
High-Quality, Low-Delay Music Coding in the Opus Codec
High-Quality, Low-Delay Music Coding in the Opus Codec
J. Valin
Gregory Maxwell
Timothy B. Terriberry
Koen Vos
19
122
0
15 Feb 2016
1