Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.06484
Cited By
ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
14 September 2022
Liumeng Xue
Frank Soong
Shaofei Zhang
Linfu Xie
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS"
13 / 13 papers shown
Title
Retrieval Augmented Generation in Prompt-based Text-to-Speech Synthesis with Context-Aware Contrastive Language-Audio Pretraining
Jinlong Xue
Yayue Deng
Yingming Gao
Ya Li
RALM
VLM
34
4
0
06 Jun 2024
Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model
Jinlong Xue
Yayue Deng
Yicheng Han
Yingming Gao
Ya Li
40
4
0
06 Jun 2024
HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS
Dake Guo
Xinfa Zhu
Liumeng Xue
Tao Li
Yuanjun Lv
Yuepeng Jiang
Linfu Xie
6
1
0
25 Sep 2023
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis
Weiqin Li
Shunwei Lei
Qiaochu Huang
Yixuan Zhou
Zhiyong Wu
Shiyin Kang
Helen Meng
25
4
0
31 Aug 2023
Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder
Xuyuan Li
Zengqiang Shang
Peiyang Shi
Hua Hua
Jian Liu
Pengyuan Zhang
29
0
0
25 Aug 2023
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training
H. Oh
Sang-Hoon Lee
Seong-Whan Lee
DiffM
15
14
0
31 Jul 2023
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis
Shunwei Lei
Yixuan Zhou
Liyang Chen
Zhiyong Wu
Xixin Wu
Shiyin Kang
Helen Meng
22
7
0
29 Jul 2023
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Yujia Xiao
Shaofei Zhang
Xi Wang
Xuejiao Tan
Lei He
Sheng Zhao
Frank Soong
Tan Lee
19
5
0
03 Jul 2023
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer
Ammar Abbas
S. Karlapati
Bastian Schnell
Penny Karanasou
M. G. Moya
Amith Nagaraj
Ayman Boustati
Nicole Peinelt
Alexis Moinet
Thomas Drugman
25
3
0
20 Jun 2023
Context-aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech Synthesis
Shunwei Lei
Yixuan Zhou
Liyang Chen
Zhiyong Wu
Shiyin Kang
Helen Meng
25
6
0
13 Apr 2023
MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy
Ya-Jie Zhang
Wei Song
Ya Yue
Zhengchen Zhang
Youzheng Wu
Xiaodong He
26
7
0
11 Nov 2022
Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts
Detai Xin
Sharath Adavanne
F. Ang
Ashish Kulkarni
Shinnosuke Takamichi
Hiroshi Saruwatari
26
13
0
04 Nov 2022
DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech
Keon Lee
Kyumin Park
Daeyoung Kim
LM&MA
16
42
0
03 Jul 2022
1