Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.10157
Cited By
Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization
16 September 2024
Xiaoxue Gao
Chen Zhang
Yiming Chen
Huayun Zhang
Nancy F. Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization"
6 / 6 papers shown
Title
VocalBench: Benchmarking the Vocal Conversational Abilities for Speech Interaction Models
Heyang Liu
Yuhao Wang
Ziyang Cheng
Ronghua Wu
Qunshan Gu
Yanfeng Wang
Yu Wang
AuLLM
14
0
0
21 May 2025
Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment
Xueyao Zhang
Yansen Wang
Chaoren Wang
Zhiyu Li
Zhuo Chen
Zhizheng Wu
164
0
0
07 May 2025
EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting
Guanrou Yang
Chen Yang
Qian Chen
Ziyang Ma
Wenxi Chen
...
Fan Yu
Zhihao Du
Zhifu Gao
Shiliang Zhang
Xie Chen
AuLLM
57
0
0
17 Apr 2025
F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization
Xiaohui Sun
Ruitong Xiao
Jianye Mo
Bowen Wu
Qun Yu
Baoxun Wang
51
1
0
03 Apr 2025
OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis
Run Luo
Ting-En Lin
Jun Wang
Yuchuan Wu
Xiong Liu
...
Jiaming Li
Lei Zhang
Yushen Chen
Hamid Alinejad-Rokny
Fei Huang
AuLLM
VLM
83
0
0
08 Jan 2025
Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Yiming Chen
Xianghu Yue
Xiaoxue Gao
Chen Zhang
L. F. D’Haro
R. Tan
Haizhou Li
AuLLM
32
1
0
27 Sep 2024
1