Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.15907
Cited By
Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation
28 January 2025
Haorui He
Zengqiang Shang
Chaoren Wang
Xuyuan Li
Yicheng Gu
Hua Hua
Liwei Liu
Chen Yang
Jiaqi Li
Peiyang Shi
Yansen Wang
Kai Chen
Pengyuan Zhang
Zhikai Wu
AuLLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation"
7 / 7 papers shown
Title
Neurodyne: Neural Pitch Manipulation with Representation Learning and Cycle-Consistency GAN
Yicheng Gu
Chaoren Wang
Zhizheng Wu
Lauri Juvela
129
1
0
21 May 2025
WavReward: Spoken Dialogue Models With Generalist Reward Evaluators
Shengpeng Ji
Tianle Liang
Yongqian Li
Jialong Zuo
Minghui Fang
...
Xize Cheng
Siqi Zheng
Jin Xu
Junyang Lin
Zhou Zhao
AuLLM
ALM
129
0
0
14 May 2025
Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment
Xueyao Zhang
Yijiao Wang
Chaoren Wang
Hui Yuan
Zhuo Chen
Zhizheng Wu
341
0
0
07 May 2025
Kimi-Audio Technical Report
KimiTeam
Ding Ding
Zeqian Ju
Yichong Leng
Shixuan Liu
...
Zhiyong Yang
Aoxiong Yin
Ruibin Yuan
Yanzhe Zhang
Zaida Zhou
AuLLM
VLM
198
13
0
25 Apr 2025
Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement
Xueyao Zhang
Xiaohui Zhang
Kainan Peng
Zhenyu Tang
Vimal Manohar
...
Yansen Wang
Julian Chan
Yuan Huang
Zhizheng Wu
Mingbo Ma
DiffM
235
6
0
11 Feb 2025
SF-Speech: Straightened Flow for Zero-Shot Voice Clone
Xuyuan Li
Zengqiang Shang
Hua Hua
Peiyang Shi
Chen Yang
Li Wang
Pengyuan Zhang
155
3
0
16 Oct 2024
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Yushen Chen
Zhikang Niu
Ziyang Ma
Keqi Deng
Chunhui Wang
Jian Zhao
Kai Yu
Xie Chen
164
92
0
09 Oct 2024
1