Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.13082
Cited By
MultiActor-Audiobook: Zero-Shot Audiobook Generation with Faces and Voices of Multiple Speakers
19 May 2025
Kyeongman Park
Seongho Joo
Kyomin Jung
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MultiActor-Audiobook: Zero-Shot Audiobook Generation with Faces and Voices of Multiple Speakers"
7 / 7 papers shown
Title
Audiobox: Unified Audio Generation with Natural Language Prompts
Apoorv Vyas
Bowen Shi
Matt Le
Andros Tjandra
Yi-Chiao Wu
...
Chris Summers
Carleigh Wood
Joshua Lane
Mary Williamson
Wei-Ning Hsu
124
94
0
25 Dec 2023
LongStory: Coherent, Complete and Length Controlled Long story Generation
Kyeongman Park
Nakyeong Yang
Kyomin Jung
114
5
0
26 Nov 2023
Prosody Analysis of Audiobooks
Charuta Pethe
Yunting Yin
Felix D Childress
Yunting Yin
Steven Skiena
58
1
0
10 Oct 2023
A Discourse-level Multi-scale Prosodic Model for Fine-grained Emotion Analysis
X. Wei
Jia Jia
Xiang Li
Zhiyong Wu
Ziyi Wang
62
1
0
21 Sep 2023
EmoSpeech: Guiding FastSpeech2 Towards Emotional Text to Speech
Daria Diatlova
V. Shutov
93
9
0
28 Jun 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
1.5K
14,761
0
15 Mar 2023
Transformer-based Multimodal Information Fusion for Facial Expression Analysis
Wei Zhang
Feng Qiu
Suzhe Wang
Hao Zeng
Zhimeng Zhang
Rudong An
Bowen Ma
Yu-qiong Ding
CVBM
90
91
0
23 Mar 2022
1