ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.13082
  4. Cited By
MultiActor-Audiobook: Zero-Shot Audiobook Generation with Faces and Voices of Multiple Speakers

MultiActor-Audiobook: Zero-Shot Audiobook Generation with Faces and Voices of Multiple Speakers

19 May 2025
Kyeongman Park
Seongho Joo
Kyomin Jung
    VGen
ArXiv (abs)PDFHTML

Papers citing "MultiActor-Audiobook: Zero-Shot Audiobook Generation with Faces and Voices of Multiple Speakers"

7 / 7 papers shown
Title
Audiobox: Unified Audio Generation with Natural Language Prompts
Audiobox: Unified Audio Generation with Natural Language Prompts
Apoorv Vyas
Bowen Shi
Matt Le
Andros Tjandra
Yi-Chiao Wu
...
Chris Summers
Carleigh Wood
Joshua Lane
Mary Williamson
Wei-Ning Hsu
124
94
0
25 Dec 2023
LongStory: Coherent, Complete and Length Controlled Long story Generation
LongStory: Coherent, Complete and Length Controlled Long story Generation
Kyeongman Park
Nakyeong Yang
Kyomin Jung
114
5
0
26 Nov 2023
Prosody Analysis of Audiobooks
Prosody Analysis of Audiobooks
Charuta Pethe
Yunting Yin
Felix D Childress
Yunting Yin
Steven Skiena
58
1
0
10 Oct 2023
A Discourse-level Multi-scale Prosodic Model for Fine-grained Emotion
  Analysis
A Discourse-level Multi-scale Prosodic Model for Fine-grained Emotion Analysis
X. Wei
Jia Jia
Xiang Li
Zhiyong Wu
Ziyi Wang
62
1
0
21 Sep 2023
EmoSpeech: Guiding FastSpeech2 Towards Emotional Text to Speech
EmoSpeech: Guiding FastSpeech2 Towards Emotional Text to Speech
Daria Diatlova
V. Shutov
93
9
0
28 Jun 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAGMLLM
1.5K
14,761
0
15 Mar 2023
Transformer-based Multimodal Information Fusion for Facial Expression
  Analysis
Transformer-based Multimodal Information Fusion for Facial Expression Analysis
Wei Zhang
Feng Qiu
Suzhe Wang
Hao Zeng
Zhimeng Zhang
Rudong An
Bowen Ma
Yu-qiong Ding
CVBM
90
91
0
23 Mar 2022
1