ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.06457
  4. Cited By
Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic
  Talking-head Generation

Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head Generation

12 August 2023
Zhichao Wang
M. Dai
Keld Lundgaard
    VGen
    DiffM
ArXivPDFHTML

Papers citing "Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head Generation"

8 / 8 papers shown
Title
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video
  Editing In the Wild
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
K. Cheng
Xiaodong Cun
Yong Zhang
Menghan Xia
Fei Yin
Mingrui Zhu
Xuanxia Wang
Jue Wang
Nan Wang
CVBM
45
96
0
27 Nov 2022
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via
  Pre-trained StyleGAN
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN
Fei Yin
Yong Zhang
Xiaodong Cun
Ming Cao
Yanbo Fan
Xuanxia Wang
Qingyan Bai
Baoyuan Wu
Jue Wang
Yujiu Yang
CVBM
81
172
0
08 Mar 2022
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice
  Conversion for everyone
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Edresson Casanova
Julian Weber
C. Shulby
Arnaldo Cândido Júnior
Eren Golge
M. Ponti
214
391
0
04 Dec 2021
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural
  Head Motion
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion
Suzhe Wang
Lincheng Li
Yu-qiong Ding
Changjie Fan
Xin Yu
VGen
80
163
0
20 Jul 2021
Text2Video: Text-driven Talking-head Video Synthesis with Personalized
  Phoneme-Pose Dictionary
Text2Video: Text-driven Talking-head Video Synthesis with Personalized Phoneme-Pose Dictionary
Sibo Zhang
Jiahong Yuan
Miao Liao
Liangjun Zhang
31
33
0
29 Apr 2021
ObamaNet: Photo-realistic lip-sync from text
ObamaNet: Photo-realistic lip-sync from text
Rithesh Kumar
Jose M. R. Sotelo
Kundan Kumar
A. D. Brébisson
Yoshua Bengio
41
118
0
06 Dec 2017
Tacotron: Towards End-to-End Speech Synthesis
Tacotron: Towards End-to-End Speech Synthesis
Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
...
Samy Bengio
Quoc V. Le
Yannis Agiomyrgiannakis
R. Clark
Rif A. Saurous
150
1,817
0
29 Mar 2017
WaveNet: A Generative Model for Raw Audio
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
331
7,361
0
12 Sep 2016
1