ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.15230
  4. Cited By
GAIA: Zero-shot Talking Avatar Generation

GAIA: Zero-shot Talking Avatar Generation

26 November 2023
Tianyu He
Junliang Guo
Runyi Yu
Yuchi Wang
Jialiang Zhu
Kaikai An
Leyi Li
Xu Tan
Chunyu Wang
Han Hu
HsiangTao Wu
Sheng Zhao
Jiang Bian
ArXivPDFHTML

Papers citing "GAIA: Zero-shot Talking Avatar Generation"

23 / 23 papers shown
Title
SE4Lip: Speech-Lip Encoder for Talking Head Synthesis to Solve Phoneme-Viseme Alignment Ambiguity
SE4Lip: Speech-Lip Encoder for Talking Head Synthesis to Solve Phoneme-Viseme Alignment Ambiguity
Yihuan Huang
Jiajun Liu
Yanzhen Ren
Wuyang Liu
Juhua Tang
24
0
0
08 Apr 2025
Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation
Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation
Dingcheng Zhen
Shunshun Yin
Shiyang Qin
Hou Yi
Ziwei Zhang
Siyuan Liu
Gan Qi
Ming Tao
VGen
69
0
0
24 Mar 2025
3D Engine-ready Photorealistic Avatars via Dynamic Textures
3D Engine-ready Photorealistic Avatars via Dynamic Textures
Yifan Wang
Ivan Molodetskikh
Ondrej Texler
Dimitar Dinev
45
0
0
19 Mar 2025
Semantic Latent Motion for Portrait Video Generation
Qiyuan Zhang
Chenyu Wu
Wenzhang Sun
Huaize Liu
Donglin Di
Wei Chen
Changqing Zou
VGen
72
0
0
13 Mar 2025
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
Gaojie Lin
Jianwen Jiang
Jiaqi Yang
Zerong Zheng
Chao Liang
DiffM
VGen
183
11
0
03 Feb 2025
OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
Hui Li
Mingwang Xu
Yun Zhan
Shan Mu
Jiaye Li
...
Y. Chen
Tan Chen
Mao Ye
Jingdong Wang
Siyu Zhu
VGen
102
2
0
28 Nov 2024
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Xiaozhong Ji
Xiaobin Hu
Zhihong Xu
Junwei Zhu
Chuming Lin
...
Donghao Luo
Yi Chen
Qin Lin
Qinglin Lu
Chengjie Wang
VGen
73
4
0
25 Nov 2024
Takin-ADA: Emotion Controllable Audio-Driven Animation with Canonical
  and Landmark Loss Optimization
Takin-ADA: Emotion Controllable Audio-Driven Animation with Canonical and Landmark Loss Optimization
Bin Lin
Yanzhen Yu
Jianhao Ye
Ruitao Lv
Y. Yang
Ruoye Xie
Pan Yu
Hongbin Zhou
VGen
32
1
0
18 Oct 2024
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation
Hanbo Cheng
Limin Lin
Chenyu Liu
Pengcheng Xia
Pengfei Hu
Jiefeng Ma
Jun Du
Jia Pan
DiffM
VGen
136
0
0
17 Oct 2024
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
Jianwen Jiang
Gaojie Lin
Zhengkun Rong
Chao Liang
Yongming Zhu
Jiaqi Yang
Tianyun Zhong
3DH
87
8
0
08 Jul 2024
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image
  Animation
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Mingwang Xu
Hui Li
Qingkun Su
Hanlin Shang
Liwei Zhang
Ce Liu
Jingdong Wang
Yao Yao
Siyu Zhu
VGen
31
68
0
13 Jun 2024
Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with
  Motion and Appearance Disentanglement
Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement
Runyi Yu
Tianyu He
Ailing Zhang
Yuchi Wang
Junliang Guo
Xu Tan
Chang Liu
Jie Chen
Jiang Bian
VGen
34
4
0
12 Jun 2024
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait
  Animation
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Y. Ma
Hongyu Liu
Haozhao Wang
Heng Pan
Yingqing He
...
Ailing Zeng
Chengfei Cai
H. Shum
Wei Liu
Qifeng Chen
33
52
0
04 Jun 2024
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar
  Generation
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
Yuchi Wang
Junliang Guo
Jianhong Bai
Runyi Yu
Tianyu He
Xu Tan
Xu Sun
Jiang Bian
DiffM
35
9
0
24 May 2024
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Sicheng Xu
Guojun Chen
Yu-Xiao Guo
Jiaolong Yang
Chong Li
Zhenyu Zang
Yizhong Zhang
Xin Tong
Baining Guo
45
87
0
16 Apr 2024
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity
  Talking Head Generation
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation
Seyeon Kim
Siyoon Jin
Jihye Park
Kihong Kim
Jiyoung Kim
Jisu Nam
Seungryong Kim
DiffM
VGen
60
3
0
28 Mar 2024
Superior and Pragmatic Talking Face Generation with Teacher-Student
  Framework
Superior and Pragmatic Talking Face Generation with Teacher-Student Framework
Chao Liang
Jianwen Jiang
Tianyun Zhong
Gaojie Lin
Zhengkun Rong
Jiaqi Yang
Yongming Zhu
42
1
0
26 Mar 2024
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer
Yu Deng
Duomin Wang
Baoyuan Wang
45
21
0
20 Mar 2024
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance
  Editing
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
Jianhong Bai
Tianyu He
Yuchi Wang
Junliang Guo
Haoji Hu
Zuozhu Liu
Jiang Bian
VGen
31
26
0
20 Feb 2024
MagicStick: Controllable Video Editing via Control Handle
  Transformations
MagicStick: Controllable Video Editing via Control Handle Transformations
Yue Ma
Xiaodong Cun
Yin-Yin He
Chenyang Qi
Xintao Wang
Ying Shan
Xiu Li
Qifeng Chen
VGen
14
24
0
05 Dec 2023
One-shot Talking Face Generation from Single-speaker Audio-Visual
  Correlation Learning
One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning
Suzhe Wang
Lincheng Li
Yueqing Ding
Xin Yu
CVBM
66
117
0
06 Dec 2021
PIRenderer: Controllable Portrait Image Generation via Semantic Neural
  Rendering
PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering
Yurui Ren
Gezhong Li
Yuanqi Chen
Thomas H. Li
Shan Liu
DiffM
VGen
49
224
0
17 Sep 2021
VoxCeleb2: Deep Speaker Recognition
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
224
2,234
0
14 Jun 2018
1