ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.01885
  4. Cited By
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

3 January 2024
Evonne Ng
Javier Romero
Timur M. Bagautdinov
Shaojie Bai
Trevor Darrell
Angjoo Kanazawa
Alexander Richard
    VGen
ArXivPDFHTML

Papers citing "From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations"

23 / 23 papers shown
Title
Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion
Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion
Evgeniia Vu
Andrei Boiarov
Dmitry Vetrov
VGen
76
0
0
13 Mar 2025
ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAE
ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAE
Sichun Wu
Kazi Injamamul Haque
Zerrin Yumak
VGen
65
2
0
12 Sep 2024
DEGAS: Detailed Expressions on Full-Body Gaussian Avatars
DEGAS: Detailed Expressions on Full-Body Gaussian Avatars
Zhijing Shao
D. B. Wang
Qing-Yao Tian
Yao-Dong Yang
Hengyu Meng
Zeyu Cai
Bo Dong
Yu Zhang
Kang Zhang
Zhaoxiang Wang
3DGS
65
4
0
20 Aug 2024
Gaussian Eigen Models for Human Heads
Gaussian Eigen Models for Human Heads
Wojciech Zielonka
Timo Bolkart
Thabo Beeler
Justus Thies
3DGS
95
5
0
05 Jul 2024
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion
  Priors
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors
Zhentao Yu
Zixin Yin
Deyu Zhou
Duomin Wang
Finn Wong
Baoyuan Wang
DiffM
57
37
0
07 Dec 2022
EDGE: Editable Dance Generation From Music
EDGE: Editable Dance Generation From Music
Jo-Han Tseng
Rodrigo Castellon
Chenxi Liu
69
235
0
19 Nov 2022
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion
  Models
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models
Simon Alexanderson
Rajmund Nagy
Jonas Beskow
G. Henter
DiffM
VGen
63
170
0
17 Nov 2022
Human Motion Diffusion Model
Human Motion Diffusion Model
Guy Tevet
Sigal Raab
Brian Gordon
Yonatan Shafir
Daniel Cohen-Or
Amit H. Bermano
DiffM
VGen
260
753
0
29 Sep 2022
Classifier-Free Diffusion Guidance
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
171
3,882
0
26 Jul 2022
Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion
Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion
Evonne Ng
Hanbyul Joo
Liwen Hu
Hao Li
Trevor Darrell
Angjoo Kanazawa
Shiry Ginosar
VGen
43
94
0
18 Apr 2022
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for
  Conversational Gestures Synthesis
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis
Haiyang Liu
Zihao Zhu
Naoya Iwamoto
Yichen Peng
Zhengqing Li
You Zhou
E. Bozkurt
Bo Zheng
SLR
CVBM
47
140
0
10 Mar 2022
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Mohan Zhou
Yalong Bai
Wei Zhang
Ting Yao
Tiejun Zhao
Tao Mei
EGVM
45
46
0
27 Dec 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality
  Disentanglement
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement
Alexander Richard
Michael Zollhoefer
Yandong Wen
Fernando de la Torre
Yaser Sheikh
CVBM
66
199
0
16 Apr 2021
Improved Denoising Diffusion Probabilistic Models
Improved Denoising Diffusion Probabilistic Models
Alex Nichol
Prafulla Dhariwal
DiffM
297
3,671
0
18 Feb 2021
Body2Hands: Learning to Infer 3D Hands from Conversational Gesture Body
  Dynamics
Body2Hands: Learning to Infer 3D Hands from Conversational Gesture Body Dynamics
Evonne Ng
Shiry Ginosar
Trevor Darrell
Hanbyul Joo
SLR
3DH
55
45
0
23 Jul 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
241
5,774
0
20 Jun 2020
APB2Face: Audio-guided face reenactment with auxiliary pose and blink
  signals
APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals
Jiangning Zhang
Lu Liu
Zhucun Xue
Yong Liu
CVBM
32
16
0
30 Apr 2020
To React or not to React: End-to-End Visual Pose Forecasting for
  Personalized Avatar during Dyadic Conversations
To React or not to React: End-to-End Visual Pose Forecasting for Personalized Avatar during Dyadic Conversations
Chaitanya Ahuja
Shugao Ma
Louis-Philippe Morency
Yaser Sheikh
52
59
0
05 Oct 2019
Realistic Speech-Driven Facial Animation with GANs
Realistic Speech-Driven Facial Animation with GANs
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
120
294
0
14 Jun 2019
Towards Social Artificial Intelligence: Nonverbal Social Signal
  Prediction in A Triadic Interaction
Towards Social Artificial Intelligence: Nonverbal Social Signal Prediction in A Triadic Interaction
Hanbyul Joo
Tomas Simon
M. Cikara
Yaser Sheikh
47
94
0
10 Jun 2019
Capture, Learning, and Synthesis of 3D Speaking Styles
Capture, Learning, and Synthesis of 3D Speaking Styles
Daniel Cudeiro
Timo Bolkart
Cassidy Laidlaw
Anurag Ranjan
Michael J. Black
CVBM
3DH
91
343
0
08 May 2019
Deep Appearance Models for Face Rendering
Deep Appearance Models for Face Rendering
Stephen Lombardi
Jason M. Saragih
Tomas Simon
Yaser Sheikh
CVBM
3DH
59
283
0
01 Aug 2018
Neural Discrete Representation Learning
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
208
4,989
0
02 Nov 2017
1