Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.01885
Cited By
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
3 January 2024
Evonne Ng
Javier Romero
Timur M. Bagautdinov
Shaojie Bai
Trevor Darrell
Angjoo Kanazawa
Alexander Richard
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations"
23 / 23 papers shown
Title
Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion
Evgeniia Vu
Andrei Boiarov
Dmitry Vetrov
VGen
76
0
0
13 Mar 2025
ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAE
Sichun Wu
Kazi Injamamul Haque
Zerrin Yumak
VGen
65
2
0
12 Sep 2024
DEGAS: Detailed Expressions on Full-Body Gaussian Avatars
Zhijing Shao
D. B. Wang
Qing-Yao Tian
Yao-Dong Yang
Hengyu Meng
Zeyu Cai
Bo Dong
Yu Zhang
Kang Zhang
Zhaoxiang Wang
3DGS
65
4
0
20 Aug 2024
Gaussian Eigen Models for Human Heads
Wojciech Zielonka
Timo Bolkart
Thabo Beeler
Justus Thies
3DGS
95
5
0
05 Jul 2024
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors
Zhentao Yu
Zixin Yin
Deyu Zhou
Duomin Wang
Finn Wong
Baoyuan Wang
DiffM
57
37
0
07 Dec 2022
EDGE: Editable Dance Generation From Music
Jo-Han Tseng
Rodrigo Castellon
Chenxi Liu
69
235
0
19 Nov 2022
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models
Simon Alexanderson
Rajmund Nagy
Jonas Beskow
G. Henter
DiffM
VGen
63
170
0
17 Nov 2022
Human Motion Diffusion Model
Guy Tevet
Sigal Raab
Brian Gordon
Yonatan Shafir
Daniel Cohen-Or
Amit H. Bermano
DiffM
VGen
260
753
0
29 Sep 2022
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
171
3,882
0
26 Jul 2022
Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion
Evonne Ng
Hanbyul Joo
Liwen Hu
Hao Li
Trevor Darrell
Angjoo Kanazawa
Shiry Ginosar
VGen
43
94
0
18 Apr 2022
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis
Haiyang Liu
Zihao Zhu
Naoya Iwamoto
Yichen Peng
Zhengqing Li
You Zhou
E. Bozkurt
Bo Zheng
SLR
CVBM
47
140
0
10 Mar 2022
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Mohan Zhou
Yalong Bai
Wei Zhang
Ting Yao
Tiejun Zhao
Tao Mei
EGVM
45
46
0
27 Dec 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement
Alexander Richard
Michael Zollhoefer
Yandong Wen
Fernando de la Torre
Yaser Sheikh
CVBM
66
199
0
16 Apr 2021
Improved Denoising Diffusion Probabilistic Models
Alex Nichol
Prafulla Dhariwal
DiffM
297
3,671
0
18 Feb 2021
Body2Hands: Learning to Infer 3D Hands from Conversational Gesture Body Dynamics
Evonne Ng
Shiry Ginosar
Trevor Darrell
Hanbyul Joo
SLR
3DH
55
45
0
23 Jul 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
241
5,774
0
20 Jun 2020
APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals
Jiangning Zhang
Lu Liu
Zhucun Xue
Yong Liu
CVBM
32
16
0
30 Apr 2020
To React or not to React: End-to-End Visual Pose Forecasting for Personalized Avatar during Dyadic Conversations
Chaitanya Ahuja
Shugao Ma
Louis-Philippe Morency
Yaser Sheikh
52
59
0
05 Oct 2019
Realistic Speech-Driven Facial Animation with GANs
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
120
294
0
14 Jun 2019
Towards Social Artificial Intelligence: Nonverbal Social Signal Prediction in A Triadic Interaction
Hanbyul Joo
Tomas Simon
M. Cikara
Yaser Sheikh
47
94
0
10 Jun 2019
Capture, Learning, and Synthesis of 3D Speaking Styles
Daniel Cudeiro
Timo Bolkart
Cassidy Laidlaw
Anurag Ranjan
Michael J. Black
CVBM
3DH
91
343
0
08 May 2019
Deep Appearance Models for Face Rendering
Stephen Lombardi
Jason M. Saragih
Tomas Simon
Yaser Sheikh
CVBM
3DH
59
283
0
01 Aug 2018
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
208
4,989
0
02 Nov 2017
1