From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

3 January 2024

Papers citing "From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations"

23 / 23 papers shown

Title
Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion Evgeniia Vu Andrei Boiarov Dmitry Vetrov VGen 76 0 0 13 Mar 2025
ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAE Sichun Wu Kazi Injamamul Haque Zerrin Yumak VGen 65 2 0 12 Sep 2024
DEGAS: Detailed Expressions on Full-Body Gaussian Avatars Zhijing Shao D. B. Wang Qing-Yao Tian Yao-Dong Yang Hengyu Meng Zeyu Cai Bo Dong Yu Zhang Kang Zhang Zhaoxiang Wang 3DGS 65 4 0 20 Aug 2024
Gaussian Eigen Models for Human Heads Wojciech Zielonka Timo Bolkart Thabo Beeler Justus Thies 3DGS 95 5 0 05 Jul 2024
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors Zhentao Yu Zixin Yin Deyu Zhou Duomin Wang Finn Wong Baoyuan Wang DiffM 57 37 0 07 Dec 2022
EDGE: Editable Dance Generation From Music Jo-Han Tseng Rodrigo Castellon Chenxi Liu 69 235 0 19 Nov 2022
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models Simon Alexanderson Rajmund Nagy Jonas Beskow G. Henter DiffM VGen 63 170 0 17 Nov 2022
Human Motion Diffusion Model Guy Tevet Sigal Raab Brian Gordon Yonatan Shafir Daniel Cohen-Or Amit H. Bermano DiffM VGen 260 753 0 29 Sep 2022
Classifier-Free Diffusion Guidance Jonathan Ho Tim Salimans FaML 171 3,882 0 26 Jul 2022
Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion Evonne Ng Hanbyul Joo Liwen Hu Hao Li Trevor Darrell Angjoo Kanazawa Shiry Ginosar VGen 43 94 0 18 Apr 2022
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis Haiyang Liu Zihao Zhu Naoya Iwamoto Yichen Peng Zhengqing Li You Zhou E. Bozkurt Bo Zheng SLR CVBM 47 140 0 10 Mar 2022
Responsive Listening Head Generation: A Benchmark Dataset and Baseline Mohan Zhou Yalong Bai Wei Zhang Ting Yao Tiejun Zhao Tao Mei EGVM 45 46 0 27 Dec 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement Alexander Richard Michael Zollhoefer Yandong Wen Fernando de la Torre Yaser Sheikh CVBM 66 199 0 16 Apr 2021
Improved Denoising Diffusion Probabilistic Models Alex Nichol Prafulla Dhariwal DiffM 297 3,671 0 18 Feb 2021
Body2Hands: Learning to Infer 3D Hands from Conversational Gesture Body Dynamics Evonne Ng Shiry Ginosar Trevor Darrell Hanbyul Joo SLR 3DH 55 45 0 23 Jul 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Alexei Baevski Henry Zhou Abdel-rahman Mohamed Michael Auli SSL 241 5,774 0 20 Jun 2020
APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals Jiangning Zhang Lu Liu Zhucun Xue Yong Liu CVBM 32 16 0 30 Apr 2020
To React or not to React: End-to-End Visual Pose Forecasting for Personalized Avatar during Dyadic Conversations Chaitanya Ahuja Shugao Ma Louis-Philippe Morency Yaser Sheikh 52 59 0 05 Oct 2019
Realistic Speech-Driven Facial Animation with GANs Konstantinos Vougioukas Stavros Petridis Maja Pantic 120 294 0 14 Jun 2019
Towards Social Artificial Intelligence: Nonverbal Social Signal Prediction in A Triadic Interaction Hanbyul Joo Tomas Simon M. Cikara Yaser Sheikh 47 94 0 10 Jun 2019
Capture, Learning, and Synthesis of 3D Speaking Styles Daniel Cudeiro Timo Bolkart Cassidy Laidlaw Anurag Ranjan Michael J. Black CVBM 3DH 91 343 0 08 May 2019
Deep Appearance Models for Face Rendering Stephen Lombardi Jason M. Saragih Tomas Simon Yaser Sheikh CVBM 3DH 59 283 0 01 Aug 2018
Neural Discrete Representation Learning Aaron van den Oord Oriol Vinyals Koray Kavukcuoglu BDL SSL OCL 208 4,989 0 02 Nov 2017