ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.06720
  4. Cited By
Audio2Gestures: Generating Diverse Gestures from Speech Audio with
  Conditional Variational Autoencoders

Audio2Gestures: Generating Diverse Gestures from Speech Audio with Conditional Variational Autoencoders

15 August 2021
Jing Li
Di Kang
Wenjie Pei
Xuefei Zhe
Ying Zhang
Zhenyu He
Linchao Bao
    SLR
ArXivPDFHTML

Papers citing "Audio2Gestures: Generating Diverse Gestures from Speech Audio with Conditional Variational Autoencoders"

27 / 27 papers shown
Title
Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Co3^{3}3Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Xingqun Qi
Yatian Wang
Hengyuan Zhang
J. Pan
Wei Xue
Shanghang Zhang
Wenhan Luo
Qifeng Liu
Yike Guo
SLR
66
0
0
03 May 2025
EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
Xiangyue Zhang
Jianfang Li
Jiaxu Zhang
Jianqiang Ren
Liefeng Bo
Zhigang Tu
30
0
0
12 Apr 2025
ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis
ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis
Xukun Zhou
Fengxin Li
Ming Chen
Yan Zhou
Pengfei Wan
Di Zhang
Yeying Jin
Zhaoxin Fan
Hongyan Liu
Jun He
DiffM
VGen
51
0
0
09 Mar 2025
ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAE
ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAE
Sichun Wu
Kazi Injamamul Haque
Zerrin Yumak
VGen
30
2
0
12 Sep 2024
DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation
DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation
Jisoo Kim
Jungbin Cho
Joonho Park
Soonmin Hwang
Da Eun Kim
Geon Kim
Youngjae Yu
57
1
0
12 Aug 2024
MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and
  Disentangled Multi-Modality Fusion
MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion
Chencan Fu
Yabiao Wang
Jiangning Zhang
Zhengkai Jiang
Xiaofeng Mao
Jiafu Wu
Weijian Cao
Chengjie Wang
Yanhao Ge
Yong Liu
Mamba
43
2
0
29 Jul 2024
Robust Facial Reactions Generation: An Emotion-Aware Framework with
  Modality Compensation
Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation
Guanyu Hu
Jie Wei
Siyang Song
Dimitrios Kollias
Xinyu Yang
Zhonglin Sun
Odysseus Kaloidas
40
0
0
22 Jul 2024
Investigating the impact of 2D gesture representation on co-speech
  gesture generation
Investigating the impact of 2D gesture representation on co-speech gesture generation
Teo Guichoux
Laure Soulier
Nicolas Obin
Catherine Pelachaud
SLR
19
0
0
21 Jun 2024
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
Zunnan Xu
Yukang Lin
Haonan Han
Sicheng Yang
Ronghui Li
Yachao Zhang
Xiu Li
Mamba
46
25
0
14 Mar 2024
Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech
  Gesture Generation
Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation
Xingqun Qi
Jiahao Pan
Peng Li
Ruibin Yuan
Xiaowei Chi
...
Wenhan Luo
Wei Xue
Shanghang Zhang
Qi-fei Liu
Yi-Ting Guo
SLR
31
11
0
29 Nov 2023
SpeechAct: Towards Generating Whole-body Motion from Speech
Jinsong Zhang
Minjie Zhu
Yuxiang Zhang
Yebin Liu
Kun Li
28
0
0
29 Nov 2023
UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons
UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons
Sicheng Yang
Zehao Wang
Zhiyong Wu
Minglei Li
Zhensong Zhang
...
Lei Hao
Songcen Xu
Xiaofei Wu
Changpeng Yang
Zonghong Dai
DiffM
44
14
0
13 Sep 2023
Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio
  Representation
Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation
Anna Deichler
Shivam Mehta
Simon Alexanderson
Jonas Beskow
DiffM
17
23
0
11 Sep 2023
BodyFormer: Semantics-guided 3D Body Gesture Synthesis with Transformer
BodyFormer: Semantics-guided 3D Body Gesture Synthesis with Transformer
Kunkun Pang
Dafei Qin
Yingruo Fan
Julian Habekost
Takaaki Shiratori
Junichi Yamagishi
Taku Komura
SLR
ViT
21
19
0
07 Sep 2023
Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Fan Zhang
Naye Ji
Fuxing Gao
Siyuan Zhao
Zhaohan Wang
Shunman Li
29
0
0
11 Aug 2023
Human Motion Generation: A Survey
Human Motion Generation: A Survey
Wentao Zhu
Xiaoxuan Ma
Dongwoo Ro
Hai Ci
Jinlu Zhang
Jiaxin Shi
Feng Gao
Qi Tian
Yizhou Wang
VGen
47
53
0
20 Jul 2023
EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
Li-Ping Yin
Yijun Wang
Tianyu He
Jinming Liu
Wei Zhao
Bohan Li
Xin Jin
Jianxin Lin
DiffM
24
14
0
20 Jun 2023
QPGesture: Quantization-Based and Phase-Guided Motion Matching for
  Natural Speech-Driven Gesture Generation
QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation
Sicheng Yang
Zhiyong Wu
Minglei Li
Zhensong Zhang
Lei Hao
Weihong Bao
Hao-Wen Zhuang
SLR
24
41
0
18 May 2023
Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
Lingting Zhu
Xian Liu
Xuanyu Liu
Rui Qian
Ziwei Liu
Lequan Yu
28
114
0
16 Mar 2023
Audio2Gestures: Generating Diverse Gestures from Audio
Audio2Gestures: Generating Diverse Gestures from Audio
Jing Li
Di Kang
Wenjie Pei
Xuefei Zhe
Ying Zhang
Linchao Bao
Zhenyu He
DiffM
SLR
32
8
0
17 Jan 2023
A Comprehensive Review of Data-Driven Co-Speech Gesture Generation
A Comprehensive Review of Data-Driven Co-Speech Gesture Generation
Simbarashe Nyatsanga
Taras Kucherenko
Chaitanya Ahuja
G. Henter
Michael Neff
SLR
33
90
0
13 Jan 2023
Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Anni Tang
Tianyu He
Xuejiao Tan
Jun Ling
Liang Song
CVBM
26
23
0
09 Dec 2022
Audio-Driven Co-Speech Gesture Video Generation
Audio-Driven Co-Speech Gesture Video Generation
Xian Liu
Qianyi Wu
Hang Zhou
Yuanqi Du
Wayne Wu
Dahua Lin
Ziwei Liu
SLR
VGen
34
47
0
05 Dec 2022
Multimodal Transformer for Parallel Concatenated Variational
  Autoencoders
Multimodal Transformer for Parallel Concatenated Variational Autoencoders
Stephen D. Liang
J. Mendel
ViT
27
5
0
28 Oct 2022
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with
  Hierarchical Neural Embeddings
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural Embeddings
Tenglong Ao
Qingzhe Gao
Yuke Lou
Baoquan Chen
Libin Liu
SLR
27
59
0
04 Oct 2022
NEURAL MARIONETTE: A Transformer-based Multi-action Human Motion
  Synthesis System
NEURAL MARIONETTE: A Transformer-based Multi-action Human Motion Synthesis System
Weiqiang Wang
Xuefei Zhe
Huan Chen
Di Kang
Tingguang Li
Ruizhi Chen
Linchao Bao
46
5
0
27 Sep 2022
The ReprGesture entry to the GENEA Challenge 2022
The ReprGesture entry to the GENEA Challenge 2022
Sicheng Yang
Zhiyong Wu
Minglei Li
Mengchen Zhao
Jiuxin Lin
Liyang Chen
Weihong Bao
30
11
0
25 Aug 2022
1