ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.18159
  4. Cited By
DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation

DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation

23 March 2025
Peng Chen
Xiaobao Wei
Ming Lu
Hui Chen
Feng Tian
ArXiv (abs)PDFHTML

Papers citing "DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation"

18 / 18 papers shown
Title
ManipDreamer: Boosting Robotic Manipulation World Model with Action Tree and Visual Guidance
ManipDreamer: Boosting Robotic Manipulation World Model with Action Tree and Visual Guidance
Ying Li
Xiaobao Wei
Xiaowei Chi
Yiming Li
Zhongyu Zhao
Hao Wang
Ningning MA
Ming Lu
Shanghang Zhang
VGen
72
0
0
23 Apr 2025
DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion
  Transformer
DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer
Zhiyuan Ma
Xiangyu Zhu
Guojun Qi
Chen Qian
Zhaoxiang Zhang
Zhen Lei
50
10
0
08 Feb 2024
FaceDiffuser: Speech-Driven 3D Facial Animation Synthesis Using
  Diffusion
FaceDiffuser: Speech-Driven 3D Facial Animation Synthesis Using Diffusion
Stefan Stan
Kazi Injamamul Haque
Zerrin Yumak
DiffM
91
57
0
20 Sep 2023
SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend
  3D Talking Faces
SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces
Ziqiao Peng
Yihao Luo
Yue Shi
Hao-Xuan Xu
Xiangyu Zhu
Jun He
Hongyan Liu
Zhaoxin Fan
99
44
0
19 Jun 2023
Anti-DreamBooth: Protecting users from personalized text-to-image
  synthesis
Anti-DreamBooth: Protecting users from personalized text-to-image synthesis
T. Le
Hao Phung
Thuan Hoang Nguyen
Quan Dao
Ngoc N. Tran
Anh Tran
89
100
0
27 Mar 2023
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
Ziqiao Peng
Hao Wu
Zhenbo Song
Hao-Xuan Xu
Xiangyu Zhu
Jun He
Hongyan Liu
Zhaoxin Fan
CVBM
94
108
0
20 Mar 2023
CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Jinbo Xing
Menghan Xia
Yuechen Zhang
Xiaodong Cun
Jue Wang
T. Wong
87
149
0
06 Jan 2023
Multi-Concept Customization of Text-to-Image Diffusion
Multi-Concept Customization of Text-to-Image Diffusion
Nupur Kumari
Bin Zhang
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
165
874
0
08 Dec 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
279
2,888
0
25 Aug 2022
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for
  Conversational Gestures Synthesis
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis
Haiyang Liu
Zihao Zhu
Naoya Iwamoto
Yichen Peng
Zhengqing Li
You Zhou
E. Bozkurt
Bo Zheng
SLRCVBM
69
142
0
10 Mar 2022
FaceFormer: Speech-Driven 3D Facial Animation with Transformers
FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Yingruo Fan
Zhaojiang Lin
Jun Saito
Wenping Wang
Taku Komura
CVBM
90
204
0
10 Dec 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked
  Prediction of Hidden Units
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
184
2,993
0
14 Jun 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality
  Disentanglement
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement
Alexander Richard
Michael Zollhoefer
Yandong Wen
Fernando de la Torre
Yaser Sheikh
CVBM
66
200
0
16 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
969
29,810
0
26 Feb 2021
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
299
5,837
0
20 Jun 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
712
18,310
0
19 Jun 2020
Capture, Learning, and Synthesis of 3D Speaking Styles
Capture, Learning, and Synthesis of 3D Speaking Styles
Daniel Cudeiro
Timo Bolkart
Cassidy Laidlaw
Anurag Ranjan
Michael J. Black
CVBM3DH
95
343
0
08 May 2019
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
786
132,363
0
12 Jun 2017
1