ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.04427
  4. Cited By
FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency

FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency

6 April 2025
Shiyan Liu
Rui Qu
Yan Jin
ArXivPDFHTML

Papers citing "FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency"

15 / 15 papers shown
Title
Seeing What You Said: Talking Face Generation Guided by a Lip Reading
  Expert
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert
Jiadong Wang
Xinyuan Qian
Malu Zhang
R. Tan
Haizhou Li
EGVM
48
96
0
29 Mar 2023
Make-A-Video: Text-to-Video Generation without Text-Video Data
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffM
VGen
74
1,399
0
29 Sep 2022
Diffusion-GAN: Training GANs with Diffusion
Diffusion-GAN: Training GANs with Diffusion
Zhendong Wang
Huangjie Zheng
Pengcheng He
Weizhu Chen
Mingyuan Zhou
DiffM
56
230
0
05 Jun 2022
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster
  Prediction
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Bowen Shi
Wei-Ning Hsu
Kushal Lakhotia
Abdel-rahman Mohamed
SSL
86
315
0
05 Jan 2022
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
181
7,765
0
11 May 2021
Text2Video: Text-driven Talking-head Video Synthesis with Personalized
  Phoneme-Pose Dictionary
Text2Video: Text-driven Talking-head Video Synthesis with Personalized Phoneme-Pose Dictionary
Sibo Zhang
Jiahong Yuan
Miao Liao
Liangjun Zhang
47
34
0
29 Apr 2021
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The
  Wild
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
EGVM
96
777
0
23 Aug 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
505
17,888
0
19 Jun 2020
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
Zachary Teed
Jia Deng
MDE
211
2,612
0
26 Mar 2020
Towards Automatic Face-to-Face Translation
Towards Automatic Face-to-Face Translation
Prajwal K R
Rudrabha Mukhopadhyay
Jerin Philip
Abhishek Jha
Vinay P. Namboodiri
C. V. Jawahar
CVBM
89
174
0
01 Mar 2020
Everybody's Talkin': Let Me Talk as You Want
Everybody's Talkin': Let Me Talk as You Want
Linsen Song
Wayne Wu
Chao Qian
Ran He
Chen Change Loy
DiffM
VGen
73
144
0
15 Jan 2020
Bridging Stereo Matching and Optical Flow via Spatiotemporal
  Correspondence
Bridging Stereo Matching and Optical Flow via Spatiotemporal Correspondence
Hsueh-Ying Lai
Yi-Hsuan Tsai
Wei-Chen Chiu
48
80
0
22 May 2019
Talking Face Generation by Conditional Recurrent Adversarial Network
Talking Face Generation by Conditional Recurrent Adversarial Network
Yang Song
Jingwen Zhu
Dawei Li
Xiaolong Wang
Hairong Qi
CVBM
120
194
0
13 Apr 2018
ObamaNet: Photo-realistic lip-sync from text
ObamaNet: Photo-realistic lip-sync from text
Rithesh Kumar
Jose M. R. Sotelo
Kundan Kumar
A. D. Brébisson
Yoshua Bengio
46
120
0
06 Dec 2017
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
Eddy Ilg
N. Mayer
Tonmoy Saikia
Margret Keuper
Alexey Dosovitskiy
Thomas Brox
3DPC
242
3,077
0
06 Dec 2016
1