ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.14717
  4. Cited By
CelebV-Text: A Large-Scale Facial Text-Video Dataset

CelebV-Text: A Large-Scale Facial Text-Video Dataset

26 March 2023
Jianhui Yu
Hao Zhu
Liming Jiang
Chen Change Loy
Weidong (Tom) Cai
Wayne Wu
ArXivPDFHTML

Papers citing "CelebV-Text: A Large-Scale Facial Text-Video Dataset"

50 / 50 papers shown
Title
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
Antoni Bigata
Rodrigo Mira
Stella Bounareli
Michał Stypułkowski
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
52
0
0
01 May 2025
Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis
Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis
Zichuan Liu
Liming Jiang
Qing Yan
Yumin Jia
Hao Kang
Xin Lu
DiffM
31
0
0
19 Apr 2025
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
Yang Shi
Jiaheng Liu
Yushuo Guan
Z. Wu
Y. Zhang
...
Bohan Zeng
W. Zhang
Fuzheng Zhang
Wenjing Yang
Di Zhang
VGen
VLM
71
0
0
14 Apr 2025
FVQ: A Large-Scale Dataset and A LMM-based Method for Face Video Quality Assessment
FVQ: A Large-Scale Dataset and A LMM-based Method for Face Video Quality Assessment
Sijing Wu
Yunhao Li
Ziwen Xu
Yixuan Gao
Huiyu Duan
Wei Sun
Guangtao Zhai
66
1
0
12 Apr 2025
VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models
VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models
Kim Sung-Bin
Jeongsoo Choi
Puyuan Peng
Joon Son Chung
Tae-Hyun Oh
David F. Harwath
VGen
45
1
0
03 Apr 2025
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation
Fa-Ting Hong
Zunnan Xu
Zixiang Zhou
Jun Zhou
Xiu Li
Qin Lin
Qinglin Lu
D. Xu
DiffM
VGen
57
2
0
03 Apr 2025
HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation
HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation
Kun Liu
Qi Liu
Xinchen Liu
Jie Li
Yongdong Zhang
Jiebo Luo
Xiaodong He
Wu Liu
VGen
35
0
0
31 Mar 2025
MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation
MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation
Yukang Lin
Hokit Fung
Jianjin Xu
Zeping Ren
Adela S.M. Lau
Guosheng Yin
Xiu Li
VGen
42
5
0
25 Mar 2025
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Liming Jiang
Qing Yan
Yumin Jia
Zichuan Liu
Hao Kang
Xin Lu
43
1
0
20 Mar 2025
Visual Persona: Foundation Model for Full-Body Human Customization
Visual Persona: Foundation Model for Full-Body Human Customization
Jisu Nam
Soowon Son
Zhan Xu
Jing Shi
Difan Liu
Feng Liu
Aashish Misraa
Seungryong Kim
Yang Zhou
DiffM
46
0
0
19 Mar 2025
Personalized Generation In Large Model Era: A Survey
Yiyan Xu
Jinghao Zhang
Alireza Salemi
Xinting Hu
W. Wang
Fuli Feng
Hamed Zamani
Xiangnan He
Tat-Seng Chua
3DV
79
2
0
04 Mar 2025
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
Antoni Bigata
Michał Stypułkowski
Rodrigo Mira
Stella Bounareli
Konstantinos Vougioukas
Zoe Landgraf
Nikita Drobyshev
Maciej Ziȩba
Stavros Petridis
M. Pantic
DiffM
VGen
65
2
0
03 Mar 2025
FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion model
FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion model
Lingzhou Mu
Baiji Liu
Ruonan Zhang
Guiming Mo
Jiawei Jin
Kai Zhang
Haozhi Huang
DiffM
VGen
56
1
0
26 Feb 2025
PERSE: Personalized 3D Generative Avatars from A Single Portrait
PERSE: Personalized 3D Generative Avatars from A Single Portrait
Hyunsoo Cha
Inhee Lee
Hanbyul Joo
3DGS
41
1
0
31 Dec 2024
Omni-ID: Holistic Identity Representation Designed for Generative Tasks
Omni-ID: Holistic Identity Representation Designed for Generative Tasks
Guocheng Qian
Kuan-Chieh Jackson Wang
Or Patashnik
Negin Heravi
Daniil Ostashev
Sergey Tulyakov
Daniel Cohen-Or
Kfir Aberman
82
4
0
12 Dec 2024
HiFiVFS: High Fidelity Video Face Swapping
HiFiVFS: High Fidelity Video Face Swapping
Xu Chen
Keke He
Junwei Zhu
Yanhao Ge
Wei Li
Chengjie Wang
VGen
DiffM
78
1
0
27 Nov 2024
MotionCharacter: Identity-Preserving and Motion Controllable Human Video
  Generation
MotionCharacter: Identity-Preserving and Motion Controllable Human Video Generation
Haopeng Fang
Di Qiu
Binjie Mao
Pengfei Yan
He Tang
VGen
DiffM
70
4
0
27 Nov 2024
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Xiaozhong Ji
Xiaobin Hu
Zhihong Xu
Junwei Zhu
Chuming Lin
...
Donghao Luo
Yi Chen
Qin Lin
Qinglin Lu
Chengjie Wang
VGen
73
4
0
25 Nov 2024
HumanVLM: Foundation for Human-Scene Vision-Language Model
HumanVLM: Foundation for Human-Scene Vision-Language Model
Dawei Dai
Xu Long
Li Yutang
Zhang YuanHui
Shuyin Xia
VLM
MLLM
37
1
0
05 Nov 2024
Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions
Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions
Malte Prinzler
Egor Zakharov
V. Sklyarova
Berna Kabadayi
Justus Thies
DiffM
26
5
0
21 Oct 2024
MMHead: Towards Fine-grained Multi-modal 3D Facial Animation
MMHead: Towards Fine-grained Multi-modal 3D Facial Animation
Sijing Wu
Yunhao Li
Yichao Yan
Huiyu Duan
Ziwei Liu
Guangtao Zhai
3DH
VGen
34
4
0
10 Oct 2024
Face Forgery Detection with Elaborate Backbone
Face Forgery Detection with Elaborate Backbone
Zonghui Guo
Y. Liu
Jie Zhang
Haiyong Zheng
Shiguang Shan
AAML
CVBM
28
1
0
25 Sep 2024
FaceVid-1K: A Large-Scale High-Quality Multiracial Human Face Video
  Dataset
FaceVid-1K: A Large-Scale High-Quality Multiracial Human Face Video Dataset
Donglin Di
H. Feng
Wenzhang Sun
Yongjia Ma
Hao Li
Wei Chen
Xiaofei Gou
Tonghua Su
Xun Yang
CVBM
46
2
0
23 Sep 2024
InstantDrag: Improving Interactivity in Drag-based Image Editing
InstantDrag: Improving Interactivity in Drag-based Image Editing
Joonghyuk Shin
Daehyeon Choi
Jaesik Park
DiffM
44
6
0
13 Sep 2024
What to Preserve and What to Transfer: Faithful, Identity-Preserving
  Diffusion-based Hairstyle Transfer
What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer
Chaeyeon Chung
Sunghyun Park
J. Kim
Jaegul Choo
DiffM
36
0
0
29 Aug 2024
15M Multimodal Facial Image-Text Dataset
15M Multimodal Facial Image-Text Dataset
Dawei Dai
Yutang Li
Yingge Liu
Mingming Jia
Zhang YuanHui
Guoyin Wang
VLM
28
7
0
11 Jul 2024
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
Kepan Nan
Rui Xie
Penghao Zhou
Tiehan Fan
Zhenheng Yang
Zhijie Chen
Xiang Li
Jian Yang
Ying Tai
78
68
0
02 Jul 2024
MultiTalk: Enhancing 3D Talking Head Generation Across Languages with
  Multilingual Video Dataset
MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset
Kim Sung-Bin
Lee Chae-Yeon
Gihun Son
Oh Hyun-Bin
Janghoon Ju
Suekyeong Nam
Tae-Hyun Oh
34
11
0
20 Jun 2024
V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data
V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data
Rotem Shalev-Arkushin
Aharon Azulay
Tavi Halperin
Eitan Richardson
Amit H. Bermano
Ohad Fried
DiffM
44
0
0
20 Jun 2024
From Sora What We Can See: A Survey of Text-to-Video Generation
From Sora What We Can See: A Survey of Text-to-Video Generation
Rui Sun
Yumin Zhang
Tejal Shah
Jiahao Sun
Shuoying Zhang
Wenqi Li
Haoran Duan
Bo Wei
R. Ranjan
EGVM
79
20
0
17 May 2024
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
Xuanhua He
Quande Liu
Shengju Qian
Xin Eric Wang
Tao Hu
Ke Cao
K. Yan
Jie Zhang
VGen
33
39
0
23 Apr 2024
3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow
3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow
Felix Taubner
Prashant Raina
Mathieu Tuli
Eu Wern Teh
Chul Lee
Jinmiao Huang
3DH
CVBM
38
4
0
15 Apr 2024
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer
Yu Deng
Duomin Wang
Baoyuan Wang
40
21
0
20 Mar 2024
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video
  Diffusion Models
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models
Wenhao Wang
Yi Yang
VGen
DiffM
33
31
0
10 Mar 2024
Detecting Multimedia Generated by Large AI Models: A Survey
Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin
Neeraj Gupta
Yue Zhang
Hainan Ren
Chun-Hao Liu
Feng Ding
Xin Eric Wang
X. Li
Luisa Verdoliva
Shu Hu
86
56
0
22 Jan 2024
PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved
  Personalization
PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
Xu Peng
Junwei Zhu
Boyuan Jiang
Ying Tai
Donghao Luo
Jiangning Zhang
Wei Lin
Taisong Jin
Chengjie Wang
Rongrong Ji
DiffM
27
54
0
11 Dec 2023
FT2TF: First-Person Statement Text-To-Talking Face Generation
FT2TF: First-Person Statement Text-To-Talking Face Generation
Xingjian Diao
Ming Cheng
Wayner Barrios
SouYoung Jin
38
11
0
09 Dec 2023
AgentAvatar: Disentangling Planning, Driving and Rendering for
  Photorealistic Avatar Agents
AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents
Duomin Wang
Bin Dai
Yu Deng
Baoyuan Wang
VGen
37
4
0
29 Nov 2023
FLAIR: A Conditional Diffusion Framework with Applications to Face Video
  Restoration
FLAIR: A Conditional Diffusion Framework with Applications to Face Video Restoration
Zihao Zou
Jiaming Liu
S. Shoushtari
Yubo Wang
Weijie Gan
Ulugbek S. Kamilov
VGen
DiffM
28
2
0
26 Nov 2023
LaughTalk: Expressive 3D Talking Head Generation with Laughter
LaughTalk: Expressive 3D Talking Head Generation with Laughter
Kim Sung-Bin
Lee Hyun
Da Hye Hong
Suekyeong Nam
Janghoon Ju
Tae-Hyun Oh
20
20
0
02 Nov 2023
A Survey on Video Diffusion Models
A Survey on Video Diffusion Models
Zhen Xing
Qijun Feng
Haoran Chen
Qi Dai
Hang-Rui Hu
Hang Xu
Zuxuan Wu
Yu-Gang Jiang
EGVM
VGen
57
116
0
16 Oct 2023
MedShapeNet -- A Large-Scale Dataset of 3D Medical Shapes for Computer
  Vision
MedShapeNet -- A Large-Scale Dataset of 3D Medical Shapes for Computer Vision
Jianning Li
Zongwei Zhou
Jiancheng Yang
Antonio Pepe
Christina Schwarz-Gsaxner
...
B. Puladi
Pascal Fua
Alan L. Yuille
Jens Kleesiek
Jan Egger
MedIm
3DH
26
32
0
30 Aug 2023
AutoDecoding Latent 3D Diffusion Models
AutoDecoding Latent 3D Diffusion Models
Evangelos Ntavelis
Aliaksandr Siarohin
Kyle Olszewski
Chao-Yuan Wang
Luc Van Gool
Sergey Tulyakov
DiffM
39
43
0
07 Jul 2023
T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified
  Visual Modalities
T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified Visual Modalities
Kangfu Mei
Mo Zhou
Vishal M. Patel
DiffM
18
1
0
24 May 2023
TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking
  Styles
TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles
Yifeng Ma
Suzhe Wang
Yu-qiong Ding
Lincheng Li
Bowen Ma
Tangjie Lv
Changjie Fan
Zhipeng Hu
Zhidong Deng
Xin Yu
CLIP
27
21
0
01 Apr 2023
PoseScript: Linking 3D Human Poses and Natural Language
PoseScript: Linking 3D Human Poses and Natural Language
Ginger Delmas
Philippe Weinzaepfel
Thomas Lucas
Francesc Moreno-Noguer
Grégory Rogez
3DH
30
1
0
21 Oct 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via
  Transformers
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
254
565
0
29 May 2022
Flexible Diffusion Modeling of Long Videos
Flexible Diffusion Modeling of Long Videos
William Harvey
Saeid Naderiparizi
Vaden Masrani
Christian Weilbach
Frank D. Wood
DiffM
BDL
VGen
176
285
0
23 May 2022
VoxCeleb2: Deep Speaker Recognition
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
219
2,233
0
14 Jun 2018
Imagine This! Scripts to Compositions to Videos
Imagine This! Scripts to Compositions to Videos
Tanmay Gupta
Dustin Schwenk
Ali Farhadi
Derek Hoiem
Aniruddha Kembhavi
CoGe
VGen
111
87
0
10 Apr 2018
1