Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.20255
Cited By
AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models
26 May 2025
Muyao Niu
Mingdeng Cao
Yifan Zhan
Qingtian Zhu
Mingze Ma
Jiancheng Zhao
Yanhong Zeng
Zhihang Zhong
Xiao Sun
Yinqiang Zheng
DiffM
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models"
34 / 34 papers shown
Title
AnimateAnywhere: Rouse the Background in Human Image Animation
Xiaoyu Liu
Mingshuai Yao
Y. Zhang
Xianhui Lin
Peiran Ren
Xiaochen Li
Ming-Yu Liu
W. Zuo
3DH
DiffM
99
1
0
28 Apr 2025
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
Xinyu Wang
Shiwei Zhang
Longxiang Tang
Yuanxing Zhang
Changxin Gao
Yuehuan Wang
Nong Sang
VGen
57
6
0
15 Apr 2025
LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh
Jing Wen
Alexander Schwing
Shenlong Wang
3DGS
3DH
145
1
0
13 Feb 2025
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Li Hu
Guangyuan Wang
Zhen Shen
Xin Gao
Dechao Meng
Lian Zhuo
Peng Zhang
Bang Zhang
Liefeng Bo
DiffM
VGen
152
19
0
10 Feb 2025
MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Jinbo Xing
Long Mai
Cusuh Ham
Jiahui Huang
Aniruddha Mahapatra
Chi-Wing Fu
T. Wong
Feng Liu
DiffM
VGen
229
5
0
06 Feb 2025
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
Gaojie Lin
Jianwen Jiang
Jiaqi Yang
Zerong Zheng
Chao Liang
DiffM
VGen
326
29
0
03 Feb 2025
LTX-Video: Realtime Video Latent Diffusion
Yoav HaCohen
Nisan Chiprut
Benny Brazowski
Daniel Shalem
Dudu Moshe
...
Sapir Weissbuch
Victor Kulikov
Yaki Bitterman
Zeev Melumian
Ofir Bibi
VGen
132
67
0
03 Jan 2025
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Shuwei Shi
Biao Gong
Xi Chen
Dandan Zheng
Shuai Tan
...
Jingwen He
Kecheng Zheng
Jingdong Chen
Ming-Hsuan Yang
Yinqiang Zheng
VGen
DiffM
84
4
0
08 Dec 2024
MovieCharacter: A Tuning-Free Framework for Controllable Character Video Synthesis
Di Qiu
Zheng Chen
Rui Wang
Mingyuan Fan
Changqian Yu
Junshi Huan
Xiang Wen
VGen
86
9
0
28 Oct 2024
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
Yifang Men
Yuan Yao
Miaomiao Cui
Liefeng Bo
DiffM
116
29
0
24 Sep 2024
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
Zhuoyi Yang
Jiayan Teng
Wendi Zheng
Ming Ding
Shiyu Huang
...
Weihan Wang
Yean Cheng
Xiaotao Gu
Yuxiao Dong
Jie Tang
DiffM
VGen
237
558
0
12 Aug 2024
MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Yuang Zhang
Jiaxi Gu
Li-Wen Wang
Han Wang
Junqi Cheng
Yuefeng Zhu
Fangyuan Zou
VGen
133
85
0
28 Jun 2024
Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation
Changgu Chen
Junwei Shu
Lianggangxu Chen
Gaoqi He
Changbo Wang
VGen
66
16
0
18 Jan 2024
Fine-grained Controllable Video Generation via Object Appearance and Context
Hsin-Ping Huang
Yu-Chuan Su
Deqing Sun
Lu Jiang
Xuhui Jia
Yukun Zhu
Ming-Hsuan Yang
DiffM
VGen
54
15
0
05 Dec 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen
Menghan Xia
Yin-Yin He
Yong Zhang
Xiaodong Cun
...
Yaofang Liu
Qifeng Chen
Xintao Wang
Chao-Liang Weng
Ying Shan
DiffM
70
309
0
30 Oct 2023
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Rui Zhao
Yuchao Gu
Jay Zhangjie Wu
David Junhao Zhang
Jia-Wei Liu
Weijia Wu
Jussi Keppo
Mike Zheng Shou
DiffM
VGen
88
117
0
12 Oct 2023
ProPainter: Improving Propagation and Transformer for Video Inpainting
Shangchen Zhou
Chongyi Li
Kelvin C. K. Chan
Chen Change Loy
ViT
95
105
0
07 Sep 2023
Effective Whole-body Pose Estimation with Two-stages Distillation
Zhendong Yang
Ailing Zeng
Chun Yuan
Yu Li
106
180
0
29 Jul 2023
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
569
4,910
0
17 Apr 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
VGen
74
574
0
23 Mar 2023
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
118
2,418
0
19 Dec 2022
Latent Video Diffusion Models for High-Fidelity Long Video Generation
Yin-Yin He
Tianyu Yang
Yong Zhang
Ying Shan
Qifeng Chen
DiffM
VGen
95
238
0
23 Nov 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
314
629
0
29 May 2022
Video Diffusion Models
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
209
1,638
0
07 Apr 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
493
15,734
0
20 Dec 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
967
29,810
0
26 Feb 2021
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
289
7,469
0
06 Oct 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
694
18,310
0
19 Jun 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
481
20,317
0
23 Oct 2019
Expressive Body Capture: 3D Hands, Face, and Body from a Single Image
Georgios Pavlakos
Vasileios Choutas
N. Ghorbani
Timo Bolkart
Ahmed A. A. Osman
Dimitrios Tzionas
Michael J. Black
3DH
58
1,725
0
11 Apr 2019
DensePose: Dense Human Pose Estimation In The Wild
R. Güler
Natalia Neverova
Iasonas Kokkinos
3DH
288
1,407
0
01 Feb 2018
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
Richard Y. Zhang
Phillip Isola
Alexei A. Efros
Eli Shechtman
Oliver Wang
EGVM
384
11,905
0
11 Jan 2018
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
230
5,071
0
02 Nov 2017
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
1.9K
77,378
0
18 May 2015
1