Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.16160
Cited By
v1
v2 (latest)
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
24 September 2024
Yifang Men
Yuan Yao
Miaomiao Cui
Liefeng Bo
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
45 / 45 papers shown
Title
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance
Yuxuan Luo
Zhengkun Rong
Lizhen Wang
Longhao Zhang
Tianshu Hu
Yongming Zhu
VGen
421
8
0
02 Apr 2025
MovieCharacter: A Tuning-Free Framework for Controllable Character Video Synthesis
Di Qiu
Zheng Chen
Rui Wang
Mingyuan Fan
Changqian Yu
Junshi Huan
Xiang Wen
VGen
86
9
0
28 Oct 2024
Replace Anyone in Videos
Xiang Wang
Shiwei Zhang
Haonan Qiu
Ruihang Chu
Zekun Li
Yuanxing Zhang
Changxin Gao
Yuehuan Wang
Chunhua Shen
Nong Sang
VGen
DiffM
107
1
0
30 Sep 2024
SAM 2: Segment Anything in Images and Videos
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
...
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
VLM
MLLM
146
917
0
01 Aug 2024
MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Yuang Zhang
Jiaxi Gu
Li-Wen Wang
Han Wang
Junqi Cheng
Yuefeng Zhu
Fangyuan Zou
VGen
130
84
0
28 Jun 2024
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Shenhao Zhu
Junming Leo Chen
Zuozhuo Dai
Qingkun Su
Yinghui Xu
Xun Cao
Yao Yao
Hao Zhu
Siyu Zhu
3DH
VGen
106
124
0
21 Mar 2024
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
VLM
225
809
0
19 Jan 2024
En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
Yifang Men
Biwen Lei
Yuan Yao
Miaomiao Cui
Zhouhui Lian
Xuansong Xie
SyDa
3DH
63
7
0
02 Jan 2024
DreaMoving: A Human Video Generation Framework based on Diffusion Models
Mengyang Feng
Jinlin Liu
Kai Yu
Yuan Yao
Zheng Hui
...
Xiaoyang Kang
Biwen Lei
Miaomiao Cui
Peiran Ren
Xuansong Xie
VGen
46
28
0
08 Dec 2023
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Liangxiao Hu
Hongwen Zhang
Yuxiang Zhang
Boyao Zhou
Boning Liu
Shengping Zhang
Liqiang Nie
3DGS
57
112
0
04 Dec 2023
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Liucheng Hu
Xin Gao
Peng Zhang
Ke Sun
Bang Zhang
Liefeng Bo
DiffM
VGen
94
387
0
28 Nov 2023
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Zhongcong Xu
Jianfeng Zhang
Jun Hao Liew
Hanshu Yan
Jia-Wei Liu
Chenxu Zhang
Jiashi Feng
Mike Zheng Shou
VGen
DiffM
97
200
0
27 Nov 2023
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Zhan Qin
Xiang Wang
Deli Zhao
Jingren Zhou
DiffM
VGen
110
227
0
07 Nov 2023
ProPainter: Improving Propagation and Transformer for Video Inpainting
Shangchen Zhou
Chongyi Li
Kelvin C. K. Chan
Chen Change Loy
ViT
90
104
0
07 Sep 2023
TADA! Text to Animatable Digital Avatars
Tingting Liao
Hongwei Yi
Yuliang Xiu
Jiaxaing Tang
Yangyi Huang
Justus Thies
Michael J. Black
136
100
0
21 Aug 2023
TeCH: Text-guided Reconstruction of Lifelike Clothed Humans
Yangyi Huang
Hongwei Yi
Yuliang Xiu
Tingting Liao
Jiaxiang Tang
Deng Cai
Justus Thies
DiffM
87
86
0
16 Aug 2023
3D Gaussian Splatting for Real-Time Radiance Field Rendering
Bernhard Kerbl
Georgios Kopanas
Thomas Leimkuehler
G. Drettakis
3DGS
231
3,770
0
08 Aug 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
107
867
0
10 Jul 2023
DisCo: Disentangled Control for Realistic Human Dance Generation
Tan Wang
Linjie Li
Kevin Qinghong Lin
Yuanhao Zhai
Chung-Ching Lin
Zhengyuan Yang
Hanwang Zhang
Zicheng Liu
Lijuan Wang
VGen
94
85
0
30 Jun 2023
Humans in 4D: Reconstructing and Tracking Humans with Transformers
Shubham Goel
Georgios Pavlakos
Jathushan Rajasegaran
Angjoo Kanazawa
Jitendra Malik
3DH
83
190
0
31 May 2023
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao
Enze Xie
Lanqing Hong
Zhenguo Li
G. Lee
DiffM
VGen
68
33
0
15 May 2023
AG3D: Learning to Generate 3D Avatars from 2D Image Collections
Taehee Kim
Xu Chen
Jinlong Yang
Michael J. Black
Otmar Hilliges
Andreas Geiger
3DH
139
52
0
03 May 2023
HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video
Jia-Wei Liu
Yan-Pei Cao
Tianyu Yang
Eric Z. Xu
Jussi Keppo
Ying Shan
Xiaohu Qie
Mike Zheng Shou
3DH
71
28
0
24 Apr 2023
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGS
VGen
196
1,092
0
18 Apr 2023
DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion
J. Karras
Aleksander Holynski
Ting-Chun Wang
Ira Kemelmacher-Shlizerman
DiffM
VGen
75
145
0
12 Apr 2023
Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos
Yue Ma
Yin-Yin He
Xiaodong Cun
Xintao Wang
Siran Chen
Ying Shan
Xiu Li
Qifeng Chen
DiffM
VGen
75
191
0
03 Apr 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
177
4,146
1
10 Feb 2023
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Weixian Lei
Yuchao Gu
Yufei Shi
Wynne Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
VGen
116
737
0
22 Dec 2022
3DHumanGAN: 3D-Aware Human Image Generation with 3D Pose Mapping
Zhuoqian Yang
Shikai Li
Wayne Wu
Bo Dai
3DH
71
12
0
14 Dec 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
311
627
0
29 May 2022
Video Diffusion Models
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
204
1,626
0
07 Apr 2022
NeuMan: Neural Human Radiance Field from a Single Video
Wei Jiang
K. M. Yi
Golnoosh Samei
Oncel Tuzel
Anurag Ranjan
3DH
73
225
0
23 Mar 2022
SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video
Boyi Jiang
Yang Hong
Hujun Bao
Juyong Zhang
3DH
155
161
0
30 Jan 2022
HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video
Chung-Yi Weng
Brian L. Curless
Pratul P. Srinivasan
Jonathan T. Barron
Ira Kemelmacher-Shlizerman
3DH
77
471
0
11 Jan 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
463
15,665
0
20 Dec 2021
Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies
Sida Peng
Junting Dong
Qianqian Wang
Shangzhan Zhang
Qing Shuai
Xiaowei Zhou
Hujun Bao
3DH
AI4CE
125
378
0
06 May 2021
A-NeRF: Articulated Neural Radiance Fields for Learning Human Shape, Appearance, and Pose
Shih-Yang Su
Frank Yu
Michael Zollhoefer
Helge Rhodin
3DH
195
256
0
11 Feb 2021
Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans
Sida Peng
Yuanqing Zhang
Yinghao Xu
Qianqian Wang
Qing Shuai
Hujun Bao
Xiaowei Zhou
3DH
281
702
0
31 Dec 2020
Modular Primitives for High-Performance Differentiable Rendering
S. Laine
Janne Hellsten
Tero Karras
Yeongho Seol
J. Lehtinen
Timo Aila
67
453
0
06 Nov 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
669
18,276
0
19 Jun 2020
ARCH: Animatable Reconstruction of Clothed Humans
Zeng Huang
Yuanlu Xu
Christoph Lassner
Hao Li
Tony Tung
3DH
75
332
0
08 Apr 2020
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
455
10,591
0
17 Feb 2020
AMASS: Archive of Motion Capture as Surface Shapes
Naureen Mahmood
N. Ghorbani
N. Troje
Gerard Pons-Moll
Michael J. Black
3DH
48
1,259
0
05 Apr 2019
Towards Accurate Generative Models of Video: A New Metric & Challenges
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
EGVM
VGen
91
737
0
03 Dec 2018
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
Richard Y. Zhang
Phillip Isola
Alexei A. Efros
Eli Shechtman
Oliver Wang
EGVM
377
11,877
0
11 Jan 2018
1