Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.09414
Cited By
Vlogger: Make Your Dream A Vlog
17 January 2024
Shaobin Zhuang
Kunchang Li
Xinyuan Chen
Yaohui Wang
Ziwei Liu
Yu Qiao
Yali Wang
VGen
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vlogger: Make Your Dream A Vlog"
32 / 32 papers shown
Title
BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation
Panwen Hu
Jiehui Huang
Qiang Sun
Xiaodan Liang
DiffM
VGen
28
0
0
11 May 2025
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
Bingjie Gao
Xinyu Gao
Xiaoxue Wu
Yujie Zhou
Yu Qiao
Li Niu
Xinyuan Chen
Yaohui Wang
76
0
0
16 Apr 2025
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
Yucheng Suo
Fan Ma
Kaixin Shen
Linchao Zhu
Yi Yang
VLM
52
0
0
12 Mar 2025
TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision
Shaobin Zhuang
Yiwei Guo
Yanbo Ding
Kunchang Li
Xinyuan Chen
Yaohui Wang
Fangyikang Wang
Ying Zhang
Chen Li
Y. Wang
45
0
0
10 Mar 2025
Text2Story: Advancing Video Storytelling with Text Guidance
Taewon Kang
D. Kothandaraman
Ming C. Lin
DiffM
VGen
59
0
0
08 Mar 2025
WeGen: A Unified Model for Interactive Multimodal Generation as We Chat
Zhipeng Huang
Shaobin Zhuang
Canmiao Fu
Binxin Yang
Ying Zhang
Chong Sun
Zhizheng Zhang
Yali Wang
Chen Li
Zheng-Jun Zha
DiffM
69
1
0
03 Mar 2025
Joint Learning of Depth and Appearance for Portrait Image Animation
Xinya Ji
Gaspard Zoss
Prashanth Chandran
Lingchen Yang
Xun Cao
B. Solenthaler
D. Bradley
3DH
MDE
48
0
0
15 Jan 2025
VideoAuteur: Towards Long Narrative Video Generation
Junfei Xiao
Feng Cheng
Lu Qi
Liangke Gui
Jiepeng Cen
Zhibei Ma
Alan L. Yuille
Lu Jiang
VGen
58
2
0
10 Jan 2025
Generative AI for Cel-Animation: A Survey
Yunlong Tang
Junjia Guo
Pinxin Liu
Zhiyuan Wang
Hang Hua
...
Jing Bi
Mingqian Feng
X. Li
Zeliang Zhang
Chenliang Xu
VGen
88
7
0
08 Jan 2025
CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image
Wonseok Roh
Hwanhee Jung
Jong Wook Kim
S. Lee
Innfarn Yoo
Andreas Lugmayr
Seunggeun Chi
K. Ramani
Sangpil Kim
3DGS
87
2
0
17 Dec 2024
Motion Dreamer: Boundary Conditional Motion Reasoning for Physically Coherent Video Generation
Tianshuo Xu
Zhifei Chen
Leyi Wu
Hao Lu
Yuying Chen
Lihui Jiang
Bingbing Liu
Yingcong Chen
VGen
73
2
0
30 Nov 2024
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Rang Meng
Xingyu Zhang
Yuming Li
Chenguang Ma
28
5
0
15 Nov 2024
Progressive Autoregressive Video Diffusion Models
Desai Xie
Zhan Xu
Yicong Hong
Hao Tan
Difan Liu
Feng Liu
Arie E. Kaufman
Yang Zhou
VGen
DiffM
58
10
0
10 Oct 2024
Compositional 3D-aware Video Generation with LLM Director
Hanxin Zhu
Tianyu He
Anni Tang
Junliang Guo
Zhibo Chen
Jiang Bian
DiffM
VGen
33
7
0
31 Aug 2024
AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Minheng Ni
Chenfei Wu
Huaying Yuan
Zhengyuan Yang
Ming Gong
Lijuan Wang
Zicheng Liu
Wangmeng Zuo
Nan Duan
VGen
33
0
0
21 Aug 2024
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Yunxin Li
Haoyuan Shi
Baotian Hu
Longyue Wang
Jiashun Zhu
Jinyi Xu
Zhen Zhao
Min Zhang
VGen
40
5
0
19 Aug 2024
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation
Liu He
Yizhi Song
Hejun Huang
Pinxin Liu
Yunlong Tang
Daniel G. Aliaga
Xin Zhou
DiffM
VGen
90
3
0
19 Aug 2024
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention
Yu Lu
Yuanzhi Liang
Linchao Zhu
Yi Yang
DiffM
VGen
41
27
0
29 Jul 2024
CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion
Xingrui Wang
Xin Li
Zhibo Chen
DiffM
47
1
0
07 Jun 2024
VideoTetris: Towards Compositional Text-to-Video Generation
Ye Tian
Ling Yang
Haotian Yang
Yuan Gao
Yufan Deng
...
Zhaochen Yu
Xin Tao
Pengfei Wan
Di Zhang
Bin Cui
DiffM
VGen
84
15
0
06 Jun 2024
LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding
Haoyu Zhao
Wenhang Ge
Ying-cong Chen
ObjD
MLLM
VLM
32
4
0
27 May 2024
From Sora What We Can See: A Survey of Text-to-Video Generation
Rui Sun
Yumin Zhang
Tejal Shah
Jiahao Sun
Shuoying Zhang
Wenqi Li
Haoran Duan
Bo Wei
R. Ranjan
EGVM
79
20
0
17 May 2024
A Survey on Long Video Generation: Challenges, Methods, and Prospects
Chengxuan Li
Di Huang
Zeyu Lu
Yang Xiao
Qingqi Pei
Lei Bai
EGVM
42
19
0
25 Mar 2024
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Zhengqing Yuan
Ruoxi Chen
Zhaoxu Li
Haolong Jia
Lifang He
Chi Wang
Lichao Sun
VGen
53
27
0
20 Mar 2024
AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production
Jiuniu Wang
Zehua Du
Yuyuan Zhao
Bo Yuan
Kexiang Wang
...
Yihen Lu
Gengliang Li
Junlong Gao
Xin Tu
Zhenyu Guo
LLMAG
VGen
36
7
0
12 Mar 2024
VideoMamba: State Space Model for Efficient Video Understanding
Kunchang Li
Xinhao Li
Yi Wang
Yinan He
Yali Wang
Limin Wang
Yu Qiao
Mamba
37
180
0
11 Mar 2024
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Yixin Liu
Kai Zhang
Yuan Li
Zhiling Yan
Chujie Gao
...
Yue Huang
Hanchi Sun
Jianfeng Gao
Lifang He
Lichao Sun
VLM
VGen
EGVM
75
257
0
27 Feb 2024
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng-Zhen Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
250
1,073
0
05 Oct 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
254
565
0
29 May 2022
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViT
VGen
245
484
0
20 Apr 2021
InMoDeGAN: Interpretable Motion Decomposition Generative Adversarial Network for Video Generation
Yaohui Wang
F. Brémond
A. Dantcheva
VGen
GAN
142
25
0
08 Jan 2021
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
294
75,800
0
18 May 2015
1