Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.10248
Cited By
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model
14 February 2025
Guoqing Ma
Haoyang Huang
K. Yan
L. Chen
Nan Duan
S. Yin
Changyi Wan
Ranchen Ming
Xiaoniu Song
Xing Chen
Yu Zhou
Deshan Sun
Deyu Zhou
Jian Zhou
Kaijun Tan
Kang An
Mei Chen
Wei Ji
Qiling Wu
Wen Sun
Xin Han
Y. X. Wei
Zheng Ge
Aojie Li
Bin Wang
Bizhu Huang
Bo Wang
B. Li
Changxing Miao
C. Xu
Chenfei Wu
C. Yu
Dapeng Shi
Dingyuan Hu
Enle Liu
Gang Yu
Ge Yang
Guanzhe Huang
Gulin Yan
H. Feng
Hao Nie
Haonan Jia
Hanpeng Hu
H. Chen
Haolong Yan
H. Wang
Hongcheng Guo
Huilin Xiong
Huixin Xiong
Jiahao Gong
Jianchang Wu
J. Wu
Jie Wu
Jie Yang
J. Liu
J. Li
Jingyang Zhang
J. Guo
Junzhe Lin
K. Li
Lei Liu
Lei Xia
Liang Zhao
Liguo Tan
L. Huang
Liying Shi
Ming Li
M. Li
Muhua Cheng
Na Wang
Qiaohui Chen
Q. He
Qiuyan Liang
Quan Sun
R.-H. Sun
Rui Wang
Shaoliang Pang
S. M. I. Simon X. Yang
S. Liu
Siqi Liu
Shuli Gao
Tiancheng Cao
T. Wang
Weipeng Ming
Wenqing He
Xu Zhao
X. Zhang
Xianfang Zeng
X. Liu
X. Yang
Yaqi Dai
Yanbo Yu
Yang Li
Y. Deng
Y. Wang
Y. Wang
Yuanwei Lu
Yu-Cheng Chen
Yu-Juan Luo
Y. Luo
DiffM
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model"
14 / 14 papers shown
Title
STORYANCHORS: Generating Consistent Multi-Scene Story Frames for Long-Form Narratives
Bo Wang
Haoyang Huang
Zhiyin Lu
F. Liu
Guoqing Ma
Jianlong Yuan
Y. Zhang
Nan Duan
VGen
29
0
0
13 May 2025
Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets
Weiyu Li
X. Zhang
Zheng Sun
Di Qi
H. Li
...
Zeming Li
Gang Yu
Xiangyu Zhang
Daxin Jiang
Ping Tan
36
0
0
12 May 2025
Generative Pre-trained Autoregressive Diffusion Transformer
Yuan Zhang
Jiacheng Jiang
Guoqing Ma
Zhiying Lu
Haoyang Huang
Jianlong Yuan
Nan Duan
VGen
40
1
0
12 May 2025
BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation
Ruotong Wang
Mingli Zhu
Jiarong Ou
R. J. Chen
Xin Tao
Pengfei Wan
Baoyuan Wu
DiffM
AAML
VGen
51
0
0
23 Apr 2025
DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation
Weijie He
Mushui Liu
Yunlong Yu
Zhao Wang
Chao Wu
DiffM
VGen
64
0
0
21 Apr 2025
H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models
Yushu Wu
Yanyu Li
Ivan Skorokhodov
Anil Kag
Willi Menapace
Sharath Girish
Aliaksandr Siarohin
Yanzhi Wang
Sergey Tulyakov
DiffM
VGen
37
0
0
14 Apr 2025
Aligning Anime Video Generation with Human Feedback
Bingwen Zhu
Yudong Jiang
Baohan Xu
Siqian Yang
Mingyu Yin
Yidi Wu
Huyang Sun
Zuxuan Wu
EGVM
VGen
48
0
0
14 Apr 2025
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
Team Seawead
Ceyuan Yang
Zhijie Lin
Yang Zhao
Shanchuan Lin
...
Zuquan Song
Zhenheng Yang
Jiashi Feng
Jianchao Yang
Lu Jiang
DiffM
83
1
0
11 Apr 2025
On Data Synthesis and Post-training for Visual Abstract Reasoning
Ke Zhu
Y. Wang
Jiangjiang Liu
Qunyi Xie
Shanshan Liu
Gang Zhang
SyDa
LRM
47
0
0
02 Apr 2025
Video-T1: Test-Time Scaling for Video Generation
F. Liu
Hanyang Wang
Yimo Cai
Kaiyan Zhang
Xiaohang Zhan
Yueqi Duan
DiffM
VGen
78
1
0
24 Mar 2025
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
Qingyu Shi
Jianzong Wu
Jinbin Bai
J. Zhang
Lu Qi
X. Li
Yunhai Tong
48
0
0
21 Mar 2025
FiVE: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
Minghan Li
C. Xie
Y. Wu
Lei Zhang
M. Wang
DiffM
VGen
57
0
0
17 Mar 2025
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
Hao He
Ceyuan Yang
Shanchuan Lin
Yinghao Xu
Meng Wei
Liangke Gui
Qi Zhao
Gordon Wetzstein
Lu Jiang
Hongsheng Li
DiffM
VGen
99
5
0
13 Mar 2025
WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation
Jing Wang
Ao Ma
Ke Cao
Jun Zheng
Zhanjie Zhang
...
Yuhang Ma
Bo Cheng
Dawei Leng
Yuhui Yin
Xiaodan Liang
VGen
87
3
0
11 Mar 2025
1