
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Papers citing "VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing"
37 / 37 papers shown
Title |
---|
![]() CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer Zhuoyi Yang Jiayan Teng Wendi Zheng Ming Ding Shiyu Huang ...Weihan Wang Yean Cheng Xiaotao Gu Yuxiao Dong Jie Tang |