ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.12035
  4. Cited By
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency,
  Controllability and Compatibility

CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility

18 March 2024
Bojia Zi
Shihao Zhao
Xianbiao Qi
Jianan Wang
Yukai Shi
Qianyu Chen
Bin Liang
Kam-Fai Wong
Lei Zhang
    DiffM
    VGen
ArXivPDFHTML

Papers citing "CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility"

17 / 17 papers shown
Title
Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting
Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting
Jiaxin Huang
Sheng Miao
BangBnag Yang
Yuewen Ma
Yiyi Liao
VGen
MDE
33
0
0
15 Apr 2025
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders
Ilan Naiman
Emanuel Ben-Baruch
Oron Anschel
Alon Shoshan
Igor Kviatkovsky
Manoj Aggarwal
Gérard Medioni
34
0
0
04 Apr 2025
Enabling Versatile Controls for Video Diffusion Models
Enabling Versatile Controls for Video Diffusion Models
Xu Zhang
Hao Zhou
Haoming Qin
Xiaobin Lu
Jiaxing Yan
Guanzhong Wang
Zeyu Chen
Yi Liu
DiffM
VGen
65
0
0
21 Mar 2025
MTV-Inpaint: Multi-Task Long Video Inpainting
Shiyuan Yang
Zheng Gu
Liang Hou
Xin Tao
Pengfei Wan
Xiaodong Chen
Jing Liao
DiffM
63
0
0
14 Mar 2025
VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control
VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control
Hao Wang
Zhaoyang Zhang
Xuan Ju
Mingdeng Cao
Liangbin Xie
Ying Shan
Qiang Xu
VGen
DiffM
73
1
0
07 Mar 2025
Track Anything Behind Everything: Zero-Shot Amodal Video Object
  Segmentation
Track Anything Behind Everything: Zero-Shot Amodal Video Object Segmentation
Finlay G. C. Hudson
W. Smith
VOS
VLM
78
0
0
28 Nov 2024
Generative Omnimatte: Learning to Decompose Video into Layers
Generative Omnimatte: Learning to Decompose Video into Layers
Yao-Chih Lee
Erika Lu
Sarah Rumbley
Michal Geyer
Jia-Bin Huang
Tali Dekel
Forrester Cole
DiffM
VGen
105
5
0
25 Nov 2024
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video
  Local Editing
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Jiahao Hu
Tianxiong Zhong
Xuebo Wang
Boyuan Jiang
Xingye Tian
Fei Yang
Pengfei Wan
Di Zhang
VGen
74
2
0
22 Nov 2024
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing
Chang-Shu Liu
Rui Li
Kaidong Zhang
Yunwei Lan
Dong Liu
DiffM
VGen
58
3
0
17 Nov 2024
Video Diffusion Models are Strong Video Inpainter
Video Diffusion Models are Strong Video Inpainter
Minhyeok Lee
Suhwan Cho
Chajin Shin
Jungho Lee
Sunghun Yang
Sangyoun Lee
VGen
DiffM
47
7
0
21 Aug 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video
  Diffusion Models
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
126
277
0
17 Jan 2024
AVID: Any-Length Video Inpainting with Diffusion Model
AVID: Any-Length Video Inpainting with Diffusion Model
Zhixing Zhang
Bichen Wu
Xiaoyan Wang
Yaqiao Luo
Luxin Zhang
Yinan Zhao
Peter Vajda
Dimitris N. Metaxas
Licheng Yu
VGen
DiffM
42
33
0
06 Dec 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large
  Datasets
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
180
1,019
0
25 Nov 2023
Control-A-Video: Controllable Text-to-Video Generation with Diffusion
  Models
Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Weifeng Chen
Yatai Ji
Jie Wu
Hefeng Wu
Pan Xie
Jiashi Li
Xin Xia
Xuefeng Xiao
Liang Lin
VGen
121
6
0
23 May 2023
CogVideo: Large-scale Pretraining for Text-to-Video Generation via
  Transformers
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
256
567
0
29 May 2022
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,796
0
24 Feb 2021
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
372
75,888
0
18 May 2015
1