Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.04761
Cited By
Video-P2P: Video Editing with Cross-attention Control
8 March 2023
Shaoteng Liu
Yuechen Zhang
Wenbo Li
Zhe-nan Lin
Jiaya Jia
DiffM
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Video-P2P: Video Editing with Cross-attention Control"
44 / 44 papers shown
Title
FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis
Jiangtong Tan
Hu Yu
Jie Huang
Jie Xiao
Feng Zhao
67
1
0
02 May 2025
We'll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback
Minkyu Choi
Sundar Sripada V. S.
Harsh Goel
Sahil Shah
Sandeep P. Chinchali
DiffM
VGen
86
0
0
24 Apr 2025
Understanding Attention Mechanism in Video Diffusion Models
Bingyan Liu
Chengyu Wang
Tongtong Su
Huan Ten
Jun Huang
K. Guo
Kui Jia
VGen
64
0
0
16 Apr 2025
Detecting Localized Deepfake Manipulations Using Action Unit-Guided Video Representations
Tharun Anand
Siva Sankar
Pravin Nair
AAML
45
0
0
28 Mar 2025
PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing
H. Iqbal
Nazmul Karim
Umar Khalid
Azib Farooq
Z. Zhong
Jing Hua
Chen Chen
DiffM
3DGS
VGen
45
0
0
14 Mar 2025
Get In Video: Add Anything You Want to the Video
Shaobin Zhuang
Zhipeng Huang
Binxin Yang
Ying Zhang
Fangyikang Wang
Canmiao Fu
Chong Sun
Zheng-Jun Zha
Chen Li
Y. Wang
DiffM
VGen
51
0
0
08 Mar 2025
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing
Varun Biyyala
Bharat Chanderprakash Kathuria
Jialu Li
Youshan Zhang
50
0
0
13 Jan 2025
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Haobo Yuan
X. Li
Tao Zhang
Zilong Huang
Shilin Xu
S. Ji
Yunhai Tong
Lu Qi
Jiashi Feng
Ming Yang
VLM
94
11
0
07 Jan 2025
VideoDirector: Precise Video Editing via Text-to-Video Models
Yukun Wang
Longguang Wang
Zhiyuan Ma
Qibin Hu
Kai Xu
Yulan Guo
VGen
DiffM
86
0
0
26 Nov 2024
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Rasoul Shafipour
David Harrison
Maxwell Horton
Jeffrey Marker
Houman Bedayat
Sachin Mehta
Mohammad Rastegari
Mahyar Najibi
Saman Naderiparizi
MQ
43
3
0
14 Oct 2024
DNI: Dilutional Noise Initialization for Diffusion Video Editing
Sunjae Yoon
Gwanhyeong Koo
Ji Woo Hong
Chang D. Yoo
DiffM
31
2
0
19 Sep 2024
EditBoard: Towards a Comprehensive Evaluation Benchmark for Text-Based Video Editing Models
Yupeng Chen
Penglin Chen
Xiaoyu Zhang
Yixian Huang
Qian Xie
DiffM
41
1
0
15 Sep 2024
Training-free Camera Control for Video Generation
Chen Hou
Guoqiang Wei
VGen
DiffM
70
30
0
14 Jun 2024
FRAG: Frequency Adapting Group for Diffusion Video Editing
Sunjae Yoon
Gwanhyeong Koo
Geonwoo Kim
Chang D. Yoo
DiffM
37
5
0
10 Jun 2024
I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models
Wenqi Ouyang
Yi Dong
Lei Yang
Jianlou Si
Xingang Pan
VGen
DiffM
39
11
0
26 May 2024
PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control
Yong Zhong
Min Zhao
Zebin You
Xiaofeng Yu
Changwang Zhang
Chongxuan Li
DiffM
29
6
0
23 May 2024
Motion Inversion for Video Customization
Luozhou Wang
Guibao Shen
Yixun Liang
Xin Tao
Pengfei Wan
Di Zhang
Yijun Li
Yingcong Chen
VGen
DiffM
34
7
0
29 Mar 2024
DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing
Hyeonho Jeong
Jinho Chang
Geon Yeong Park
Jong Chul Ye
DiffM
VGen
27
13
0
18 Mar 2024
Contextualized Diffusion Models for Text-Guided Image and Video Generation
Ling Yang
Zhilong Zhang
Zhaochen Yu
Jingwei Liu
Minkai Xu
Stefano Ermon
Bin Cui
36
4
0
26 Feb 2024
Human Video Translation via Query Warping
Haiming Zhu
Yangyang Xu
Shengfeng He
DiffM
36
0
0
19 Feb 2024
Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling
Xiaoyu Shi
Zhaoyang Huang
Fu-Yun Wang
Weikang Bian
Dasong Li
...
Ka Chun Cheung
Simon See
Hongwei Qin
Jifeng Da
Hongsheng Li
VGen
DiffM
37
80
0
29 Jan 2024
Object-Centric Diffusion for Efficient Video Editing
Kumara Kahatapitiya
Adil Karjauv
Davide Abati
Fatih Porikli
Yuki M. Asano
A. Habibian
VGen
32
12
0
11 Jan 2024
RealCraft: Attention Control as A Tool for Zero-Shot Consistent Video Editing
Shutong Jin
Ruiyu Wang
Florian T. Pokorny
DiffM
VGen
81
1
0
19 Dec 2023
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Yujie Wei
Shiwei Zhang
Zhiwu Qing
Hangjie Yuan
Zhiheng Liu
Yu Liu
Yingya Zhang
Jingren Zhou
Hongming Shan
DiffM
VGen
11
89
0
07 Dec 2023
StoryGPT-V: Large Language Models as Consistent Story Visualizers
Xiaoqian Shen
Mohamed Elhoseiny
VLM
90
10
0
04 Dec 2023
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Zhen Xing
Qi Dai
Zihao Zhang
Hui Zhang
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
37
17
0
30 Nov 2023
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
Haoyu Zhao
Tianyi Lu
Jiaxi Gu
Xing Zhang
Qingping Zheng
Zuxuan Wu
Hang Xu
Yu-Gang Jiang
VGen
DiffM
27
10
0
29 Nov 2023
Sketch Video Synthesis
Yudian Zheng
Xiaodong Cun
Menghan Xia
Chi-Man Pun
VGen
DiffM
19
2
0
26 Nov 2023
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Jiaxi Gu
Shicong Wang
Haoyu Zhao
Tianyi Lu
Xing Zhang
Zuxuan Wu
Songcen Xu
Wei Zhang
Yu-Gang Jiang
Hang Xu
DiffM
VGen
34
43
0
07 Sep 2023
MagicEdit: High-Fidelity and Temporally Coherent Video Editing
Jun Hao Liew
Hanshu Yan
Jianfeng Zhang
Zhongcong Xu
Jiashi Feng
VGen
DiffM
25
52
0
28 Aug 2023
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu
Tzu-Hua Huang
Shuohao Lin
Jun-Cheng Chen
DiffM
VGen
26
13
0
19 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffM
VGen
41
81
0
15 Aug 2023
VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet
Zhihao Hu
Dong Xu
DiffM
VGen
23
64
0
26 Jul 2023
Dyn-E: Local Appearance Editing of Dynamic Neural Radiance Fields
Shangzhan Zhang
Sida Peng
Yinji ShenTu
Qing Shuai
Tianrun Chen
Kaicheng Yu
Hujun Bao
Xiaowei Zhou
69
7
0
24 Jul 2023
Collaborative Score Distillation for Consistent Visual Synthesis
Subin Kim
Kyungmin Lee
June Suk Choi
Jongheon Jeong
Kihyuk Sohn
Jinwoo Shin
DiffM
24
21
0
04 Jul 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGen
DiffM
33
88
0
29 May 2023
InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
28
31
0
29 May 2023
Towards Consistent Video Editing with Text-to-Image Diffusion Models
Zicheng Zhang
Bonan Li
Xuecheng Nie
Congying Han
Tiande Guo
Luoqi Liu
DiffM
18
24
0
27 May 2023
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao
Enze Xie
Lanqing Hong
Zhenguo Li
G. Lee
DiffM
VGen
25
32
0
15 May 2023
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
77
79
0
13 Apr 2023
Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models
Wen Wang
Yan Jiang
K. Xie
Zide Liu
Hao Chen
Yue Cao
Xinlong Wang
Chunhua Shen
DiffM
VGen
29
112
0
30 Mar 2023
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
245
565
0
29 May 2022
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
303
5,773
0
29 Apr 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,774
0
24 Feb 2021
1