ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.11565
  4. Cited By
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for
  Text-to-Video Generation

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

22 December 2022
Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Weixian Lei
Yuchao Gu
Yufei Shi
W. Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
    VGen
ArXivPDFHTML

Papers citing "Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation"

50 / 565 papers shown
Title
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
Ruiqi Wu
Liangyu Chen
Tong Yang
Chunle Guo
Chongyi Li
Xiangyu Zhang
DiffM
VGen
89
52
0
16 Oct 2023
A Survey on Video Diffusion Models
A Survey on Video Diffusion Models
Zhen Xing
Qijun Feng
Haoran Chen
Qi Dai
Hang-Rui Hu
Hang Xu
Zuxuan Wu
Yu-Gang Jiang
EGVM
VGen
57
116
0
16 Oct 2023
DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and
  View-Change Human-Centric Video Editing
DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing
Jia-Wei Liu
Yan-Pei Cao
Jay Zhangjie Wu
Weijia Mao
Yuchao Gu
Rui Zhao
Jussi Keppo
Ying Shan
Mike Zheng Shou
VGen
DiffM
39
15
0
16 Oct 2023
LOVECon: Text-driven Training-Free Long Video Editing with ControlNet
LOVECon: Text-driven Training-Free Long Video Editing with ControlNet
Zhenyi Liao
Zhijie Deng
DiffM
38
7
0
15 Oct 2023
R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image
  Generation
R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image Generation
Jiayu Xiao
Henglei Lv
Liang Li
Shuhui Wang
Qingming Huang
DiffM
35
20
0
13 Oct 2023
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Rui Zhao
Yuchao Gu
Jay Zhangjie Wu
David Junhao Zhang
Jia-Wei Liu
Weijia Wu
Jussi Keppo
Mike Zheng Shou
DiffM
VGen
25
103
0
12 Oct 2023
Consistent123: Improve Consistency for One Image to 3D Object Synthesis
Consistent123: Improve Consistency for One Image to 3D Object Synthesis
Haohan Weng
Tianyu Yang
Jianan Wang
Yu Li
Tong Zhang
Cheng Chen
Lei Zhang
DiffM
23
72
0
12 Oct 2023
ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation
ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation
Bo Peng
Xinyuan Chen
Yaohui Wang
Chaochao Lu
Yu Qiao
DiffM
VGen
27
7
0
11 Oct 2023
State of the Art on Diffusion Models for Visual Computing
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
33
102
0
11 Oct 2023
HiFi-123: Towards High-fidelity One Image to 3D Content Generation
HiFi-123: Towards High-fidelity One Image to 3D Content Generation
Wangbo Yu
Li-ming Yuan
Yan-Pei Cao
Xiangjun Gao
Xiaoyu Li
Wenbo Hu
Long Quan
Ying Shan
Yonghong Tian
DiffM
42
29
0
10 Oct 2023
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video
  editing
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Yuren Cong
Mengmeng Xu
Christian Simon
Shoufa Chen
Jiawei Ren
Yanping Xie
Juan-Manuel Perez-Rua
Bodo Rosenhahn
Tao Xiang
Sen He
DiffM
VGen
33
74
0
09 Oct 2023
IPDreamer: Appearance-Controllable 3D Object Generation with Complex
  Image Prompts
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
Bo-Wen Zeng
Shanglin Li
Yutang Feng
Ling Yang
Hong Li
...
Conghui He
Wentao Zhang
Jianzhuang Liu
Baochang Zhang
Shuicheng Yan
DiffM
32
1
0
09 Oct 2023
MagicDrive: Street View Generation with Diverse 3D Geometry Control
MagicDrive: Street View Generation with Diverse 3D Geometry Control
Ruiyuan Gao
Kai Chen
Enze Xie
Lanqing Hong
Zhenguo Li
Dit-Yan Yeung
Qiang Xu
DiffM
36
103
0
04 Oct 2023
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image
  Diffusion Models
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models
Hyeonho Jeong
Jong Chul Ye
DiffM
VGen
35
41
0
02 Oct 2023
PixArt-$α$: Fast Training of Diffusion Transformer for
  Photorealistic Text-to-Image Synthesis
PixArt-ααα: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen
Jincheng Yu
Chongjian Ge
Lewei Yao
Enze Xie
...
Zhongdao Wang
James T. Kwok
Ping Luo
Huchuan Lu
Zhenguo Li
DiffM
39
391
0
30 Sep 2023
FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video
  Synthesis from Static Imagery
FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video Synthesis from Static Imagery
Tasin Islam
A. Miron
Xiaohui Liu
Yongmin Li
DiffM
26
3
0
29 Sep 2023
LLM-grounded Video Diffusion Models
LLM-grounded Video Diffusion Models
Long Lian
Baifeng Shi
Semih Yavuz
Ye Liu
Boyi Li
DiffM
22
54
0
29 Sep 2023
RealFill: Reference-Driven Generation for Authentic Image Completion
RealFill: Reference-Driven Generation for Authentic Image Completion
Luming Tang
Nataniel Ruiz
Qinghao Chu
Yuanzhen Li
Aleksander Holynski
...
Bharath Hariharan
Yael Pritch
Neal Wadhwa
Kfir Aberman
Michael Rubinstein
DiffM
18
43
0
28 Sep 2023
CCEdit: Creative and Controllable Video Editing via Diffusion Models
CCEdit: Creative and Controllable Video Editing via Diffusion Models
Danfeng Hong
Wenming Weng
Hao Li
Yuhui Yuan
Jing Yao
Chong Luo
Zhibo Chen
Baining Guo
DiffM
VGen
21
42
0
28 Sep 2023
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video
  Generation
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
David Junhao Zhang
Jay Zhangjie Wu
Jia-Wei Liu
Rui Zhao
L. Ran
Yuchao Gu
Difei Gao
Mike Zheng Shou
DiffM
VGen
27
214
0
27 Sep 2023
Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM
  Animator
Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator
Hanzhuo Huang
Yufan Feng
Cheng Shi
Lan Xu
Jingyi Yu
Sibei Yang
DiffM
VGen
23
64
0
25 Sep 2023
PIE: Simulating Disease Progression via Progressive Image Editing
PIE: Simulating Disease Progression via Progressive Image Editing
Kaizhao Liang
Xu Cao
Kuei-Da Liao
Tianren Gao
Wenqian Ye
Zhengyu Chen
Jianguo Cao
Tejas Nama
Jimeng Sun
MedIm
AI4CE
21
5
0
21 Sep 2023
FreeU: Free Lunch in Diffusion U-Net
FreeU: Free Lunch in Diffusion U-Net
Chenyang Si
Ziqi Huang
Yuming Jiang
Ziwei Liu
DiffM
45
132
0
20 Sep 2023
Language-driven Object Fusion into Neural Radiance Fields with
  Pose-Conditioned Dataset Updates
Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates
Kashun Shum
Jaeyeon Kim
Binh-Son Hua
Duc Thanh Nguyen
Sai-Kit Yeung
3DH
AI4CE
21
7
0
20 Sep 2023
A Generative Framework for Self-Supervised Facial Representation
  Learning
A Generative Framework for Self-Supervised Facial Representation Learning
Ruian He
Zhen Xing
Weimin Tan
Bo Yan
DiffM
26
0
0
15 Sep 2023
Measuring the Quality of Text-to-Video Model Outputs: Metrics and
  Dataset
Measuring the Quality of Text-to-Video Model Outputs: Metrics and Dataset
Iya Chivileva
Philip Lynch
Tomás E. Ward
Alan F. Smeaton
EGVM
21
15
0
14 Sep 2023
NExT-GPT: Any-to-Any Multimodal LLM
NExT-GPT: Any-to-Any Multimodal LLM
Shengqiong Wu
Hao Fei
Leigang Qu
Wei Ji
Tat-Seng Chua
MLLM
46
457
0
11 Sep 2023
Chasing Consistency in Text-to-3D Generation from a Single Image
Chasing Consistency in Text-to-3D Generation from a Single Image
Yichen Ouyang
Wenhao Chai
Jiayi Ye
Dapeng Tao
Yibing Zhan
Gaoang Wang
DiffM
20
15
0
07 Sep 2023
MagicProp: Diffusion-based Video Editing via Motion-aware Appearance
  Propagation
MagicProp: Diffusion-based Video Editing via Motion-aware Appearance Propagation
Hanshu Yan
Jun Hao Liew
Long Mai
Shanchuan Lin
Jiashi Feng
VGen
DiffM
28
14
0
02 Sep 2023
VideoGen: A Reference-Guided Latent Diffusion Approach for High
  Definition Text-to-Video Generation
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Xin Li
Wenqing Chu
Ye Wu
Weihang Yuan
Fanglong Liu
Qi Zhang
Fu Li
Haocheng Feng
Errui Ding
Jingdong Wang
VGen
45
52
0
01 Sep 2023
MagicEdit: High-Fidelity and Temporally Coherent Video Editing
MagicEdit: High-Fidelity and Temporally Coherent Video Editing
Jun Hao Liew
Hanshu Yan
Jianfeng Zhang
Zhongcong Xu
Jiashi Feng
VGen
DiffM
38
52
0
28 Aug 2023
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
Hao Fei
Shengqiong Wu
Wei Ji
Hanwang Zhang
Tat-Seng Chua
VGen
DiffM
21
32
0
26 Aug 2023
EVE: Efficient zero-shot text-based Video Editing with Depth Map
  Guidance and Temporal Consistency Constraints
EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints
Yutao Chen
Xingning Dong
Tian Gan
Chunluan Zhou
Ming Yang
Qingpei Guo
DiffM
25
5
0
21 Aug 2023
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation
  with Temporal Correspondence Guidance
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu
Tzu-Hua Huang
Shuohao Lin
Jun-Cheng Chen
DiffM
VGen
34
13
0
19 Aug 2023
SimDA: Simple Diffusion Adapter for Efficient Video Generation
SimDA: Simple Diffusion Adapter for Efficient Video Generation
Zhen Xing
Qi Dai
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
DiffM
35
81
0
18 Aug 2023
StableVideo: Text-driven Consistency-aware Diffusion Video Editing
StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Wenhao Chai
Xun Guo
Gaoang Wang
Yang Lu
VGen
DiffM
27
147
0
18 Aug 2023
Edit Temporal-Consistent Videos with Image Diffusion Model
Edit Temporal-Consistent Videos with Image Diffusion Model
Yuan-Zheng Wang
Yong Li
Xiaoya Zhang
Xin Liu
Anbo Dai
Antoni B. Chan
Zhen Cui
DiffM
33
6
0
17 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video
  Processing
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffM
VGen
46
81
0
15 Aug 2023
Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with
  Image Diffusion Model
Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model
Bosheng Qin
Wentao Ye
Qifan Yu
Siliang Tang
Yueting Zhuang
DiffM
VGen
29
13
0
15 Aug 2023
DiffSynth: Latent In-Iteration Deflickering for Realistic Video
  Synthesis
DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis
Zhongjie Duan
Lizhou You
Chengyu Wang
Cen Chen
Ziheng Wu
Weining Qian
Jun Huang
DiffM
29
8
0
07 Aug 2023
BEVControl: Accurately Controlling Street-view Elements with
  Multi-perspective Consistency via BEV Sketch Layout
BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout
Kairui Yang
Enhui Ma
Jibing Peng
Qing-Wu Guo
Di Lin
Kaicheng Yu
DiffM
31
57
0
03 Aug 2023
VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by
  Using Diffusion Model with ControlNet
VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet
Zhihao Hu
Dong Xu
DiffM
VGen
41
65
0
26 Jul 2023
InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot
  Text-based Video Editing
InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot Text-based Video Editing
Anant Khandelwal
DiffM
VGen
31
14
0
22 Jul 2023
TokenFlow: Consistent Diffusion Features for Consistent Video Editing
TokenFlow: Consistent Diffusion Features for Consistent Video Editing
Michal Geyer
Omer Bar-Tal
Shai Bagon
Tali Dekel
VGen
DiffM
20
250
0
19 Jul 2023
Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D
  Brain MRI Synthesis
Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis
Lingting Zhu
Zeyue Xue
Zhenchao Jin
Xian Liu
Jingzhen He
Ziwei Liu
Lequan Yu
DiffM
MedIm
32
33
0
19 Jul 2023
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding
  and Generation
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Yi Wang
Yinan He
Yizhuo Li
Kunchang Li
Jiashuo Yu
...
Ping Luo
Ziwei Liu
Yali Wang
Limin Wang
Yu Qiao
VLM
VGen
33
244
0
13 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models
  without Specific Tuning
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
25
782
0
10 Jul 2023
A Survey of Deep Learning in Sports Applications: Perception,
  Comprehension, and Decision
A Survey of Deep Learning in Sports Applications: Perception, Comprehension, and Decision
Zhonghan Zhao
Wenhao Chai
Shengyu Hao
Wenhao Hu
Guanhong Wang
Shidong Cao
Min-Gyoo Song
Lei Li
Gaoang Wang
35
17
0
07 Jul 2023
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Chong Mou
Xintao Wang
Jie Song
Ying Shan
Jian Zhang
DiffM
47
141
0
05 Jul 2023
Collaborative Score Distillation for Consistent Visual Synthesis
Collaborative Score Distillation for Consistent Visual Synthesis
Subin Kim
Kyungmin Lee
June Suk Choi
Jongheon Jeong
Kihyuk Sohn
Jinwoo Shin
DiffM
29
21
0
04 Jul 2023
Previous
123...1011129
Next