ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.11565
  4. Cited By
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for
  Text-to-Video Generation

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

22 December 2022
Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Weixian Lei
Yuchao Gu
Yufei Shi
W. Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
    VGen
ArXivPDFHTML

Papers citing "Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation"

50 / 565 papers shown
Title
DragAnything: Motion Control for Anything using Entity Representation
DragAnything: Motion Control for Anything using Entity Representation
Wejia Wu
Zhuang Li
Yuchao Gu
Rui Zhao
Yefei He
David Junhao Zhang
Mike Zheng Shou
Yan Li
Tingting Gao
Di Zhang
VGen
84
51
0
12 Mar 2024
Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions
Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions
Lan Wang
Vishnu Naresh Boddeti
Sernam Lim
VGen
DiffM
42
0
0
11 Mar 2024
Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention
  Regulation in Diffusion Models
Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models
Yang Zhang
Teoh Tze Tzun
Lim Wei Hern
Tiviatis Sim
Kenji Kawaguchi
DiffM
32
9
0
11 Mar 2024
Video Generation with Consistency Tuning
Video Generation with Consistency Tuning
Chaoyi Wang
Yaozhe Song
Yafeng Zhang
Jun Pei
Lijie Xia
Jianpo Liu
30
1
0
11 Mar 2024
FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video
  Editing
FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing
Youyuan Zhang
Xuan Ju
James J. Clark
VGen
DiffM
37
6
0
10 Mar 2024
BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video
  Deflickering
BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering
Xin Qiu
Congying Han
Zicheng Zhang
Bonan li
Tiande Guo
Pingyu Wang
Xuecheng Nie
47
0
0
10 Mar 2024
VideoElevator: Elevating Video Generation Quality with Versatile
  Text-to-Image Diffusion Models
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Yabo Zhang
Yuxiang Wei
Xianhui Lin
Zheng Hui
Peiran Ren
Xuansong Xie
Xiangyang Ji
Wangmeng Zuo
VGen
43
6
0
08 Mar 2024
StereoDiffusion: Training-Free Stereo Image Generation Using Latent
  Diffusion Models
StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion Models
Lezhong Wang
J. Frisvad
Mark Bo Jensen
Siavash Bigdeli
DiffM
34
10
0
08 Mar 2024
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces
  from Disentangled Audio
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Chao Xu
Yang Liu
Jiazheng Xing
Weida Wang
Mingze Sun
...
Tianxin Huang
Siyuan Li
Zhi-Qi Cheng
Ying Tai
Baigui Sun
CVBM
54
11
0
04 Mar 2024
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
Lukas Höllein
Aljavz Bovzivc
Norman Muller
David Novotny
Hung-Yu Tseng
Christian Richardt
Michael Zollhöfer
Matthias Nießner
DiffM
49
39
0
04 Mar 2024
Abductive Ego-View Accident Video Understanding for Safe Driving
  Perception
Abductive Ego-View Accident Video Understanding for Safe Driving Perception
Jianwu Fang
Lei-lei Li
Junfei Zhou
Junbin Xiao
Hongkai Yu
Chen Lv
Jianru Xue
Tat-Seng Chua
37
14
0
01 Mar 2024
Trajectory Consistency Distillation: Improved Latent Consistency
  Distillation by Semi-Linear Consistency Function with Trajectory Mapping
Trajectory Consistency Distillation: Improved Latent Consistency Distillation by Semi-Linear Consistency Function with Trajectory Mapping
Jianbin Zheng
Minghui Hu
Zhongyi Fan
Chaoyue Wang
Changxing Ding
Dacheng Tao
Tat-Jen Cham
43
27
0
29 Feb 2024
Sora: A Review on Background, Technology, Limitations, and Opportunities
  of Large Vision Models
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Yixin Liu
Kai Zhang
Yuan Li
Zhiling Yan
Chujie Gao
...
Yue Huang
Hanchi Sun
Jianfeng Gao
Lifang He
Lichao Sun
VLM
VGen
EGVM
75
260
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
66
86
0
27 Feb 2024
Contextualized Diffusion Models for Text-Guided Image and Video
  Generation
Contextualized Diffusion Models for Text-Guided Image and Video Generation
Ling Yang
Zhilong Zhang
Zhaochen Yu
Jingwei Liu
Minkai Xu
Stefano Ermon
Bin Cui
44
4
0
26 Feb 2024
ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion
  Models against Stochastic Perturbation
ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation
Yi Zhang
Yun Tang
Wenjie Ruan
Xiaowei Huang
Siddartha Khastgir
P. Jennings
Xingyu Zhao
AAML
35
4
0
23 Feb 2024
Customize-A-Video: One-Shot Motion Customization of Text-to-Video
  Diffusion Models
Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models
Yixuan Ren
Yang Zhou
Jimei Yang
Jing Shi
Difan Liu
Feng Liu
Mingi Kwon
Abhinav Shrivastava
DiffM
VGen
96
34
0
22 Feb 2024
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance
  Editing
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
Jianhong Bai
Tianyu He
Yuchi Wang
Junliang Guo
Haoji Hu
Zuozhu Liu
Jiang Bian
VGen
31
26
0
20 Feb 2024
VGMShield: Mitigating Misuse of Video Generative Models
VGMShield: Mitigating Misuse of Video Generative Models
Yan Pang
Yang Zhang
Tianhao Wang
42
3
0
20 Feb 2024
Visual Style Prompting with Swapping Self-Attention
Visual Style Prompting with Swapping Self-Attention
Jaeseok Jeong
Junho Kim
Yunjey Choi
Gayoung Lee
Youngjung Uh
DiffM
40
40
0
20 Feb 2024
Human Video Translation via Query Warping
Human Video Translation via Query Warping
Haiming Zhu
Yangyang Xu
Shengfeng He
DiffM
49
0
0
19 Feb 2024
Make a Cheap Scaling: A Self-Cascade Diffusion Model for
  Higher-Resolution Adaptation
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Lanqing Guo
Yin-Yin He
Haoxin Chen
Menghan Xia
Xiaodong Cun
...
Yong Zhang
Xintao Wang
Qifeng Chen
Ying Shan
Bihan Wen
43
23
0
16 Feb 2024
Magic-Me: Identity-Specific Video Customized Diffusion
Magic-Me: Identity-Specific Video Customized Diffusion
Ze Ma
Daquan Zhou
Chun-Hsiao Yeh
Xue-She Wang
Xiuyu Li
Huanrui Yang
Zhen Dong
Kurt Keutzer
Jiashi Feng
VGen
DiffM
40
31
0
14 Feb 2024
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Weiming Ren
Harry Yang
Ge Zhang
Cong Wei
Xinrun Du
Stephen W. Huang
Wenhu Chen
DiffM
VGen
93
54
0
06 Feb 2024
Direct-a-Video: Customized Video Generation with User-Directed Camera
  Movement and Object Motion
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Shiyuan Yang
Liang Hou
Haibin Huang
Chongyang Ma
Pengfei Wan
Di Zhang
Xiaodong Chen
Jing Liao
VGen
DiffM
66
77
0
05 Feb 2024
NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties
NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties
Jingyuan Sun
Mingxiao Li
Zijiao Chen
Marie-Francine Moens
VGen
36
7
0
02 Feb 2024
A Survey on Generative AI and LLM for Video Generation, Understanding,
  and Streaming
A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming
Pengyuan Zhou
Lin Wang
Zhi Liu
Yanbin Hao
Pan Hui
Sasu Tarkoma
J. Kangasharju
VGen
46
26
0
30 Jan 2024
FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion
  Models
FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models
Feihong He
Gang Li
Mengyuan Zhang
Leilei Yan
Hui Xiong
Fanzhang Li
Li Shen
DiffM
26
15
0
28 Jan 2024
Do You Guys Want to Dance: Zero-Shot Compositional Human Dance
  Generation with Multiple Persons
Do You Guys Want to Dance: Zero-Shot Compositional Human Dance Generation with Multiple Persons
Zhe Xu
Kun-Juan Wei
Xu Yang
Cheng Deng
DiffM
28
4
0
24 Jan 2024
ActAnywhere: Subject-Aware Video Background Generation
ActAnywhere: Subject-Aware Video Background Generation
Boxiao Pan
Zhan Xu
Chun-Hao Paul Huang
Krishna Kumar Singh
Yang Zhou
Leonidas J. Guibas
Jimei Yang
VGen
DiffM
29
3
0
19 Jan 2024
Inflation with Diffusion: Efficient Temporal Adaptation for
  Text-to-Video Super-Resolution
Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution
Xin Yuan
Jinoo Baek
Keyang Xu
Omer Tov
Hongliang Fei
VGen
37
3
0
18 Jan 2024
Towards Language-Driven Video Inpainting via Multimodal Large Language
  Models
Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Jianzong Wu
Xiangtai Li
Chenyang Si
Shangchen Zhou
Jingkang Yang
...
Yining Li
Kai Chen
Yunhai Tong
Ziwei Liu
Chen Change Loy
VGen
DiffM
MLLM
41
17
0
18 Jan 2024
Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation
Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation
Changgu Chen
Junwei Shu
Lianggangxu Chen
Gaoqi He
Changbo Wang
VGen
22
14
0
18 Jan 2024
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
Zhao Wang
Aoxue Li
Lingting Zhu
Yong Guo
Qi Dou
Zhenguo Li
VGen
DiffM
35
40
0
18 Jan 2024
Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in
  High-Resolution RS Imagery
Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS Imagery
Jia Jia
Geunho Lee
Zhibo Wang
Zhi Lyu
Yuchu He
DiffM
37
7
0
17 Jan 2024
UniVG: Towards UNIfied-modal Video Generation
UniVG: Towards UNIfied-modal Video Generation
Ludan Ruan
Lei Tian
Chuanwei Huang
Xu Zhang
Xinyan Xiao
VGen
DiffM
34
3
0
17 Jan 2024
Instilling Multi-round Thinking to Text-guided Image Generation
Instilling Multi-round Thinking to Text-guided Image Generation
Lidong Zeng
Zhedong Zheng
Yinwei Wei
Tat-Seng Chua
34
5
0
16 Jan 2024
Towards A Better Metric for Text-to-Video Generation
Towards A Better Metric for Text-to-Video Generation
Jay Zhangjie Wu
Guian Fang
Haoning Wu
Xintao Wang
Yixiao Ge
...
Rui Zhao
Weisi Lin
Wynne Hsu
Ying Shan
Mike Zheng Shou
VGen
37
34
0
15 Jan 2024
360DVD: Controllable Panorama Video Generation with 360-Degree Video
  Diffusion Model
360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
Qian Wang
Weiqi Li
Chong Mou
Xinhua Cheng
Jian Zhang
VGen
53
17
0
12 Jan 2024
Object-Centric Diffusion for Efficient Video Editing
Object-Centric Diffusion for Efficient Video Editing
Kumara Kahatapitiya
Adil Karjauv
Davide Abati
Fatih Porikli
Yuki M. Asano
A. Habibian
VGen
40
12
0
11 Jan 2024
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos
E. Peruzzo
Vidit Goel
Dejia Xu
Xingqian Xu
Yi Ding
Zhangyang Wang
Humphrey Shi
N. Sebe
LM&Ro
VGen
DiffM
64
9
0
04 Jan 2024
Preserving Image Properties Through Initializations in Diffusion Models
Preserving Image Properties Through Initializations in Diffusion Models
Jeffrey Zhang
Shao-Yu Chang
Kedan Li
David Forsyth
DiffM
14
6
0
04 Jan 2024
Moonshot: Towards Controllable Video Generation and Editing with
  Multimodal Conditions
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
David Junhao Zhang
Dongxu Li
Hung Le
Mike Zheng Shou
Caiming Xiong
Doyen Sahoo
VGen
22
23
0
03 Jan 2024
AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated
  by AI
AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AI
Fanda Fan
Chunjie Luo
Wanling Gao
Jianfeng Zhan
85
15
0
03 Jan 2024
GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for
  One-shot Generalizable Neural Radiance Fields
GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields
X. Pan
Zongxin Yang
Shuai Bai
Yi Yang
DiffM
OffRL
30
1
0
01 Jan 2024
TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
W. Ma
J. P. Lewis
W. Kleijn
DiffM
VGen
24
34
0
31 Dec 2023
FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video
  Synthesis
FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Feng Liang
Bichen Wu
Jialiang Wang
Licheng Yu
Kunpeng Li
...
Ishan Misra
Jia-Bin Huang
Peizhao Zhang
Peter Vajda
Diana Marculescu
VGen
DiffM
40
32
0
29 Dec 2023
4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency
4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency
Yuyang Yin
Dejia Xu
Zhangyang Wang
Yao-Min Zhao
Yunchao Wei
3DGS
57
72
0
28 Dec 2023
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Xiang Wang
Shiwei Zhang
Hangjie Yuan
Zhiwu Qing
Biao Gong
Yingya Zhang
Yujun Shen
Changxin Gao
Nong Sang
DiffM
VGen
33
26
0
25 Dec 2023
PIA: Your Personalized Image Animator via Plug-and-Play Modules in
  Text-to-Image Models
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
Yiming Zhang
Zhening Xing
Yanhong Zeng
Youqing Fang
Kai Chen
VGen
36
27
0
21 Dec 2023
Previous
123...678...101112
Next