ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.15977
  4. Cited By
Motion-I2V: Consistent and Controllable Image-to-Video Generation with
  Explicit Motion Modeling

Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

29 January 2024
Xiaoyu Shi
Zhaoyang Huang
Fu-Yun Wang
Weikang Bian
Dasong Li
Yuanhang Zhang
Manyuan Zhang
Ka Chun Cheung
Simon See
Hongwei Qin
Jifeng Da
Hongsheng Li
    VGen
    DiffM
ArXivPDFHTML

Papers citing "Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling"

36 / 36 papers shown
Title
Temporal Differential Fields for 4D Motion Modeling via Image-to-Video Synthesis
Xin You
Minghui Zhang
Hanxiao Zhang
J. Yang
Nassir Navab
DiffM
VGen
MedIm
181
0
0
22 May 2025
MG-Gen: Single Image to Motion Graphics Generation with Layer Decomposition
MG-Gen: Single Image to Motion Graphics Generation with Layer Decomposition
Takahiro Shirakawa
Tomoyuki Suzuki
Daichi Haraguchi
VGen
74
0
0
03 Apr 2025
SketchVideo: Sketch-based Video Generation and Editing
SketchVideo: Sketch-based Video Generation and Editing
Feng-Lin Liu
Hongbo Fu
Xintao Wang
Weicai Ye
Pengfei Wan
Di Zhang
Lin Gao
DiffM
VGen
91
0
0
30 Mar 2025
VidTwin: Video VAE with Decoupled Structure and Dynamics
VidTwin: Video VAE with Decoupled Structure and Dynamics
Yuchi Wang
Junliang Guo
Xinyi Xie
Tianyu He
Xu Sun
Li Zhao
DRL
VGen
115
3
0
23 Dec 2024
OSV: One Step is Enough for High-Quality Image to Video Generation
OSV: One Step is Enough for High-Quality Image to Video Generation
Xiaofeng Mao
Zhengkai Jiang
Fu-Yun Wang
Wenbing Zhu
Hao Chen
Mingmin Chi
Yabiao Wang
Wenhan Luo
DiffM
VGen
96
10
0
17 Sep 2024
Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation
Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation
Changgu Chen
Junwei Shu
Lianggangxu Chen
Gaoqi He
Changbo Wang
VGen
31
16
0
18 Jan 2024
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion
  Models
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Zhan Qin
Xiang Wang
Deli Zhao
Jingren Zhou
DiffM
VGen
102
223
0
07 Nov 2023
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Chong Mou
Xintao Wang
Jie Song
Ying Shan
Jian Zhang
DiffM
75
151
0
05 Jul 2023
A Unified Conditional Framework for Diffusion-based Image Restoration
A Unified Conditional Framework for Diffusion-based Image Restoration
Yuanhang Zhang
Xiaoyu Shi
Dasong Li
Xiaogang Wang
Jian Wang
Hongsheng Li
DiffM
65
23
0
31 May 2023
Common Diffusion Noise Schedules and Sample Steps are Flawed
Common Diffusion Noise Schedules and Sample Steps are Flawed
Shanchuan Lin
Bingchen Liu
Jiashi Li
Xiao Yang
DiffM
63
214
0
15 May 2023
DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion
DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion
J. Karras
Aleksander Holynski
Ting-Chun Wang
Ira Kemelmacher-Shlizerman
DiffM
VGen
71
142
0
12 Apr 2023
Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Haomiao Ni
Changhao Shi
Kaican Li
Sharon X. Huang
Martin Renqiang Min
VGen
DiffM
61
173
0
24 Mar 2023
Video-P2P: Video Editing with Cross-attention Control
Video-P2P: Video Editing with Cross-attention Control
Shaoteng Liu
Yuechen Zhang
Wenbo Li
Zhe Lin
Jiaya Jia
DiffM
VGen
177
210
0
08 Mar 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
103
4,074
1
10 Feb 2023
TAP-Vid: A Benchmark for Tracking Any Point in a Video
TAP-Vid: A Benchmark for Tracking Any Point in a Video
Carl Doersch
Ankush Gupta
L. Markeeva
Adrià Recasens
Lucas Smaira
Y. Aytar
João Carreira
Andrew Zisserman
Yezhou Yang
62
162
0
07 Nov 2022
Imagen Video: High Definition Video Generation with Diffusion Models
Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
...
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
100
1,518
0
05 Oct 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffM
VGen
74
1,399
0
29 Sep 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
339
6,830
0
13 Apr 2022
Particle Video Revisited: Tracking Through Occlusions Using Point
  Trajectories
Particle Video Revisited: Tracking Through Occlusions Using Point Trajectories
Adam W. Harley
Zhaoyuan Fang
Katerina Fragkiadaki
54
165
0
08 Apr 2022
FlowFormer: A Transformer Architecture for Optical Flow
FlowFormer: A Transformer Architecture for Optical Flow
Zhaoyang Huang
Xiaoyu Shi
Chao Zhang
Qiang Wang
Ka Chun Cheung
Hongwei Qin
Jifeng Dai
Hongsheng Li
ViT
74
279
0
30 Mar 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
345
15,373
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with
  Text-Guided Diffusion Models
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
282
3,571
0
20 Dec 2021
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
VLM
CLIP
241
1,035
0
09 Oct 2021
Motion Representations for Articulated Animation
Motion Representations for Articulated Animation
Aliaksandr Siarohin
Oliver J. Woodford
Jian Ren
Menglei Chai
Sergey Tulyakov
OCL
138
268
0
22 Apr 2021
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Max Bain
Arsha Nagrani
Gül Varol
Andrew Zisserman
VGen
133
1,172
0
01 Apr 2021
Animating Pictures with Eulerian Motion Fields
Animating Pictures with Eulerian Motion Fields
Aleksander Holynski
Brian L. Curless
S. M. Seitz
Richard Szeliski
55
61
0
30 Nov 2020
Denoising Diffusion Implicit Models
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
208
7,294
0
06 Oct 2020
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
Zachary Teed
Jia Deng
MDE
211
2,612
0
26 Mar 2020
Softmax Splatting for Video Frame Interpolation
Softmax Splatting for Video Frame Interpolation
Simon Niklaus
Feng Liu
130
387
0
11 Mar 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
367
20,053
0
23 Oct 2019
SinGAN: Learning a Generative Model from a Single Natural Image
SinGAN: Learning a Generative Model from a Single Natural Image
Tamar Rott Shaham
Tali Dekel
T. Michaeli
GAN
VLM
90
840
0
02 May 2019
A Lightweight Optical Flow CNN - Revisiting Data Fidelity and
  Regularization
A Lightweight Optical Flow CNN - Revisiting Data Fidelity and Regularization
Tak-Wai Hui
Xiaoou Tang
Chen Change Loy
3DPC
59
179
0
15 Mar 2019
Photo Wake-Up: 3D Character Animation from a Single Photo
Photo Wake-Up: 3D Character Animation from a Single Photo
Chung-Yi Weng
Brian L. Curless
Ira Kemelmacher-Shlizerman
3DH
47
135
0
05 Dec 2018
Learning to Generate Time-Lapse Videos Using Multi-Stage Dynamic
  Generative Adversarial Networks
Learning to Generate Time-Lapse Videos Using Multi-Stage Dynamic Generative Adversarial Networks
Wei Xiong
Wenhan Luo
Lin Ma
Wen Liu
Jiebo Luo
GAN
49
182
0
22 Sep 2017
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume
Deqing Sun
Xiaodong Yang
Ming-Yuan Liu
Jan Kautz
3DPC
244
2,441
0
07 Sep 2017
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
Eddy Ilg
N. Mayer
Tonmoy Saikia
Margret Keuper
Alexey Dosovitskiy
Thomas Brox
3DPC
221
3,077
0
06 Dec 2016
1