ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.03458
  4. Cited By
Video Diffusion Models

Video Diffusion Models

7 April 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
    DiffM
    VGen
ArXivPDFHTML

Papers citing "Video Diffusion Models"

50 / 1,186 papers shown
Title
SceneDiffuser: Efficient and Controllable Driving Simulation
  Initialization and Rollout
SceneDiffuser: Efficient and Controllable Driving Simulation Initialization and Rollout
C. Jiang
Yijing Bai
Andre Cornman
Christopher Davis
Xiukun Huang
...
Carlos Fuertes
Chang Yuan
Mingxing Tan
Yin Zhou
Dragomir Anguelov
82
14
0
05 Dec 2024
Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Yuyang Wang
Anurag Ranjan
J. Susskind
Miguel Angel Bautista
3DPC
81
0
0
05 Dec 2024
DIVE: Taming DINO for Subject-Driven Video Editing
DIVE: Taming DINO for Subject-Driven Video Editing
Yi Huang
Wei Xiong
He Zhang
Chaoqi Chen
Jianzhuang Liu
Mingfu Yan
Shifeng Chen
VGen
DiffM
78
0
0
04 Dec 2024
Diffusion-VLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression
Diffusion-VLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression
Junjie Wen
Minjie Zhu
Yichen Zhu
Zhibin Tang
Jinming Li
...
Chengmeng Li
Xiaoyu Liu
Yaxin Peng
Chaomin Shen
Feifei Feng
88
15
0
04 Dec 2024
World-consistent Video Diffusion with Explicit 3D Modeling
World-consistent Video Diffusion with Explicit 3D Modeling
Qihang Zhang
Shuangfei Zhai
Miguel Angel Bautista
Kevin Miao
Alexander Toshev
J. Susskind
Jiatao Gu
VGen
83
8
0
02 Dec 2024
LoyalDiffusion: A Diffusion Model Guarding Against Data Replication
LoyalDiffusion: A Diffusion Model Guarding Against Data Replication
Chenghao Li
Yuke Zhang
Dake Chen
Jingqi Xu
P. Beerel
71
0
0
02 Dec 2024
Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation
Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation
Zilyu Ye
Zhiyang Chen
Tiancheng Li
Zemin Huang
Weijian Luo
Guo-jun Qi
DiffM
83
5
0
02 Dec 2024
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses
Yatian Pang
Bin Zhu
Bin Lin
Mingzhe Zheng
Francis E. H. Tay
Ser-Nam Lim
Harry Yang
Li Yuan
VGen
3DH
79
4
0
30 Nov 2024
ReconDreamer: Crafting World Models for Driving Scene Reconstruction via
  Online Restoration
ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration
Chaojun Ni
Guosheng Zhao
Xiaofeng Wang
Zheng Hua Zhu
Wenkang Qin
...
Kun Zhan
Peng Jia
Xianpeng Lang
Xingang Wang
Wenjun Mei
VGen
154
6
0
29 Nov 2024
Deepfake Media Generation and Detection in the Generative AI Era: A
  Survey and Outlook
Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Florinel-Alin Croitoru
Andrei Iulian Hiji
Vlad Hondru
Nicolae-Cătălin Ristea
Paul Irofti
Marius Popescu
Cristian Rusu
Radu Tudor Ionescu
Fahad Shahbaz Khan
Mubarak Shah
89
3
0
29 Nov 2024
AerialGo: Walking-through City View Generation from Aerial Perspectives
Fuqiang Zhao
Yijing Guo
Siyuan Yang
Xi Chen
Luo Wang
Lan Xu
Yuyao Zhang
Yujiao Shi
Jingyi Yu
74
0
0
29 Nov 2024
Motion Modes: What Could Happen Next?
Karran Pandey
Matheus Gadelha
Yannick Hold-Geoffroy
Karan Singh
Niloy J. Mitra
Paul Guerrero
VGen
DiffM
88
2
0
29 Nov 2024
Pretrained Reversible Generation as Unsupervised Visual Representation Learning
Pretrained Reversible Generation as Unsupervised Visual Representation Learning
Rongkun Xue
Jinouwen Zhang
Yazhe Niu
Dazhong Shen
Bingqi Ma
Yu Liu
Jing Yang
80
0
0
29 Nov 2024
Track Anything Behind Everything: Zero-Shot Amodal Video Object
  Segmentation
Track Anything Behind Everything: Zero-Shot Amodal Video Object Segmentation
Finlay G. C. Hudson
W. Smith
VOS
VLM
76
0
0
28 Nov 2024
SPAgent: Adaptive Task Decomposition and Model Selection for General
  Video Generation and Editing
SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing
Rong-Cheng Tu
Wenhao Sun
Zhao Jin
Jingyi Liao
Jiaxing Huang
Dacheng Tao
VGen
DiffM
103
3
0
28 Nov 2024
OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
Hui Li
Mingwang Xu
Yun Zhan
Shan Mu
Jiaye Li
...
Y. Chen
Tan Chen
Mao Ye
Jingdong Wang
Siyu Zhu
VGen
106
2
0
28 Nov 2024
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Feng Liu
Shiwei Zhang
Xiaofeng Wang
Yujie Wei
Haonan Qiu
Yuzhong Zhao
Yingya Zhang
Qixiang Ye
Fang Wan
VGen
AI4TS
99
11
0
28 Nov 2024
Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling
Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling
J. Hyung
Kinam Kim
Susung Hong
M. Kim
Jaegul Choo
VGen
90
3
0
27 Nov 2024
Individual Content and Motion Dynamics Preserved Pruning for Video
  Diffusion Models
Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models
Yiming Wu
Huan Wang
Zhenghao Chen
Dong Xu
DiffM
VGen
84
1
0
27 Nov 2024
MotionCharacter: Identity-Preserving and Motion Controllable Human Video
  Generation
MotionCharacter: Identity-Preserving and Motion Controllable Human Video Generation
Haopeng Fang
Di Qiu
Binjie Mao
Pengfei Yan
He Tang
VGen
DiffM
72
4
0
27 Nov 2024
Scene Co-pilot: Procedural Text to Video Generation with Human in the
  Loop
Scene Co-pilot: Procedural Text to Video Generation with Human in the Loop
Zhaofang Qian
Abolfazl Sharifi
Tucker Carroll
Ser-Nam Lim
VGen
79
0
0
26 Nov 2024
Privacy Protection in Personalized Diffusion Models via Targeted
  Cross-Attention Adversarial Attack
Privacy Protection in Personalized Diffusion Models via Targeted Cross-Attention Adversarial Attack
Xide Xu
Muhammad Atif Butt
Sandesh Kamath
Bogdan Raducanu
DiffM
AAML
77
1
0
25 Nov 2024
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Xiaozhong Ji
Xiaobin Hu
Zhihong Xu
Junwei Zhu
Chuming Lin
...
Donghao Luo
Yi Chen
Qin Lin
Qinglin Lu
Chengjie Wang
VGen
81
4
0
25 Nov 2024
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Zhichao Zhang
Wei Sun
Xinyue Li
Yunhao Li
Qihang Ge
...
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guangtao Zhai
EGVM
122
1
0
25 Nov 2024
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
Chenjie Cao
Chaohui Yu
Shang Liu
Fan Wang
Xiangyang Xue
Yanwei Fu
94
1
0
25 Nov 2024
Frequency-Guided Posterior Sampling for Diffusion-Based Image
  Restoration
Frequency-Guided Posterior Sampling for Diffusion-Based Image Restoration
D. Thaker
Abhishek Goyal
René Vidal
DiffM
68
0
0
22 Nov 2024
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
Zhiwei Jia
Yuesong Nan
Huixi Zhao
Gengdai Liu
EGVM
91
0
0
22 Nov 2024
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Weijia Wu
Mingyu Liu
Zeyu Zhu
Xi Xia
Haoen Feng
Wen Wang
Kevin Qinghong Lin
Chunhua Shen
Mike Zheng Shou
DiffM
VGen
122
1
0
22 Nov 2024
Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge
Qinglong Cao
Ding Wang
Xirui Li
Yuntian Chen
Chao Ma
Xiaokang Yang
DiffM
VGen
118
2
0
18 Nov 2024
SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
Zhen Lv
Yangqi Long
Congzhentao Huang
Cao Li
Chengfei Lv
Hao Ren
Dian Zheng
DiffM
VGen
MDE
114
5
0
18 Nov 2024
Jailbreak Attacks and Defenses against Multimodal Generative Models: A
  Survey
Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey
Xuannan Liu
Xing Cui
Peipei Li
Zekun Li
Huaibo Huang
Shuhan Xia
Miaoxuan Zhang
Yueying Zou
Ran He
AAML
67
8
0
14 Nov 2024
Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply
  Better Samples
Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples
Noël Vouitsis
Rasa Hosseinzadeh
Brendan Leigh Ross
Valentin Villecroze
S. Gorti
Jesse C. Cresswell
G. Loaiza-Ganem
DiffM
48
0
0
13 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
63
1
0
12 Nov 2024
Edify Image: High-Quality Image Generation with Pixel Space Laplacian
  Diffusion Models
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models
Nvidia
:
Yuval Atzmon
Maciej Bala
Yogesh Balaji
...
Ting-Chun Wang
Shuran Song
Fangyin Wei
Yu Zeng
Qinsheng Zhang
58
6
0
11 Nov 2024
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
Koichi Namekata
Sherwin Bahmani
Ziyi Wu
Yash Kant
Igor Gilitschenski
David B. Lindell
VGen
65
13
0
07 Nov 2024
Generating Synthetic Electronic Health Record (EHR) Data: A Review with
  Benchmarking
Generating Synthetic Electronic Health Record (EHR) Data: A Review with Benchmarking
Xingran Chen
Zhenke Wu
Xu Shi
Hyunghoon Cho
Bhramar Mukherjee
SyDa
33
1
0
06 Nov 2024
Boosting Latent Diffusion with Perceptual Objectives
Boosting Latent Diffusion with Perceptual Objectives
Tariq Berrada
Pietro Astolfi
Jakob Verbeek
Melissa Hall
Marton Havasi
M. Drozdzal
Yohann Benchetrit
Adriana Romero Soriano
Karteek Alahari
48
0
0
06 Nov 2024
DiMSUM: Diffusion Mamba -- A Scalable and Unified Spatial-Frequency Method for Image Generation
DiMSUM: Diffusion Mamba -- A Scalable and Unified Spatial-Frequency Method for Image Generation
Hao Phung
Quan Dao
T. Dao
Hoang Phan
Dimitris Metaxas
Anh Tran
Mamba
67
4
0
06 Nov 2024
Pre-trained Visual Dynamics Representations for Efficient Policy
  Learning
Pre-trained Visual Dynamics Representations for Efficient Policy Learning
Hao Luo
Bohan Zhou
Zongqing Lu
30
1
0
05 Nov 2024
Exploring the Interplay Between Video Generation and World Models in
  Autonomous Driving: A Survey
Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Ao Fu
Yi Zhou
Tao Zhou
Yuqing Yang
Bojun Gao
Qun Li
Guobin Wu
Ling Shao
VGen
59
2
0
05 Nov 2024
Bridge-IF: Learning Inverse Protein Folding with Markov Bridges
Bridge-IF: Learning Inverse Protein Folding with Markov Bridges
Yiheng Zhu
Jialu Wu
Yue Liu
Jiahuan Yan
Mingze Yin
Wei Wu
Mingyang Li
Jieping Ye
Zehua Wang
Jian Wu
41
4
0
04 Nov 2024
Optical Flow Representation Alignment Mamba Diffusion Model for Medical
  Video Generation
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation
Zhenbin Wang
Lei Zhang
Lituan Wang
Minjuan Zhu
Zhenwei Zhang
VGen
MedIm
62
1
0
03 Nov 2024
Denoising Fisher Training For Neural Implicit Samplers
Denoising Fisher Training For Neural Implicit Samplers
Weijian Luo
Wei Deng
36
0
0
03 Nov 2024
Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
Zheng Zhan
Yushu Wu
Yifan Gong
Zichong Meng
Zhenglun Kong
Changdi Yang
Geng Yuan
Pu Zhao
Wei Niu
Yanzhi Wang
VGen
44
4
0
02 Nov 2024
Conditional Synthesis of 3D Molecules with Time Correction Sampler
Conditional Synthesis of 3D Molecules with Time Correction Sampler
Hojung Jung
Youngrok Park
Laura Schmid
Jaehyeong Jo
Dongkyu Lee
Bongsang Kim
Se-Young Yun
Jinwoo Shin
DiffM
43
4
0
01 Nov 2024
Enhancing Motion in Text-to-Video Generation with Decomposed Encoding
  and Conditioning
Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning
Penghui Ruan
Pichao Wang
Divya Saxena
Jiannong Cao
Yuhui Shi
DiffM
VGen
36
78
0
31 Oct 2024
Accelerated AI Inference via Dynamic Execution Methods
Accelerated AI Inference via Dynamic Execution Methods
Haim Barad
Jascha Achterberg
Tien Pei Chou
Jean Yu
31
0
0
30 Oct 2024
Video prediction using score-based conditional density estimation
Video prediction using score-based conditional density estimation
P. Fiquet
Eero P. Simoncelli
AI4TS
21
0
0
30 Oct 2024
Investigating Memorization in Video Diffusion Models
Investigating Memorization in Video Diffusion Models
Chong Chen
Enhuai Liu
Daochang Liu
M. Shah
Chang Xu
VGen
DiffM
83
1
0
29 Oct 2024
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
Hanyu Wang
Saksham Suri
Yixuan Ren
Hao Chen
Abhinav Shrivastava
VGen
31
10
0
28 Oct 2024
Previous
123...567...222324
Next