Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.02612
Cited By
v1
v2
v3 (latest)
Generating Videos with Scene Dynamics
8 September 2016
Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
GAN
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generating Videos with Scene Dynamics"
50 / 746 papers shown
Title
RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping
Yang Bai
Liudi Yang
George Eskandar
Fengyi Shen
Dong Chen
Mohammad Altillawi
Z. Liu
Gitta Kutyniok
VGen
39
0
0
10 Jun 2025
Motion aware video generative model
Bowen Xue
G. C. Guarnera
Shuang Zhao
Zahra Montazeri
DiffM
VGen
29
0
0
02 Jun 2025
Fostering Video Reasoning via Next-Event Prediction
Haonan Wang
Hongfu Liu
Xiangyan Liu
C. Du
Kenji Kawaguchi
Ye Wang
Tianyu Pang
AI4TS
LRM
85
0
0
28 May 2025
gen2seg: Generative Models Enable Generalizable Instance Segmentation
Om Khangaonkar
Hamed Pirsiavash
DiffM
VLM
149
0
0
21 May 2025
The Spotlight Resonance Method: Resolving the Alignment of Embedded Activations
George Bird
76
0
0
09 May 2025
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Rundong Luo
Matthew Wallingford
Ali Farhadi
Noah Snavely
Wei-Chiu Ma
VGen
154
1
0
10 Apr 2025
A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning
Akash Kumar
Ashlesha Kumar
Vibhav Vineet
Yogesh S Rawat
SSL
478
0
0
08 Apr 2025
SkyReels-A2: Compose Anything in Video Diffusion Transformers
Zhengcong Fei
Didong Li
Di Qiu
Jiadong Wang
Yikun Dou
...
Jinfeng Xu
Mingyuan Fan
Guibin Chen
Yang Li
Yahui Zhou
DiffM
VGen
138
10
0
03 Apr 2025
VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models
Chi-Pin Huang
Yen-Siang Wu
Hung-Kai Chung
Kai-Po Chang
Fu-En Yang
Yu-Jie Wang
DiffM
VGen
110
1
0
27 Mar 2025
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
T. Liu
Z. Huang
Zhaoxi Chen
Guangcong Wang
Shoukang Hu
Liao Shen
Huiqiang Sun
Z. Cao
Wei Li
Ziwei Liu
VGen
3DGS
131
1
0
26 Mar 2025
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention
Xuan Ju
Weicai Ye
Quande Liu
Qiulin Wang
Xintao Wang
Pengfei Wan
Di Zhang
Kun Gai
Qiang Xu
VGen
110
4
0
25 Mar 2025
Joint Self-Supervised Video Alignment and Action Segmentation
Ali Shah Ali
Syed Ahmed Mahmood
Mubin Saeed
Andrey Konin
M. Zia
Quoc-Huy Tran
OT
108
0
0
21 Mar 2025
WonderVerse: Extendable 3D Scene Generation with Video Generative Models
Hao Feng
Zhi Zuo
Jia-Hui Pan
Ka-Hei Hui
Yihua Shao
Qi Dou
Wei Xie
Zhengzhe Liu
VGen
158
1
0
12 Mar 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffM
VGen
119
0
0
18 Feb 2025
Object-Centric Image to Video Generation with Language Guidance
Angel Villar-Corrales
Gjergj Plepi
Sven Behnke
DiffM
VGen
OCL
257
1
0
17 Feb 2025
Towards Precise Scaling Laws for Video Diffusion Transformers
Yuanyang Yin
Yaqi Zhao
Mingwu Zheng
Ke Lin
Jiarong Ou
...
Pengfei Wan
Di Zhang
Baoqun Yin
Wentao Zhang
Kun Gai
205
3
0
03 Jan 2025
DTSGAN: Learning Dynamic Textures via Spatiotemporal Generative Adversarial Network
Xiangtian Li
Xiaobo Wang
Zhen Qi
Han Cao
Zhaoyang Zhang
Ao Xiang
GAN
TTA
117
2
0
22 Dec 2024
Do Language Models Understand Time?
Xi Ding
Lei Wang
348
2
0
18 Dec 2024
DINO-Foresight
\texttt{DINO-Foresight}
DINO-Foresight
: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
158
3
0
16 Dec 2024
A comprehensive GeoAI review: Progress, Challenges and Outlooks
Anasse Boutayeb
Iyad Lahsen-cherif
Ahmed El Khadimi
109
0
0
16 Dec 2024
Can video generation replace cinematographers? Research on the cinematic language of generated video
Xuelong Li
Kai WU
Siyi Yang
YiZhan Qu
Guohua. Zhang
...
Mingliang Xiong
Hao Deng
Qingwen Liu
Gang Li
Bin He
VGen
DiffM
183
1
0
16 Dec 2024
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGen
DiffM
425
3
0
14 Dec 2024
KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder
Maheswar Bora
Saurabh Atreya
Aritra Mukherjee
Abhijit Das
140
0
0
19 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
183
0
0
12 Nov 2024
Asymptotic Analysis of Sample-averaged Q-learning
Saunak Kumar Panda
Ruiqi Liu
Yisha Xiang
OnRL
134
0
0
14 Oct 2024
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Yuqing Wang
Tianwei Xiong
Daquan Zhou
Zhijie Lin
Yang Zhao
Bingyi Kang
Jiashi Feng
Xihui Liu
VGen
165
35
0
03 Oct 2024
Advancing Video Quality Assessment for AIGC
Xinli Yue
Jianhui Sun
Han Kong
Liangchao Yao
Tianyi Wang
...
Jing Lv
Fan Xia
Yuetang Deng
Qian Wang
Lingchen Zhao
VGen
EGVM
82
0
0
23 Sep 2024
Multi-modal Generative AI: Multi-modal LLMs, Diffusions and the Unification
X. Wang
Yuwei Zhou
Bin Huang
Hong Chen
Wenwu Zhu
DiffM
163
9
0
23 Sep 2024
JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation
Hadrien Reynaud
Matthew Baugh
Mischa Dombrowski
Sarah Cechnicka
Qingjie Meng
Bernhard Kainz
VLM
66
0
0
21 Sep 2024
Automatic Scene Generation: State-of-the-Art Techniques, Models, Datasets, Challenges, and Future Prospects
Awal Ahmed Fime
Saifuddin Mahmud
Arpita Das
Md. Sunzidul Islam
Hong-Hoon Kim
VGen
3DV
49
1
0
14 Sep 2024
Real-Time Video Generation with Pyramid Attention Broadcast
Xuanlei Zhao
Xiaolong Jin
Kai Wang
Yang You
VGen
DiffM
190
45
0
22 Aug 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
Jiahe Liu
Youran Qu
Qi Yan
Fangyin Wei
Lele Wang
Renjie Liao
VGen
EGVM
69
15
0
23 Jul 2024
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions
Xuan Ju
Yiming Gao
Zhaoyang Zhang
Ziyang Yuan
Xintao Wang
Ailing Zeng
Yu Xiong
Qiang Xu
Ying Shan
VGen
122
47
0
08 Jul 2024
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
121
25
0
26 Jun 2024
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Junbang Liang
Ruoshi Liu
Ege Ozguroglu
Sruthi Sudhakar
Achal Dave
P. Tokmakov
Shuran Song
Carl Vondrick
VGen
94
29
0
24 Jun 2024
FacEnhance: Facial Expression Enhancing with Recurrent DDPMs
Hamza Bouzid
Lahoucine Ballihi
DiffM
85
1
0
13 Jun 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Tianle Zhang
Langtian Ma
Yuchen Yan
Yuchen Zhang
Kai Wang
...
Wenqi Shao
Yang You
Yu Qiao
Ping Luo
Kaipeng Zhang
VGen
147
2
0
13 Jun 2024
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
104
3
0
11 Jun 2024
Searching Priors Makes Text-to-Video Synthesis Better
Haoran Cheng
Liang Peng
Linxuan Xia
Yuepeng Hu
Hengjia Li
Qinglin Lu
Xiaofei He
Boxi Wu
VGen
DiffM
47
0
0
05 Jun 2024
Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Enhui Ma
Lijun Zhou
Tao Tang
Zhan Zhang
Dong Han
...
Peng Jia
Xianpeng Lang
Haiyang Sun
Di Lin
Kaicheng Yu
VGen
114
28
0
03 Jun 2024
SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model
Zhengang Li
Yan Kang
Yuchen Liu
Difan Liu
Tobias Hinz
Feng Liu
Yanzhi Wang
DiffM
65
1
0
31 May 2024
GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis
Boming Zhao
Yuan Li
Ziyu Sun
Lin Zeng
Yujun Shen
Rui-ya Ma
Yinda Zhang
Hujun Bao
Zhaopeng Cui
3DGS
3DV
73
8
0
30 May 2024
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
171
103
0
27 May 2024
Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models
Hanwen Liang
Yuyang Yin
Dejia Xu
Hanxue Liang
Zhangyang Wang
Konstantinos N. Plataniotis
Yao Zhao
Yunchao Wei
VGen
107
44
0
26 May 2024
Review of Deep Representation Learning Techniques for Brain-Computer Interfaces and Recommendations
Pierre Guetschel
Sara Ahmadi
Michael Tangermann
109
0
0
17 May 2024
From Sora What We Can See: A Survey of Text-to-Video Generation
Rui Sun
Yumin Zhang
Tejal Shah
Jiahao Sun
Shuoying Zhang
Wenqi Li
Haoran Duan
Bo Wei
R. Ranjan
EGVM
136
20
0
17 May 2024
TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
Hritik Bansal
Yonatan Bitton
Michal Yarom
Idan Szpektor
Aditya Grover
Kai-Wei Chang
DiffM
111
12
0
07 May 2024
Matten: Video Generation with Mamba-Attention
Yu Gao
Jiancheng Huang
Xiaopeng Sun
Zequn Jie
Yujie Zhong
Lin Ma
162
17
0
05 May 2024
Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap
Bowen Qu
Xiaoyu Liang
Shangkun Sun
Wei-Nan Gao
EGVM
126
8
0
21 Apr 2024
On the Content Bias in Fréchet Video Distance
Jason S. Hoffman
Aniruddha Mahapatra
Gaurav Parmar
Jun-Yan Zhu
Jia-Bin Huang
EGVM
90
20
0
18 Apr 2024
1
2
3
4
...
13
14
15
Next