Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.14683
Cited By
StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
29 December 2021
Ivan Skorokhodov
Sergey Tulyakov
Mohamed Elhoseiny
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2"
50 / 230 papers shown
Title
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data
Tao Yang
Yangming Shi
Yunwen Huang
Feng Chen
Yin Zheng
Lei Zhang
DiffM
VGen
62
0
0
19 Aug 2024
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention
Yu Lu
Yuanzhi Liang
Linchao Zhu
Yi Yang
DiffM
VGen
44
27
0
29 Jul 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
Jiahe Liu
Youran Qu
Qi Yan
Fangyin Wei
Lele Wang
Renjie Liao
VGen
EGVM
52
12
0
23 Jul 2024
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Qinyu Yang
Haoxin Chen
Yong Zhang
Menghan Xia
Xiaodong Cun
Zhixun Su
Ying Shan
DiffM
35
1
0
14 Jul 2024
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions
Xuan Ju
Yiming Gao
Zhaoyang Zhang
Ziyang Yuan
Xintao Wang
Ailing Zeng
Yu Xiong
Qiang Xu
Ying Shan
VGen
69
39
0
08 Jul 2024
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
66
22
0
26 Jun 2024
Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation
Rafael Redondo
37
0
0
23 Jun 2024
Neural Residual Diffusion Models for Deep Scalable Vision Generation
Zhiyuan Ma
Liangliang Zhao
Biqing Qi
Bowen Zhou
DiffM
64
2
0
19 Jun 2024
ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models
Kaifeng Gao
Jiaxin Shi
Hanwang Zhang
Chunping Wang
Jun Xiao
DiffM
VGen
67
12
0
16 Jun 2024
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Junke Wang
Yi-Xin Jiang
Zehuan Yuan
Binyue Peng
Zuxuan Wu
Yu-Gang Jiang
ViT
VGen
78
36
0
13 Jun 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Tianle Zhang
Langtian Ma
Yuchen Yan
Yuchen Zhang
Kai Wang
...
Wenqi Shao
Yang You
Yu Qiao
Ping Luo
Kaipeng Zhang
VGen
72
2
0
13 Jun 2024
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Ivan Skorokhodov
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
VGen
48
10
0
12 Jun 2024
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
Yuanhao Zhai
Kevin Lin
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Chung-Ching Lin
David Doermann
Junsong Yuan
Lijuan Wang
VGen
DiffM
41
9
0
11 Jun 2024
SF-V: Single Forward Video Generation Model
Zhixing Zhang
Yanyu Li
Yushu Wu
Yanwu Xu
Anil Kag
...
Aliaksandr Siarohin
Junli Cao
Dimitris N. Metaxas
Sergey Tulyakov
Jian Ren
DiffM
VGen
45
9
0
06 Jun 2024
VideoPhy: Evaluating Physical Commonsense for Video Generation
Hritik Bansal
Zongyu Lin
Tianyi Xie
Zeshun Zong
Michal Yarom
Yonatan Bitton
Chenfanfu Jiang
Ningyu Zhang
Kai-Wei Chang
Aditya Grover
EGVM
VGen
40
36
0
05 Jun 2024
Searching Priors Makes Text-to-Video Synthesis Better
Haoran Cheng
Liang Peng
Linxuan Xia
Yuepeng Hu
Hengjia Li
Qinglin Lu
Xiaofei He
Boxi Wu
VGen
DiffM
36
0
0
05 Jun 2024
SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model
Zhengang Li
Yan Kang
Yuchen Liu
Difan Liu
Tobias Hinz
Feng Liu
Yanzhi Wang
DiffM
32
1
0
31 May 2024
EG4D: Explicit Generation of 4D Object without Score Distillation
Qi Sun
Zhiyang Guo
Bo Liu
Jing Nathan Yan
Shengming Yin
Wen-gang Zhou
Jing Liao
Houqiang Li
VGen
3DGS
37
13
0
28 May 2024
Scaling Diffusion Mamba with Bidirectional SSMs for Efficient Image and Video Generation
Shentong Mo
Yapeng Tian
Mamba
76
16
0
24 May 2024
Diffusion for World Modeling: Visual Details Matter in Atari
Eloi Alonso
Adam Jelley
Vincent Micheli
Anssi Kanervisto
Amos Storkey
Tim Pearce
Franccois Fleuret
51
40
0
20 May 2024
FIFO-Diffusion: Generating Infinite Videos from Text without Training
Jihwan Kim
Junoh Kang
Jinyoung Choi
Bohyung Han
DiffM
VGen
69
24
0
19 May 2024
From Sora What We Can See: A Survey of Text-to-Video Generation
Rui Sun
Yumin Zhang
Tejal Shah
Jiahao Sun
Shuoying Zhang
Wenqi Li
Haoran Duan
Bo Wei
R. Ranjan
EGVM
79
20
0
17 May 2024
Matten: Video Generation with Mamba-Attention
Yu Gao
Jiancheng Huang
Xiaopeng Sun
Zequn Jie
Yujie Zhong
Lin Ma
72
12
0
05 May 2024
On the Content Bias in Fréchet Video Distance
Jason S. Hoffman
Aniruddha Mahapatra
Gaurav Parmar
Jun-Yan Zhu
Jia-Bin Huang
EGVM
50
15
0
18 Apr 2024
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
Yiran Xu
Taesung Park
Richard Zhang
Yang Zhou
Eli Shechtman
Feng Liu
Jia-Bin Huang
Difan Liu
SupR
93
10
0
18 Apr 2024
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Sicheng Xu
Guojun Chen
Yu-Xiao Guo
Jiaolong Yang
Chong Li
Zhenyu Zang
Yizhong Zhang
Xin Tong
Baining Guo
45
87
0
16 Apr 2024
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Han Lin
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
DiffM
VGen
69
20
0
15 Apr 2024
TC4D: Trajectory-Conditioned Text-to-4D Generation
Sherwin Bahmani
Xian Liu
Yifan Wang
Ivan Skorokhodov
Victor Rong
...
Jeong Joon Park
Sergey Tulyakov
Gordon Wetzstein
Andrea Tagliasacchi
David B. Lindell
97
35
0
26 Mar 2024
A Survey on Long Video Generation: Challenges, Methods, and Prospects
Chengxuan Li
Di Huang
Zeyu Lu
Yang Xiao
Qingqi Pei
Lei Bai
EGVM
42
20
0
25 Mar 2024
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Sihyun Yu
Weili Nie
De-An Huang
Boyi Li
Jinwoo Shin
A. Anandkumar
VGen
DiffM
34
15
0
21 Mar 2024
Endora: Video Generation Models as Endoscopy Simulators
Chenxin Li
Hengyu Liu
Yifan Liu
Brandon Yushan Feng
Wuyang Li
Xinyu Liu
Zhen Chen
Jing Shao
Yixuan Yuan
VGen
MedIm
80
34
0
17 Mar 2024
Intention-driven Ego-to-Exo Video Generation
Hongcheng Luo
Kai Zhu
Wei Zhai
Yang Cao
DiffM
VGen
40
4
0
14 Mar 2024
BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering
Xin Qiu
Congying Han
Zicheng Zhang
Bonan Li
Tiande Guo
Pingyu Wang
Xuecheng Nie
47
0
0
10 Mar 2024
An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data
Yudong Yang
Rongfeng Su
Xiaokang Liu
Nan Yan
Lan Wang
MedIm
DiffM
19
1
0
09 Mar 2024
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
Xuweiyi Chen
Tian Xia
Sihan Xu
VGen
DiffM
34
7
0
04 Mar 2024
Boosting Neural Representations for Videos with a Conditional Decoder
Xinjie Zhang
Ren Yang
Dailan He
Xingtong Ge
Tongda Xu
Yan Wang
Hongwei Qin
Jun Zhang
36
15
0
28 Feb 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
42
57
0
22 Feb 2024
Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same
Sungjun Ahn
Hyun-Jeong Yim
Youngwan Lee
Sung-Ik Park
VGen
41
4
0
19 Feb 2024
Using Left and Right Brains Together: Towards Vision and Language Planning
Jun Cen
Chenfei Wu
Xiao Liu
Sheng-Siang Yin
Yixuan Pei
Jinglong Yang
Qifeng Chen
Nan Duan
Jianguo Zhang
60
3
0
16 Feb 2024
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Weiming Ren
Harry Yang
Ge Zhang
Cong Wei
Xinrun Du
Stephen W. Huang
Wenhu Chen
DiffM
VGen
90
54
0
06 Feb 2024
One-shot Neural Face Reenactment via Finding Directions in GAN's Latent Space
Stella Bounareli
Christos Tzelepis
Vasileios Argyriou
Ioannis Patras
Georgios Tzimiropoulos
CVBM
3DH
45
8
0
05 Feb 2024
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
Yiyuan Zhang
Yuhao Kang
Zhixin Zhang
Xiaohan Ding
Sanyuan Zhao
Xiangyu Yue
VGen
60
4
0
05 Feb 2024
A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming
Pengyuan Zhou
Lin Wang
Zhi Liu
Yanbin Hao
Pan Hui
Sasu Tarkoma
J. Kangasharju
VGen
41
26
0
30 Jan 2024
DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations
Dogyun Park
S. Kim
Sojin Lee
Hyunwoo J. Kim
DiffM
38
7
0
23 Jan 2024
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
Zhao Wang
Aoxue Li
Lingting Zhu
Yong Guo
Qi Dou
Zhenguo Li
VGen
DiffM
35
40
0
18 Jan 2024
Vlogger: Make Your Dream A Vlog
Shaobin Zhuang
Kunchang Li
Xinyuan Chen
Yaohui Wang
Ziwei Liu
Yu Qiao
Yali Wang
VGen
DiffM
35
35
0
17 Jan 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
120
275
0
17 Jan 2024
RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks
Partha Ghosh
Soubhik Sanyal
Cordelia Schmid
Bernhard Scholkopf
VGen
44
1
0
11 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffM
VGen
125
233
0
05 Jan 2024
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
David Junhao Zhang
Dongxu Li
Hung Le
Mike Zheng Shou
Caiming Xiong
Doyen Sahoo
VGen
22
23
0
03 Jan 2024
Previous
1
2
3
4
5
Next