Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.04993
Cited By
v1
v2 (latest)
MoCoGAN: Decomposing Motion and Content for Video Generation
17 July 2017
Sergey Tulyakov
Ming-Yuan Liu
Xiaodong Yang
Jan Kautz
GAN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MoCoGAN: Decomposing Motion and Content for Video Generation"
50 / 647 papers shown
Title
4D Facial Expression Diffusion Model
K. Zou
S. Faisan
Boyang Yu
S. Valette
Hyewon Seo
81
12
0
29 Mar 2023
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation
Jiawei Liu
Weining Wang
Sihan Chen
Xinxin Zhu
Qingbin Liu
DiffM
VGen
84
14
0
29 Mar 2023
Information-Theoretic GAN Compression with Variational Energy-based Model
Minsoo Kang
Hyewon Yoo
Eunhee Kang
Sehwan Ki
Hyong-Euk Lee
Bohyung Han
GAN
72
3
0
28 Mar 2023
Seer: Language Instructed Video Prediction with Latent Diffusion Models
Xianfan Gu
Chuan Wen
Weirui Ye
Jiaming Song
Yang Gao
DiffM
VGen
64
43
0
27 Mar 2023
Factor Decomposed Generative Adversarial Networks for Text-to-Image Synthesis
Jiguo Li
Xiaobin Liu
Lirong Zheng
DRL
54
1
0
24 Mar 2023
Persistent Nature: A Generative Model of Unbounded 3D Worlds
Lucy Chai
Richard Tucker
Zhengqi Li
Phillip Isola
Noah Snavely
VGen
93
31
0
23 Mar 2023
Pix2Video: Video Editing using Image Diffusion
Duygu Ceylan
C. Huang
Niloy J. Mitra
DiffM
VGen
146
262
0
22 Mar 2023
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Sheng-Siang Yin
Chenfei Wu
Huan Yang
Jianfeng Wang
Xiaodong Wang
...
Gong Ming
Lijuan Wang
Zicheng Liu
Houqiang Li
Nan Duan
VGen
83
137
0
22 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
186
170
0
21 Mar 2023
CoopInit: Initializing Generative Adversarial Networks via Cooperative Learning
Yang Zhao
Jianwen Xie
Ping Li
GAN
115
3
0
21 Mar 2023
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers
Jaehoon Yoo
Semin Kim
Doyup Lee
Chiheon Kim
Seunghoon Hong
77
3
0
20 Mar 2023
VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation
Zhengxiong Luo
Dayou Chen
Yingya Zhang
Yan Huang
Liangsheng Wang
Yujun Shen
Deli Zhao
Jinren Zhou
Tien-Ping Tan
DiffM
VGen
220
322
0
15 Mar 2023
Continual Visual Reinforcement Learning with A Life-Long World Model
Wendong Zhang
Wendong Zhang
Geng Chen
Siyu Gao
Yunbo Wang
Xiaokang Yang
Xiaokang Yang
CLL
95
3
0
12 Mar 2023
Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE
Yucheng Xu
Nanbo Li
A. Goel
Zijian Guo
Zonghai Yao
Hamidreza Kasaei
Mohammad-Sajad Kasaei
Zhibin Li
116
5
0
09 Mar 2023
Pedestrian Attribute Editing for Gait Recognition and Anonymization
Jingzhe Ma
Dingqiang Ye
Chao Fan
Shiqi Yu
CVBM
97
5
0
09 Mar 2023
Video-P2P: Video Editing with Cross-attention Control
Shaoteng Liu
Yuechen Zhang
Wenbo Li
Zhe Lin
Jiaya Jia
DiffM
VGen
213
221
0
08 Mar 2023
MOSO: Decomposing MOtion, Scene and Object for Video Prediction
M. Sun
Weining Wang
Xinxin Zhu
Jing Liu
95
14
0
07 Mar 2023
MotionVideoGAN: A Novel Video Generator Based on the Motion Space Learned from Image Pairs
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
VGen
62
4
0
06 Mar 2023
Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition
Yifan Jiang
Han Chen
Hanseok Ko
DiffM
109
4
0
26 Feb 2023
Video Probabilistic Diffusion Models in Projected Latent Space
Sihyun Yu
Kihyuk Sohn
Subin Kim
Jinwoo Shin
VGen
DiffM
103
172
0
15 Feb 2023
Structure and Content-Guided Video Synthesis with Diffusion Models
Patrick Esser
Johnathan Chiu
Parmida Atighehchian
Jonathan Granskog
Anastasis Germanidis
DiffM
VGen
191
539
0
06 Feb 2023
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer
Yuhta Takida
Masaaki Imaizumi
Takashi Shibuya
Chieh-Hsin Lai
Toshimitsu Uesaka
Naoki Murata
Yuki Mitsufuji
GAN
109
13
0
30 Jan 2023
Audio2Gestures: Generating Diverse Gestures from Audio
Jing Li
Di Kang
Wenjie Pei
Xuefei Zhe
Ying Zhang
Linchao Bao
Zhenyu He
DiffM
SLR
84
8
0
17 Jan 2023
T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations
Jianrong Zhang
Yangsong Zhang
Xiaodong Cun
Shaoli Huang
Yong Zhang
Hongwei Zhao
Hongtao Lu
Xiaodong Shen
149
358
0
15 Jan 2023
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Dan Bigioi
Shubhajit Basak
Michał Stypułkowski
Maciej Ziȩba
H. Jordan
R. Mcdonnell
Peter Corcoran
DiffM
VGen
104
36
0
10 Jan 2023
Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Michal Stypulkowski
Konstantinos Vougioukas
Sen He
Maciej Ziȩba
Stavros Petridis
Maja Pantic
DiffM
85
127
0
06 Jan 2023
Predictive Coding Based Multiscale Network with Encoder-Decoder LSTM for Video Prediction
Chaofan Ling
Junpei Zhong
Wei-Hong Li
78
3
0
22 Dec 2022
Face Generation and Editing with StyleGAN: A Survey
Andrew Melnik
Maksim Miasayedzenkau
Dzianis Makaravets
Dzianis Pirshtuk
Eren Akbulut
Dennis Holzmann
Tarek Renusch
Gustav Reichert
Helge J. Ritter
CVBM
81
43
0
18 Dec 2022
Towards Smooth Video Composition
Qihang Zhang
Ceyuan Yang
Yujun Shen
Yinghao Xu
Bolei Zhou
VGen
87
14
0
14 Dec 2022
PV3D: A 3D Generative Model for Portrait Video Generation
Eric Xu
Jianfeng Zhang
Jun Hao Liew
Wenqing Zhang
Song Bai
Jiashi Feng
Mike Zheng Shou
VGen
84
21
0
13 Dec 2022
Video Prediction by Efficient Transformers
Xi Ye
Guillaume-Alexandre Bilodeau
ViT
100
35
0
12 Dec 2022
Fighting Malicious Media Data: A Survey on Tampering Detection and Deepfake Detection
Junke Wang
Zhenxin Li
Chao Zhang
Jingjing Chen
Zuxuan Wu
Larry S. Davis
Yueping Jiang
AAML
78
6
0
12 Dec 2022
MAGVIT: Masked Generative Video Transformer
Lijun Yu
Yong Cheng
Kihyuk Sohn
José Lezama
Han Zhang
...
Alexander G. Hauptmann
Ming-Hsuan Yang
Yuan Hao
Irfan Essa
Lu Jiang
DiffM
VGen
121
248
0
10 Dec 2022
MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis
Rishabh Dabral
Muhammad Hamza Mughal
Vladislav Golyanik
Christian Theobalt
DiffM
VGen
111
183
0
08 Dec 2022
VIDM: Video Implicit Diffusion Models
Kangfu Mei
Vishal M. Patel
DiffM
VGen
104
82
0
01 Dec 2022
CLIP2GAN: Towards Bridging Text with the Latent Space of GANs
Yixuan Wang
Wen-gang Zhou
Jianmin Bao
Weilun Wang
Li Li
Houqiang Li
GAN
CLIP
62
6
0
28 Nov 2022
Deep Fake Detection, Deterrence and Response: Challenges and Opportunities
Amin Azmoodeh
Ali Dehghantanha
83
3
0
26 Nov 2022
Efficient Video Prediction via Sparsely Conditioned Flow Matching
A. Davtyan
Sepehr Sameni
Paolo Favaro
VGen
DiffM
102
31
0
26 Nov 2022
WALDO: Future Video Synthesis using Object Layer Decomposition and Parametric Flow Prediction
G. L. Moing
Jean Ponce
Cordelia Schmid
76
6
0
25 Nov 2022
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Tanzila Rahman
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Shweta Mahajan
Leonid Sigal
DiffM
132
71
0
23 Nov 2022
Latent Video Diffusion Models for High-Fidelity Long Video Generation
Yin-Yin He
Tianyu Yang
Yong Zhang
Ying Shan
Qifeng Chen
DiffM
VGen
112
243
0
23 Nov 2022
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
Tsu-Jui Fu
Licheng Yu
Ning Zhang
Cheng-Yang Fu
Jong-Chyi Su
William Yang Wang
Sean Bell
VGen
148
38
0
23 Nov 2022
SimVP: Towards Simple yet Powerful Spatiotemporal Predictive Learning
Cheng Tan
Zhangyang Gao
Siyuan Li
Stan Z. Li
VLM
AI4TS
100
3
0
22 Nov 2022
SinFusion: Training Diffusion Models on a Single Image or Video
Yaniv Nikankin
Niv Haim
Michal Irani
VGen
106
71
0
21 Nov 2022
MagicVideo: Efficient Video Generation With Latent Diffusion Models
Daquan Zhou
Weimin Wang
Hanshu Yan
Weiwei Lv
Yizhe Zhu
Jiashi Feng
DiffM
VGen
131
390
0
20 Nov 2022
Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models
Zhihong Pan
Xiaoxia Zhou
Hao Tian
DiffM
75
23
0
14 Nov 2022
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Zhihong Pan
Xiaoxia Zhou
Hao Tian
DiffM
62
12
0
14 Nov 2022
SSGVS: Semantic Scene Graph-to-Video Synthesis
Yuren Cong
Jinhui Yi
Bodo Rosenhahn
M. Yang
135
8
0
11 Nov 2022
Disentangling Content and Motion for Text-Based Neural Video Manipulation
Levent Karacan
Tolga Kerimouglu
.Ismail .Inan
Tolga Birdal
Erkut Erdem
Aykut Erdem
90
1
0
05 Nov 2022
Autoregressive GAN for Semantic Unconditional Head Motion Generation
Louis Airale
Xavier Alameda-Pineda
Stéphane Lathuilière
Dominique Vaufreydaz
61
3
0
02 Nov 2022
Previous
1
2
3
4
5
6
...
11
12
13
Next