Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.04993
Cited By
v1
v2 (latest)
MoCoGAN: Decomposing Motion and Content for Video Generation
17 July 2017
Sergey Tulyakov
Ming-Yuan Liu
Xiaodong Yang
Jan Kautz
GAN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MoCoGAN: Decomposing Motion and Content for Video Generation"
50 / 647 papers shown
Title
RT-GAN: Recurrent Temporal GAN for Adding Lightweight Temporal Consistency to Frame-Based Domain Translation Approaches
Shawn Mathew
Saad Nadeem
Alvin C. Goh
Arie Kaufman
MedIm
118
0
0
02 Oct 2023
FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video Synthesis from Static Imagery
Tasin Islam
A. Miron
Xiaohui Liu
Yongmin Li
DiffM
67
3
0
29 Sep 2023
GAIA-1: A Generative World Model for Autonomous Driving
Masane Fuchi
Lloyd Russell
Hudson Yeo
Zak Murez
Hiroto Minami
Alex Kendall
Tomohiro Takagi
Gianluca Corrado
VGen
130
252
0
29 Sep 2023
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
David Junhao Zhang
Jay Zhangjie Wu
Jia-Wei Liu
Rui Zhao
L. Ran
Yuchao Gu
Difei Gao
Mike Zheng Shou
DiffM
VGen
129
223
0
27 Sep 2023
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Yaohui Wang
Xinyuan Chen
Xin Ma
Shangchen Zhou
Ziqi Huang
...
Chen Change Loy
Bo Dai
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
DiffM
112
231
0
26 Sep 2023
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER
Mingzhen Sun
Weining Wang
Zihan Qin
Jiahui Sun
Si-Qing Chen
Qingbin Liu
DiffM
59
3
0
23 Sep 2023
What is the Best Automated Metric for Text to Motion Generation?
Jordan Voas
Yili Wang
Qixing Huang
Raymond Mooney
EGVM
125
14
0
19 Sep 2023
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Xiaofeng Wang
Zheng Hua Zhu
Guan Huang
Xinze Chen
Jiagang Zhu
Jiwen Lu
VGen
116
167
0
18 Sep 2023
Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model
Yin Wang
Zhiying Leng
Frederick W. B. Li
Shun-cheng Wu
Xiaohui Liang
DiffM
55
60
0
12 Sep 2023
Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis
Jiapeng Zhu
Ceyuan Yang
Kecheng Zheng
Yinghao Xu
Zifan Shi
Yujun Shen
MoE
97
8
0
07 Sep 2023
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Jiaxi Gu
Shicong Wang
Haoyu Zhao
Tianyi Lu
Xing Zhang
Zuxuan Wu
Songcen Xu
Wei Zhang
Yu-Gang Jiang
Hang Xu
DiffM
VGen
82
48
0
07 Sep 2023
StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation
Yuhan Wang
Liming Jiang
Chen Change Loy
VGen
93
15
0
31 Aug 2023
Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis
Linsen Song
Wayne Wu
Chaoyou Fu
Chen Change Loy
Ran He
87
12
0
31 Aug 2023
Ten Years of Generative Adversarial Nets (GANs): A survey of the state-of-the-art
Tanujit Chakraborty
Ujjwal Reddy K S
Shraddha M. Naik
Madhurima Panja
B. Manvitha
112
72
0
30 Aug 2023
Learning Modulated Transformation in GANs
Ceyuan Yang
Qihang Zhang
Yinghao Xu
Jiapeng Zhu
Yujun Shen
Bo Dai
49
1
0
29 Aug 2023
LAC: Latent Action Composition for Skeleton-based Action Segmentation
Di Yang
Yaohui Wang
A. Dantcheva
Quan Kong
Lorenzo Garattoni
Gianpiero Francesca
Francois Bremond
81
9
0
28 Aug 2023
Priority-Centric Human Motion Generation in Discrete Latent Space
Hanyang Kong
Kehong Gong
Dongze Lian
Michael Bi Mi
Xinchao Wang
DiffM
119
55
0
28 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Jing Liu
270
31
0
27 Aug 2023
Hamiltonian GAN
Christine Allen-Blanchette
GAN
AI4CE
66
1
0
22 Aug 2023
Language-guided Human Motion Synthesis with Atomic Actions
Yuanhao Zhai
Mingzhen Huang
Tianyu Luan
Lu Dong
Ifeoma Nwogu
Siwei Lyu
David Doermann
Junsong Yuan
79
13
0
18 Aug 2023
A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Li Liu
Lufei Gao
Wen-Ling Lei
Fengji Ma
Xiaotian Lin
Jin-Tao Wang
CVBM
84
5
0
17 Aug 2023
Real-Time Neural Video Recovery and Enhancement on Mobile Devices
Zhaoyuan He
Yifan Yang
Lili Qiu
Kyoungjun Park
38
2
0
22 Jul 2023
Enabling Real-time Neural Recovery for Cloud Gaming on Mobile Devices
Zhaoyuan He
Yifan Yang
Shuozhe Li
Diyuan Dai
Lili Qiu
Yuqing Yang
80
0
0
15 Jul 2023
Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Yin-Yin He
Menghan Xia
Haoxin Chen
Xiaodong Cun
Yuan Gong
...
Yong Zhang
Xintao Wang
Chao-Liang Weng
Ying Shan
Qifeng Chen
DiffM
VGen
63
79
0
13 Jul 2023
GD-VDM: Generated Depth for better Diffusion-based Video Generation
Ariel Lapid
Idan Achituve
Lior Bracha
Ethan Fetaya
DiffM
VGen
130
9
0
19 Jun 2023
Learning Joint Latent Space EBM Prior Model for Multi-layer Generator
Jiali Cui
Ying Nian Wu
Tian Han
79
8
0
10 Jun 2023
The Age of Synthetic Realities: Challenges and Opportunities
J. P. Cardenuto
Jing Yang
Rafael Padilha
Renjie Wan
Daniel Moreira
Haoliang Li
Shiqi Wang
Fernanda A. Andaló
Sébastien Marcel
Anderson de Rezende Rocha
DeLMO
115
30
0
09 Jun 2023
Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation
A. Davtyan
Paolo Favaro
VGen
56
4
0
06 Jun 2023
Video Diffusion Models with Local-Global Context Guidance
Si-hang Yang
Lu Zhang
Yu Liu
Zhizhuo Jiang
You He
VGen
DiffM
40
14
0
05 Jun 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGen
DiffM
121
341
0
03 Jun 2023
We never go out of Style: Motion Disentanglement by Subspace Decomposition of Latent Space
Rishubh Parihar
Raghav Magazine
P. Tiwari
R. Venkatesh Babu
DRL
95
1
0
01 Jun 2023
Sample and Predict Your Latent: Modality-free Sequential Disentanglement via Contrastive Estimation
Ilana D Naiman
Nimrod Berman
Omri Azencot
DRL
99
6
0
25 May 2023
DuDGAN: Improving Class-Conditional GANs via Dual-Diffusion
Tae-Jung Yeom
Minhyeok Lee
DiffM
52
7
0
24 May 2023
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
117
7
0
24 May 2023
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
Haoyu Lu
Guoxing Yang
Nanyi Fei
Yuqi Huo
Zhiwu Lu
Ping Luo
Mingyu Ding
DiffM
VGen
77
68
0
22 May 2023
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Sitong Su
Jianlong Fu
Jiaying Liu
DiffM
VGen
155
117
0
18 May 2023
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Songwei Ge
Seungjun Nah
Guilin Liu
Tyler Poon
Andrew Tao
Bryan Catanzaro
David Jacobs
Jia-Bin Huang
Ming-Yuan Liu
Yogesh Balaji
DiffM
VGen
125
263
0
17 May 2023
LEO: Generative Latent Image Animator for Human Video Synthesis
Yaohui Wang
Xin Ma
Xinyuan Chen
A. Dantcheva
Bo Dai
Yu Qiao
DiffM
183
33
0
06 May 2023
Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
Théophile Cabannes
Shreya Ghosh
Raphaël Marinier
Tom Gedeon
Alexandre M. Bayen
Munawar Hayat
159
29
0
03 May 2023
Putting People in Their Place: Affordance-Aware Human Insertion into Scenes
Sumith Kulal
Tim Brooks
A. Aiken
Jiajun Wu
Jimei Yang
Jingwan Lu
Alexei A. Efros
Krishna Kumar Singh
DiffM
118
44
0
27 Apr 2023
Motion-Conditioned Diffusion Model for Controllable Video Synthesis
Tsai-Shien Chen
C. Lin
Hung-Yu Tseng
Nayeon Lee
Ming-Hsuan Yang
DiffM
VGen
141
67
0
27 Apr 2023
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations
Rongliang Wu
Yingchen Yu
Fangneng Zhan
Jiahui Zhang
Xiaoqin Zhang
Shijian Lu
CVBM
66
10
0
18 Apr 2023
Text2Performer: Text-Driven Human Video Generation
Yuming Jiang
Shuai Yang
Tong Liang Koh
Wayne Wu
Chen Change Loy
Ziwei Liu
DiffM
VGen
98
52
0
17 Apr 2023
MS-LSTM: Exploring Spatiotemporal Multiscale Representations in Video Prediction Domain
Zhifeng Ma
Hao Zhang
Jie Liu
129
7
0
16 Apr 2023
Video Generation Beyond a Single Clip
Hsin-Ping Huang
Yu-Chuan Su
Ming-Hsuan Yang
VLM
DiffM
VGen
93
3
0
15 Apr 2023
VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs
Moayed Haji-Ali
Andrew Bond
Tolga Birdal
Duygu Ceylan
Levent Karacan
Erkut Erdem
Aykut Erdem
VGen
DiffM
219
2
0
12 Apr 2023
MoStGAN-V: Video Generation with Temporal Motion Styles
Xiaoqian Shen
Xiang Li
Mohamed Elhoseiny
VGen
75
32
0
05 Apr 2023
TM2D: Bimodality Driven 3D Dance Generation via Music-Text Integration
Kehong Gong
Dongze Lian
Heng Chang
Chuan Guo
Zihang Jiang
Wei Ji
Michael Bi Mi
Xinchao Wang
114
66
0
05 Apr 2023
ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model
Mingyuan Zhang
Xinying Guo
Liang Pan
Zhongang Cai
Fangzhou Hong
Huirong Li
Lei Yang
Ziwei Liu
DiffM
VGen
139
172
0
03 Apr 2023
Multifactor Sequential Disentanglement via Structured Koopman Autoencoders
Nimrod Berman
Ilana D Naiman
Omri Azencot
CoGe
89
24
0
30 Mar 2023
Previous
1
2
3
4
5
...
11
12
13
Next