Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.10090
Cited By
MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior
16 September 2024
Weijing Tao
Xiaofeng Yang
Miaomiao Cui
Guosheng Lin
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior"
17 / 17 papers shown
Title
DivAvatar: Diverse 3D Avatar Generation with a Single Prompt
Weijing Tao
Biwen Lei
Kunhao Liu
Shijian Lu
Miaomiao Cui
Xuansong Xie
Chunyan Miao
DiffM
59
1
0
27 Feb 2024
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Ling Yang
Zhaochen Yu
Chenlin Meng
Minkai Xu
Stefano Ermon
Tengjiao Wang
CoGe
DiffM
89
133
0
22 Jan 2024
DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance
Cong Wang
Jiaxi Gu
Panwen Hu
Songcen Xu
Hang Xu
Xiaodan Liang
VGen
59
16
0
05 Dec 2023
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Zhan Qin
Xiang Wang
Deli Zhao
Jingren Zhou
DiffM
VGen
108
227
0
07 Nov 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen
Menghan Xia
Yin-Yin He
Yong Zhang
Xiaodong Cun
...
Yaofang Liu
Qifeng Chen
Xintao Wang
Chao-Liang Weng
Ying Shan
DiffM
60
300
0
30 Oct 2023
Cones 2: Customizable Image Synthesis with Multiple Subjects
Zhiheng Liu
Yifei Zhang
Yujun Shen
Kecheng Zheng
Kai Zhu
Ruili Feng
Yu Liu
Deli Zhao
Jingren Zhou
Yang Cao
DiffM
92
82
0
30 May 2023
Multi-Concept Customization of Text-to-Image Diffusion
Nupur Kumari
Bin Zhang
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
131
871
0
08 Dec 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffM
VGen
81
1,409
0
29 Sep 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
279
2,861
0
25 Aug 2022
Harmonizer: Learning to Perform White-Box Image and Video Harmonization
Zhanghan Ke
Chunyi Sun
Lei Zhu
Ke Xu
Rynson W. H. Lau
47
72
0
04 Jul 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
419
15,515
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
341
3,605
0
20 Dec 2021
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
224
7,857
0
11 May 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
927
29,436
0
26 Feb 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
397
4,953
0
24 Feb 2021
Deep Image Compositing
He Zhang
Jianming Zhang
Federico Perazzi
Zhe Lin
Vishal M. Patel
48
36
0
04 Nov 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
774
42,055
0
28 May 2020
1