ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.07549
  4. Cited By
Make-A-Storyboard: A General Framework for Storyboard with Disentangled
  and Merged Control

Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control

6 December 2023
Jingkuan Song
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control"

46 / 46 papers shown
Title
TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition
TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition
Shilin Lu
Yanzhu Liu
A. Kong
109
101
0
24 Jul 2023
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation
  without Test-time Fine-tuning
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
Jiancang Ma
Junhao Liang
Chen Chen
H. Lu
62
150
0
21 Jul 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion
  Models
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffMVLM
103
44
0
01 Jun 2023
TaleCrafter: Interactive Story Visualization with Multiple Characters
TaleCrafter: Interactive Story Visualization with Multiple Characters
Yuan Gong
Youxin Pang
Xiaodong Cun
Menghan Xia
Yingqing He
...
Longyue Wang
Yong Zhang
Xintao Wang
Ying Shan
Yujiu Yang
DiffM
78
49
0
29 May 2023
Improved Visual Story Generation with Adaptive Context Modeling
Improved Visual Story Generation with Adaptive Context Modeling
Zhangyin Feng
Yuchen Ren
Xinmiao Yu
Xiaocheng Feng
Duyu Tang
Shuming Shi
Bing Qin
DiffM
69
15
0
26 May 2023
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized
  Attention
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Guangxuan Xiao
Tianwei Yin
William T. Freeman
F. Durand
Song Han
VGenDiffM
145
254
0
17 May 2023
InstantBooth: Personalized Text-to-Image Generation without Test-Time
  Finetuning
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Jing Shi
Wei Xiong
Zhe Lin
H. J. Jung
DiffM
168
294
0
06 Apr 2023
Segment Anything
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLMVLM
377
7,405
0
05 Apr 2023
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set
  Object Detection
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
...
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
191
2,023
0
09 Mar 2023
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized
  Text-to-Image Generation
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Yuxiang Wei
Yabo Zhang
Zhilong Ji
Jinfeng Bai
Lei Zhang
W. Zuo
DiffM
94
328
0
27 Feb 2023
Zero-shot Generation of Coherent Storybook from Plain Text Story using
  Diffusion Models
Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models
Hyeonho Jeong
Gihyun Kwon
Jong Chul Ye
69
23
0
08 Feb 2023
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image
  Diffusion Models
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
Hila Chefer
Yuval Alaluf
Yael Vinker
Lior Wolf
Daniel Cohen-Or
DiffM
119
516
0
31 Jan 2023
Training-Free Structured Diffusion Guidance for Compositional
  Text-to-Image Synthesis
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Weixi Feng
Xuehai He
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
P. Narayana
Sugato Basu
Xinze Wang
William Yang Wang
CoGe
148
318
0
09 Dec 2022
Multi-Concept Customization of Text-to-Image Diffusion
Multi-Concept Customization of Text-to-Image Diffusion
Nupur Kumari
Bin Zhang
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
165
875
0
08 Dec 2022
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Tanzila Rahman
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Shweta Mahajan
Leonid Sigal
DiffM
104
70
0
23 Nov 2022
Paint by Example: Exemplar-based Image Editing with Diffusion Models
Paint by Example: Exemplar-based Image Editing with Diffusion Models
Binxin Yang
Shuyang Gu
Bo Zhang
Ting Zhang
Xuejin Chen
Xiaoyan Sun
Dong Chen
Fang Wen
DiffM
95
426
0
23 Nov 2022
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Xichen Pan
Pengda Qin
Yuhong Li
Hui Xue
Wenhu Chen
DiffM
84
65
0
20 Nov 2022
Character-Centric Story Visualization via Visual Planning and Token
  Alignment
Character-Centric Story Visualization via Visual Planning and Token Alignment
Hong Chen
Rujun Han
Te-Lin Wu
Hideki Nakayama
Nanyun Peng
DiffMVGen
72
32
0
16 Oct 2022
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story
  Continuation
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation
A. Maharana
Darryl Hannan
Joey Tianyi Zhou
DiffM
101
81
0
13 Sep 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
282
2,891
0
25 Aug 2022
Word-Level Fine-Grained Story Visualization
Word-Level Fine-Grained Story Visualization
Bowen Li
Thomas Lukasiewicz
DiffM3DH
91
26
0
03 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using
  Textual Inversion
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
168
1,897
0
02 Aug 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
200
1,133
0
22 Jun 2022
Blended Latent Diffusion
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
136
390
0
06 Jun 2022
Compositional Visual Generation with Composable Diffusion Models
Compositional Visual Generation with Composable Diffusion Models
Nan Liu
Shuang Li
Yilun Du
Antonio Torralba
J. Tenenbaum
DiffMCoGe
198
529
0
03 Jun 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
466
6,077
0
23 May 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLMDiffM
413
6,916
0
13 Apr 2022
Generative Adversarial Networks
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
298
30,152
0
01 Mar 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
502
15,788
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with
  Text-Guided Diffusion Models
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
364
3,630
0
20 Dec 2021
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
DiffM
169
799
0
29 Nov 2021
Blended Diffusion for Text-driven Editing of Natural Images
Blended Diffusion for Text-driven Editing of Natural Images
Omri Avrahami
Dani Lischinski
Ohad Fried
DiffM
135
954
0
29 Nov 2021
Integrating Visuospatial, Linguistic and Commonsense Structure into
  Story Visualization
Integrating Visuospatial, Linguistic and Commonsense Structure into Story Visualization
A. Maharana
Joey Tianyi Zhou
77
62
0
21 Oct 2021
Alias-Free Generative Adversarial Networks
Alias-Free Generative Adversarial Networks
Tero Karras
M. Aittala
S. Laine
Erik Härkönen
Janne Hellsten
J. Lehtinen
Timo Aila
GAN
177
1,601
0
23 Jun 2021
Improving Generation and Evaluation of Visual Stories via Semantic
  Consistency
Improving Generation and Evaluation of Visual Stories via Semantic Consistency
A. Maharana
Darryl Hannan
Joey Tianyi Zhou
EGVM
78
62
0
20 May 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
271
7,958
0
11 May 2021
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
Jack Hessel
Ari Holtzman
Maxwell Forbes
Ronan Le Bras
Yejin Choi
CLIP
169
1,588
0
18 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
996
29,871
0
26 Feb 2021
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
739
18,364
0
19 Jun 2020
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
530
10,591
0
17 Feb 2020
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
619
10,595
0
12 Dec 2018
StoryGAN: A Sequential Conditional GAN for Story Visualization
StoryGAN: A Sequential Conditional GAN for Story Visualization
Yitong Li
Zhe Gan
Yelong Shen
Jingjing Liu
Yu Cheng
Yuexin Wu
Lawrence Carin
David Carlson
Jianfeng Gao
100
233
0
06 Dec 2018
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Andrew Brock
Jeff Donahue
Karen Simonyan
274
5,404
0
28 Sep 2018
Imagine This! Scripts to Compositions to Videos
Imagine This! Scripts to Compositions to Videos
Tanmay Gupta
Dustin Schwenk
Ali Farhadi
Derek Hoiem
Aniruddha Kembhavi
CoGeVGen
148
91
0
10 Apr 2018
Neural Discrete Representation Learning
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDLSSLOCL
233
5,079
0
02 Nov 2017
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Narain Sohl-Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
SyDaDiffM
312
7,031
0
12 Mar 2015
1