Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.00973
Cited By
v1
v2
v3 (latest)
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
1 June 2023
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models"
33 / 33 papers shown
Title
A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation
Yukang Feng
Jianwen Sun
Chuanhao Li
Zizhen Li
Jiaxin Ai
...
Yifan Chang
Sizhuo Zhou
Shenglin Zhang
Yu Dai
Kaipeng Zhang
MLLM
EGVM
92
0
0
11 Jun 2025
ViStoryBench: Comprehensive Benchmark Suite for Story Visualization
Cailin Zhuang
Ailin Huang
Wei Cheng
J. Wu
Yaoqi Hu
...
Hengyuan Xu
Xuanyang Zhang
Xianfang Zeng
Gang Yu
Fangqiu Yi
CoGe
70
2
0
30 May 2025
Storybooth: Training-free Multi-Subject Consistency for Improved Visual Storytelling
Jaskirat Singh
Junshen Kevin Chen
Jonas Kohler
Michael Cohen
DiffM
VGen
88
1
0
08 Apr 2025
One-Minute Video Generation with Test-Time Training
Karan Dalal
Daniel Koceja
Gashon Hussein
Jiarui Xu
Yue Zhao
...
Tatsunori Hashimoto
Sanmi Koyejo
Yejin Choi
Yu Sun
Xiaolong Wang
ViT
194
13
0
07 Apr 2025
Consistent Subject Generation via Contrastive Instantiated Concepts
Lee Hsin-Ying
Kelvin Chan
Ming-Hsuan Yang
DiffM
157
0
0
31 Mar 2025
Object Isolated Attention for Consistent Story Visualization
Xiangyang Luo
Junhao Cheng
Yifan Xie
Xin Zhang
Tao Feng
Ziqiang Liu
Fei Ma
Fei Richard Yu
DiffM
110
6
0
30 Mar 2025
Latent Beam Diffusion Models for Decoding Image Sequences
Guilherme Fernandes
Vasco Ramos
Regev Cohen
Idan Szpektor
João Magalhães
168
1
0
26 Mar 2025
MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving
Haiguang Wang
Daqi Liu
Hongwei Xie
Haisong Liu
Enhui Ma
Kaicheng Yu
Limin Wang
Bing Wang
VGen
129
2
0
20 Mar 2025
Automated Movie Generation via Multi-Agent CoT Planning
Weijia Wu
Zeyu Zhu
Mike Zheng Shou
VGen
155
7
0
10 Mar 2025
VisAgent: Narrative-Preserving Story Visualization Framework
Seungkwon Kim
GyuTae Park
Sangyeon Kim
Seung-Hun Nam
93
1
0
04 Mar 2025
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Weijia Wu
Mingyu Liu
Zeyu Zhu
Xi Xia
Haoen Feng
Wen Wang
Kevin Qinghong Lin
Chunhua Shen
Mike Zheng Shou
DiffM
VGen
235
3
0
22 Nov 2024
StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration
Panwen Hu
Jin Jiang
Jianqi Chen
Mingfei Han
Shengcai Liao
Xiaojun Chang
Xiaodan Liang
VGen
DiffM
132
6
0
07 Nov 2024
KAHANI: Culturally-Nuanced Visual Storytelling Tool for Non-Western Cultures
Hamna
Deepthi Sudharsan
Agrima Seth
Ritvik Budhiraja
Deepika Khullar
Vyshak Jain
Kalika Bali
Aditya Vashistha
Sameer Segal
DiffM
65
0
0
25 Oct 2024
Unbounded: A Generative Infinite Game of Character Life Simulation
Jialu Li
Yuanzhen Li
Neal Wadhwa
Yael Pritch
David E. Jacobs
Michael Rubinstein
Joey Tianyi Zhou
Nataniel Ruiz
VGen
AI4CE
86
6
0
24 Oct 2024
M2Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes
Sixu Yan
Zeyu Zhang
Muzhi Han
Zaijin Wang
Qi Xie
Zhitian Li
Zhehan Li
Hangxin Liu
Xinggang Wang
Song-Chun Zhu
122
8
0
15 Oct 2024
Story-Adapter: A Training-free Iterative Framework for Long Story Visualization
Jiawei Mao
Xiaoke Huang
Yunfei Xie
Yuanqi Chang
Mude Hui
Bingjie Xu
Yuyin Zhou
VGen
DiffM
121
4
0
08 Oct 2024
A Survey on Multimodal Benchmarks: In the Era of Large AI Models
Lin Li
Guikun Chen
Hanrong Shi
Jun Xiao
Long Chen
128
11
0
21 Sep 2024
CinePreGen: Camera Controllable Video Previsualization via Engine-powered Diffusion
Yiran Chen
Anyi Rao
Xuekun Jiang
Shishi Xiao
Ruiqing Ma
Zeyu Wang
Hui Xiong
Bo Dai
VGen
DiffM
71
1
0
30 Aug 2024
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
Haoning Wu
Shaocheng Shen
Qiang Hu
Xiaoyun Zhang
Ya Zhang
Yanfeng Wang
114
11
0
20 Aug 2024
Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling
Zilyu Ye
Yu Lei
Ruotian Peng
Jinjin Cao
Zhiyang Chen
...
Mingyuan Zhou
Xiaoqian Shen
Mohamed Elhoseiny
Nan Zhuang
Guo-Jun Qi
VGen
VLM
76
1
0
07 Aug 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
200
17
0
17 Jul 2024
SEED-Story: Multimodal Long Story Generation with Large Language Model
Shuai Yang
Yuying Ge
Yang Li
Yukang Chen
Yixiao Ge
Ying Shan
Yingcong Chen
VGen
DiffM
146
32
0
11 Jul 2024
StoryDiffusion: How to Support UX Storyboarding With Generative-AI
Zhaohui Liang
Xiaoyu Zhang
Kevin Ma
Zhao Liu
Xipei Ren
K. Goucher-Lambert
Can Liu
DiffM
76
7
0
10 Jul 2024
Replication in Visual Diffusion Models: A Survey and Outlook
Wenhao Wang
Yifan Sun
Zongxin Yang
Zhengdong Hu
Zhentao Tan
Yi Yang
190
10
0
07 Jul 2024
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Wei Chen
Lin Li
Yongqi Yang
Bin Wen
Fan Yang
Tingting Gao
Yu Wu
Long Chen
VLM
VGen
145
11
0
15 Jun 2024
Binarized Diffusion Model for Image Super-Resolution
Zheng Chen
Haotong Qin
Yong Guo
Xiongfei Su
Xin Yuan
Linghe Kong
Yulun Zhang
DiffM
88
9
0
09 Jun 2024
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Junhao Cheng
Xi Lu
Hanhui Li
Khun Loun Zai
Baiqiao Yin
Yuhao Cheng
Yiqiang Yan
Xiaodan Liang
DiffM
VGen
133
11
0
03 Jun 2024
RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance
JiaoJiao Fan
Haotian Xue
Qinsheng Zhang
Yongxin Chen
84
2
0
27 May 2024
StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
Ming Tao
Bing-Kun Bao
Hao Tang
Yaowei Wang
Changsheng Xu
DiffM
95
8
0
09 Apr 2024
Many-to-many Image Generation with Auto-regressive Diffusion Models
Ying Shen
Yizhe Zhang
Shuangfei Zhai
Lifu Huang
J. Susskind
Jiatao Gu
135
6
0
03 Apr 2024
WonderJourney: Going from Anywhere to Everywhere
Hong-Xing Yu
Haoyi Duan
Junhwa Hur
Kyle Sargent
Michael Rubinstein
...
Forrester Cole
Deqing Sun
Noah Snavely
Jiajun Wu
Charles Herrmann
VGen
116
57
0
06 Dec 2023
Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Jingkuan Song
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
DiffM
72
3
0
06 Dec 2023
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Jie An
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Kevin Qinghong Lin
Zicheng Liu
Lijuan Wang
Jiebo Luo
82
10
0
11 Oct 2023
1