ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.01434
  4. Cited By
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video
  Generation

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

2 May 2024
Yupeng Zhou
Daquan Zhou
Ming-Ming Cheng
Jiashi Feng
Qibin Hou
    DiffM
    VGen
ArXivPDFHTML

Papers citing "StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation"

22 / 72 papers shown
Title
StoryMaker: Towards Holistic Consistent Characters in Text-to-image
  Generation
StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation
Zhengguang Zhou
Jing Li
Huaxia Li
Nemo Chen
Xu Tang
DiffM
VGen
38
8
0
19 Sep 2024
AudioEditor: A Training-Free Diffusion-Based Audio Editing Framework
AudioEditor: A Training-Free Diffusion-Based Audio Editing Framework
Yuhang Jia
Yang Chen
Jinghua Zhao
Shiwan Zhao
Wenjia Zeng
Yong Chen
Yong Qin
DiffM
36
1
0
19 Sep 2024
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
Jiahao Wang
Caixia Yan
Weizhan Zhang
Haonan Lin
Mengmeng Wang
Guang Dai
Tieliang Gong
Hao Sun
Jingdong Wang
DiffM
29
2
0
07 Sep 2024
SPDiffusion: Semantic Protection Diffusion Models for Multi-concept Text-to-image Generation
SPDiffusion: Semantic Protection Diffusion Models for Multi-concept Text-to-image Generation
Yang Zhang
Rui Zhang
Xuecheng Nie
Haochen Li
Jikun Chen
Yifan Hao
Xin Zhang
Luoqi Liu
Ling Li
43
0
0
02 Sep 2024
Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing
Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing
Yitong Yang
Yinglin Wang
Jing Wang
Tian Zhang
DiffM
40
1
0
24 Aug 2024
Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's
  Impact on Spatio-Temporal Cross-Attentions
Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
Ashkan Taghipour
Morteza Ghahremani
Bennamoun
Aref Miri Rekavandi
Zinuo Li
Hamid Laga
F. Boussaïd
VGen
79
2
0
27 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
62
12
0
17 Jul 2024
Contrastive Sequential-Diffusion Learning: An approach to Multi-Scene
  Instructional Video Synthesis
Contrastive Sequential-Diffusion Learning: An approach to Multi-Scene Instructional Video Synthesis
Vasco Ramos
Yonatan Bitton
Michal Yarom
Idan Szpektor
João Magalhães
DiffM
35
0
0
16 Jul 2024
Improving Visual Storytelling with Multimodal Large Language Models
Improving Visual Storytelling with Multimodal Large Language Models
Xiaochuan Lin
Xiangyong Chen
39
0
0
02 Jul 2024
MotionBooth: Motion-Aware Customized Text-to-Video Generation
MotionBooth: Motion-Aware Customized Text-to-Video Generation
Jianzong Wu
Xiangtai Li
Yanhong Zeng
J. J. Zhang
Qianyu Zhou
Yining Li
Yunhai Tong
Kai Chen
DiffM
VGen
75
40
0
25 Jun 2024
Video-Infinity: Distributed Long Video Generation
Video-Infinity: Distributed Long Video Generation
Zhenxiong Tan
Xingyi Yang
Songhua Liu
Xinchao Wang
VGen
43
19
0
24 Jun 2024
Coherent Zero-Shot Visual Instruction Generation
Coherent Zero-Shot Visual Instruction Generation
Quynh Phung
Songwei Ge
Jia-Bin Huang
57
2
0
06 Jun 2024
ORACLE: Leveraging Mutual Information for Consistent Character
  Generation with LoRAs in Diffusion Models
ORACLE: Leveraging Mutual Information for Consistent Character Generation with LoRAs in Diffusion Models
Kiymet Akdemir
Pinar Yanardag
DiffM
41
1
0
04 Jun 2024
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image
  Generation
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Junhao Cheng
Xi Lu
Hanhui Li
Khun Loun Zai
Baiqiao Yin
Yuhao Cheng
Yiqiang Yan
Xiaodan Liang
DiffM
VGen
40
10
0
03 Jun 2024
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo
  Benchmark
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark
Haoxing Chen
Yan Hong
Zizheng Huang
Zhuoer Xu
Zhangxuan Gu
...
Jun Lan
Huijia Zhu
Jianfu Zhang
Weiqiang Wang
Huaxiong Li
Mamba
83
14
0
30 May 2024
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Feng Liang
Akio Kodaira
Chenfeng Xu
M. Tomizuka
Kurt Keutzer
Diana Marculescu
DiffM
VGen
70
7
0
24 May 2024
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion
  Models
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
42
6
0
24 Apr 2024
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Roberto Henschel
Levon Khachatryan
Daniil Hayrapetyan
Hayk Poghosyan
Vahram Tadevosyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
VGen
101
77
0
21 Mar 2024
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large
  Datasets
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
164
1,016
0
25 Nov 2023
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
Ruiqi Wu
Liangyu Chen
Tong Yang
Chunle Guo
Chongyi Li
Xiangyu Zhang
DiffM
VGen
89
52
0
16 Oct 2023
RePaint: Inpainting using Denoising Diffusion Probabilistic Models
RePaint: Inpainting using Denoising Diffusion Probabilistic Models
Andreas Lugmayr
Martin Danelljan
Andrés Romero
F. I. F. Richard Yu
Radu Timofte
Luc Van Gool
DiffM
233
1,355
0
24 Jan 2022
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
321
75,834
0
18 May 2015
Previous
12