ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.09814
  4. Cited By
NUWA-Infinity: Autoregressive over Autoregressive Generation for
  Infinite Visual Synthesis

NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis

20 July 2022
Chenfei Wu
Jian Liang
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
    VGen
ArXivPDFHTML

Papers citing "NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis"

50 / 53 papers shown
Title
ControlFill: Spatially Adjustable Image Inpainting from Prompt Learning
Boseong Jeon
55
0
0
06 Mar 2025
ASurvey: Spatiotemporal Consistency in Video Generation
ASurvey: Spatiotemporal Consistency in Video Generation
Zhiyu Yin
Kehai Chen
Xuefeng Bai
Ruili Jiang
J. Li
Hongdong Li
Jin Liu
Yang Xiang
Jun Yu
Min Zhang
EGVM
VGen
AI4TS
62
0
0
25 Feb 2025
Bayesian Computation in Deep Learning
Bayesian Computation in Deep Learning
Wenlong Chen
Bolian Li
Ruqi Zhang
Yingzhen Li
BDL
75
0
0
25 Feb 2025
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
Tianwei Yin
Qiang Zhang
Richard Zhang
William T. Freeman
F. Durand
Eli Shechtman
Xun Huang
VGen
DiffM
81
11
0
10 Dec 2024
Efficient Continuous Video Flow Model for Video Prediction
Efficient Continuous Video Flow Model for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGen
63
0
0
07 Dec 2024
Continuous Video Process: Modeling Videos as Continuous
  Multi-Dimensional Processes for Video Prediction
Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGen
DiffM
66
0
0
06 Dec 2024
Seeing Beyond Views: Multi-View Driving Scene Video Generation with
  Holistic Attention
Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention
Hannan Lu
Xiaohe Wu
Shudong Wang
Xiameng Qin
Xinyu Zhang
Junyu Han
W. Zuo
Ji Tao
92
1
0
04 Dec 2024
Playable Game Generation
Playable Game Generation
Mingyu Yang
Junyou Li
Zhongbin Fang
Sheng Chen
Yangbin Yu
Qiang Fu
Wei Yang
Deheng Ye
VGen
76
9
0
01 Dec 2024
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Zhichao Zhang
Wei Sun
Xinyue Li
Yunhao Li
Qihang Ge
...
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guangtao Zhai
EGVM
117
1
0
25 Nov 2024
Autoregressive Models in Vision: A Survey
Autoregressive Models in Vision: A Survey
Jing Xiong
Gongye Liu
Lun Huang
Chengyue Wu
Taiqiang Wu
...
M. Zhang
Guillermo Sapiro
Jiebo Luo
Ping Luo
Ngai Wong
VGen
48
9
0
08 Nov 2024
Supervised Chain of Thought
Supervised Chain of Thought
Xiang Zhang
Dujian Ding
LRM
AI4CE
26
1
0
18 Oct 2024
Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
Zhichao Zhang
Xinyue Li
Wei Sun
Jun Jia
Xiongkuo Min
...
Puyi Wang
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Guangtao Zhai
EGVM
47
5
0
31 Jul 2024
Zero-shot Text-guided Infinite Image Synthesis with LLM guidance
Zero-shot Text-guided Infinite Image Synthesis with LLM guidance
Soyeong Kwon
Taegyeong Lee
Taehwan Kim
DiffM
21
2
0
17 Jul 2024
Magic Insert: Style-Aware Drag-and-Drop
Magic Insert: Style-Aware Drag-and-Drop
Nataniel Ruiz
Yuanzhen Li
Neal Wadhwa
Yael Pritch
Michael Rubinstein
David E. Jacobs
Shlomi Fruchter
DiffM
35
7
0
02 Jul 2024
FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models
FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models
Haonan Qiu
Zhaoxi Chen
Zhouxia Wang
Yingqing He
Menghan Xia
Ziwei Liu
VGen
DiffM
36
17
0
24 Jun 2024
Learning Images Across Scales Using Adversarial Training
Learning Images Across Scales Using Adversarial Training
Krzysztof Wolski
Adarsh Djeacoumar
Alireza Javanmardi
Hans-Peter Seidel
Christian Theobalt
Guillaume Cordonnier
K. Myszkowski
G. Drettakis
Xingang Pan
Thomas Leimkuhler
43
2
0
13 Jun 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing
  Reliability,Reproducibility, and Practicality
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Tianle Zhang
Langtian Ma
Yuchen Yan
Yuchen Zhang
Kai Wang
...
Wenqi Shao
Yang You
Yu Qiao
Ping Luo
Kaipeng Zhang
VGen
69
2
0
13 Jun 2024
Controllable Longer Image Animation with Diffusion Models
Controllable Longer Image Animation with Diffusion Models
Qiang Wang
Minghua Liu
Junjun Hu
Fan Jiang
Mu Xu
VGen
30
0
0
27 May 2024
From Sora What We Can See: A Survey of Text-to-Video Generation
From Sora What We Can See: A Survey of Text-to-Video Generation
Rui Sun
Yumin Zhang
Tejal Shah
Jiahao Sun
Shuoying Zhang
Wenqi Li
Haoran Duan
Bo Wei
R. Ranjan
EGVM
79
20
0
17 May 2024
ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object
  Removal and Insertion
ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion
Daniel Winter
Matan Cohen
Shlomi Fruchter
Yael Pritch
Alex Rav Acha
Yedid Hoshen
DiffM
40
26
0
27 Mar 2024
A Survey on Long Video Generation: Challenges, Methods, and Prospects
A Survey on Long Video Generation: Challenges, Methods, and Prospects
Chengxuan Li
Di Huang
Zeyu Lu
Yang Xiao
Qingqi Pei
Lei Bai
EGVM
42
19
0
25 Mar 2024
LayerDiff: Exploring Text-guided Multi-layered Composable Image
  Synthesis via Layer-Collaborative Diffusion Model
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Runhu Huang
Kaixin Cai
Jianhua Han
Xiaodan Liang
Renjing Pei
Guansong Lu
Songcen Xu
Wei Zhang
Hang Xu
DiffM
31
3
0
18 Mar 2024
Just Say the Name: Online Continual Learning with Category Names Only
  via Data Generation
Just Say the Name: Online Continual Learning with Category Names Only via Data Generation
Minhyuk Seo
Diganta Misra
Seongwon Cho
Minjae Lee
Jonghyun Choi
CLL
33
7
0
16 Mar 2024
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression
Xinjie Zhang
Shenyuan Gao
Zhening Liu
Jiawei Shao
Xingtong Ge
Dailan He
Tongda Xu
Yan Wang
Jun Zhang
42
1
0
13 Mar 2024
A Survey on Generative AI and LLM for Video Generation, Understanding,
  and Streaming
A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming
Pengyuan Zhou
Lin Wang
Zhi Liu
Yanbin Hao
Pan Hui
Sasu Tarkoma
J. Kangasharju
VGen
38
26
0
30 Jan 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video
  Diffusion Models
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
115
274
0
17 Jan 2024
MEVG: Multi-event Video Generation with Text-to-Video Models
MEVG: Multi-event Video Generation with Text-to-Video Models
Gyeongrok Oh
Jaehwan Jeong
Sieun Kim
Wonmin Byeon
Jinkyu Kim
Sungwoong Kim
Sangpil Kim
VGen
DiffM
33
20
0
07 Dec 2023
RealFill: Reference-Driven Generation for Authentic Image Completion
RealFill: Reference-Driven Generation for Authentic Image Completion
Luming Tang
Nataniel Ruiz
Qinghao Chu
Yuanzhen Li
Aleksander Holynski
...
Bharath Hariharan
Yael Pritch
Neal Wadhwa
Kfir Aberman
Michael Rubinstein
DiffM
11
43
0
28 Sep 2023
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided
  Planning
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Han Lin
Abhaysinh Zala
Jaemin Cho
Mohit Bansal
LM&Ro
VGen
DiffM
43
74
0
26 Sep 2023
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Emanuele Bugliarello
Hernan Moraldo
Ruben Villegas
Mohammad Babaeizadeh
M. Saffar
Han Zhang
D. Erhan
V. Ferrari
Pieter-Jan Kindermans
P. Voigtlaender
VGen
33
10
0
22 Aug 2023
Metrics to Quantify Global Consistency in Synthetic Medical Images
Metrics to Quantify Global Consistency in Synthetic Medical Images
Daniel Scholz
Benedikt Wiestler
Daniel Rueckert
M. Menten
MedIm
11
1
0
01 Aug 2023
The Age of Synthetic Realities: Challenges and Opportunities
The Age of Synthetic Realities: Challenges and Opportunities
J. P. Cardenuto
Jing Yang
Rafael Padilha
Renjie Wan
Daniel Moreira
Haoliang Li
Shiqi Wang
Fernanda A. Andaló
Sébastien Marcel
Anderson de Rezende Rocha
DeLMO
42
29
0
09 Jun 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal
  Co-Denoising
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGen
DiffM
45
88
0
29 May 2023
Vision + Language Applications: A Survey
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
30
5
0
24 May 2023
i-Code Studio: A Configurable and Composable Framework for Integrative
  AI
i-Code Studio: A Configurable and Composable Framework for Integrative AI
Yuwei Fang
Mahmoud Khademi
Chenguang Zhu
Ziyi Yang
Reid Pryzant
...
Yao Qian
Takuya Yoshioka
Lu Yuan
Michael Zeng
Xuedong Huang
30
2
0
23 May 2023
MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal
  Conditional Image Synthesis
MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis
Jinsheng Zheng
Daqing Liu
Chaoyue Wang
Minghui Hu
Zuopeng Yang
Changxing Ding
Dacheng Tao
31
1
0
10 May 2023
Generative Disco: Text-to-Video Generation for Music Visualization
Generative Disco: Text-to-Video Generation for Music Visualization
Vivian Liu
Tao Long
Nathan Raw
Lydia B. Chilton
VGen
11
33
0
17 Apr 2023
Soundini: Sound-Guided Diffusion for Natural Video Editing
Soundini: Sound-Guided Diffusion for Natural Video Editing
Seung Hyun Lee
Si-Yeol Kim
Innfarn Yoo
Feng Yang
Donghyeon Cho
Youngseo Kim
Huiwen Chang
Jinkyu Kim
Sangpil Kim
VGen
DiffM
35
15
0
13 Apr 2023
DiffCollage: Parallel Generation of Large Content with Diffusion Models
DiffCollage: Parallel Generation of Large Content with Diffusion Models
Qinsheng Zhang
Jiaming Song
Xun Huang
Yongxin Chen
Ming-Yu Liu
DiffM
29
82
0
30 Mar 2023
CelebV-Text: A Large-Scale Facial Text-Video Dataset
CelebV-Text: A Large-Scale Facial Text-Video Dataset
Jianhui Yu
Hao Zhu
Liming Jiang
Chen Change Loy
Weidong (Tom) Cai
Wayne Wu
22
56
0
26 Mar 2023
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Sheng-Siang Yin
Chenfei Wu
Huan Yang
Jianfeng Wang
Xiaodong Wang
...
Gong Ming
Lijuan Wang
Zicheng Liu
Houqiang Li
Nan Duan
VGen
15
125
0
22 Mar 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
38
19
0
17 Mar 2023
Learning 3D Photography Videos via Self-supervised Diffusion on Single
  Images
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images
Xiaodong Wang
Chenfei Wu
S. Yin
Minheng Ni
Jianfeng Wang
...
Fan Yang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
VGen
DiffM
23
7
0
21 Feb 2023
InfiniCity: Infinite-Scale City Synthesis
InfiniCity: Infinite-Scale City Synthesis
C. Lin
Hsin-Ying Lee
Willi Menapace
Menglei Chai
Aliaksandr Siarohin
Ming-Hsuan Yang
Sergey Tulyakov
VGen
26
52
0
23 Jan 2023
TeViS:Translating Text Synopses to Video Storyboards
TeViS:Translating Text Synopses to Video Storyboards
Xu Gu
Yuchong Sun
Feiyue Ni
Shizhe Chen
Xihua Wang
Ruihua Song
B. Li
Xiang Cao
DiffM
23
4
0
31 Dec 2022
Image Inpainting via Iteratively Decoupled Probabilistic Modeling
Image Inpainting via Iteratively Decoupled Probabilistic Modeling
Wenbo Li
Xin Yu
Kun Zhou
Yibing Song
Zhe-nan Lin
Jiaya Jia
DiffM
32
11
0
06 Dec 2022
High-Resolution Image Editing via Multi-Stage Blended Diffusion
High-Resolution Image Editing via Multi-Stage Blended Diffusion
J. Ackermann
Minjun Li
DiffM
14
15
0
24 Oct 2022
Large-scale Text-to-Image Generation Models for Visual Artists' Creative
  Works
Large-scale Text-to-Image Generation Models for Visual Artists' Creative Works
Hyung-Kwon Ko
Gwanmo Park
Hyeon Jeon
Jaemin Jo
Juho Kim
Jinwook Seo
27
138
0
16 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual
  Description
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffM
VGen
54
371
0
05 Oct 2022
Progressive Text-to-Image Generation
Progressive Text-to-Image Generation
Zhengcong Fei
Mingyuan Fan
Li Zhu
Junshi Huang
81
4
0
05 Oct 2022
12
Next