ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.16862
  4. Cited By
Conditional Panoramic Image Generation via Masked Autoregressive Modeling

Conditional Panoramic Image Generation via Masked Autoregressive Modeling

22 May 2025
Chaoyang Wang
Xiangtai Li
Lu Qi
X. Lin
Jinbin Bai
Qianyu Zhou
Yunhai Tong
    DiffM
ArXivPDFHTML

Papers citing "Conditional Panoramic Image Generation via Masked Autoregressive Modeling"

33 / 33 papers shown
Title
Omni$^2$: Unifying Omnidirectional Image Generation and Editing in an Omni Model
Omni2^22: Unifying Omnidirectional Image Generation and Editing in an Omni Model
Liu Yang
Huiyu Duan
Yucheng Zhu
Xiaohong Liu
Lu Liu
Zitong Xu
Guangji Ma
Xiongkuo Min
Guangtao Zhai
P. Callet
VLM
VGen
351
2
0
15 Apr 2025
Panorama Generation From NFoV Image Done Right
Panorama Generation From NFoV Image Done Right
Dian Zheng
Cheng Zhang
Xiao-Ming Wu
Cao Li
Chengfei Lv
Jian-Fang Hu
Wei-Shi Zheng
DiffM
93
2
0
24 Mar 2025
UMC: Unified Resilient Controller for Legged Robots with Joint Malfunctions
UMC: Unified Resilient Controller for Legged Robots with Joint Malfunctions
Yu Qiu
X. Lin
Jingbo Wang
Xianrui Li
Lu Qi
Ming-Hsuan Yang
51
1
0
05 Feb 2025
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling
Xiaokang Chen
Zhiyu Wu
Xingchao Liu
Zizheng Pan
Wen Liu
Zhenda Xie
X. Yu
Chong Ruan
AI4TS
68
126
0
29 Jan 2025
VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception
VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception
Zhaoliang Wan
Yonggen Ling
Senlin Yi
Lu Qi
Wangwei Lee
...
Xiao Teng
Peng Lu
Xu Yang
Ming-Hsuan Yang
Hui Cheng
77
5
0
31 Dec 2024
Imagine360: Immersive 360 Video Generation from Perspective Anchor
Imagine360: Immersive 360 Video Generation from Perspective Anchor
Jing Tan
Shuai Yang
Tong Wu
Jingwen He
Yuwei Guo
Ziqiang Liu
Dahua Lin
VGen
88
3
0
04 Dec 2024
PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs
PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs
Teng Zhou
Xiaoyu Zhang
Yongchuan Tang
MLLM
DiffM
133
1
0
24 Nov 2024
Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation
Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation
Xiaoyu Zhang
Teng Zhou
Xinlong Zhang
Jia Wei
Yongchuan Tang
67
2
0
24 Oct 2024
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint
Junwei Zhou
Xueting Li
Lu Qi
Ming-Hsuan Yang
DiffM
64
4
0
20 Oct 2024
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Yuqing Wang
Tianwei Xiong
Daquan Zhou
Zhijie Lin
Yang Zhao
Bingyi Kang
Jiashi Feng
Xihui Liu
VGen
80
26
0
03 Oct 2024
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
Kuan-Chih Huang
Xiangtai Li
Lu Qi
Shuicheng Yan
Ming-Hsuan Yang
LRM
108
12
0
27 May 2024
PanoDiffusion: 360-degree Panorama Outpainting via Diffusion
PanoDiffusion: 360-degree Panorama Outpainting via Diffusion
Tianhao Wu
Chuanxia Zheng
Tat-Jen Cham
DiffM
58
21
0
06 Jul 2023
MVDiffusion: Enabling Holistic Multi-view Image Generation with
  Correspondence-Aware Diffusion
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion
Shitao Tang
Fuyang Zhang
Jiacheng Chen
Peng Wang
Yasutaka Furukawa
46
153
0
03 Jul 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for
  Vision-and-Language Navigation
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li
Joey Tianyi Zhou
DiffM
54
52
0
30 May 2023
Phenaki: Variable Length Video Generation From Open Domain Textual
  Description
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffM
VGen
99
381
0
05 Oct 2022
Flow Straight and Fast: Learning to Generate and Transfer Data with
  Rectified Flow
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow
Xingchao Liu
Chengyue Gong
Qiang Liu
OOD
92
960
0
07 Sep 2022
Classifier-Free Diffusion Guidance
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
86
3,830
0
26 Jul 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
158
1,089
0
22 Jun 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via
  Transformers
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
277
585
0
29 May 2022
Flexible Diffusion Modeling of Long Videos
Flexible Diffusion Modeling of Long Videos
William Harvey
Saeid Naderiparizi
Vaden Masrani
Christian D. Weilbach
Frank Wood
DiffM
BDL
VGen
190
293
0
23 May 2022
Diffusion Probabilistic Modeling for Video Generation
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffM
VGen
90
264
0
16 Mar 2022
MaskGIT: Masked Generative Image Transformer
MaskGIT: Masked Generative Image Transformer
Huiwen Chang
Han Zhang
Lu Jiang
Ce Liu
William T. Freeman
ViT
85
664
0
08 Feb 2022
RePaint: Inpainting using Denoising Diffusion Probabilistic Models
RePaint: Inpainting using Denoising Diffusion Probabilistic Models
Andreas Lugmayr
Martin Danelljan
Andrés Romero
Feng Yu
Radu Timofte
Luc Van Gool
DiffM
308
1,385
0
24 Jan 2022
BIPS: Bi-modal Indoor Panorama Synthesis via Residual Depth-aided
  Adversarial Learning
BIPS: Bi-modal Indoor Panorama Synthesis via Residual Depth-aided Adversarial Learning
Chang-Hwan Oh
Wonjune Cho
Daehee Park
Yujeong Chae
Lin Wang
Kuk-Jin Yoon
3DV
MDE
43
17
0
12 Dec 2021
Palette: Image-to-Image Diffusion Models
Palette: Image-to-Image Diffusion Models
Chitwan Saharia
William Chan
Huiwen Chang
Chris A. Lee
Jonathan Ho
Tim Salimans
David J. Fleet
Mohammad Norouzi
DiffM
VLM
444
1,617
0
10 Nov 2021
Cascaded Diffusion Models for High Fidelity Image Generation
Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
120
1,196
0
30 May 2021
VideoGPT: Video Generation using VQ-VAE and Transformers
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViT
VGen
283
495
0
20 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
681
28,659
0
26 Feb 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
327
4,873
0
24 Feb 2021
Score-Based Generative Modeling through Stochastic Differential
  Equations
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song
Jascha Narain Sohl-Dickstein
Diederik P. Kingma
Abhishek Kumar
Stefano Ermon
Ben Poole
DiffM
SyDa
268
6,293
0
26 Nov 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
500
41,106
0
28 May 2020
Matterport3D: Learning from RGB-D Data in Indoor Environments
Matterport3D: Learning from RGB-D Data in Indoor Environments
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
3DV
3DPC
114
1,880
0
18 Sep 2017
Rethinking the Inception Architecture for Computer Vision
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
497
27,231
0
02 Dec 2015
1