ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.15321
  4. Cited By
Next Patch Prediction for Autoregressive Visual Generation

Next Patch Prediction for Autoregressive Visual Generation

19 December 2024
Yatian Pang
Peng Jin
Shuo Yang
Bin Lin
Bin Zhu
Zhenyu Tang
Liuhan Chen
Francis E. H. Tay
Ser-Nam Lim
Harry Yang
Li Yuan
ArXivPDFHTML

Papers citing "Next Patch Prediction for Autoregressive Visual Generation"

9 / 9 papers shown
Title
MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioning
MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioning
Jinhua Zhang
Wei Long
Minghao Han
Weiyi You
Shuhang Gu
BDL
17
0
0
19 May 2025
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Jiahui Geng
Jintao Guo
Shanshan Zhao
Minghao Fu
Lunhao Duan
Guo-Hua Wang
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
DiffM
74
0
0
05 May 2025
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
Zhiyuan Yan
Junyan Ye
Weijia Li
Zilong Huang
Shenghai Yuan
Xiangyang He
Kaiqing Lin
Jun-Jian He
Conghui He
Li Yuan
MLLM
EGVM
96
12
0
03 Apr 2025
NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations
NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations
Zhenyu Tang
Chaoran Feng
Xinhua Cheng
Wangbo Yu
Junwu Zhang
Yuan Liu
Xiaoxiao Long
Wenping Wang
Li Yuan
3DGS
66
1
0
29 Mar 2025
Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis
Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis
Kai Qiu
Xianrui Li
Jason Kuen
Hongyu Chen
Xiaohao Xu
Jiuxiang Gu
Yinyi Luo
Bhiksha Raj
Zhe-nan Lin
Marios Savvides
62
0
0
11 Mar 2025
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
Yuwei Niu
Munan Ning
Mengren Zheng
Bin Lin
Peng Jin
Jiaqi Liao
Kunpeng Ning
Bin Zhu
Li Yuan
EGVM
66
14
0
10 Mar 2025
Frequency Autoregressive Image Generation with Continuous Tokens
Hu Yu
Hao Luo
Hangjie Yuan
Yu Rong
Feng Zhao
VGen
52
3
0
07 Mar 2025
Hierarchical Banzhaf Interaction for General Video-Language Representation Learning
Hierarchical Banzhaf Interaction for General Video-Language Representation Learning
Peng Jin
Yiming Li
Li Yuan
Shuicheng Yan
Jie Chen
66
1
0
31 Dec 2024
Autoregressive Video Generation without Vector Quantization
Autoregressive Video Generation without Vector Quantization
Haoge Deng
Ting Pan
Haiwen Diao
Zhengxiong Luo
Yufeng Cui
Huchuan Lu
Shiguang Shan
Yonggang Qi
Xinlong Wang
VGen
DiffM
91
18
0
18 Dec 2024
1