ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.08827
  4. Cited By
ImageBART: Bidirectional Context with Multinomial Diffusion for
  Autoregressive Image Synthesis

ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis

19 August 2021
Patrick Esser
Robin Rombach
A. Blattmann
Bjorn Ommer
    DiffM
ArXivPDFHTML

Papers citing "ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis"

50 / 112 papers shown
Title
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Xuzhi Zhang
Jintao Guo
Shanshan Zhao
Minghao Fu
Lunhao Duan
Guo-Hua Wang
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
DiffM
74
0
0
05 May 2025
D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation
D2^22iT: Dynamic Diffusion Transformer for Accurate Image Generation
Weinan Jia
Mengqi Huang
Nan Chen
Lei Zhang
Zhendong Mao
29
0
0
13 Apr 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
135
2
0
10 Mar 2025
Text-to-Image Generation for Vocabulary Learning Using the Keyword Method
Text-to-Image Generation for Vocabulary Learning Using the Keyword Method
Nuwan T. Attygalle
M. Kljun
Aaron Quigley
Klen Copic Pucihar
Jens Grubert
...
Juri Yoneyama
Alice Toniolo
Angela Miguel
Hirokazu Kato
M. Weerasinghe
DiffM
83
0
0
28 Jan 2025
[MASK] is All You Need
[MASK] is All You Need
Vincent Tao Hu
Bjorn Ommer
DiffM
137
2
0
09 Dec 2024
Autoregressive Models in Vision: A Survey
Autoregressive Models in Vision: A Survey
Jing Xiong
Gongye Liu
Lun Huang
Chengyue Wu
Taiqiang Wu
...
M. Zhang
Guillermo Sapiro
Jiebo Luo
Ping Luo
Ngai Wong
VGen
48
9
0
08 Nov 2024
DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models
DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models
Xiaoxiao He
Ligong Han
Quan Dao
Song Wen
Minhao Bai
...
Hongdong Li
Junzhou Huang
Faez Ahmed
Akash Srivastava
Dimitris Metaxas
DiffM
SyDa
40
5
0
10 Oct 2024
Enhancing User-Centric Privacy Protection: An Interactive Framework
  through Diffusion Models and Machine Unlearning
Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning
Huaxi Huang
Xin Yuan
Qiyu Liao
Dadong Wang
Tongliang Liu
DiffM
32
0
0
05 Sep 2024
Attacks and Defenses for Generative Diffusion Models: A Comprehensive
  Survey
Attacks and Defenses for Generative Diffusion Models: A Comprehensive Survey
V. T. Truong
Luan Ba Dang
Long Bao Le
DiffM
MedIm
56
16
0
06 Aug 2024
Zero-shot Text-guided Infinite Image Synthesis with LLM guidance
Zero-shot Text-guided Infinite Image Synthesis with LLM guidance
Soyeong Kwon
Taegyeong Lee
Taehwan Kim
DiffM
26
2
0
17 Jul 2024
ARTIST: Improving the Generation of Text-rich Images by Disentanglement
ARTIST: Improving the Generation of Text-rich Images by Disentanglement
Jianyi Zhang
Yufan Zhou
Jiuxiang Gu
Curtis Wigington
Tong Yu
Yiran Chen
Tong Sun
Ruiyi Zhang
77
0
0
17 Jun 2024
Latent Denoising Diffusion GAN: Faster sampling, Higher image quality
Latent Denoising Diffusion GAN: Faster sampling, Higher image quality
Luan Thanh Trinh
T. Hamagami
DiffM
40
5
0
17 Jun 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis:
  Techniques for Portrait Generation, Driving Mechanisms, and Editing
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
42
1
0
15 Jun 2024
UniCode: Learning a Unified Codebook for Multimodal Large Language
  Models
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
Sipeng Zheng
Bohan Zhou
Yicheng Feng
Ye Wang
Zongqing Lu
VLM
MLLM
46
7
0
14 Mar 2024
NoiseDiffusion: Correcting Noise for Image Interpolation with Diffusion
  Models beyond Spherical Linear Interpolation
NoiseDiffusion: Correcting Noise for Image Interpolation with Diffusion Models beyond Spherical Linear Interpolation
PengFei Zheng
Yonggang Zhang
Zhen Fang
Tongliang Liu
Defu Lian
Bo Han
DiffM
34
8
0
13 Mar 2024
StableDrag: Stable Dragging for Point-based Image Editing
StableDrag: Stable Dragging for Point-based Image Editing
Yutao Cui
Xiaotong Zhao
Guozhen Zhang
Shengming Cao
Kai Ma
Limin Wang
41
10
0
07 Mar 2024
Text-guided Explorable Image Super-resolution
Text-guided Explorable Image Super-resolution
Kanchana Vaishnavi Gandikota
Paramanand Chandramouli
45
7
0
02 Mar 2024
AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal
  Conditioning
AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning
W. Para
Abdelrahman Eldesokey
Zhenyu Li
Pradyumna Reddy
Jiankang Deng
Peter Wonka
DiffM
35
0
0
08 Feb 2024
Generative Human Motion Stylization in Latent Space
Generative Human Motion Stylization in Latent Space
Chuan Guo
Yuxuan Mu
Wei Ji
Peng Dai
Youliang Yan
Juwei Lu
Li Cheng
VGen
38
10
0
24 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
35
3
0
21 Jan 2024
Deep Learning-based Image and Video Inpainting: A Survey
Deep Learning-based Image and Video Inpainting: A Survey
Weize Quan
Jiaxi Chen
Yanli Liu
Dong-Ming Yan
Peter Wonka
3DV
43
35
0
07 Jan 2024
Improving Diffusion-Based Image Synthesis with Context Prediction
Improving Diffusion-Based Image Synthesis with Context Prediction
Ling Yang
Jingwei Liu
Shenda Hong
Zhilong Zhang
Zhilin Huang
Zheming Cai
Wentao Zhang
Tengjiao Wang
DiffM
46
33
0
04 Jan 2024
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
Weijian Mai
Jian Zhang
Pengfei Fang
Zhijun Zhang
54
9
0
31 Dec 2023
HQ-VAE: Hierarchical Discrete Representation Learning with Variational
  Bayes
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
Yuhta Takida
Yukara Ikemiya
Takashi Shibuya
Kazuki Shimada
Woosung Choi
...
Naoki Murata
Toshimitsu Uesaka
Kengo Uchida
Wei-Hsiang Liao
Yuki Mitsufuji
BDL
43
11
0
31 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
62
1
0
19 Dec 2023
Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided
  Document Generation
Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
YoungJoon Yoo
Jongwon Choi
BDL
26
2
0
15 Dec 2023
Free3D: Consistent Novel View Synthesis without 3D Representation
Free3D: Consistent Novel View Synthesis without 3D Representation
Chuanxia Zheng
Andrea Vedaldi
3DV
42
48
0
07 Dec 2023
Diffusion Models Without Attention
Diffusion Models Without Attention
Jing Nathan Yan
Jiatao Gu
Alexander M. Rush
29
61
0
30 Nov 2023
Formulating Discrete Probability Flow Through Optimal Transport
Formulating Discrete Probability Flow Through Optimal Transport
Pengze Zhang
Hubery Yin
Chen Li
Xiaohua Xie
OT
52
5
0
07 Nov 2023
Composer Style-specific Symbolic Music Generation Using Vector Quantized
  Discrete Diffusion Models
Composer Style-specific Symbolic Music Generation Using Vector Quantized Discrete Diffusion Models
Jincheng Zhang
Jingjing Tang
C. Saitis
Gyorgy Fazekas
DiffM
30
3
0
21 Oct 2023
Improving Compositional Text-to-image Generation with Large
  Vision-Language Models
Improving Compositional Text-to-image Generation with Large Vision-Language Models
Song Wen
Guian Fang
Renrui Zhang
Peng Gao
Hao Dong
Dimitris N. Metaxas
25
17
0
10 Oct 2023
Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient
  Vision Transformers
Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers
Shiyue Cao
Yueqin Yin
Lianghua Huang
Yu Liu
Xin Zhao
Deli Zhao
Kaiqi Huang
ViT
24
14
0
09 Oct 2023
Generating 3D Brain Tumor Regions in MRI using Vector-Quantization
  Generative Adversarial Networks
Generating 3D Brain Tumor Regions in MRI using Vector-Quantization Generative Adversarial Networks
Meng Zhou
Matthias W. Wagner
U. Tabori
C. Hawkins
B. Ertl-Wagner
Farzad Khalvati
MedIm
19
5
0
02 Oct 2023
On quantifying and improving realism of images generated with diffusion
On quantifying and improving realism of images generated with diffusion
Yunzhu Chen
Naveed Akhtar
Nur Al Hasan Haldar
Ajmal Saeed Mian
27
4
0
26 Sep 2023
FreeU: Free Lunch in Diffusion U-Net
FreeU: Free Lunch in Diffusion U-Net
Chenyang Si
Ziqi Huang
Yuming Jiang
Ziwei Liu
DiffM
45
132
0
20 Sep 2023
A 3M-Hybrid Model for the Restoration of Unique Giant Murals: A Case
  Study on the Murals of Yongle Palace
A 3M-Hybrid Model for the Restoration of Unique Giant Murals: A Case Study on the Murals of Yongle Palace
Jing Yang
Nur Intan Raihana Ruhaiyem
Chichun Zhou
32
1
0
12 Sep 2023
Text-to-feature diffusion for audio-visual few-shot learning
Text-to-feature diffusion for audio-visual few-shot learning
Otniel-Bogdan Mercea
Thomas Hummel
A. Sophia Koepke
Zeynep Akata
VLM
27
2
0
07 Sep 2023
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation
  Using only Images
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images
Cuican Yu
Guansong Lu
Yihan Zeng
Jian Sun
Xiaodan Liang
Huibin Li
Zongben Xu
Songcen Xu
Wei Zhang
Hang Xu
47
14
0
31 Aug 2023
Learning A Coarse-to-Fine Diffusion Transformer for Image Restoration
Learning A Coarse-to-Fine Diffusion Transformer for Image Restoration
Liyan Wang
Qinyu Yang
Cong Wang
Wen Wang
Jin-shan Pan
Zhixun Su
DiffM
37
2
0
17 Aug 2023
Photorealistic and Identity-Preserving Image-Based Emotion Manipulation
  with Latent Diffusion Models
Photorealistic and Identity-Preserving Image-Based Emotion Manipulation with Latent Diffusion Models
Ioannis Pikoulis
P. Filntisis
Petros Maragos
32
2
0
06 Aug 2023
Patched Denoising Diffusion Models For High-Resolution Image Synthesis
Patched Denoising Diffusion Models For High-Resolution Image Synthesis
Zheng Ding
Mengqi Zhang
Jiajun Wu
Z. Tu
DiffM
30
33
0
02 Aug 2023
Online Clustered Codebook
Online Clustered Codebook
Chuanxia Zheng
Andrea Vedaldi
37
26
0
27 Jul 2023
Flow Matching in Latent Space
Flow Matching in Latent Space
Quan Dao
Hao Phung
Binh Duc Nguyen
Anh Tran
37
60
0
17 Jul 2023
Dynamically Masked Discriminator for Generative Adversarial Networks
Dynamically Masked Discriminator for Generative Adversarial Networks
Wentian Zhang
Haozhe Liu
Bing Li
Jinheng Xie
Yawen Huang
Yuexiang Li
Yefeng Zheng
Guohao Li
TTA
38
2
0
13 Jun 2023
Designing a Better Asymmetric VQGAN for StableDiffusion
Designing a Better Asymmetric VQGAN for StableDiffusion
Zixin Zhu
Xuelu Feng
Dongdong Chen
Jianmin Bao
Le Wang
Yinpeng Chen
Lu Yuan
Gang Hua
DiffM
27
34
0
07 Jun 2023
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine
  Semantic Re-alignment
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment
Guian Fang
Zutao Jiang
Jianhua Han
Guangsong Lu
Hang Xu
Shengcai Liao
Xiaodan Liang
EGVM
29
1
0
31 May 2023
DualVAE: Controlling Colours of Generated and Real Images
DualVAE: Controlling Colours of Generated and Real Images
Keerth Rathakumar
David Liebowitz
Christian J. Walder
Kristen Moore
S. Kanhere
26
0
0
30 May 2023
BRICS: Bi-level feature Representation of Image CollectionS
BRICS: Bi-level feature Representation of Image CollectionS
Dingdong Yang
Yizhi Wang
Ali Mahdavi-Amiri
Hao Zhang
DiffM
15
0
0
29 May 2023
TD-GEM: Text-Driven Garment Editing Mapper
TD-GEM: Text-Driven Garment Editing Mapper
R. Dadfar
Sanaz Sabzevari
Mårten Björkman
Danica Kragic
DiffM
33
2
0
29 May 2023
Not All Image Regions Matter: Masked Vector Quantization for
  Autoregressive Image Generation
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation
Mengqi Huang
Zhendong Mao
Quang Wang
Yongdong Zhang
VGen
DiffM
68
21
0
23 May 2023
123
Next