Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.08827
Cited By
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
19 August 2021
Patrick Esser
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis"
50 / 112 papers shown
Title
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization
Mengqi Huang
Zhendong Mao
Zhuowei Chen
Yongdong Zhang
MQ
38
35
0
19 May 2023
T-former: An Efficient Transformer for Image Inpainting
Ye Deng
Siqi Hui
Sanping Zhou
Deyu Meng
Jinjun Wang
ViT
19
30
0
12 May 2023
Collaborative Diffusion for Multi-Modal Face Generation and Editing
Ziqi Huang
Kelvin C. K. Chan
Yuming Jiang
Ziwei Liu
DiffM
49
103
0
20 Apr 2023
Binary Latent Diffusion
Ze Wang
Jiang Wang
Zicheng Liu
Qiang Qiu
32
13
0
10 Apr 2023
ReVersion: Diffusion-Based Relation Inversion from Images
Ziqi Huang
Tianxing Wu
Yuming Jiang
Kelvin C. K. Chan
Ziwei Liu
39
65
0
23 Mar 2023
DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery
Chaofan Ma
Yu-Hao Yang
Chen Ju
Feifan Zhang
Jinxian Liu
Yu Wang
Ya Zhang
Yanfeng Wang
DiffM
32
37
0
17 Mar 2023
TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets
Weixin Chen
D. Song
Bo-wen Li
DiffM
31
74
0
10 Mar 2023
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding
Jiacheng Li
Longhui Wei
Zongyuan Zhan
Xinfu He
Siliang Tang
Qi Tian
Yueting Zhuang
24
4
0
07 Mar 2023
Entity-Level Text-Guided Image Manipulation
Yikai Wang
Jianan Wang
Guansong Lu
Hang Xu
Zhenguo Li
Wei Zhang
Yanwei Fu
VGen
34
3
0
22 Feb 2023
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
41
57
0
11 Feb 2023
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt
Dongchao Yang
Songxiang Liu
Rongjie Huang
Chao Weng
Helen Meng
DiffM
VLM
31
85
0
31 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
197
521
0
02 Jan 2023
Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers
Yasheng Sun
Hang Zhou
Kaisiyuan Wang
Qianyi Wu
Zhibin Hong
Jingtuo Liu
Errui Ding
Jingdong Wang
Ziwei Liu
Koike Hideki
35
34
0
09 Dec 2022
Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis
Yuchao Gu
Xintao Wang
Yixiao Ge
Ying Shan
Xiaohu Qie
Mike Zheng Shou
DiffM
32
20
0
06 Dec 2022
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation
Zutao Jiang
Guangsong Lu
Xiaodan Liang
Jihua Zhu
Wei Zhang
Xiaojun Chang
Hang Xu
DiffM
21
8
0
02 Dec 2022
Wavelet Diffusion Models are fast and scalable Image Generators
Hao Phung
Quan Dao
Anh Tran
DiffM
33
87
0
29 Nov 2022
Dimensionality-Varying Diffusion Process
Han Zhang
Ruili Feng
Zhantao Yang
Lianghua Huang
Yu Liu
Yifei Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
Fan Cheng
DiffM
30
10
0
29 Nov 2022
Unified Discrete Diffusion for Simultaneous Vision-Language Generation
Minghui Hu
Chuanxia Zheng
Heliang Zheng
Tat-Jen Cham
Chaoyue Wang
Zuopeng Yang
Dacheng Tao
Ponnuthurai Nagaratnam Suganthan
DiffM
20
23
0
27 Nov 2022
Exploring Discrete Diffusion Models for Image Captioning
Zixin Zhu
Yixuan Wei
Jianfeng Wang
Zhe Gan
Zheng-Wei Zhang
Le Wang
G. Hua
Lijuan Wang
Zicheng Liu
Han Hu
DiffM
VLM
28
17
0
21 Nov 2022
A Structure-Guided Diffusion Model for Large-Hole Image Completion
Daichi Horita
Jiaolong Yang
Dong Chen
Yuki Koyama
Kiyoharu Aizawa
N. Sebe
DiffM
28
2
0
18 Nov 2022
A Unified Pyramid Recurrent Network for Video Frame Interpolation
Xin Jin
Longhai Wu
Jie Chen
Youxin Chen
Jayoon Koo
Cheul-hee Hahm
33
35
0
07 Nov 2022
ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Minheng Ni
Zitong Huang
Kai-Hua Feng
W. Zuo
VLM
19
15
0
13 Oct 2022
Efficient Diffusion Models for Vision: A Survey
Anwaar Ulhaq
Naveed Akhtar
MedIm
32
60
0
07 Oct 2022
Progressive Text-to-Image Generation
Zhengcong Fei
Mingyuan Fan
Li Zhu
Junshi Huang
89
4
0
05 Oct 2022
Diffusion Models for Graphs Benefit From Discrete State Spaces
K. Haefeli
Karolis Martinkus
Nathanael Perraudin
Roger Wattenhofer
DiffM
103
52
0
04 Oct 2022
MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation
Chuanxia Zheng
L. Vuong
Jianfei Cai
Dinh Q. Phung
MQ
71
72
0
19 Sep 2022
Diffusion Models in Vision: A Survey
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
M. Shah
DiffM
VLM
MedIm
197
1,149
0
10 Sep 2022
Improved Masked Image Generation with Token-Critic
José Lezama
Huiwen Chang
Lu Jiang
Irfan Essa
DiffM
188
43
0
09 Sep 2022
Text-Free Learning of a Natural Language Interface for Pretrained Face Generators
Xiaodan Du
Raymond A. Yeh
Nicholas I. Kolkin
Eli Shechtman
Gregory Shakhnarovich
CLIP
29
1
0
08 Sep 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
29
90
0
29 Aug 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
36
296
0
20 Jul 2022
DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change Detection
W. G. C. Bandara
Nithin Gopalakrishnan Nair
Vishal M. Patel
DiffM
29
5
0
23 Jun 2022
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Ye Zhu
Yuehua Wu
Kyle Olszewski
Jian Ren
Sergey Tulyakov
Yan Yan
DiffM
28
47
0
15 Jun 2022
Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer
Doyup Lee
Chiheon Kim
Saehoon Kim
Minsu Cho
Wook-Shin Han
21
28
0
09 Jun 2022
Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models
W. H. Pinaya
M. Graham
Robert J. Gray
P. F. D. Costa
Petru-Daniel Tudosiu
...
D. Werring
Geraint Rees
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
DiffM
MedIm
32
102
0
07 Jun 2022
Wavelet Prior Attention Learning in Axial Inpainting Network
Chenjie Cao
Chengrong Wang
Yuntao Zhang
Yanwei Fu
43
2
0
07 Jun 2022
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
59
373
0
06 Jun 2022
Modeling Image Composition for Complex Scene Generation
Zuopeng Yang
Daqing Liu
Chaoyue Wang
J. Yang
Dacheng Tao
ViT
36
50
0
02 Jun 2022
Improved Vector Quantized Diffusion Models
Zhicong Tang
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
DiffM
181
63
0
31 May 2022
Text2Human: Text-Driven Controllable Human Image Generation
Yuming Jiang
Shuai Yang
Haonan Qiu
Wayne Wu
Chen Change Loy
Ziwei Liu
DiffM
116
46
0
31 May 2022
A Continuous Time Framework for Discrete Denoising Models
Andrew Campbell
Joe Benton
Valentin De Bortoli
Tom Rainforth
George Deligiannidis
Arnaud Doucet
DiffM
194
134
0
30 May 2022
ASSET: Autoregressive Semantic Scene Editing with Transformers at High Resolutions
Difan Liu
Sandesh Shetty
Tobias Hinz
Matthew Fisher
Richard Y. Zhang
Taesung Park
E. Kalogerakis
ViT
27
30
0
24 May 2022
End-to-End Visual Editing with a Generatively Pre-Trained Artist
A. Brown
Cheng-Yang Fu
Omkar M. Parkhi
Tamara L. Berg
Andrea Vedaldi
DiffM
32
8
0
03 May 2022
Semi-Parametric Neural Image Synthesis
A. Blattmann
Robin Rombach
Kaan Oktay
Jonas Muller
Bjorn Ommer
DiffM
33
28
0
25 Apr 2022
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation
Jianan Wang
Guansong Lu
Hang Xu
Zhenguo Li
Chunjing Xu
Yanwei Fu
33
17
0
09 Apr 2022
Autoregressive Image Generation using Residual Quantization
Doyup Lee
Chiheon Kim
Saehoon Kim
Minsu Cho
Wook-Shin Han
VGen
175
330
0
03 Mar 2022
NÜWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN
Minheng Ni
Chenfei Wu
Haoyang Huang
Daxin Jiang
W. Zuo
Nan Duan
30
19
0
10 Feb 2022
MaskGIT: Masked Generative Image Transformer
Huiwen Chang
Han Zhang
Lu Jiang
Ce Liu
William T. Freeman
ViT
40
622
0
08 Feb 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
29
48
0
27 Dec 2021
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
150
14,683
0
20 Dec 2021
Previous
1
2
3
Next