Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.10789
Cited By
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
22 June 2022
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
Zirui Wang
Vijay Vasudevan
Alexander Ku
Yinfei Yang
Burcu Karagol Ayan
Ben Hutchinson
Wei Han
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Scaling Autoregressive Models for Content-Rich Text-to-Image Generation"
50 / 899 papers shown
Title
Variational Distribution Learning for Unsupervised Text-to-Image Generation
Minsoo Kang
Doyup Lee
Jiseob Kim
Saehoon Kim
Bohyung Han
DRL
OOD
67
4
0
28 Mar 2023
StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing
Senmao Li
Joost van de Weijer
Taihang Hu
Fahad Shahbaz Khan
Qibin Hou
Yaxing Wang
Jian Yang
DiffM
136
56
0
28 Mar 2023
Anti-DreamBooth: Protecting users from personalized text-to-image synthesis
T. Le
Hao Phung
Thuan Hoang Nguyen
Quan Dao
Ngoc N. Tran
Anh Tran
109
100
0
27 Mar 2023
Text-to-Image Diffusion Models are Zero-Shot Classifiers
Kevin Clark
P. Jaini
DiffM
VLM
119
116
0
27 Mar 2023
Seer: Language Instructed Video Prediction with Latent Diffusion Models
Xianfan Gu
Chuan Wen
Weirui Ye
Jiaming Song
Yang Gao
DiffM
VGen
64
43
0
27 Mar 2023
Equivariant Similarity for Vision-Language Foundation Models
Tan Wang
Kevin Qinghong Lin
Linjie Li
Chung-Ching Lin
Zhengyuan Yang
Hanwang Zhang
Zicheng Liu
Lijuan Wang
CoGe
83
51
0
25 Mar 2023
Freestyle Layout-to-Image Synthesis
Han Xue
Z. Huang
Qianru Sun
Li Song
Wenjun Zhang
DiffM
68
67
0
25 Mar 2023
High Fidelity Image Synthesis With Deep VAEs In Latent Space
Troy Luhman
Eric Luhman
DRL
3DV
65
7
0
23 Mar 2023
Ablating Concepts in Text-to-Image Diffusion Models
Nupur Kumari
Bin Zhang
Sheng-Yu Wang
Eli Shechtman
Richard Y. Zhang
Jun-Yan Zhu
VLM
75
201
0
23 Mar 2023
DreamBooth3D: Subject-Driven Text-to-3D Generation
Amit Raj
S. Kaza
Ben Poole
Michael Niemeyer
Nataniel Ruiz
...
Kfir Aberman
Michael Rubinstein
Jonathan T. Barron
Yuanzhen Li
Varun Jampani
DiffM
118
228
0
23 Mar 2023
CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Haoxuan You
Mandy Guo
Zhecan Wang
Kai-Wei Chang
Jason Baldridge
Jiahui Yu
DiffM
81
13
0
23 Mar 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
VGen
88
581
0
23 Mar 2023
TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision
Jiacheng Wei
Hao Wang
Jiashi Feng
Guosheng Lin
Kim-Hui Yap
72
30
0
23 Mar 2023
A Word is Worth a Thousand Pictures: Prompts as AI Design Material
Chinmay Kulkarni
Stefania Druga
Minsuk Chang
Alexander J. Fiannaca
Carrie J. Cai
Michael Terry
3DV
59
31
0
22 Mar 2023
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Sheng-Siang Yin
Chenfei Wu
Huan Yang
Jianfeng Wang
Xiaodong Wang
...
Gong Ming
Lijuan Wang
Zicheng Liu
Houqiang Li
Nan Duan
VGen
83
137
0
22 Mar 2023
The Prompt Artists
Minsuk Chang
Stefania Druga
Alexander J. Fiannaca
P. Vergani
Chinmay Kulkarni
Carrie J. Cai
Michael Terry
56
66
0
22 Mar 2023
MAGVLT: Masked Generative Vision-and-Language Transformer
Sungwoong Kim
DaeJin Jo
Donghoon Lee
Jongmin Kim
VLM
58
12
0
21 Mar 2023
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Yushi Hu
Benlin Liu
Jungo Kasai
Yizhong Wang
Mari Ostendorf
Ranjay Krishna
Noah A. Smith
EGVM
87
239
0
21 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
186
170
0
21 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the Future
Jianing Qiu
Lin Li
Jiankai Sun
Jiachuan Peng
Peilun Shi
...
Bo Xiao
Wu Yuan
Ningli Wang
Dong Xu
Benny Lo
AI4MH
LM&MA
114
140
0
21 Mar 2023
Localizing Object-level Shape Variations with Text-to-Image Diffusion Models
Or Patashnik
Daniel Garibi
Idan Azuri
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
95
120
0
20 Mar 2023
Retrieving Multimodal Information for Augmented Generation: A Survey
Ruochen Zhao
Hailin Chen
Weishi Wang
Fangkai Jiao
Do Xuan Long
...
Bosheng Ding
Xiaobao Guo
Minzhi Li
Xingxuan Li
Shafiq Joty
129
88
0
20 Mar 2023
Deep Image Fingerprint: Towards Low Budget Synthetic Image Detection and Model Lineage Analysis
Sergey Sinitsa
Ohad Fried
52
17
0
19 Mar 2023
IRGen: Generative Modeling for Image Retrieval
Yidan Zhang
Ting Zhang
Dong Chen
Yujing Wang
Qi Chen
...
Qi Zhang
Fan Yang
Mao Yang
Q. Liao
B. Guo
3DV
VLM
139
15
0
17 Mar 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
129
21
0
17 Mar 2023
HIVE: Harnessing Human Feedback for Instructional Visual Editing
Shu Zhen Zhang
Xinyi Yang
Yihao Feng
Can Qin
Chia-Chih Chen
...
Haiquan Wang
Silvio Savarese
Stefano Ermon
Caiming Xiong
Ran Xu
93
116
0
16 Mar 2023
StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model
Zipeng Xu
E. Sangineto
N. Sebe
DiffM
86
13
0
16 Mar 2023
Text-to-image Diffusion Models in Generative AI: A Survey
Chenshuang Zhang
Chaoning Zhang
Mengchun Zhang
In So Kweon
VLM
120
280
0
14 Mar 2023
One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
Fan Bao
Shen Nie
Kaiwen Xue
Chongxuan Li
Shiliang Pu
Yaole Wang
Gang Yue
Yue Cao
Hang Su
Jun Zhu
DiffM
271
163
0
12 Mar 2023
Scaling up GANs for Text-to-Image Synthesis
Minguk Kang
Jun-Yan Zhu
Richard Y. Zhang
Jaesik Park
Eli Shechtman
Sylvain Paris
Taesung Park
96
478
0
09 Mar 2023
Cones: Concept Neurons in Diffusion Models for Customized Generation
Zhiheng Liu
Ruili Feng
Kai Zhu
Yifei Zhang
Kecheng Zheng
Yu Liu
Deli Zhao
Jingren Zhou
Yang Cao
DiffM
155
129
0
09 Mar 2023
disco: a toolkit for Distributional Control of Generative Models
Germán Kruszewski
Jos Rozen
Marc Dymetman
59
4
0
08 Mar 2023
Video-P2P: Video Editing with Cross-attention Control
Shaoteng Liu
Yuechen Zhang
Wenbo Li
Zhe Lin
Jiaya Jia
DiffM
VGen
219
221
0
08 Mar 2023
Vector Quantized Time Series Generation with a Bidirectional Prior Model
Daesoo Lee
Sara Malacarne
Erlend Aune
BDL
88
29
0
08 Mar 2023
A Prompt Log Analysis of Text-to-Image Generation Systems
Yutong Xie
Zhaoying Pan
Jing Ma
Jie Luo
Qiaozhu Mei
DiffM
166
43
0
08 Mar 2023
ELODIN: Naming Concepts in Embedding Spaces
Rodrigo Mello
Filipe Calegario
Geber Ramalho
DiffM
132
1
0
07 Mar 2023
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding
Jiacheng Li
Longhui Wei
Zongyuan Zhan
Xinfu He
Siliang Tang
Qi Tian
Yueting Zhuang
48
4
0
07 Mar 2023
A Complete Recipe for Diffusion Generative Models
Kushagra Pandey
Stephan Mandt
DiffM
67
9
0
03 Mar 2023
A Pathway Towards Responsible AI Generated Content
Chen Chen
Jie Fu
Lingjuan Lyu
109
72
0
02 Mar 2023
X&Fuse: Fusing Visual Information in Text-to-Image Generation
Yuval Kirstain
Omer Levy
Adam Polyak
DiffM
50
6
0
02 Mar 2023
Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation
Diederik P. Kingma
Ruiqi Gao
DiffM
120
144
0
01 Mar 2023
StraIT: Non-autoregressive Generation with Stratified Image Transformer
Shengju Qian
Huiwen Chang
Yuanzhen Li
Zizhao Zhang
Jiaya Jia
Han Zhang
114
12
0
01 Mar 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Chao Xue
Wen Liu
Shunxing Xie
Zhenfang Wang
Jiaxing Li
...
Shi-Yong Chen
Yibing Zhan
Jing Zhang
Chaoyue Wang
Dacheng Tao
104
2
0
01 Mar 2023
Benchmarking Deepart Detection
Yabin Wang
Zhiwu Huang
Xiaopeng Hong
105
11
0
28 Feb 2023
Enhanced Controllability of Diffusion Models via Feature Disentanglement and Realism-Enhanced Sampling Methods
Wonwoong Cho
Hareesh Ravi
Midhun Harikumar
V. Khuc
Krishna Kumar Singh
Jingwan Lu
David I. Inouye
Ajinkya Kale
DiffM
161
7
0
28 Feb 2023
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Yuxiang Wei
Yabo Zhang
Zhilong Ji
Jinfeng Bai
Lei Zhang
W. Zuo
DiffM
107
329
0
27 Feb 2023
Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models
Rinon Gal
Moab Arar
Yuval Atzmon
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
DiffM
141
200
0
23 Feb 2023
Aligning Text-to-Image Models using Human Feedback
Kimin Lee
Hao Liu
Moonkyung Ryu
Olivia Watkins
Yuqing Du
Craig Boutilier
Pieter Abbeel
Mohammad Ghavamzadeh
S. Gu
EGVM
136
285
0
23 Feb 2023
Teaching CLIP to Count to Ten
Roni Paiss
Ariel Ephrat
Omer Tov
Shiran Zada
Inbar Mosseri
Michal Irani
Tali Dekel
VLM
CLIP
97
107
0
23 Feb 2023
Controlled and Conditional Text to Image Generation with Diffusion Prior
Pranav Aggarwal
Hareesh Ravi
Naveen Marri
Sachin Kelkar
F. Chen
...
Alvin Ghouas
Sarah Saber
Malavika Ramprasad
Baldo Faieta
Ajinkya Kale
DiffM
102
7
0
23 Feb 2023
Previous
1
2
3
...
14
15
16
17
18
Next