Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 4,340 papers shown
Title
Text-To-4D Dynamic Scene Generation
Uriel Singer
Shelly Sheynin
Adam Polyak
Oron Ashual
Iurii Makarov
...
Naman Goyal
Andrea Vedaldi
Devi Parikh
Justin Johnson
Yaniv Taigman
DiffM
39
147
0
26 Jan 2023
Simple diffusion: End-to-end diffusion for high resolution images
Emiel Hoogeboom
Jonathan Heek
Tim Salimans
33
251
0
26 Jan 2023
Imitating Human Behaviour with Diffusion Models
Tim Pearce
Tabish Rashid
Anssi Kanervisto
David Bignell
Mingfei Sun
...
Sergio Valcarcel Macua
Shan Zheng Tan
Ida Momennejad
Katja Hofmann
Sam Devlin
DiffM
48
207
0
25 Jan 2023
Towards Arbitrary Text-driven Image Manipulation via Space Alignment
Yun-Hao Bai
Zi-Qi Zhong
Chao Dong
Weichen Zhang
Guowei Xu
Chun Yuan
40
0
0
25 Jan 2023
Bipartite Graph Diffusion Model for Human Interaction Generation
Baptiste Chopin
Hao Tang
Mohamed Daoudi
DiffM
24
12
0
24 Jan 2023
StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis
Axel Sauer
Tero Karras
S. Laine
Andreas Geiger
Timo Aila
37
209
0
23 Jan 2023
Regeneration Learning: A Learning Paradigm for Data Generation
Xu Tan
Tao Qin
Jiang Bian
Tie-Yan Liu
Yoshua Bengio
GAN
40
15
0
21 Jan 2023
A Multi-Resolution Framework for U-Nets with Applications to Hierarchical VAEs
Fabian Falck
Christopher Williams
D. Danks
George Deligiannidis
C. Yau
Chris Holmes
Arnaud Doucet
M. Willetts
43
8
0
19 Jan 2023
MedSegDiff-V2: Diffusion based Medical Image Segmentation with Transformer
Junde Wu
Rao Fu
Huihui Fang
Min Xu
Yu Zhang
Yanwu Xu
DiffM
MedIm
49
160
0
19 Jan 2023
Using Large Text-to-Image Models with Structured Prompts for Skin Disease Identification: A Case Study
Sajith Rajapaksa
Jean M. Uwabeza Vianney
Renell Castro
Farzad Khalvati
Shubhra Aich
LM&MA
14
1
0
17 Jan 2023
GLIGEN: Open-Set Grounded Text-to-Image Generation
Yuheng Li
Haotian Liu
Qingyang Wu
Fangzhou Mu
Jianwei Yang
Jianfeng Gao
Chunyuan Li
Yong Jae Lee
VLM
80
570
1
17 Jan 2023
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Kinyugo Maina
32
5
0
16 Jan 2023
Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Siyuan Huang
Zan Wang
Puhao Li
Baoxiong Jia
Tengyu Liu
Yixin Zhu
Wei Liang
Song-Chun Zhu
DiffM
73
202
0
15 Jan 2023
A survey and taxonomy of loss functions in machine learning
Lorenzo Ciampiconi
A. Elwood
Marco Leonardi
A. Mohamed
A. Rozza
MU
FaML
14
27
0
13 Jan 2023
A Residual Diffusion Model for High Perceptual Quality Codec Augmentation
Noor Fathima Ghouse
Jens Petersen
Auke Wiggers
Tianlin Xu
Guillaume Sautière
DiffM
32
22
0
13 Jan 2023
Open-vocabulary Object Segmentation with Diffusion Models
Ziyi Li
Qinye Zhou
Xiaoyun Zhang
Ya Zhang
Yanfeng Wang
Weidi Xie
VLM
35
65
0
12 Jan 2023
Thompson Sampling with Diffusion Generative Prior
Yu-Guan Hsieh
S. Kasiviswanathan
Branislav Kveton
Patrick Blobaum
DiffM
40
7
0
12 Jan 2023
Predictive World Models from Real-World Partial Observations
Robin Karlsson
Alexander Carballo
Keisuke Fujii
Kento Ohtani
K. Takeda
49
5
0
12 Jan 2023
Street-View Image Generation from a Bird's-Eye View Layout
Alexander Swerdlow
Runsheng Xu
Bolei Zhou
30
65
0
11 Jan 2023
ChatGPT is not all you need. A State of the Art Review of large Generative AI models
Roberto Gozalo-Brizuela
E.C. Garrido-Merchán
27
261
0
11 Jan 2023
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Dan Bigioi
Shubhajit Basak
Michał Stypułkowski
Maciej Ziȩba
H. Jordan
R. Mcdonnell
Peter Corcoran
DiffM
VGen
24
35
0
10 Jan 2023
ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions
Aashish Anantha Ramakrishnan
Sharon X. Huang
Dongwon Lee
29
5
0
05 Jan 2023
FICE: Text-Conditioned Fashion Image Editing With Guided GAN Inversion
Martin Pernuš
Clinton Fookes
Vitomir Štruc
Simon Dobrišek
DiffM
28
27
0
05 Jan 2023
Accuracy and Fidelity Comparison of Luna and DALL-E 2 Diffusion-Based Image Generation Systems
Michael Cahyadi
M. Rafi
William Shan
Jurike V. Moniaga
Henry Lucky
35
4
0
05 Jan 2023
Attribute-Centric Compositional Text-to-Image Generation
Yuren Cong
Martin Renqiang Min
Erran L. Li
Bodo Rosenhahn
M. Yang
68
11
0
04 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
197
526
0
02 Jan 2023
TeViS:Translating Text Synopses to Video Storyboards
Xu Gu
Yuchong Sun
Feiyue Ni
Shizhe Chen
Xihua Wang
Ruihua Song
Yangqiu Song
Xiang Cao
DiffM
38
4
0
31 Dec 2022
Image Embedding for Denoising Generative Models
Andrea Asperti
David Evangelista
Samuele Marro
Fabio Merizzi
DiffM
25
9
0
30 Dec 2022
Foreground-Background Separation through Concept Distillation from Generative Image Foundation Models
Mischa Dombrowski
Hadrien Reynaud
Matthew Baugh
Bernhard Kainz
DiffM
30
3
0
29 Dec 2022
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models
Jiale Xu
Xintao Wang
Weihao Cheng
Yan-Pei Cao
Ying Shan
Xiaohu Qie
Shenghua Gao
188
161
0
28 Dec 2022
Multi-Realism Image Compression with a Conditional Generator
E. Agustsson
David C. Minnen
G. Toderici
Fabian Mentzer
17
68
0
28 Dec 2022
Exploring Vision Transformers as Diffusion Learners
He Cao
Jianan Wang
Tianhe Ren
Xianbiao Qi
Yihao Chen
Yuan Yao
Lefei Zhang
44
10
0
28 Dec 2022
Exploring Transformer Backbones for Image Diffusion Models
Princy Chahal
13
3
0
27 Dec 2022
DiffFace: Diffusion-based Face Swapping with Facial Guidance
Kihong Kim
Yunho Kim
Seokju Cho
Junyoung Seo
Jisu Nam
Kychul Lee
Seung Wook Kim
Kwanghee Lee
DiffM
32
53
0
27 Dec 2022
Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program
Tiange Luo
Honglak Lee
Justin Johnson
39
5
0
25 Dec 2022
Do DALL-E and Flamingo Understand Each Other?
Hang Li
Jindong Gu
Rajat Koner
Sahand Sharifzadeh
Volker Tresp
MLLM
21
12
0
23 Dec 2022
When are Lemons Purple? The Concept Association Bias of Vision-Language Models
Yutaro Yamada
Yingtian Tang
Yoyo Zhang
Ilker Yildirim
CoGe
26
14
0
22 Dec 2022
Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise
Zheng-Wen Lin
Yeyun Gong
Yelong Shen
Tong Wu
Zhihao Fan
Chen Lin
Nan Duan
Weizhu Chen
AI4CE
DiffM
VLM
40
61
0
22 Dec 2022
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Weixian Lei
Yuchao Gu
Yufei Shi
Wynne Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
VGen
64
694
0
22 Dec 2022
Multi-Lingual DALL-E Storytime
Noga Mudrik
Adam S. Charles
14
0
0
22 Dec 2022
Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing
Shengchao Liu
Weili Nie
Chengpeng Wang
Jiarui Lu
Zhuoran Qiao
Ling Liu
Jian Tang
Chaowei Xiao
Anima Anandkumar
48
155
0
21 Dec 2022
Character-Aware Models Improve Visual Text Rendering
Rosanne Liu
Daniel H Garrette
Chitwan Saharia
William Chan
Adam Roberts
Sharan Narang
Irina Blok
R. Mical
Mohammad Norouzi
Noah Constant
VLM
31
71
0
20 Dec 2022
Benchmarking Spatial Relationships in Text-to-Image Generation
Tejas Gokhale
Hamid Palangi
Besmira Nushi
Vibhav Vineet
Eric Horvitz
Ece Kamar
Chitta Baral
Yezhou Yang
EGVM
51
66
0
20 Dec 2022
Are Deep Neural Networks SMARTer than Second Graders?
A. Cherian
Kuan-Chuan Peng
Suhas Lohit
Kevin A. Smith
J. Tenenbaum
AAML
LRM
ReLM
35
29
0
20 Dec 2022
MetaCLUE: Towards Comprehensive Visual Metaphors Research
Arjun Reddy Akula
Brenda S. Driscoll
P. Narayana
Soravit Changpinyo
Zhi-xuan Jia
...
Sugato Basu
Leonidas J. Guibas
William T. Freeman
Yuanzhen Li
Varun Jampani
CLIP
VLM
21
24
0
19 Dec 2022
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
40
2,052
0
19 Dec 2022
Optimizing Prompts for Text-to-Image Generation
Y. Hao
Zewen Chi
Li Dong
Furu Wei
55
140
0
19 Dec 2022
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Ludan Ruan
Yi Ma
Huan Yang
Huiguo He
Bei Liu
Jianlong Fu
Nicholas Jing Yuan
Qin Jin
B. Guo
DiffM
VGen
46
174
0
19 Dec 2022
Latent Diffusion for Language Generation
Justin Lovelace
Varsha Kishore
Chao-gang Wan
Eliot Shekhtman
Kilian Q. Weinberger
DiffM
29
71
0
19 Dec 2022
Face Generation and Editing with StyleGAN: A Survey
Andrew Melnik
Maksim Miasayedzenkau
Dzianis Makaravets
Dzianis Pirshtuk
Eren Akbulut
Dennis Holzmann
Tarek Renusch
Gustav Reichert
Helge J. Ritter
CVBM
34
40
0
18 Dec 2022
Previous
1
2
3
...
79
80
81
...
85
86
87
Next