Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 4,338 papers shown
Title
Categorical SDEs with Simplex Diffusion
Pierre Harvey Richemond
Sander Dieleman
Arnaud Doucet
DiffM
19
24
0
26 Oct 2022
Full-band General Audio Synthesis with Score-based Diffusion
Santiago Pascual
Gautam Bhattacharya
Chunghsin Yeh
Jordi Pons
Joan Serrà
DiffM
30
33
0
26 Oct 2022
Towards the Detection of Diffusion Model Deepfakes
Jonas Ricker
Simon Damm
Thorsten Holz
Asja Fischer
DiffM
35
102
0
26 Oct 2022
Lafite2: Few-shot Text-to-Image Generation
Yufan Zhou
Chunyuan Li
Changyou Chen
Jianfeng Gao
Jinhui Xu
DiffM
32
11
0
25 Oct 2022
Vitruvio: 3D Building Meshes via Single Perspective Sketches
Alberto Tono
Heyaojing Huang
Ashwin Agrawal
Martin Fischer
29
5
0
24 Oct 2022
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models
Hao Liu
Xinyang Geng
Lisa Lee
Igor Mordatch
Sergey Levine
Sharan Narang
Pieter Abbeel
KELM
CLL
35
2
0
24 Oct 2022
High-Resolution Image Editing via Multi-Stage Blended Diffusion
J. Ackermann
Minjun Li
DiffM
27
15
0
24 Oct 2022
Instance-Aware Image Completion
Ji-Ho Cho
Minguk Kang
Vibhav Vineet
Jaesik Park
ISeg
VLM
20
2
0
22 Oct 2022
Tools for Extracting Spatio-Temporal Patterns in Meteorological Image Sequences: From Feature Engineering to Attention-Based Neural Networks
A. S. Bansal
Yoonjin Lee
Kyle Hilburn
I. Ebert‐Uphoff
AI4TS
38
2
0
22 Oct 2022
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
Yue Yang
Wenlin Yao
Hongming Zhang
Xiaoyang Wang
Dong Yu
Jianshu Chen
VLM
41
22
0
21 Oct 2022
Conditional Diffusion with Less Explicit Guidance via Model Predictive Control
Max W. Shen
Ehsan Hajiramezanali
Gabriele Scalia
Alex Tseng
N. Diamant
Tommaso Biancalani
Andreas Loukas
34
1
0
21 Oct 2022
Boomerang: Local sampling on image manifolds using diffusion models
Lorenzo Luzi
P. Mayer
Josue Casco-Rodriguez
Ali Siahkoohi
Richard G. Baraniuk
DiffM
32
20
0
21 Oct 2022
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
Vivian Liu
Jo Vermeulen
G. Fitzmaurice
Justin Matejka
HAI
33
119
0
20 Oct 2022
Composing Ensembles of Pre-trained Models via Iterative Consensus
Shuang Li
Yilun Du
J. Tenenbaum
Antonio Torralba
Igor Mordatch
MoMe
19
23
0
20 Oct 2022
DiffEdit: Diffusion-based semantic image editing with mask guidance
Guillaume Couairon
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
65
483
0
20 Oct 2022
OCR-VQGAN: Taming Text-within-Image Generation
Juan A. Rodriguez
David Vazquez
I. Laradji
M. Pedersoli
Pau Rodríguez López
38
18
0
19 Oct 2022
Language Models Understand Us, Poorly
Jared Moore
LRM
19
4
0
19 Oct 2022
DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image Models
Royi Rassin
Shauli Ravfogel
Yoav Goldberg
29
60
0
19 Oct 2022
Language Does More Than Describe: On The Lack Of Figurative Speech in Text-To-Image Models
Ricardo Kleinlein
Cristina Luna Jiménez
Fernando Fernández-Martínez
DiffM
20
3
0
19 Oct 2022
Differentially Private Diffusion Models
Tim Dockhorn
Tianshi Cao
Arash Vahdat
Karsten Kreis
DiffM
32
91
0
18 Oct 2022
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Rui Li
Weihua Li
Yi Yang
Hanyu Wei
Jianhua Jiang
Quan-wei Bai
DiffM
27
11
0
18 Oct 2022
UniTune: Text-Driven Image Editing by Fine Tuning a Diffusion Model on a Single Image
Dani Valevski
Matan Kalman
Eyal Molad
Eyal Segalis
Yossi Matias
Yaniv Leviathan
DiffM
24
39
0
17 Oct 2022
Imagic: Text-Based Real Image Editing with Diffusion Models
Bahjat Kawar
Shiran Zada
Oran Lang
Omer Tov
Hui-Tang Chang
Tali Dekel
Inbar Mosseri
Michal Irani
11
1,054
0
17 Oct 2022
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Shansan Gong
Mukai Li
Jiangtao Feng
Zhiyong Wu
Lingpeng Kong
44
308
0
17 Oct 2022
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
90
3,276
0
16 Oct 2022
TransFusion: Transcribing Speech with Multinomial Diffusion
Matthew Baas
Kevin Eloff
Herman Kamper
DiffM
22
4
0
14 Oct 2022
Is synthetic data from generative models ready for image recognition?
Ruifei He
Shuyang Sun
Xin Yu
Chuhui Xue
Wenqing Zhang
Philip Torr
Song Bai
Xiaojuan Qi
52
288
0
14 Oct 2022
MTEB: Massive Text Embedding Benchmark
Niklas Muennighoff
Nouamane Tazi
L. Magne
Nils Reimers
28
374
0
13 Oct 2022
The Hidden Uniform Cluster Prior in Self-Supervised Learning
Mahmoud Assran
Randall Balestriero
Quentin Duval
Florian Bordes
Ishan Misra
Piotr Bojanowski
Pascal Vincent
Michael G. Rabbat
Nicolas Ballas
SSL
52
47
0
13 Oct 2022
DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation Models
Zeyang Sha
Zheng Li
Ning Yu
Yang Zhang
DiffM
28
116
0
13 Oct 2022
ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Minheng Ni
Zitong Huang
Kai-Hua Feng
W. Zuo
VLM
19
15
0
13 Oct 2022
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities
Brian Bartoldson
B. Kailkhura
Davis W. Blalock
31
47
0
13 Oct 2022
Self-Guided Diffusion Models
Vincent Tao Hu
David W. Zhang
Yuki M. Asano
Gertjan J. Burghouts
Cees G. M. Snoek
36
32
0
12 Oct 2022
LION: Latent Point Diffusion Models for 3D Shape Generation
Fangyin Wei
Arash Vahdat
Francis Williams
Zan Gojcic
Or Litany
Sanja Fidler
Karsten Kreis
DiffM
73
489
0
12 Oct 2022
Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation
Chaerin Kong
D. Jeon
Oh-Hun Kwon
Nojun Kwak
DiffM
27
16
0
12 Oct 2022
Underspecification in Scene Description-to-Depiction Tasks
Ben Hutchinson
Jason Baldridge
Vinodkumar Prabhakaran
DiffM
74
32
0
11 Oct 2022
A generic diffusion-based approach for 3D human pose prediction in the wild
Saeed Saadatnejad
Ali-Ahmad Rasekh
Mohammadreza Mofayezi
Yasamin Medghalchi
Sara Rajabzadeh
Taylor Mordan
Alexandre Alahi
DiffM
19
34
0
11 Oct 2022
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance
Chen Henry Wu
Fernando de la Torre
DiffM
33
67
0
11 Oct 2022
GENIE: Higher-Order Denoising Diffusion Solvers
Tim Dockhorn
Arash Vahdat
Karsten Kreis
DiffM
49
105
0
11 Oct 2022
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
Matthew Baas
Herman Kamper
DiffM
40
8
0
11 Oct 2022
Markup-to-Image Diffusion Models with Scheduled Sampling
Yuntian Deng
Noriyuki Kojima
Alexander M. Rush
DiffM
38
4
0
11 Oct 2022
f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation
Jiatao Gu
Shuangfei Zhai
Yizhe Zhang
Miguel Angel Bautista
J. Susskind
DiffM
53
26
0
10 Oct 2022
What the DAAM: Interpreting Stable Diffusion Using Cross Attention
Raphael Tang
Linqing Liu
Akshat Pandey
Zhiying Jiang
Gefei Yang
K. Kumar
Pontus Stenetorp
Jimmy J. Lin
Ferhan Ture
34
167
0
10 Oct 2022
CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning
Shi-You Xu
VLM
DiffM
32
11
0
10 Oct 2022
Bridging CLIP and StyleGAN through Latent Alignment for Image Editing
Wanfeng Zheng
Qiang Li
Xiaoyan Guo
Pengfei Wan
Zhong-ming Wang
75
14
0
10 Oct 2022
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation
Chieh-Hsin Lai
Yuhta Takida
Naoki Murata
Toshimitsu Uesaka
Yuki Mitsufuji
Stefano Ermon
DiffM
26
28
0
09 Oct 2022
Adapting Pretrained Vision-Language Foundational Models to Medical Imaging Domains
Pierre J. Chambon
Christian Blüthgen
C. Langlotz
Akshay S. Chaudhari
DiffM
MedIm
LM&MA
30
111
0
09 Oct 2022
Can Artificial Intelligence Reconstruct Ancient Mosaics?
Fernando Moral-Andrés
Elena Merino-Gómez
Pedro Reviriego
Fabrizio Lombardi
24
7
0
07 Oct 2022
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
Wanrong Zhu
An Yan
Yujie Lu
Wenda Xu
Qing Guo
Miguel P. Eckstein
William Yang Wang
82
37
0
07 Oct 2022
Trustworthiness of Laser-Induced Breakdown Spectroscopy Predictions via Simulation-based Synthetic Data Augmentation and Multitask Learning
Riccardo Finotello
D. L’hermite
Celine Quéré
Benjamin Rouge
M. Tamaazousti
J. Sirven
22
1
0
07 Oct 2022
Previous
1
2
3
...
83
84
85
86
87
Next