ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 1,364 papers shown
Title
A Reparameterized Discrete Diffusion Model for Text Generation
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
151
69
0
11 Feb 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
267
4,192
1
10 Feb 2023
MaskSketch: Unpaired Structure-guided Masked Image Generation
MaskSketch: Unpaired Structure-guided Masked Image Generation
D. Bashkirova
José Lezama
Kihyuk Sohn
Kate Saenko
Irfan Essa
DiffM
60
25
0
10 Feb 2023
UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of
  Diffusion Models
UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
Wenliang Zhao
Lujia Bai
Yongming Rao
Jie Zhou
Jiwen Lu
DiffM
143
226
0
09 Feb 2023
A Text-guided Protein Design Framework
A Text-guided Protein Design Framework
Shengchao Liu
Yanjing Li
Zhuoxinran Li
A. Gitter
Yutao Zhu
...
Arvind Ramanathan
Chaowei Xiao
Jian Tang
Hongyu Guo
Anima Anandkumar
130
70
0
09 Feb 2023
Information-Theoretic Diffusion
Information-Theoretic Diffusion
Xianghao Kong
Rob Brekelmans
Greg Ver Steeg
DiffM
59
16
0
07 Feb 2023
Effective Data Augmentation With Diffusion Models
Effective Data Augmentation With Diffusion Models
Brandon Trabucco
Kyle Doherty
Max Gurinas
Ruslan Salakhutdinov
DiffMVLM
116
257
0
07 Feb 2023
Mixture of Diffusers for scene composition and high resolution image
  generation
Mixture of Diffusers for scene composition and high resolution image generation
Á. Jiménez
DiffM
73
46
0
05 Feb 2023
Dreamix: Video Diffusion Models are General Video Editors
Dreamix: Video Diffusion Models are General Video Editors
Eyal Molad
Eliahu Horwitz
Dani Valevski
Alex Rav-Acha
Yossi Matias
Yael Pritch
Yaniv Leviathan
Yedid Hoshen
DiffMVGen
131
188
0
02 Feb 2023
Learning Universal Policies via Text-Guided Video Generation
Learning Universal Policies via Text-Guided Video Generation
Yilun Du
Mengjiao Yang
Bo Dai
H. Dai
Ofir Nachum
J. Tenenbaum
Dale Schuurmans
Pieter Abbeel
PINNLM&Ro
135
264
0
31 Jan 2023
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image
  Diffusion Models
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
Hila Chefer
Yuval Alaluf
Yael Vinker
Lior Wolf
Daniel Cohen-Or
DiffM
170
519
0
31 Jan 2023
DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models
DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models
Tao Yang
Yuwang Wang
Yan Lv
Nanning Zh
DiffM
144
24
0
31 Jan 2023
What Makes Good Examples for Visual In-Context Learning?
What Makes Good Examples for Visual In-Context Learning?
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
MLLMVPVLMVLMLRM
106
117
0
31 Jan 2023
Improved machine learning algorithm for predicting ground state
  properties
Improved machine learning algorithm for predicting ground state properties
Laura Lewis
Hsin-Yuan Huang
Viet-Trung Tran
Sebastian Lehner
R. Kueng
J. Preskill
AI4CE
89
49
0
30 Jan 2023
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion
  Models
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models
Rongjie Huang
Jia-Bin Huang
Dongchao Yang
Yi Ren
Luping Liu
Mingze Li
Zhenhui Ye
Jinglin Liu
Xiaoyue Yin
Zhou Zhao
DiffM
238
344
0
30 Jan 2023
Minimizing Trajectory Curvature of ODE-based Generative Models
Minimizing Trajectory Curvature of ODE-based Generative Models
Sangyun Lee
Beomsu Kim
Jong Chul Ye
145
60
0
27 Jan 2023
Diffusion Models as Artists: Are we Closing the Gap between Humans and
  Machines?
Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?
Victor Boutin
Thomas Fel
Lakshya Singhal
Rishav Mukherji
Akash Nagaraj
Julien Colin
Thomas Serre
DiffM
72
6
0
27 Jan 2023
3DShape2VecSet: A 3D Shape Representation for Neural Fields and
  Generative Diffusion Models
3DShape2VecSet: A 3D Shape Representation for Neural Fields and Generative Diffusion Models
Biao Zhang
Jiapeng Tang
Matthias Niessner
Peter Wonka
DiffM
175
220
0
26 Jan 2023
MusicLM: Generating Music From Text
MusicLM: Generating Music From Text
A. Agostinelli
Timo I. Denk
Zalan Borsos
Jesse Engel
Mauro Verzetti
...
Adam Roberts
Marco Tagliasacchi
Matthew Sharifi
Neil Zeghidour
Christian Frank
MGen
152
451
0
26 Jan 2023
Simple diffusion: End-to-end diffusion for high resolution images
Simple diffusion: End-to-end diffusion for high resolution images
Emiel Hoogeboom
Jonathan Heek
Tim Salimans
108
268
0
26 Jan 2023
A Multi-Resolution Framework for U-Nets with Applications to
  Hierarchical VAEs
A Multi-Resolution Framework for U-Nets with Applications to Hierarchical VAEs
Fabian Falck
Christopher Williams
D. Danks
George Deligiannidis
C. Yau
Chris Holmes
Arnaud Doucet
M. Willetts
98
9
0
19 Jan 2023
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Kinyugo Maina
85
6
0
16 Jan 2023
Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Siyuan Huang
Zan Wang
Puhao Li
Baoxiong Jia
Tengyu Liu
Yixin Zhu
Wei Liang
Song-Chun Zhu
DiffM
128
218
0
15 Jan 2023
A survey and taxonomy of loss functions in machine learning
A survey and taxonomy of loss functions in machine learning
Lorenzo Ciampiconi
A. Elwood
Marco Leonardi
A. Mohamed
A. Rozza
MUFaML
57
29
0
13 Jan 2023
A Residual Diffusion Model for High Perceptual Quality Codec
  Augmentation
A Residual Diffusion Model for High Perceptual Quality Codec Augmentation
Noor Fathima Ghouse
Jens Petersen
Auke Wiggers
Tianlin Xu
Guillaume Sautière
DiffM
103
22
0
13 Jan 2023
Thompson Sampling with Diffusion Generative Prior
Thompson Sampling with Diffusion Generative Prior
Yu-Guan Hsieh
S. Kasiviswanathan
Branislav Kveton
Patrick Blobaum
DiffM
60
7
0
12 Jan 2023
Predictive World Models from Real-World Partial Observations
Predictive World Models from Real-World Partial Observations
Robin Karlsson
Alexander Carballo
Keisuke Fujii
Kento Ohtani
K. Takeda
91
5
0
12 Jan 2023
Street-View Image Generation from a Bird's-Eye View Layout
Street-View Image Generation from a Bird's-Eye View Layout
Alexander Swerdlow
Runsheng Xu
Bolei Zhou
138
73
0
11 Jan 2023
ChatGPT is not all you need. A State of the Art Review of large
  Generative AI models
ChatGPT is not all you need. A State of the Art Review of large Generative AI models
Roberto Gozalo-Brizuela
E.C. Garrido-Merchán
91
267
0
11 Jan 2023
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Dan Bigioi
Shubhajit Basak
Michał Stypułkowski
Maciej Ziȩba
H. Jordan
R. Mcdonnell
Peter Corcoran
DiffMVGen
104
36
0
10 Jan 2023
FICE: Text-Conditioned Fashion Image Editing With Guided GAN Inversion
FICE: Text-Conditioned Fashion Image Editing With Guided GAN Inversion
Martin Pernuš
Clinton Fookes
Vitomir Štruc
Simon Dobrišek
DiffM
84
29
0
05 Jan 2023
Accuracy and Fidelity Comparison of Luna and DALL-E 2 Diffusion-Based
  Image Generation Systems
Accuracy and Fidelity Comparison of Luna and DALL-E 2 Diffusion-Based Image Generation Systems
Michael Cahyadi
M. Rafi
William Shan
Jurike V. Moniaga
Henry Lucky
66
6
0
05 Jan 2023
Attribute-Centric Compositional Text-to-Image Generation
Attribute-Centric Compositional Text-to-Image Generation
Yuren Cong
Martin Renqiang Min
Erran L. Li
Bodo Rosenhahn
M. Yang
114
13
0
04 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
278
560
0
02 Jan 2023
Image Embedding for Denoising Generative Models
Image Embedding for Denoising Generative Models
Andrea Asperti
David Evangelista
Samuele Marro
Fabio Merizzi
DiffM
65
10
0
30 Dec 2022
Foreground-Background Separation through Concept Distillation from
  Generative Image Foundation Models
Foreground-Background Separation through Concept Distillation from Generative Image Foundation Models
Mischa Dombrowski
Hadrien Reynaud
Matthew Baugh
Bernhard Kainz
DiffM
89
3
0
29 Dec 2022
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and
  Text-to-Image Diffusion Models
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models
Jiale Xu
Xintao Wang
Weihao Cheng
Yan-Pei Cao
Ying Shan
Xiaohu Qie
Shenghua Gao
260
165
0
28 Dec 2022
Multi-Realism Image Compression with a Conditional Generator
Multi-Realism Image Compression with a Conditional Generator
E. Agustsson
David C. Minnen
G. Toderici
Fabian Mentzer
101
75
0
28 Dec 2022
Exploring Vision Transformers as Diffusion Learners
Exploring Vision Transformers as Diffusion Learners
He Cao
Jianan Wang
Tianhe Ren
Xianbiao Qi
Yihao Chen
Yuan Yao
Lefei Zhang
83
10
0
28 Dec 2022
Exploring Transformer Backbones for Image Diffusion Models
Exploring Transformer Backbones for Image Diffusion Models
Princy Chahal
42
3
0
27 Dec 2022
DiffFace: Diffusion-based Face Swapping with Facial Guidance
DiffFace: Diffusion-based Face Swapping with Facial Guidance
Kihong Kim
Yunho Kim
Seokju Cho
Junyoung Seo
Jisu Nam
Kychul Lee
Seung Wook Kim
Kwanghee Lee
DiffM
99
59
0
27 Dec 2022
Neural Shape Compiler: A Unified Framework for Transforming between
  Text, Point Cloud, and Program
Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program
Tiange Luo
Honglak Lee
Justin Johnson
92
5
0
25 Dec 2022
Do DALL-E and Flamingo Understand Each Other?
Do DALL-E and Flamingo Understand Each Other?
Hang Li
Jindong Gu
Rajat Koner
Sahand Sharifzadeh
Volker Tresp
MLLM
79
12
0
23 Dec 2022
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for
  Text-to-Video Generation
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Weixian Lei
Yuchao Gu
Yufei Shi
Wynne Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
VGen
171
752
0
22 Dec 2022
Multi-Lingual DALL-E Storytime
Multi-Lingual DALL-E Storytime
Noga Mudrik
Adam S. Charles
24
0
0
22 Dec 2022
Multi-modal Molecule Structure-text Model for Text-based Retrieval and
  Editing
Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing
Shengchao Liu
Weili Nie
Chengpeng Wang
Jiarui Lu
Zhuoran Qiao
Ling Liu
Jian Tang
Chaowei Xiao
Anima Anandkumar
135
167
0
21 Dec 2022
Character-Aware Models Improve Visual Text Rendering
Character-Aware Models Improve Visual Text Rendering
Rosanne Liu
Daniel H Garrette
Chitwan Saharia
William Chan
Adam Roberts
Sharan Narang
Irina Blok
R. Mical
Mohammad Norouzi
Noah Constant
VLM
117
74
0
20 Dec 2022
Benchmarking Spatial Relationships in Text-to-Image Generation
Benchmarking Spatial Relationships in Text-to-Image Generation
Tejas Gokhale
Hamid Palangi
Besmira Nushi
Vibhav Vineet
Eric Horvitz
Ece Kamar
Chitta Baral
Yezhou Yang
EGVM
116
72
0
20 Dec 2022
Are Deep Neural Networks SMARTer than Second Graders?
Are Deep Neural Networks SMARTer than Second Graders?
A. Cherian
Kuan-Chuan Peng
Suhas Lohit
Kevin A. Smith
J. Tenenbaum
AAMLLRMReLM
112
31
0
20 Dec 2022
MetaCLUE: Towards Comprehensive Visual Metaphors Research
MetaCLUE: Towards Comprehensive Visual Metaphors Research
Arjun Reddy Akula
Brenda S. Driscoll
P. Narayana
Soravit Changpinyo
Zhi-xuan Jia
...
Sugato Basu
Leonidas Guibas
William T. Freeman
Yuanzhen Li
Varun Jampani
CLIPVLM
56
26
0
19 Dec 2022
Previous
123...212223...262728
Next