ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXivPDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 4,340 papers shown
Title
Accountable Textual-Visual Chat Learns to Reject Human Instructions in
  Image Re-creation
Accountable Textual-Visual Chat Learns to Reject Human Instructions in Image Re-creation
Zhiwei Zhang
Yuliang Liu
MLLM
30
0
0
10 Mar 2023
GECCO: Geometrically-Conditioned Point Diffusion Models
GECCO: Geometrically-Conditioned Point Diffusion Models
M. Tyszkiewicz
Pascal Fua
Eduard Trulls
DiffM
26
21
0
10 Mar 2023
Scaling up GANs for Text-to-Image Synthesis
Scaling up GANs for Text-to-Image Synthesis
Minguk Kang
Jun-Yan Zhu
Richard Y. Zhang
Jaesik Park
Eli Shechtman
Sylvain Paris
Taesung Park
51
450
0
09 Mar 2023
3DGen: Triplane Latent Diffusion for Textured Mesh Generation
3DGen: Triplane Latent Diffusion for Textured Mesh Generation
Anchit Gupta
Wenhan Xiong
Yixin Nie
Anchit Gupta
Barlas Oğuz
DiffM
106
158
0
09 Mar 2023
Natural scene reconstruction from fMRI signals using generative latent
  diffusion
Natural scene reconstruction from fMRI signals using generative latent diffusion
Furkan Ozcelik
Rufin VanRullen
DiffM
105
42
0
09 Mar 2023
Cones: Concept Neurons in Diffusion Models for Customized Generation
Cones: Concept Neurons in Diffusion Models for Customized Generation
Zhiheng Liu
Ruili Feng
Kai Zhu
Yifei Zhang
Kecheng Zheng
Yu Liu
Deli Zhao
Jingren Zhou
Yang Cao
DiffM
111
122
0
09 Mar 2023
Identification of Systematic Errors of Image Classifiers on Rare
  Subgroups
Identification of Systematic Errors of Image Classifiers on Rare Subgroups
J. H. Metzen
Robin Hutmacher
N. G. Hua
Valentyn Boreiko
Dan Zhang
AAML
VLM
61
19
0
09 Mar 2023
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion
  Models
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
Jiarui Xu
Sifei Liu
Arash Vahdat
Wonmin Byeon
Xiaolong Wang
Shalini De Mello
VLM
223
321
0
08 Mar 2023
Vector Quantized Time Series Generation with a Bidirectional Prior Model
Vector Quantized Time Series Generation with a Bidirectional Prior Model
Daesoo Lee
Sara Malacarne
Erlend Aune
BDL
45
25
0
08 Mar 2023
Transformer-based Image Generation from Scene Graphs
Transformer-based Image Generation from Scene Graphs
Renato Sortino
S. Palazzo
C. Spampinato
ViT
59
15
0
08 Mar 2023
A Prompt Log Analysis of Text-to-Image Generation Systems
A Prompt Log Analysis of Text-to-Image Generation Systems
Yutong Xie
Zhaoying Pan
Jing Ma
Jie Luo
Qiaozhu Mei
DiffM
125
40
0
08 Mar 2023
TRACT: Denoising Diffusion Models with Transitive Closure
  Time-Distillation
TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation
David Berthelot
Arnaud Autef
Jierui Lin
Dian Ang Yap
Shuangfei Zhai
Siyuan Hu
Daniel Zheng
Walter Talbot
Eric Gu
DiffM
36
81
0
07 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
38
509
0
07 Mar 2023
ELODIN: Naming Concepts in Embedding Spaces
ELODIN: Naming Concepts in Embedding Spaces
Rodrigo Mello
Filipe Calegario
Geber Ramalho
DiffM
35
1
0
07 Mar 2023
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding
Jiacheng Li
Longhui Wei
Zongyuan Zhan
Xinfu He
Siliang Tang
Qi Tian
Yueting Zhuang
31
4
0
07 Mar 2023
Deep Learning for Inertial Positioning: A Survey
Deep Learning for Inertial Positioning: A Survey
Changhao Chen
Xianfei Pan
24
49
0
07 Mar 2023
DLT: Conditioned layout generation with Joint Discrete-Continuous
  Diffusion Layout Transformer
DLT: Conditioned layout generation with Joint Discrete-Continuous Diffusion Layout Transformer
Elad Levi
Eli Brosh
Mykola Mykhailych
Meir Perez
DiffM
69
16
0
07 Mar 2023
Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic
  Analysis For DDIM-Type Samplers
Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers
Sitan Chen
Giannis Daras
A. Dimakis
DiffM
26
62
0
06 Mar 2023
ChatGPT is on the Horizon: Could a Large Language Model be Suitable for
  Intelligent Traffic Safety Research and Applications?
ChatGPT is on the Horizon: Could a Large Language Model be Suitable for Intelligent Traffic Safety Research and Applications?
Ou Zheng
Mohamed Abdel-Aty
Dongdong Wang
Zijin Wang
Shengxuan Ding
LM&MA
35
14
0
06 Mar 2023
StyO: Stylize Your Face in Only One-Shot
StyO: Stylize Your Face in Only One-Shot
Bonan li
Zicheng Zhang
Xuecheng Nie
Congying Han
Yinhan Hu
Tiande Guo
DiffM
39
6
0
06 Mar 2023
Learning multi-scale local conditional probability models of images
Learning multi-scale local conditional probability models of images
Zahra Kadkhodaie
Florentin Guth
S. Mallat
Eero P. Simoncelli
DiffM
37
17
0
06 Mar 2023
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and
  Artificial Scenes
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
Xu Ju
Ailing Zeng
Jianan Wang
Qian Xu
Lei Zhang
3DH
40
45
0
05 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
163
218
0
03 Mar 2023
Bi-parametric prostate MR image synthesis using pathology and
  sequence-conditioned stable diffusion
Bi-parametric prostate MR image synthesis using pathology and sequence-conditioned stable diffusion
Shaheer U. Saeed
Tom Syer
Wen Yan
Qianye Yang
M. Emberton
S. Punwani
Matthew J. Clarkson
D. Barratt
Yipeng Hu
DiffM
MedIm
29
9
0
03 Mar 2023
Word-As-Image for Semantic Typography
Word-As-Image for Semantic Typography
Shira Iluz
Yael Vinker
Amir Hertz
Daniel Berio
Daniel Cohen-Or
Ariel Shamir
DiffM
60
61
0
03 Mar 2023
A Complete Recipe for Diffusion Generative Models
A Complete Recipe for Diffusion Generative Models
Kushagra Pandey
Stephan Mandt
DiffM
46
8
0
03 Mar 2023
ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of
  Pneumothorax
ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax
Zachary Huemann
Xin Tie
Junjie Hu
Tyler Bradshaw
22
14
0
02 Mar 2023
Counterfactual Edits for Generative Evaluation
Counterfactual Edits for Generative Evaluation
Maria Lymperaiou
Giorgos Filandrianos
Konstantinos Thomas
Giorgos Stamou
EGVM
26
0
0
02 Mar 2023
Consistency Models
Consistency Models
Yang Song
Prafulla Dhariwal
Mark Chen
Ilya Sutskever
VLM
DiffM
74
875
0
02 Mar 2023
A Pathway Towards Responsible AI Generated Content
A Pathway Towards Responsible AI Generated Content
Chen Chen
Jie Fu
Lingjuan Lyu
49
71
0
02 Mar 2023
DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural
  Network Worry-Free?
DSD2^22: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?
Victor Quétu
Enzo Tartaglione
37
7
0
02 Mar 2023
X&Fuse: Fusing Visual Information in Text-to-Image Generation
X&Fuse: Fusing Visual Information in Text-to-Image Generation
Yuval Kirstain
Omer Levy
Adam Polyak
DiffM
27
5
0
02 Mar 2023
UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse
  Proposal Generation and Goal-Conditioned Policy
UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned Policy
Yinzhen Xu
Weikang Wan
Jialiang Zhang
Haoran Liu
Zikang Shan
...
Yijia Weng
Jiayi Chen
Tengyu Liu
Li Yi
He Wang
71
115
0
02 Mar 2023
Understanding Diffusion Objectives as the ELBO with Simple Data
  Augmentation
Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation
Diederik P. Kingma
Ruiqi Gao
DiffM
21
128
0
01 Mar 2023
Continuous-Time Functional Diffusion Processes
Continuous-Time Functional Diffusion Processes
Giulio Franzese
Dario Rossi
Simone Rossi
Markus Heinonen
Maurizio Filippone
Pietro Michiardi
44
24
0
01 Mar 2023
StraIT: Non-autoregressive Generation with Stratified Image Transformer
StraIT: Non-autoregressive Generation with Stratified Image Transformer
Shengju Qian
Huiwen Chang
Yuanzhen Li
Zizhao Zhang
Jiaya Jia
Han Zhang
50
10
0
01 Mar 2023
Unlimited-Size Diffusion Restoration
Unlimited-Size Diffusion Restoration
Yinhuai Wang
Jiwen Yu
Runyi Yu
Jian Zhang
61
15
0
01 Mar 2023
Monocular Depth Estimation using Diffusion Models
Monocular Depth Estimation using Diffusion Models
Saurabh Saxena
Abhishek Kar
Mohammad Norouzi
David J. Fleet
DiffM
VLM
MDE
45
84
0
28 Feb 2023
Synthesizing Mixed-type Electronic Health Records using Diffusion Models
Synthesizing Mixed-type Electronic Health Records using Diffusion Models
T. Ceritli
Ghadeer O. Ghosheh
V. Chauhan
T. Zhu
Andrew P. Creagh
David Clifton
MedIm
DiffM
51
19
0
28 Feb 2023
Can We Use Diffusion Probabilistic Models for 3D Motion Prediction?
Can We Use Diffusion Probabilistic Models for 3D Motion Prediction?
Hyemin Ahn
Esteve Valls Mascaro
Dongheui Lee
VGen
DiffM
16
22
0
28 Feb 2023
Benchmarking Deepart Detection
Benchmarking Deepart Detection
Yabin Wang
Zhiwu Huang
Xiaopeng Hong
36
11
0
28 Feb 2023
Enhanced Controllability of Diffusion Models via Feature Disentanglement and Realism-Enhanced Sampling Methods
Enhanced Controllability of Diffusion Models via Feature Disentanglement and Realism-Enhanced Sampling Methods
Wonwoong Cho
Hareesh Ravi
Midhun Harikumar
V. Khuc
Krishna Kumar Singh
Jingwan Lu
David I. Inouye
Ajinkya Kale
DiffM
34
7
0
28 Feb 2023
AVscript: Accessible Video Editing with Audio-Visual Scripts
AVscript: Accessible Video Editing with Audio-Visual Scripts
Mina Huh
Saelyne Yang
Yi-Hao Peng
Xiang Ánthony' Chen
Young-Ho Kim
Amy Pavel
41
32
0
27 Feb 2023
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized
  Text-to-Image Generation
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Yuxiang Wei
Yabo Zhang
Zhilong Ji
Jinfeng Bai
Lei Zhang
W. Zuo
DiffM
30
314
0
27 Feb 2023
MetaAID 2.0: An Extensible Framework for Developing Metaverse
  Applications via Human-controllable Pre-trained Models
MetaAID 2.0: An Extensible Framework for Developing Metaverse Applications via Human-controllable Pre-trained Models
Hongyin Zhu
25
6
0
25 Feb 2023
Directed Diffusion: Direct Control of Object Placement through Attention
  Guidance
Directed Diffusion: Direct Control of Object Placement through Attention Guidance
W. Ma
J. P. Lewis
Avisek Lahiri
Thomas Leung
W. Kleijn
DiffM
21
66
0
25 Feb 2023
Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Cusuh Ham
James Hays
Jingwan Lu
Krishna Kumar Singh
Zhifei Zhang
Tobias Hinz
DiffM
21
24
0
24 Feb 2023
To the Noise and Back: Diffusion for Shared Autonomy
To the Noise and Back: Diffusion for Shared Autonomy
Takuma Yoneda
Luzhe Sun
Ge Yang
Bradly C. Stadie
Matthew R. Walter
DiffM
35
27
0
23 Feb 2023
DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising
  Diffusion Models
DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models
Jamie M. Wynn
Daniyar Turmukhambetov
DiffM
AI4CE
23
110
0
23 Feb 2023
Encoder-based Domain Tuning for Fast Personalization of Text-to-Image
  Models
Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models
Rinon Gal
Moab Arar
Yuval Atzmon
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
DiffM
48
198
0
23 Feb 2023
Previous
123...767778...858687
Next