ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXivPDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 4,346 papers shown
Title
SafeDiffuser: Safe Planning with Diffusion Probabilistic Models
SafeDiffuser: Safe Planning with Diffusion Probabilistic Models
Wei Xiao
Tsun-Hsuan Wang
Chuang Gan
Daniela Rus
DiffM
40
30
0
31 May 2023
MuseCoco: Generating Symbolic Music from Text
MuseCoco: Generating Symbolic Music from Text
Peiling Lu
Xin Xu
C. Kang
Botao Yu
Chengyi Xing
Xuejiao Tan
Jiang Bian
39
40
0
31 May 2023
GANDiffFace: Controllable Generation of Synthetic Datasets for Face
  Recognition with Realistic Variations
GANDiffFace: Controllable Generation of Synthetic Datasets for Face Recognition with Realistic Variations
Pietro Melzi
Christian Rathgeb
Ruben Tolosana
R. Vera-Rodríguez
Dominik Lawatsch
Florian Domin
Maxim Schaubert
DiffM
31
51
0
31 May 2023
A Geometric Perspective on Diffusion Models
A Geometric Perspective on Diffusion Models
Defang Chen
Zhenyu Zhou
Jianhan Mei
Chunhua Shen
Chun-Yen Chen
C. Wang
DiffM
36
19
0
31 May 2023
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL
Fei Ni
Jianye Hao
Yao Mu
Yifu Yuan
Yan Zheng
Bin Wang
Zhixuan Liang
DiffM
OffRL
67
44
0
31 May 2023
Direct Diffusion Bridge using Data Consistency for Inverse Problems
Direct Diffusion Bridge using Data Consistency for Inverse Problems
Hyungjin Chung
Jeongsol Kim
Jong Chul Ye
DiffM
28
52
0
31 May 2023
A Unified Framework for U-Net Design and Analysis
A Unified Framework for U-Net Design and Analysis
Christopher Williams
Fabian Falck
George Deligiannidis
Chris Holmes
Arnaud Doucet
Saifuddin Syed
SSeg
AI4CE
57
35
0
31 May 2023
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine
  Semantic Re-alignment
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment
Guian Fang
Zutao Jiang
Jianhua Han
Guangsong Lu
Hang Xu
Shengcai Liao
Xiaodan Liang
EGVM
34
1
0
31 May 2023
Improving Handwritten OCR with Training Samples Generated by Glyph
  Conditional Denoising Diffusion Probabilistic Model
Improving Handwritten OCR with Training Samples Generated by Glyph Conditional Denoising Diffusion Probabilistic Model
Haisong Ding
Bozhi Luan
Dongnan Gui
Kai Chen
Qiang Huo
DiffM
23
7
0
31 May 2023
Cones 2: Customizable Image Synthesis with Multiple Subjects
Cones 2: Customizable Image Synthesis with Multiple Subjects
Zhiheng Liu
Yifei Zhang
Yujun Shen
Kecheng Zheng
Kai Zhu
Ruili Feng
Yu Liu
Deli Zhao
Jingren Zhou
Yang Cao
DiffM
65
80
0
30 May 2023
Ambient Diffusion: Learning Clean Distributions from Corrupted Data
Ambient Diffusion: Learning Clean Distributions from Corrupted Data
Giannis Daras
Kulin Shah
Y. Dagan
Aravind Gollakota
A. Dimakis
Adam R. Klivans
DiffM
62
68
0
30 May 2023
AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation
AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation
Thu Nguyen-Phuoc
Gabriel Schwartz
Yuting Ye
Stephen Lombardi
Lei Xiao
48
6
0
30 May 2023
Translation-Enhanced Multilingual Text-to-Image Generation
Translation-Enhanced Multilingual Text-to-Image Generation
Yaoyiran Li
Ching-Yun Chang
Stephen Rawls
Ivan Vulić
Anna Korhonen
34
8
0
30 May 2023
Likelihood-Based Diffusion Language Models
Likelihood-Based Diffusion Language Models
Ishaan Gulrajani
Tatsunori B. Hashimoto
DiffM
37
53
0
30 May 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for
  Vision-and-Language Navigation
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li
Joey Tianyi Zhou
DiffM
38
50
0
30 May 2023
Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video
  Translation Using Conditional Image Diffusion Models
Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video Translation Using Conditional Image Diffusion Models
Ernie Chu
Shuohao Lin
Jun-Cheng Chen
DiffM
27
21
0
30 May 2023
Nested Diffusion Processes for Anytime Image Generation
Nested Diffusion Processes for Anytime Image Generation
Noam Elata
Bahjat Kawar
T. Michaeli
Michael Elad
DiffM
37
4
0
30 May 2023
StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity
  3D Avatar Generation
StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation
Chi Zhang
Yiwen Chen
Yijun Fu
Zheng-Yang Zhou
YU Gang
Billzb Wang
Bin-Bin Fu
Tao Chen
Guosheng Lin
Chunhua Shen
DiffM
52
27
0
30 May 2023
DiffSketching: Sketch Control Image Synthesis with Diffusion Models
DiffSketching: Sketch Control Image Synthesis with Diffusion Models
Qiang Wang
Di Kong
Fengyin Lin
Yonggang Qi
DiffM
54
14
0
30 May 2023
HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion
  Guidance
HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance
Junzhe Zhu
Peiye Zhuang
Oluwasanmi Koyejo
DiffM
34
73
0
30 May 2023
Real-World Image Variation by Aligning Diffusion Inversion Chain
Real-World Image Variation by Aligning Diffusion Inversion Chain
Yuechen Zhang
Jinbo Xing
Eric Lo
Jiaya Jia
38
34
0
30 May 2023
Diffusion-Stego: Training-free Diffusion Generative Steganography via
  Message Projection
Diffusion-Stego: Training-free Diffusion Generative Steganography via Message Projection
Daegyu Kim
Chaehun Shin
Jooyoung Choi
Dahuin Jung
Sung-Hoon Yoon
DiffM
35
10
0
30 May 2023
LayerDiffusion: Layered Controlled Image Editing with Diffusion Models
LayerDiffusion: Layered Controlled Image Editing with Diffusion Models
Pengzhi Li
Qinxuan Huang
Yikang Ding
Zhiheng Li
DiffM
41
36
0
30 May 2023
SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for
  Text-driven Video Editing
SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing
Nazmul Karim
Umar Khalid
M. Joneidi
Chen Chen
Nazanin Rahnavard
DiffM
VGen
19
5
0
30 May 2023
BRICS: Bi-level feature Representation of Image CollectionS
BRICS: Bi-level feature Representation of Image CollectionS
Dingdong Yang
Yizhi Wang
Ali Mahdavi-Amiri
Hao Zhang
DiffM
31
0
0
29 May 2023
Controllable Text-to-Image Generation with GPT-4
Controllable Text-to-Image Generation with GPT-4
Tianjun Zhang
Yi Zhang
Vibhav Vineet
Neel Joshi
Xin Eric Wang
DiffM
41
42
0
29 May 2023
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Zeyue Xue
Guanglu Song
Qiushan Guo
Boxiao Liu
Zhuofan Zong
Yu Liu
Ping Luo
DiffM
73
133
0
29 May 2023
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept
  Customization of Diffusion Models
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
Yuchao Gu
Xintao Wang
Jay Zhangjie Wu
Yujun Shi
Yunpeng Chen
...
Shuning Chang
Wei Wu
Yixiao Ge
Ying Shan
Mike Zheng Shou
DiffM
66
168
0
29 May 2023
Photoswap: Personalized Subject Swapping in Images
Photoswap: Personalized Subject Swapping in Images
Jing Gu
Yilin Wang
Nanxuan Zhao
Tsu-Jui Fu
Wei Xiong
...
Zhifei Zhang
He Zhang
Jianming Zhang
Hyun-Sun Jung
Xin Eric Wang
DiffM
31
37
0
29 May 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal
  Co-Denoising
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGen
DiffM
58
90
0
29 May 2023
GlyphControl: Glyph Conditional Control for Visual Text Generation
GlyphControl: Glyph Conditional Control for Visual Text Generation
Yukang Yang
Dongnan Gui
Yuhui Yuan
Weicong Liang
Haisong Ding
Hang-Rui Hu
Kai Chen
DiffM
38
78
0
29 May 2023
TaleCrafter: Interactive Story Visualization with Multiple Characters
TaleCrafter: Interactive Story Visualization with Multiple Characters
Yuan Gong
Youxin Pang
Xiaodong Cun
Menghan Xia
Yingqing He
...
Longyue Wang
Yong Zhang
Xintao Wang
Ying Shan
Yujiu Yang
DiffM
51
46
0
29 May 2023
Image Captioning with Multi-Context Synthetic Data
Image Captioning with Multi-Context Synthetic Data
Feipeng Ma
Y. Zhou
Fengyun Rao
Yueyi Zhang
Xiaoyan Sun
DiffM
45
7
0
29 May 2023
InstructEdit: Improving Automatic Masks for Diffusion-based Image
  Editing With User Instructions
InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
35
32
0
29 May 2023
Conditional Score Guidance for Text-Driven Image-to-Image Translation
Conditional Score Guidance for Text-Driven Image-to-Image Translation
Hyunsoo Lee
Minsoo Kang
Bohyung Han
DiffM
24
14
0
29 May 2023
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Jia-Bin Huang
Yi Ren
Rongjie Huang
Dongchao Yang
Zhenhui Ye
Chen Zhang
Jinglin Liu
Xiang Yin
Zejun Ma
Zhou Zhao
DiffM
37
60
0
29 May 2023
Diffusion Model is an Effective Planner and Data Synthesizer for
  Multi-Task Reinforcement Learning
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Haoran He
Chenjia Bai
Kang Xu
Zhuoran Yang
Weinan Zhang
Dong Wang
Bingyan Zhao
Xuelong Li
DiffM
OffRL
43
93
0
29 May 2023
Diff-Instruct: A Universal Approach for Transferring Knowledge From
  Pre-trained Diffusion Models
Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models
Weijian Luo
Tianyang Hu
Shifeng Zhang
Jiacheng Sun
Zhenguo Li
Zhihua Zhang
40
118
0
29 May 2023
Cognitively Inspired Cross-Modal Data Generation Using Diffusion Models
Cognitively Inspired Cross-Modal Data Generation Using Diffusion Models
Zizhao Hu
Mohammad Rostami
DiffM
14
0
0
28 May 2023
AIMS: All-Inclusive Multi-Level Segmentation
AIMS: All-Inclusive Multi-Level Segmentation
Lu Qi
Jason Kuen
Weidong Guo
Jiuxiang Gu
Zhe Lin
Bo Du
Yu-Syuan Xu
Ming-Hsuan Yang
VLM
39
6
0
28 May 2023
Mitigating Inappropriateness in Image Generation: Can there be Value in
  Reflecting the World's Ugliness?
Mitigating Inappropriateness in Image Generation: Can there be Value in Reflecting the World's Ugliness?
Manuel Brack
Felix Friedrich
P. Schramowski
Kristian Kersting
EGVM
18
13
0
28 May 2023
Learning to Jump: Thinning and Thickening Latent Counts for Generative
  Modeling
Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling
Tianqi Chen
Mingyuan Zhou
DiffM
67
8
0
28 May 2023
Text-to-image Editing by Image Information Removal
Text-to-image Editing by Image Information Removal
Zhongping Zhang
Jian Zheng
Jacob Zhiyuan Fang
Bryan A. Plummer
DiffM
42
12
0
27 May 2023
Towards Consistent Video Editing with Text-to-Image Diffusion Models
Towards Consistent Video Editing with Text-to-Image Diffusion Models
Zicheng Zhang
Bonan li
Xuecheng Nie
Congying Han
Tiande Guo
Luoqi Liu
DiffM
31
25
0
27 May 2023
Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion
  Inference
Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference
Zihao Yu
Haoyang Li
Fangcheng Fu
Xupeng Miao
Tengjiao Wang
DiffM
38
8
0
27 May 2023
Im-Promptu: In-Context Composition from Image Prompts
Im-Promptu: In-Context Composition from Image Prompts
Bhishma Dedhia
Michael Chang
Jake C. Snell
Thomas Griffiths
N. Jha
LRM
MLLM
39
1
0
26 May 2023
COMCAT: Towards Efficient Compression and Customization of
  Attention-Based Vision Models
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models
Jinqi Xiao
Miao Yin
Yu Gong
Xiao Zang
Jian Ren
Bo Yuan
VLM
ViT
58
9
0
26 May 2023
Generating Images with Multimodal Language Models
Generating Images with Multimodal Language Models
Jing Yu Koh
Daniel Fried
Ruslan Salakhutdinov
MLLM
46
243
0
26 May 2023
Contrast, Attend and Diffuse to Decode High-Resolution Images from Brain
  Activities
Contrast, Attend and Diffuse to Decode High-Resolution Images from Brain Activities
Jingyuan Sun
Mingxiao Li
Zijiao Chen
Yunhao Zhang
Shaonan Wang
Marie-Francine Moens
DiffM
52
30
0
26 May 2023
Functional Flow Matching
Functional Flow Matching
Gavin Kerrigan
Giosue Migliorini
Padhraic Smyth
50
14
0
26 May 2023
Previous
123...676869...858687
Next