ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 1,364 papers shown
Title
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
Junseo Park
Hyeryung Jang
288
1
0
17 Jul 2024
Quantised Global Autoencoder: A Holistic Approach to Representing Visual
  Data
Quantised Global Autoencoder: A Holistic Approach to Representing Visual Data
Tim Elsner
Paula Usinger
Victor Czech
Gregor Kobsik
Yanjiang He
I. Lim
Leif Kobbelt
64
2
0
16 Jul 2024
SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images
SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images
Nir Barel
Ron Shapira Weber
Nir Mualem
Shahaf E. Finder
Oren Freifeld
171
2
0
16 Jul 2024
InsertDiffusion: Identity Preserving Visualization of Objects through a
  Training-Free Diffusion Architecture
InsertDiffusion: Identity Preserving Visualization of Objects through a Training-Free Diffusion Architecture
Phillip Mueller
Jannik Wiese
Ioan Crăciun
Lars Mikelsons
83
4
0
15 Jul 2024
Exploring the Potentials and Challenges of Deep Generative Models in Product Design Conception
Exploring the Potentials and Challenges of Deep Generative Models in Product Design Conception
Phillip Mueller
Lars Mikelsons
AI4CE
118
3
0
15 Jul 2024
PersonificationNet: Making customized subject act like a person
PersonificationNet: Making customized subject act like a person
Tianchu Guo
Pengyu Li
Biao Wang
Xiansheng Hua
46
0
0
12 Jul 2024
Surgical Text-to-Image Generation
Surgical Text-to-Image Generation
C. Nwoye
Rupak Bose
K. Elgohary
Lorenzo Arboit
Giorgio Carlino
Joël L. Lavanchy
Pietro Mascagni
N. Padoy
MedIm
150
4
0
12 Jul 2024
Controlling Space and Time with Diffusion Models
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
163
32
0
10 Jul 2024
Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
Yu Cao
Shaogang Gong
DiffM
87
2
0
09 Jul 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and
  Editing
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang
Aoxue Li
Zhenguo Li
Xihui Liu
MLLMDiffM
132
40
0
08 Jul 2024
Timestep-Aware Correction for Quantized Diffusion Models
Timestep-Aware Correction for Quantized Diffusion Models
Yuzhe Yao
Feng Tian
Jun Chen
Haonan Lin
Guang Dai
Yong Liu
Jingdong Wang
DiffMMQ
97
5
0
04 Jul 2024
Improved Noise Schedule for Diffusion Training
Improved Noise Schedule for Diffusion Training
Tiankai Hang
Shuyang Gu
DiffM
87
18
0
03 Jul 2024
Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation
Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation
Xiang Gao
Zhengbo Xu
Junhan Zhao
Jiaying Liu
DiffM
95
8
0
03 Jul 2024
Boosting Consistency in Story Visualization with Rich-Contextual
  Conditional Diffusion Models
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
Fei Shen
Hu Ye
Sibo Liu
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
136
40
0
02 Jul 2024
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
Jian Ma
Yonglin Deng
Chen Chen
H. Lu
Zhenyu Yang
Zhenyu Yang
VLMDiffM
195
10
0
02 Jul 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Yuchen Li
Fan Ma
Zongxin Yang
Yue Yang
159
11
0
02 Jul 2024
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models
Seyedmorteza Sadat
Manuel Kansy
Otmar Hilliges
Romann M. Weber
91
14
0
02 Jul 2024
StyleShot: A Snapshot on Any Style
StyleShot: A Snapshot on Any Style
Junyao Gao
Yanchen Liu
Yanan Sun
Yinhao Tang
Yanhong Zeng
Kai Chen
Cairong Zhao
TTA3DHVLM
180
19
0
01 Jul 2024
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
Chang-Han Yeh
Chin-Yang Lin
Zhixiang Wang
Chi-Wei Hsiao
Ting-Hsuan Chen
Hau-Shiang Shiu
Yu-Lun Liu
VGenDiffM
194
6
0
01 Jul 2024
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs
Xin Su
Man Luo
Kris W Pan
Tien Pei Chou
Vasudev Lal
Phillip Howard
116
4
0
28 Jun 2024
ScoreFusion: Fusing Score-based Generative Models via Kullback-Leibler Barycenters
ScoreFusion: Fusing Score-based Generative Models via Kullback-Leibler Barycenters
Hao Liu
Junze Tony Ye
Ye
Jose H. Blanchet
DiffMFedML
116
1
0
28 Jun 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLMDiffM
160
3
0
28 Jun 2024
Text-Animator: Controllable Visual Text Video Generation
Text-Animator: Controllable Visual Text Video Generation
Lin Liu
Quande Liu
Shengju Qian
Yuan Zhou
Wengang Zhou
Houqiang Li
Lingxi Xie
Qi Tian
VGen
96
1
0
25 Jun 2024
Director3D: Real-world Camera Trajectory and 3D Scene Generation from
  Text
Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Xinyang Li
Zhangyu Lai
Linning Xu
Yansong Qu
Liujuan Cao
Shengchuan Zhang
Bo Dai
Rongrong Ji
VGen
131
10
0
25 Jun 2024
DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for
  Efficient Scanned Document Annotation
DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation
Ahmad Mohammadshirazi
Ali Nosrati Firoozsalari
Mengxi Zhou
Dheeraj Kulshrestha
R. Ramnath
85
0
0
25 Jun 2024
Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient
  Consistency Modeling
Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling
Min-Seop Kwak
Donghoon Ahn
Ines Hyeonsu Kim
Jin-Hwa Kim
Seungryong Kim
55
2
0
24 Jun 2024
Prompt-Consistency Image Generation (PCIG): A Unified Framework
  Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models
Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models
Yichen Sun
Zhixuan Chu
Zhan Qin
Kui Ren
DiffM
86
1
0
24 Jun 2024
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Yuang Peng
Yuxin Cui
Haomiao Tang
Zekun Qi
Runpei Dong
Jing Bai
Chunrui Han
Zheng Ge
Xiangyu Zhang
Shu-Tao Xia
EGVM
185
39
0
24 Jun 2024
EmoAttack: Emotion-to-Image Diffusion Models for Emotional Backdoor Generation
EmoAttack: Emotion-to-Image Diffusion Models for Emotional Backdoor Generation
Tianyu Wei
Shanmin Pang
Qi Guo
Yizhuo Ma
Yihao Huang
Ming-Ming Cheng
Qing Guo
399
2
0
22 Jun 2024
GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D
  Gaussian Generation
GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation
Chubin Zhang
Hongliang Song
Yi Wei
Yu Chen
Jiwen Lu
Yansong Tang
3DGS3DV
103
19
0
21 Jun 2024
Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning
Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning
Xu Han
Fangfang Fan
Jingzhao Rong
Xiaofeng Liu
Georges El Fakhri
Qingyu Chen
Xiaofeng Liu
MedIm
74
2
0
21 Jun 2024
A3D: Does Diffusion Dream about 3D Alignment?
A3D: Does Diffusion Dream about 3D Alignment?
Savva Ignatyev
Nina Konovalova
Daniil Selikhanovych
Nikolay Patakin
Nikolay Patakin
...
Anton Konushin
Peter Wonka
Alexander Filippov
Peter Wonka
Evgeny Burnaev
DiffM
182
1
0
21 Jun 2024
Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models
Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models
Matthew Zheng
Enis Simsar
Hidir Yesiltepe
Federico Tombari
Joel Simon
Pinar Yanardag
136
4
0
20 Jun 2024
Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation
Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation
Eyal Michaeli
Ohad Fried
114
1
0
20 Jun 2024
Evaluating Numerical Reasoning in Text-to-Image Models
Evaluating Numerical Reasoning in Text-to-Image Models
Ivana Kajić
Olivia Wiles
Isabela Albuquerque
Matthias Bauer
Su Wang
Jordi Pont-Tuset
Aida Nematzadeh
EGVMReLM
201
2
0
20 Jun 2024
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Alireza Ganjdanesh
Reza Shirkavand
Shangqian Gao
Heng Huang
DiffMVLM
149
5
0
17 Jun 2024
Adversarial Perturbations Cannot Reliably Protect Artists From Generative AI
Adversarial Perturbations Cannot Reliably Protect Artists From Generative AI
Robert Honig
Javier Rando
Nicholas Carlini
Florian Tramèr
WIGMAAML
137
21
0
17 Jun 2024
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Yulai Zhao
Masatoshi Uehara
Gabriele Scalia
Tommaso Biancalani
Sergey Levine
Ehsan Hajiramezanali
Ehsan Hajiramezanali
AI4CE
177
7
0
17 Jun 2024
Diffusion Models in Low-Level Vision: A Survey
Diffusion Models in Low-Level Vision: A Survey
Chunming He
Yuqi Shen
Chengyu Fang
Fengyang Xiao
Longxiang Tang
Yulun Zhang
W. Zuo
Zhenhua Guo
Xiu Li
VLMDiffMMedIm
221
42
0
17 Jun 2024
FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional
  Flow Matching
FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional Flow Matching
Chaeyoung Jung
Suyeon Lee
Ji-Hoon Kim
Joon Son Chung
DiffM
89
7
0
13 Jun 2024
C3DAG: Controlled 3D Animal Generation using 3D pose guidance
C3DAG: Controlled 3D Animal Generation using 3D pose guidance
Sandeep Mishra
Oindrila Saha
A. Bovik
76
0
0
11 Jun 2024
Zero-shot Image Editing with Reference Imitation
Zero-shot Image Editing with Reference Imitation
Xi Chen
Yutong Feng
Mengting Chen
Yiyang Wang
Shilong Zhang
Yu Liu
Yujun Shen
Hengshuang Zhao
DiffM
88
27
0
11 Jun 2024
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with
  Foundation Models
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models
Athanasios Tragakis
Marco Aversa
Chaitanya Kaul
Roderick Murray-Smith
Daniele Faccio
101
2
0
11 Jun 2024
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
X. Wang
Siming Fu
Qihan Huang
Wanggui He
Hao Jiang
DiffM
131
53
0
11 Jun 2024
Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional
  SSMs
Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional SSMs
Shentong Mo
Mamba
88
6
0
07 Jun 2024
PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction
PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction
Eduard Poesina
Adriana Valentina Costache
Adrian-Gabriel Chifu
Josiane Mothe
Radu Tudor Ionescu
VLM
145
1
0
07 Jun 2024
How to Strategize Human Content Creation in the Era of GenAI?
How to Strategize Human Content Creation in the Era of GenAI?
Seyed A. Esmaeili
Kshipra Bhawalkar
Zhe Feng
Di Wang
Di Wang
Haifeng Xu
132
3
0
07 Jun 2024
DiffuSyn Bench: Evaluating Vision-Language Models on Real-World
  Complexities with Diffusion-Generated Synthetic Benchmarks
DiffuSyn Bench: Evaluating Vision-Language Models on Real-World Complexities with Diffusion-Generated Synthetic Benchmarks
Haokun Zhou
Yipeng Hong
VLMEGVM
74
1
0
06 Jun 2024
TexIm FAST: Text-to-Image Representation for Semantic Similarity
  Evaluation using Transformers
TexIm FAST: Text-to-Image Representation for Semantic Similarity Evaluation using Transformers
Wazib Ansar
Saptarsi Goswami
Amlan Chakrabarti
ViT
70
0
0
06 Jun 2024
DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D
  Data
DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data
Qihao Liu
Yi Zhang
Song Bai
Adam Kortylewski
Alan Yuille
99
11
0
06 Jun 2024
Previous
123...101112...262728
Next