ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 1,364 papers shown
Title
Skyeyes: Ground Roaming using Aerial View Images
Skyeyes: Ground Roaming using Aerial View Images
Zhiyuan Gao
Wenbin Teng
Gonglin Chen
Jinsen Wu
Ningli Xu
R. Qin
Andrew Feng
Yajie Zhao
VGen
90
2
0
25 Sep 2024
GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design
GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design
Phillip Mueller
Sebastian Mueller
Lars Mikelsons
112
2
0
25 Sep 2024
PRESTO: Fast Motion Planning Using Diffusion Models Based on Key-Configuration Environment Representation
PRESTO: Fast Motion Planning Using Diffusion Models Based on Key-Configuration Environment Representation
Mingyo Seo
Yoonyoung Cho
Yoonchang Sung
Peter Stone
Yuke Zhu
Beomjoon Kim
DiffM
150
0
0
24 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLMDiffM
190
14
0
23 Sep 2024
MaterialFusion: Enhancing Inverse Rendering with Material Diffusion Priors
MaterialFusion: Enhancing Inverse Rendering with Material Diffusion Priors
Yehonathan Litman
Or Patashnik
Kangle Deng
Aviral Agrawal
Rushikesh Zawar
Fernando de la Torre
Shubham Tulsiani
146
7
0
23 Sep 2024
Dormant: Defending against Pose-driven Human Image Animation
Dormant: Defending against Pose-driven Human Image Animation
Jiachen Zhou
Mingsi Wang
Tianlin Li
Guozhu Meng
Kai Chen
160
5
0
22 Sep 2024
Imagine yourself: Tuning-Free Personalized Image Generation
Imagine yourself: Tuning-Free Personalized Image Generation
Zecheng He
Bo Sun
Felix Juefei-Xu
Haoyu Ma
Ankit Ramchandani
...
Ning Zhang
Peizhao Zhang
Roshan Sumbaly
Peter Vajda
Animesh Sinha
DiffM
100
19
0
20 Sep 2024
Out-of-Distribution Detection: A Task-Oriented Survey of Recent Advances
Out-of-Distribution Detection: A Task-Oriented Survey of Recent Advances
Shuo Lu
YingSheng Wang
Lijun Sheng
Lingxiao He
A. Zheng
Jian Liang
OODD
179
7
0
18 Sep 2024
Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Tianqi Chen
Shujian Zhang
Mingyuan Zhou
DiffM
189
5
0
17 Sep 2024
Sub-graph Based Diffusion Model for Link Prediction
Sub-graph Based Diffusion Model for Link Prediction
Hang Li
Wei Jin
Geri Skenderi
Harry Shomer
Wenzhuo Tang
Wenqi Fan
Jiliang Tang
DiffM
60
0
0
13 Sep 2024
Click2Mask: Local Editing with Dynamic Mask Generation
Click2Mask: Local Editing with Dynamic Mask Generation
Omer Regev
Omri Avrahami
Dani Lischinski
DiffM
118
2
0
12 Sep 2024
What to align in multimodal contrastive learning?
What to align in multimodal contrastive learning?
Benoit Dufumier
J. Castillo-Navarro
D. Tuia
Jean-Philippe Thiran
156
4
0
11 Sep 2024
Generative Hierarchical Materials Search
Generative Hierarchical Materials Search
Sherry Yang
Simon L. Batzner
Ruiqi Gao
Muratahan Aykol
Alexander L. Gaunt
Brendan McMorrow
Danilo J. Rezende
Dale Schuurmans
Igor Mordatch
E. D. Cubuk
AI4CE
94
7
0
10 Sep 2024
pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning
pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning
Jiahao Lai
Jiaqiang Li
Jian Xu
Yanru Wu
Boshi Tang
Siqi Chen
Yongfeng Huang
Wenbo Ding
Yang Li
FedML
191
0
0
09 Sep 2024
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Jiaxin Cheng
Zixu Zhao
Tong He
Tianjun Xiao
Yicong Zhou
Zheng Zhang
DiffM
146
0
0
07 Sep 2024
Plug-and-Hide: Provable and Adjustable Diffusion Generative Steganography
Plug-and-Hide: Provable and Adjustable Diffusion Generative Steganography
Jiahao Zhu
Zixuan Chen
Lingxiao Yang
Xiaohua Xie
Yi Zhou
DiffM
88
0
0
07 Sep 2024
OPAL: Outlier-Preserved Microscaling Quantization Accelerator for
  Generative Large Language Models
OPAL: Outlier-Preserved Microscaling Quantization Accelerator for Generative Large Language Models
Jahyun Koo
Dahoon Park
Sangwoo Jung
Jaeha Kung
MQ
45
2
0
06 Sep 2024
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Jianbiao Mei
T. Hu
Xuemeng Yang
Licheng Wen
Yu Yang
Tiantian Wei
Yukai Ma
Min Dou
Botian Shi
Yong Liu
VGenDiffM
170
6
0
06 Sep 2024
DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
Qianlong Xiang
Miao Zhang
Yuzhang Shang
Jianlong Wu
Yan Yan
Liqiang Nie
DiffM
125
10
0
05 Sep 2024
Semantically Controllable Augmentations for Generalizable Robot Learning
Semantically Controllable Augmentations for Generalizable Robot Learning
Zoey Chen
Zhao Mandi
Homanga Bharadhwaj
Mohit Sharma
Shuran Song
Abhishek Gupta
Vikash Kumar
LM&Ro
105
7
0
02 Sep 2024
Training-Free Sketch-Guided Diffusion with Latent Optimization
Training-Free Sketch-Guided Diffusion with Latent Optimization
Sandra Zhang Ding
Jiafeng Mao
Kiyoharu Aizawa
DiffM
182
3
0
31 Aug 2024
RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model
RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model
Zhuan Shi
Jing Yan
Xiaoli Tang
Lingjuan Lyu
Boi Faltings
76
1
0
29 Aug 2024
Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
Xiaoyu Jin
Zunnan Xu
Mingwen Ou
Wenming Yang
DiffM
89
7
0
29 Aug 2024
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model
Fan Liu
Wenqiang Sun
Hanyang Wang
Yikai Wang
Haowen Sun
Junliang Ye
Jun Zhang
Yueqi Duan
VGen
117
41
0
29 Aug 2024
Hand1000: Generating Realistic Hands from Text with Only 1,000 Images
Hand1000: Generating Realistic Hands from Text with Only 1,000 Images
Haozhuo Zhang
B. Zhu
Yu Cao
Y. Hao
VLM
132
3
0
28 Aug 2024
Constrained Diffusion Models via Dual Training
Constrained Diffusion Models via Dual Training
Shervin Khalafi
Dongsheng Ding
Alejandro Ribeiro
109
4
0
27 Aug 2024
Diffusion Models Are Real-Time Game Engines
Diffusion Models Are Real-Time Game Engines
Dani Valevski
Yaniv Leviathan
Moab Arar
Shlomi Fruchter
DiffMVGenAI4CE
139
91
0
27 Aug 2024
OctFusion: Octree-based Diffusion Models for 3D Shape Generation
OctFusion: Octree-based Diffusion Models for 3D Shape Generation
Bojun Xiong
Si-Tong Wei
Xin-Yang Zheng
Yan-Pei Cao
Zhouhui Lian
Peng-Shuai Wang
96
10
0
27 Aug 2024
Foodfusion: A Novel Approach for Food Image Composition via Diffusion
  Models
Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models
Chaohua Shi
Xuan Wang
Si Shi
Xule Wang
Mingrui Zhu
Nannan Wang
X. Gao
CoGe
93
2
0
26 Aug 2024
Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold
Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold
Lazar Atanackovic
Xi Zhang
Brandon Amos
Mathieu Blanchette
Leo J. Lee
Yoshua Bengio
Alexander Tong
Kirill Neklyudov
183
12
0
26 Aug 2024
Atlas Gaussians Diffusion for 3D Generation
Atlas Gaussians Diffusion for 3D Generation
Haitao Yang
Yuan Dong
Hanwen Jiang
Dejia Xu
Georgios Pavlakos
Qixing Huang
3DGS
189
3
0
23 Aug 2024
AnyDesign: Versatile Area Fashion Editing via Mask-Free Diffusion
AnyDesign: Versatile Area Fashion Editing via Mask-Free Diffusion
Yunfang Niu
Lingxiang Wu
Dong Yi
Jie Peng
Ning Jiang
Haiying Wu
Jinqiao Wang
DiffM
75
1
0
21 Aug 2024
TrackGo: A Flexible and Efficient Method for Controllable Video Generation
TrackGo: A Flexible and Efficient Method for Controllable Video Generation
Haitao Zhou
Chuang Wang
Rui Nie
Jinxiao Lin
Dongdong Yu
Qian Yu
Changhu Wang
VGenDiffM
162
15
0
21 Aug 2024
Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models
Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models
Oz Zafar
Yuval Cohen
Lior Wolf
Idan Schwartz
VLM
89
4
0
21 Aug 2024
FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting
FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting
Liyao Jiang
Negar Hassanpour
Mohammad Salameh
Mohan Sai Singamsetti
Fengyu Sun
Wei Lu
Di Niu
DiffM
150
2
0
21 Aug 2024
Perception-guided Jailbreak against Text-to-Image Models
Perception-guided Jailbreak against Text-to-Image Models
Yihao Huang
Le Liang
Tianlin Li
Xiaojun Jia
Run Wang
Weikai Miao
G. Pu
Yang Liu
122
11
0
20 Aug 2024
Moonshine: Distilling Game Content Generators into Steerable Generative Models
Moonshine: Distilling Game Content Generators into Steerable Generative Models
Yuhe Nie
Michael Middleton
Tim Merino
Nidhushan Kanagaraja
Ashutosh Kumar
Zhan Zhuang
Julian Togelius
91
0
0
18 Aug 2024
Deep Generative Classification of Blood Cell Morphology
Deep Generative Classification of Blood Cell Morphology
Simon Deltadahl
J. Gilbey
C. V. Laer
Nancy Boeckx
M. Leers
...
Nicholas S. Gleadall
Carola-Bibiane Schönlieb
S. Sivapalaratnam
Michael Roberts
P. Nachev
DiffMMedIm
78
0
0
16 Aug 2024
Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding
Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding
Bing Hu
Anita Layton
Helen Chen
MedIm
72
2
0
14 Aug 2024
Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Utkarsh Nath
Rajeev Goel
Eun Som Jeon
Changhoon Kim
Kyle Min
Yezhou Yang
Yingzhen Yang
Pavan Turaga
167
1
0
12 Aug 2024
LaWa: Using Latent Space for In-Generation Image Watermarking
LaWa: Using Latent Space for In-Generation Image Watermarking
Ahmad Rezaei
Mohammad Akbari
Saeed Ranjbar Alvar
Arezou Fatemi
Yong Zhang
WIGM
98
17
0
11 Aug 2024
ZePo: Zero-Shot Portrait Stylization with Faster Sampling
ZePo: Zero-Shot Portrait Stylization with Faster Sampling
Jin Liu
Huaibo Huang
Jie Cao
Ran He
DiffM
80
2
0
10 Aug 2024
A Survey of Text-to-SQL in the Era of LLMs: Where are we, and where are we going?
A Survey of Text-to-SQL in the Era of LLMs: Where are we, and where are we going?
Xinyu Liu
Shuyu Shen
Boyan Li
Peixian Ma
Runzhi Jiang
Yuxin Zhang
Ju Fan
Guoliang Li
Nan Tang
Yuyu Luo
76
32
0
09 Aug 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu
Shitian Zhao
Le Zhuo
Weifeng Lin
Ping Luo
Xinyue Li
Qi Qin
Yu Qiao
Hongsheng Li
Peng Gao
MLLM
166
59
0
05 Aug 2024
Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy
  Curvature of Attention
Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention
Mengkang Hu
DiffM
115
10
0
01 Aug 2024
Fuzz-Testing Meets LLM-Based Agents: An Automated and Efficient Framework for Jailbreaking Text-To-Image Generation Models
Fuzz-Testing Meets LLM-Based Agents: An Automated and Efficient Framework for Jailbreaking Text-To-Image Generation Models
Yingkai Dong
Xiangtao Meng
Ning Yu
Zheng Li
Shanqing Guo
LLMAG
119
17
0
01 Aug 2024
Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion
Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion
Manuel Kansy
Jacek Naruniec
Christopher Schroers
Markus Gross
Romann M. Weber
DiffMVGen
127
4
0
01 Aug 2024
Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Ekaterina Iakovleva
Fabio Pizzati
Philip Torr
Stéphane Lathuiliere
DiffM
96
0
0
29 Jul 2024
Diffusion Models as Data Mining Tools
Diffusion Models as Data Mining Tools
Ioannis Siglidis
Aleksander Holynski
Alexei A. Efros
Mathieu Aubry
Shiry Ginosar
DiffMMedIm
95
3
0
20 Jul 2024
Connecting Consistency Distillation to Score Distillation for Text-to-3D
  Generation
Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation
Zong-Han Li
Minghui Hu
Qian Zheng
Xudong Jiang
DiffM
80
7
0
18 Jul 2024
Previous
123...91011...262728
Next