ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 1,364 papers shown
Title
Bayesian Power Steering: An Effective Approach for Domain Adaptation of
  Diffusion Models
Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models
Ding Huang
Ting Li
Jian Huang
DiffM
84
1
0
06 Jun 2024
SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
Rishit Dagli
Shivesh Prakash
Robert Wu
H. Khosravani
141
6
0
06 Jun 2024
Tiny models from tiny data: Textual and null-text inversion for few-shot distillation
Tiny models from tiny data: Textual and null-text inversion for few-shot distillation
Erik Landolsi
Fredrik Kahl
DiffM
122
1
0
05 Jun 2024
A-Bench: Are LMMs Masters at Evaluating AI-generated Images?
A-Bench: Are LMMs Masters at Evaluating AI-generated Images?
Zicheng Zhang
H. Wu
Chunyi Li
Yingjie Zhou
Wei Sun
Xiongkuo Min
Zijian Chen
Xiaohong Liu
Weisi Lin
Guangtao Zhai
EGVM
145
18
0
05 Jun 2024
TSPDiffuser: Diffusion Models as Learned Samplers for Traveling Salesperson Path Planning Problems
TSPDiffuser: Diffusion Models as Learned Samplers for Traveling Salesperson Path Planning Problems
Ryo Yonetani
139
1
0
05 Jun 2024
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
Hao Wen
Zehuan Huang
Yaohui Wang
Xinyuan Chen
Yu Qiao
159
9
0
05 Jun 2024
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Kengo Uchida
Takashi Shibuya
Yuhta Takida
Naoki Murata
Shusuke Takahashi
Shusuke Takahashi
Yuki Mitsufuji
VGen
141
5
0
04 Jun 2024
Turning Text and Imagery into Captivating Visual Video
Turning Text and Imagery into Captivating Visual Video
Mingming Wang
Elijah Miller
VGen
66
0
0
03 Jun 2024
pOps: Photo-Inspired Diffusion Operators
pOps: Photo-Inspired Diffusion Operators
Elad Richardson
Yuval Alaluf
Ali Mahdavi-Amiri
Daniel Cohen-Or
VLM
88
3
0
03 Jun 2024
Convergence of the denoising diffusion probabilistic models for general noise schedules
Convergence of the denoising diffusion probabilistic models for general noise schedules
Yumiharu Nakano
DiffM
155
1
0
03 Jun 2024
Information Theoretic Text-to-Image Alignment
Information Theoretic Text-to-Image Alignment
Chao Wang
Giulio Franzese
A. Finamore
Massimo Gallo
Pietro Michiardi
174
0
0
31 May 2024
TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes
TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes
Minghao Guo
Bohan Wang
Kaiming He
Wojciech Matusik
3DGS
167
7
0
30 May 2024
Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Nicolas Dufour
Victor Besnier
Vicky Kalogeiton
David Picard
DiffM
131
2
0
30 May 2024
Improved Emotional Alignment of AI and Humans: Human Ratings of Emotions
  Expressed by Stable Diffusion v1, DALL-E 2, and DALL-E 3
Improved Emotional Alignment of AI and Humans: Human Ratings of Emotions Expressed by Stable Diffusion v1, DALL-E 2, and DALL-E 3
J. Lomas
Willem van der Maden
Sohhom Bandyopadhyay
Giovanni Lion
Nirmal Patel
Gyanesh Jain
Yanna Litowsky
Haian Xue
Pieter M. A. Desmet
90
1
0
28 May 2024
Learning diverse attacks on large language models for robust red-teaming and safety tuning
Learning diverse attacks on large language models for robust red-teaming and safety tuning
Seanie Lee
Minsu Kim
Lynn Cherif
David Dobre
Juho Lee
...
Kenji Kawaguchi
Gauthier Gidel
Yoshua Bengio
Nikolay Malkin
Moksh Jain
AAML
158
20
0
28 May 2024
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
Jiannan Huang
Jun Hao Liew
Hanshu Yan
Yuyang Yin
Yao Zhao
Yunchao Wei
Yunchao Wei
DiffM
207
7
0
27 May 2024
Glauber Generative Model: Discrete Diffusion Models via Binary Classification
Glauber Generative Model: Discrete Diffusion Models via Binary Classification
Harshit Varma
Dheeraj M. Nagaraj
Karthikeyan Shanmugam
VLM
209
3
0
27 May 2024
Ensembling Diffusion Models via Adaptive Feature Aggregation
Ensembling Diffusion Models via Adaptive Feature Aggregation
Cong Wang
Kuan Tian
Yonghang Guan
Jun Zhang
Zhiwei Jiang
Fei Shen
Xiao Han
127
6
0
27 May 2024
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
Kai Wang
Yukun Zhou
Mingjia Shi
Zhihang Yuan
Yuzhang Shang
Yuzhang Shang
Hanwang Zhang
Hanwang Zhang
Yang You
155
14
0
27 May 2024
Towards Black-Box Membership Inference Attack for Diffusion Models
Towards Black-Box Membership Inference Attack for Diffusion Models
Jingwei Li
Jingyi Dong
Tianxing He
Jingzhao Zhang
93
5
0
25 May 2024
Data Reconstruction: When You See It and When You Don't
Data Reconstruction: When You See It and When You Don't
Edith Cohen
Haim Kaplan
Yishay Mansour
Shay Moran
Kobbi Nissim
Uri Stemmer
Eliad Tsfadia
AAML
76
3
0
24 May 2024
Challenges and Opportunities in 3D Content Generation
Challenges and Opportunities in 3D Content Generation
Ke Zhao
Andreas Larsen
106
0
0
24 May 2024
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Feng Liang
Akio Kodaira
Chenfeng Xu
Masayoshi Tomizuka
Kurt Keutzer
Diana Marculescu
DiffMVGen
197
9
0
24 May 2024
Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient
Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient
Yongliang Wu
Shiji Zhou
Mingzhuo Yang
Lianzhe Wang
Wenbo Zhu
Heng Chang
Xiao Zhou
Xu Yang
Xu Yang
144
21
0
24 May 2024
DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception
DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception
Run Luo
Yunshui Li
Longze Chen
Wanwei He
Ting-En Lin
...
Zikai Song
Xiaobo Xia
Tongliang Liu
Min Yang
Binyuan Hui
VLMDiffM
188
22
0
24 May 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
143
127
0
23 May 2024
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models
Seyedmorteza Sadat
Jakob Buhmann
Derek Bradley
Otmar Hilliges
Romann M. Weber
152
9
0
23 May 2024
TerDiT: Ternary Diffusion Models with Transformers
TerDiT: Ternary Diffusion Models with Transformers
Xudong Lu
Aojun Zhou
Ziyi Lin
Qi Liu
Yuhui Xu
Renrui Zhang
Yafei Wen
Shuai Ren
Peng Gao
Junchi Yan
MQ
120
3
0
23 May 2024
AdjointDEIS: Efficient Gradients for Diffusion Models
AdjointDEIS: Efficient Gradients for Diffusion Models
Zander W. Blasingame
Chen Liu
DiffM
158
5
0
23 May 2024
Text-to-Model: Text-Conditioned Neural Network Diffusion for Train-Once-for-All Personalization
Text-to-Model: Text-Conditioned Neural Network Diffusion for Train-Once-for-All Personalization
Zexi Li
Lingzhi Gao
Chao Wu
AI4CEDiffM
131
4
0
23 May 2024
Enhanced Creativity and Ideation through Stable Video Synthesis
Enhanced Creativity and Ideation through Stable Video Synthesis
Elijah Miller
Thomas Dupont
Mingming Wang
VGen
59
1
0
22 May 2024
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
N. Sebe
Mubarak Shah
EGVM
203
7
0
22 May 2024
LAGA: Layered 3D Avatar Generation and Customization via Gaussian Splatting
LAGA: Layered 3D Avatar Generation and Customization via Gaussian Splatting
Jia Gong
Shenyu Ji
Lin Geng Foo
Kang Chen
Hossein Rahmani
Jun Liu
3DGS
120
6
0
21 May 2024
Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory
  Score Matching
Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching
Xingyu Miao
Haoran Duan
Varun Ojha
Jun Song
Tejal Shah
Yang Long
R. Ranjan
131
4
0
18 May 2024
ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation
ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation
Pengzhi Li
Chengshuai Tang
Qinxuan Huang
Zhiheng Li
3DGS
78
12
0
17 May 2024
Open Challenges and Opportunities in Federated Foundation Models Towards
  Biomedical Healthcare
Open Challenges and Opportunities in Federated Foundation Models Towards Biomedical Healthcare
Xingyu Li
Lu Peng
Yuping Wang
Weihua Zhang
AI4CEMedImLM&MA
114
12
0
10 May 2024
Automated Virtual Product Placement and Assessment in Images using
  Diffusion Models
Automated Virtual Product Placement and Assessment in Images using Diffusion Models
Mohammad Mahmudul Alam
Negin Sokhandan
Emmett Goodman
DiffM
66
0
0
02 May 2024
Streamlining Image Editing with Layered Diffusion Brushes
Streamlining Image Editing with Layered Diffusion Brushes
Peyman Gholami
Robert Xiao
DiffM
89
1
0
01 May 2024
DOCCI: Descriptions of Connected and Contrasting Images
DOCCI: Descriptions of Connected and Contrasting Images
Yasumasa Onoe
Sunayana Rane
Zachary Berger
Yonatan Bitton
Jaemin Cho
...
Zarana Parekh
Jordi Pont-Tuset
Garrett Tanzer
Su Wang
Jason Baldridge
114
63
0
30 Apr 2024
Probing Unlearned Diffusion Models: A Transferable Adversarial Attack
  Perspective
Probing Unlearned Diffusion Models: A Transferable Adversarial Attack Perspective
Xiaoxuan Han
Songlin Yang
Wei Wang
Yang Li
Jing Dong
DiffMAAML
100
7
0
30 Apr 2024
X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models
X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models
Emmanuelle Bourigault
Abdullah Hamdi
Amir Jamaludin
MedIm
126
2
0
30 Apr 2024
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Navve Wasserman
Noam Rotstein
Roy Ganz
Ron Kimmel
DiffM
135
16
0
28 Apr 2024
MuseumMaker: Continual Style Customization without Catastrophic
  Forgetting
MuseumMaker: Continual Style Customization without Catastrophic Forgetting
Chenxi Liu
Gan Sun
Wenqi Liang
Jiahua Dong
Can Qin
Yang Cong
DiffM
125
4
0
25 Apr 2024
VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and
  Lexical Alterations
VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical Alterations
Sri Harsha Dumpala
Aman Jaiswal
Chandramouli Shama Sastry
E. Milios
Sageev Oore
Hassan Sajjad
VLMCoGe
106
0
0
25 Apr 2024
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Olivia Wiles
Chuhan Zhang
Isabela Albuquerque
Ivana Kajić
Su Wang
...
Jordi Pont-Tuset
Aida Nematzadeh
Anant Nawalgaria
Jordi Pont-Tuset
Aida Nematzadeh
EGVM
257
22
0
25 Apr 2024
Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging
  Perturbations That Efficiently Fool Customized Diffusion Models
Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models
Jingyao Xu
Yuetong Lu
Yandong Li
Siyang Lu
Dongdong Wang
Xiang Wei
AAMLDiffM
77
11
0
23 Apr 2024
RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance
RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance
Chengrui Wang
Pengfei Liu
Min Zhou
Ming Zeng
Xubin Li
Tiezheng Ge
Bo Zheng
DiffM
126
5
0
22 Apr 2024
MultiBooth: Towards Generating All Your Concepts in an Image from Text
MultiBooth: Towards Generating All Your Concepts in an Image from Text
Chenyang Zhu
Kai Li
Yue Ma
Chunming He
Li Xiu
DiffM
241
29
0
22 Apr 2024
Iteratively Prompting Multimodal LLMs to Reproduce Natural and
  AI-Generated Images
Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
Ali Naseh
Katherine Thai
Mohit Iyyer
Amir Houmansadr
90
7
0
21 Apr 2024
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Haoyu Zheng
Wenqiao Zhang
Yaoke Wang
Hao Zhou
Jiang Liu
Juncheng Li
Zheqi Lv
Siliang Tang
Yueting Zhuang
Yueting Zhuang
138
2
0
21 Apr 2024
Previous
123...111213...262728
Next