ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXivPDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 4,324 papers shown
Title
MAVOS-DD: Multilingual Audio-Video Open-Set Deepfake Detection Benchmark
MAVOS-DD: Multilingual Audio-Video Open-Set Deepfake Detection Benchmark
Florinel-Alin Croitoru
Vlad Hondru
Marius Popescu
Radu Tudor Ionescu
F. Khan
Mubarak Shah
12
0
0
16 May 2025
DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning
DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
12
0
0
16 May 2025
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
Yuang Ai
Qihang Fan
Xuefeng Hu
Zhenheng Yang
Ran He
Huaibo Huang
DiffM
12
0
0
16 May 2025
Score-based diffusion nowcasting of GOES imagery
Score-based diffusion nowcasting of GOES imagery
Randy J. Chase
Katherine Haynes
Lander Ver Hoef
Imme Ebert-Uphoff
DiffM
26
0
0
15 May 2025
IMAGE-ALCHEMY: Advancing subject fidelity in personalised text-to-image generation
IMAGE-ALCHEMY: Advancing subject fidelity in personalised text-to-image generation
Amritanshu Tiwari
Cherish Puniani
Kaustubh Sharma
Ojasva Nema
DiffM
12
0
0
15 May 2025
Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data
Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data
Yiwen Liu
Jessica Bader
Jae Myung Kim
DiffM
16
0
0
15 May 2025
Parallel Scaling Law for Language Models
Parallel Scaling Law for Language Models
Mouxiang Chen
Binyuan Hui
Zeyu Cui
Jiaxi Yang
Dayiheng Liu
Jianling Sun
Junyang Lin
Zhongxin Liu
MoE
LRM
37
0
0
15 May 2025
Don't Forget your Inverse DDIM for Image Editing
Don't Forget your Inverse DDIM for Image Editing
Guillermo Gomez-Trenado
Pablo Mesejo
Ó. Cordón
Stéphane Lathuilière
DiffM
28
0
0
14 May 2025
Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation
Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation
Guan Gui
Bin-Bin Gao
Xiaozhong Liu
Chengjie Wang
Yongpeng Wu
DiffM
31
0
0
14 May 2025
Unsupervised Raindrop Removal from a Single Image using Conditional Diffusion Models
Unsupervised Raindrop Removal from a Single Image using Conditional Diffusion Models
Lhuqita Fazry
Valentino Vito
DiffM
40
0
0
13 May 2025
Controllable Image Colorization with Instance-aware Texts and Masks
Controllable Image Colorization with Instance-aware Texts and Masks
Yanru An
Ling Gui
Qiang Hu
Chunlei Cai
Tianxiao Ye
Xiaoyun Zhang
Yanfeng Wang
DiffM
34
0
0
13 May 2025
Generative AI for Urban Planning: Synthesizing Satellite Imagery via Diffusion Models
Generative AI for Urban Planning: Synthesizing Satellite Imagery via Diffusion Models
Qingyi Wang
Y. Liang
Yunhan Zheng
Kaiyuan Xu
Jinhua Zhao
Shenhao Wang
21
0
0
13 May 2025
Addressing degeneracies in latent interpolation for diffusion models
Addressing degeneracies in latent interpolation for diffusion models
Erik Landolsi
Fredrik Kahl
DiffM
45
0
0
12 May 2025
TokenProber: Jailbreaking Text-to-image Models via Fine-grained Word Impact Analysis
TokenProber: Jailbreaking Text-to-image Models via Fine-grained Word Impact Analysis
Longtian Wang
Xiaofei Xie
Tianlin Li
Yuhan Zhi
Chao Shen
19
0
0
11 May 2025
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models
Junhao Xia
Chaoyang Zhang
Yecheng Zhang
Chengyang Zhou
Zhichang Wang
Bochun Liu
Dongshuo Yin
DiffM
VGen
31
0
0
11 May 2025
MMiC: Mitigating Modality Incompleteness in Clustered Federated Learning
MMiC: Mitigating Modality Incompleteness in Clustered Federated Learning
L. Yang
W. Zhang
Quan Z. Sheng
Weitong Chen
L. Yao
Weitong Chen
A. Shakeri
26
0
0
11 May 2025
Unsupervised Learning for Class Distribution Mismatch
Unsupervised Learning for Class Distribution Mismatch
Pan Du
Wangbo Zhao
Xinai Lu
Nian Liu
ZeLin Li
...
Suyun Zhao
H. Chen
Cuiping Li
Kai Wang
Yang You
26
0
0
11 May 2025
StableMotion: Repurposing Diffusion-Based Image Priors for Motion Estimation
StableMotion: Repurposing Diffusion-Based Image Priors for Motion Estimation
Ziyi Wang
Haipeng Li
Lin Sui
Tianhao Zhou
Hai Jiang
Lang Nie
Shuaicheng Liu
DiffM
VGen
49
0
0
10 May 2025
HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation
HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation
Hang Wang
Zhi-Qi Cheng
Chenhao Lin
Chao Shen
Lei Zhang
DiffM
35
0
0
10 May 2025
Learning Graph Representation of Agent Diffusers
Learning Graph Representation of Agent Diffusers
Youcef Djenouri
Nassim Belmecheri
Tomasz Michalak
Jan Dubiñski
Ahmed Nabil Belbachir
Anis Yazidi
AI4CE
31
0
0
10 May 2025
ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images
ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images
Xianghao Kong
Qiaosong Qi
Yuanbin Wang
Anyi Rao
Biaolong Chen
Aixi Zhang
Si Liu
Hao Jiang
DiffM
VGen
25
0
0
10 May 2025
Demystifying Diffusion Policies: Action Memorization and Simple Lookup Table Alternatives
Demystifying Diffusion Policies: Action Memorization and Simple Lookup Table Alternatives
Chengyang He
Xu Liu
Gadiel Sznaier Camps
Guillaume Sartoretti
Mac Schwager
28
0
0
09 May 2025
Computationally Efficient Diffusion Models in Medical Imaging: A Comprehensive Review
Computationally Efficient Diffusion Models in Medical Imaging: A Comprehensive Review
Abdullah
Tao Huang
Ickjai Lee
E. Ahn
MedIm
26
0
0
09 May 2025
Automated Learning of Semantic Embedding Representations for Diffusion Models
Automated Learning of Semantic Embedding Representations for Diffusion Models
Limai Jiang
Yunpeng Cai
DiffM
31
0
0
09 May 2025
X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
Hanxun Huang
Sarah Monazam Erfani
Yige Li
Xingjun Ma
James Bailey
AAML
44
0
0
08 May 2025
Prompt to Polyp: Medical Text-Conditioned Image Synthesis with Diffusion Models
Prompt to Polyp: Medical Text-Conditioned Image Synthesis with Diffusion Models
Mikhail Chaichuk
Sushant Gautam
Steven A. Hicks
Elena Tutubalina
DiffM
MedIm
52
0
0
08 May 2025
MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models
MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models
Hongyang Zhu
Haipeng Liu
Bo Fu
Yang Wang
DiffM
35
0
0
08 May 2025
PIDiff: Image Customization for Personalized Identities with Diffusion Models
PIDiff: Image Customization for Personalized Identities with Diffusion Models
Jinyu Gu
Haipeng Liu
M. Y. Wang
Y. Wang
68
0
0
08 May 2025
Flow-GRPO: Training Flow Matching Models via Online RL
Flow-GRPO: Training Flow Matching Models via Online RL
Jie Liu
Gongye Liu
Jiajun Liang
Yongqian Li
Jiaheng Liu
Xinyu Wang
Pengfei Wan
Di Zhang
Wanli Ouyang
AI4CE
68
0
0
08 May 2025
ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model
ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model
Sagnik Bhattacharya
Abhiram Gorle
Ahmed Mohsin
Ahsan Bilal
Connor Ding
Amit Kumar Singh Yadav
Tsachy Weissman
DiffM
45
0
0
08 May 2025
InstanceGen: Image Generation with Instance-level Instructions
InstanceGen: Image Generation with Instance-level Instructions
Etai Sella
Yanir Kleiman
Hadar Averbuch-Elor
33
0
0
08 May 2025
Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model
Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model
Pengfei Guo
Can Zhao
Dong Yang
Yufan He
V. Nath
...
Zongwei Zhou
Benjamin D. Simon
Stephanie Harmon
B. Turkbey
Daguang Xu
DiffM
MedIm
40
0
0
07 May 2025
ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition
ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition
Zhiping Qiu
Yitong Jin
Y. Wang
Yi Shi
C. Wang
Chao Tan
Xiaobing Li
Feng Yu
Tao Yu
Qionghai Dai
29
0
0
07 May 2025
CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion
CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion
Yongqian Li
Pencheng Wan
Liang Han
Yaowei Wang
Liqiang Nie
Min Zhang
43
0
0
07 May 2025
Wasserstein Convergence of Score-based Generative Models under Semiconvexity and Discontinuous Gradients
Wasserstein Convergence of Score-based Generative Models under Semiconvexity and Discontinuous Gradients
Stefano Bruno
Sotirios Sabanis
DiffM
50
0
0
06 May 2025
Deepfakes on Demand: the rise of accessible non-consensual deepfake image generators
Deepfakes on Demand: the rise of accessible non-consensual deepfake image generators
Will Hawkins
Chris Russell
Brent Mittelstadt
DiffM
129
0
0
06 May 2025
MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation
MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation
Mingcheng Li
Xiaolu Hou
Ziyang Liu
Dingkang Yang
Ziyun Qian
Jiawei Chen
Jinjie Wei
Y. Jiang
Qingyao Xu
Li Zhang
DiffM
150
0
0
05 May 2025
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Xuzhi Zhang
Jintao Guo
Shanshan Zhao
Minghao Fu
Lunhao Duan
Guo-Hua Wang
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
DiffM
74
0
0
05 May 2025
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Ming Li
Xin Gu
Fan Chen
X. Xing
Longyin Wen
Cheng Chen
Sijie Zhu
DiffM
81
1
0
05 May 2025
PhytoSynth: Leveraging Multi-modal Generative Models for Crop Disease Data Generation with Novel Benchmarking and Prompt Engineering Approach
PhytoSynth: Leveraging Multi-modal Generative Models for Crop Disease Data Generation with Novel Benchmarking and Prompt Engineering Approach
Nitin Rai
Arnold W. Schumann
Nathan Boyd
MedIm
39
0
0
03 May 2025
Rethinking Score Distilling Sampling for 3D Editing and Generation
Rethinking Score Distilling Sampling for 3D Editing and Generation
Xingyu Miao
Haoran Duan
Yang Long
J. Han
46
0
0
03 May 2025
Provable Efficiency of Guidance in Diffusion Models for General Data Distribution
Provable Efficiency of Guidance in Diffusion Models for General Data Distribution
Gen Li
Yuchen Jiao
48
0
0
02 May 2025
Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation
Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation
Daniele Molino
Francesco Di Feola
Linlin Shen
Paolo Soda
V. Guarrasi
MedIm
LM&MA
67
0
0
02 May 2025
Fast Flow-based Visuomotor Policies via Conditional Optimal Transport Couplings
Fast Flow-based Visuomotor Policies via Conditional Optimal Transport Couplings
Andreas Sochopoulos
Nikolay Malkin
Nikolaos Tsagkas
João Moura
Michael Gienger
S. Vijayakumar
47
1
0
02 May 2025
GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution
GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution
Aditya Arora
Z. Tu
Y. Wang
Ruizheng Bai
Jian Wang
Sizhuo Ma
DiffM
61
0
0
01 May 2025
InstructAttribute: Fine-grained Object Attributes editing with Instruction
InstructAttribute: Fine-grained Object Attributes editing with Instruction
Xingxi Yin
Jingfeng Zhang
Zhi Li
Y. Li
Yuhang Zhang
DiffM
165
0
0
01 May 2025
Diffusion-based Adversarial Identity Manipulation for Facial Privacy Protection
Diffusion-based Adversarial Identity Manipulation for Facial Privacy Protection
Liqin Wang
Qianyue Hu
Wei Lu
Xiangyang Luo
DiffM
AAML
PICV
70
0
0
30 Apr 2025
IDDM: Bridging Synthetic-to-Real Domain Gap from Physics-Guided Diffusion for Real-world Image Dehazing
IDDM: Bridging Synthetic-to-Real Domain Gap from Physics-Guided Diffusion for Real-world Image Dehazing
Shijun Zhou
Yi Liu
Chunhui Hao
Zhiyuan Liu
Jiandong Tian
DiffM
39
0
0
30 Apr 2025
Capturing Conditional Dependence via Auto-regressive Diffusion Models
Capturing Conditional Dependence via Auto-regressive Diffusion Models
Xunpeng Huang
Yujin Han
Difan Zou
Yian Ma
Tong Zhang
DiffM
61
0
0
30 Apr 2025
Can We Achieve Efficient Diffusion without Self-Attention? Distilling Self-Attention into Convolutions
Can We Achieve Efficient Diffusion without Self-Attention? Distilling Self-Attention into Convolutions
Ziyi Dong
Chengxing Zhou
Weijian Deng
Pengxu Wei
Xiangyang Ji
Liang Lin
MQ
53
0
0
30 Apr 2025
1234...858687
Next