ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXivPDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 4,337 papers shown
Title
Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets
H. Kniesel
Pedro Hermosilla
Timo Ropinski
71
0
0
12 Mar 2025
DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness
DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness
Yiming Zhong
Qi Jiang
Jingyi Yu
Yuexin Ma
66
2
0
11 Mar 2025
Controlling Latent Diffusion Using Latent CLIP
Jason Becker
Chris Wendler
Peter Baylies
Robert West
Christian Wressnegger
DiffM
VLM
68
0
0
11 Mar 2025
NullFace: Training-Free Localized Face Anonymization
Han-Wei Kung
Tuomas Varanka
Terence Sim
N. Sebe
DiffM
PICV
68
0
0
11 Mar 2025
Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation
Mingkang Zhu
Xi Chen
Zihan Wang
Bei Yu
Hengshuang Zhao
Jiaya Jia
MoMe
57
0
0
11 Mar 2025
MVD-HuGaS: Human Gaussians from a Single Image via 3D Human Multi-view Diffusion Prior
K. Xiong
Ying Feng
Qi Zhang
Jianbo Jiao
Yang Zhao
Zhihao Liang
Huachen Gao
Ronggang Wang
3DGS
68
0
0
11 Mar 2025
Exploring Bias in over 100 Text-to-Image Generative Models
J. Vice
Naveed Akhtar
Richard I. Hartley
Ajmal Mian
EGVM
67
3
0
11 Mar 2025
MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-Resolution
X. Li
Jianlong Wu
Xinchuan Huang
C. L. Philip Chen
Weili Guan
Xian-Sheng Hua
Liqiang Nie
DiffM
56
0
0
11 Mar 2025
Identity Preserving Latent Diffusion for Brain Aging Modeling
Gexin Huang
Zhangsihao Yang
Yalin Wang
Guido Gerig
Mengwei Ren
Xiaoxiao Li
MedIm
DiffM
74
0
0
11 Mar 2025
MGHanD: Multi-modal Guidance for authentic Hand Diffusion
Taehyeon Eum
Jieun Choi
Tae-Kyun Kim
52
0
0
11 Mar 2025
OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models
Jialv Zou
Bencheng Liao
Qian Zhang
Wenyu Liu
Xinggang Wang
Mamba
MLLM
82
1
0
11 Mar 2025
Reconstruct Anything Model: a lightweight foundation model for computational imaging
Reconstruct Anything Model: a lightweight foundation model for computational imaging
M. Terris
Samuel Hurault
Maxime Song
Julian Tachella
MedIm
DiffM
78
2
0
11 Mar 2025
GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing
Yuanhao Wang
Cheng Zhang
Gonçalo Frazão
Jinlong Yang
Alexandru-Eugen Ichim
Thabo Beeler
Fernando de la Torre
DiffM
85
0
0
11 Mar 2025
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
Yuwei Niu
Munan Ning
Mengren Zheng
Bin Lin
Peng Jin
Jiaqi Liao
Kunpeng Ning
Bin Zhu
Li Yuan
EGVM
64
13
0
10 Mar 2025
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity
Kwanyoung Kim
Byeongsu Sim
DiffM
VLM
58
0
0
10 Mar 2025
TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision
Shaobin Zhuang
Yiwei Guo
Yanbo Ding
Kunchang Li
Xinyuan Chen
Yaohui Wang
Fangyikang Wang
Ying Zhang
Chen Li
Yijiao Wang
45
0
0
10 Mar 2025
Efficient Distillation of Classifier-Free Guidance using Adapters
Cristian Perez Jensen
Seyedmorteza Sadat
53
1
0
10 Mar 2025
Recovering Partially Corrupted Major Objects through Tri-modality Based Image Completion
Yongle Zhang
Yimin Liu
Qiang Wu
DiffM
38
0
0
10 Mar 2025
From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers
Jiacheng Liu
Chang Zou
Yuanhuiyi Lyu
Junjie Chen
Linfeng Zhang
DiffM
63
1
0
10 Mar 2025
Learning A Zero-shot Occupancy Network from Vision Foundation Models via Self-supervised Adaptation
Sihao Lin
Daqi Liu
Ruochong Fu
Dongrui Liu
A. Song
Hongwei Xie
Zhihui Li
Bing Wang
Xiaojun Chang
74
0
0
10 Mar 2025
AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis
Zhangyu Lai
Yilin Lu
Xinyang Li
Jianghang Lin
Yansong Qu
Liujuan Cao
Ming Li
Rongrong Ji
DiffM
182
0
0
10 Mar 2025
Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios
Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios
Chenglu Pan
Xiaogang Xu
Ganggui Ding
Yunke Zhang
Wenbo Li
Jiarong Xu
Qingbiao Wu
60
0
0
10 Mar 2025
LatexBlend: Scaling Multi-concept Customized Generation with Latent Textual Blending
Jian Jin
Zhenbo Yu
Yang Shen
Zhenyong Fu
Jian Yang
DiffM
63
0
0
10 Mar 2025
FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset
FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset
Shuhe Wang
Xiaoya Li
Jiwei Li
G. Wang
Xiaofei Sun
...
Han Qiu
Mo Yu
Shengjie Shen
Tianwei Zhang
Eduard H. Hovy
VLM
63
0
0
10 Mar 2025
TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models
Ruidong Chen
Honglin Guo
Lanjun Wang
Chenyu Zhang
Weizhi Nie
An-an Liu
DiffM
66
1
0
10 Mar 2025
Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation
Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation
Tianyu Chen
Yasi Zhang
Zihan Wang
Ying Nian Wu
Oscar Leong
Mingyuan Zhou
DiffM
50
1
0
10 Mar 2025
DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability
Xirui Hu
Jiahao Wang
Hao Chen
Weizhan Zhang
Benqi Wang
Yangfu Li
Haishun Nan
DiffM
67
0
0
09 Mar 2025
Color Alignment in Diffusion
Ka Chun Shum
Binh-Son Hua
Duc Thanh Nguyen
Sai-Kit Yeung
65
0
0
09 Mar 2025
NaviDet: Efficient Input-level Backdoor Detection on Text-to-Image Synthesis via Neuron Activation Variation
Shengfang Zhai
Jiajun Li
Yue Liu
Huanran Chen
Zhihua Tian
Wenjie Qu
Qingni Shen
Ruoxi Jia
Yinpeng Dong
Jiaheng Zhang
AAML
49
0
0
09 Mar 2025
Conceptrol: Concept Control of Zero-shot Personalized Image Generation
Qiyuan He
Angela Yao
DiffM
41
0
0
09 Mar 2025
Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation
Amir Mohammad Izadi
Seyed Mohsen Hosseini
Soroush Vafaie Tabar
Ali Abdollahi
Armin Saghafian
M. Baghshah
EGVM
48
0
0
09 Mar 2025
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
Xiang Gao
Shuai Yang
Jiaying Liu
DiffM
51
0
0
08 Mar 2025
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
Jian Ma
Qirong Peng
Xu Guo
Chen Chen
H. Lu
Zhenyu Yang
VLM
72
1
0
08 Mar 2025
BioMoDiffuse: Physics-Guided Biomechanical Diffusion for Controllable and Authentic Human Motion Synthesis
Zixi Kang
Xinghan Wang
Yadong Mu
VGen
91
0
0
08 Mar 2025
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
Zengqun Zhao
Ziquan Liu
Yu Cao
Shaogang Gong
Ioannis Patras
50
0
0
07 Mar 2025
Discrete Contrastive Learning for Diffusion Policies in Autonomous Driving
Kalle Kujanpää
Daulet Baimukashev
Farzeen Munir
Shoaib Azam
Tomasz Piotr Kucner
Joni Pajarinen
Ville Kyrki
43
0
0
07 Mar 2025
Escaping Plato's Cave: Towards the Alignment of 3D and Text Latent Spaces
Souhail Hadgi
Luca Moschella
Andrea Santilli
Diego Gomez
Qixing Huang
Emanuele Rodolà
Simone Melzi
M. Ovsjanikov
45
0
0
07 Mar 2025
ControlFill: Spatially Adjustable Image Inpainting from Prompt Learning
Boseong Jeon
55
0
0
06 Mar 2025
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion
Ziyi Yang
Fanqi Wan
Longguang Zhong
Canbin Huang
Guosheng Liang
Xiaojun Quan
MoMe
98
0
0
06 Mar 2025
Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach
Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach
Soumyadeep Ro
Sanapala Satwika
Pamarthi Yasoda Gayathri
Mohmmad Ghaith Balsha
Aysegul Ucar
VLM
ObjD
56
0
0
06 Mar 2025
Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models
Rui Jiang
Xinghe Fu
Guangcong Zheng
Teng Li
Taiping Yao
Xi Li
DiffM
70
0
0
06 Mar 2025
Underlying Semantic Diffusion for Effective and Efficient In-Context Learning
Zhong Ji
Weilong Cao
Yan Zhang
Yanwei Pang
Jungong Han
Xuelong Li
DiffM
VLM
52
0
0
06 Mar 2025
GenColor: Generative Color-Concept Association in Visual Design
Yihan Hou
Xingchen Zeng
Yusong Wang
Manling Yang
Xiaojiao Chen
Wei Zeng
DiffM
78
0
0
05 Mar 2025
CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP
Songlong Xing
Zhengyu Zhao
N. Sebe
AAML
62
1
0
05 Mar 2025
ProReflow: Progressive Reflow with Decomposed Velocity
Lei Ke
Haohang Xu
Xuefei Ning
Yong Li
Jiajun Li
Haoling Li
Yuxuan Lin
Dongsheng Jiang
Yuqing Yang
Linfeng Zhang
DiffM
62
1
0
05 Mar 2025
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
Rui Zhao
Weijia Mao
Mike Zheng Shou
66
0
0
05 Mar 2025
WarmFed: Federated Learning with Warm-Start for Globalization and Personalization Via Personalized Diffusion Models
Tao Feng
Jie Zhang
Xiangjian Li
Rong Huang
Huashan Liu
Zhijie Wang
FedML
62
0
0
05 Mar 2025
Generative Tools for Graphical Assets: Empirical Guidelines based on Game Designers' and Developers' Preferences
Kaisei Fukaya
Damon Daylamani-Zad
Harry Agius
43
0
0
04 Mar 2025
MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI
Guangyin Bao
Qi Zhang
Z. Gong
Zhuojia Wu
Duoqian Miao
41
0
0
04 Mar 2025
Fine-Grained Controllable Apparel Showcase Image Generation via Garment-Centric Outpainting
Rong Zhang
Jun Wang
Zhiwen Zuo
Jianfeng Dong
W. Li
Chi-Yin Wang
Wenyuan Xu
Xun Wang
DiffM
69
0
0
03 Mar 2025
Previous
123...678...858687
Next